<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">REMSE</journal-id>
<journal-title>Remote Sensing</journal-title>
<issn pub-type="epub">2072-4292</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/rs4020532</article-id>
<article-id pub-id-type="publisher-id">remotesensing-04-00532</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Two Linear Unmixing Algorithms to Recognize Targets Using Supervised Classification and Orthogonal Rotation in Airborne Hyperspectral Images</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Averbuch</surname><given-names>Amir</given-names></name><xref ref-type="corresp" rid="c1-remotesensing-04-00532"><sup>★</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Zheludev</surname><given-names>Michael</given-names></name></contrib>
<aff id="af1-remotesensing-04-00532">School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel; E-Mail: <email>mzheludev@gmail.com</email></aff></contrib-group>
<author-notes>
<corresp id="c1-remotesensing-04-00532">
<label>★</label> Author to whom correspondence should be addressed; E-Mail: <email>amir@math.tau.ac.il</email>; Tel.: +972-54-569-4455; Fax: +972-3-642-2020.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>21</day>
<month>02</month>
<year>2012</year></pub-date>
<volume>4</volume>
<issue>2</issue>
<fpage>532</fpage>
<lpage>560</lpage>
<history>
<date date-type="received">
<day>05</day>
<month>12</month>
<year>2011</year></date>
<date date-type="rev-recd">
<day>31</day>
<month>01</month>
<year>2012</year></date>
<date date-type="accepted">
<day>01</day>
<month>02</month>
<year>2012</year></date></history>
<copyright-statement>© 2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (<ext-link xlink:href="http://creativecommons.org/licenses/by/3.0/" ext-link-type="uri">http://creativecommons.org/licenses/by/3.0/</ext-link>.)</p></license>
<abstract>
<p>The goal of the paper is to detect pixels that contain targets of known spectra. The target can be present in a sub- or above pixel. Pixels without targets are classified as background pixels. Each pixel is treated via the content of its neighborhood. A pixel whose spectrum is different from its neighborhood is classified as a “suspicious point”. In each suspicious point there is a mix of target(s) and background. The main objective in a supervised detection (also called “target detection”) is to search for a specific given spectral material (target) in hyperspectral imaging (HSI) where the spectral signature of the target is known <italic>a priori</italic> from laboratory measurements. In addition, the fractional abundance of the target is computed. To achieve this we present two linear unmixing algorithms that recognize targets with known (given) spectral signatures. The CLUN is based on automatic feature extraction from the target’s spectrum. These features separate the target from the background. The ROTU algorithm is based on embedding the spectra space into a special space by random orthogonal transformation and on the statistical properties of the embedded result. Experimental results demonstrate that the targets’ locations were extracted correctly and these algorithms are robust and efficient.</p></abstract>
<kwd-group>
<kwd>hyperspectral imaging</kwd>
<kwd>unmixing</kwd>
<kwd>spectral signature</kwd>
<kwd>target recognition</kwd>
<kwd>sub-above pixel</kwd>
<kwd>supervised classification</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<sec sec-type="methods">
<label>1.1.</label>
<title>Data Representation and Extraction of Spectral Information</title>
<p>Hyperspectral remote sensing exploits the fact that all materials reflect, absorb and emit electromagnetic energy at specific wavelengths. In comparison to a typical camera that uses red, green and blue colors as three wavelength bands, hyperspectral imaging (HSI) sensors acquire digital images in many contiguous and very narrow spectral bands that typically span the visible, near-infrared and mid-infrared portions of the spectrum. This enables to construct essentially continuous radiance spectrum for every pixel in the scene. The goal of the paper is to identify and classify materials that are present in HSI and to detect targets of interest.</p>
<p>Material and object detection using remotely sensed spectral information has many military and civilian applications. Detection algorithms can be divided into two classes: supervised and unsupervised. The unsupervised detection is also called “anomaly detection”. Anomalies are defined as patterns in the data that do not conform to a well defined notion of normal data behavior. In anomaly detection tasks, hyperspectral pixels have to be classified as either background (normal behavior) or anomalies. Every pixel or group of pixels, which are spectrally different in a meaningful way from the abundant (background) spectral material, are classified as anomalies. A survey of unsupervised detection algorithms is given in [<xref ref-type="bibr" rid="b1-remotesensing-04-00532">1</xref>].</p>
<p>In a supervised detection (also called “target detection”), the main objective is to search for a specific given spectral material (target) in HSI. The spectral signature of the target is known <italic>a priori</italic> from laboratory measurements. In this paper, we focus on supervised target detection in HSI.</p>
<p>The paper has the following structure. Problem statement including HSI descriptions, which are used for testing the proposed algorithms, are given in Section 2. Related work are described in Section 3. Section 4 describes the needed mathematical background that is used for the constructed solution. Section 5 presents the ROTU algorithm that is based on orthogonal rotations. Classification for an unmixing algorithm is given in Section 6 and its experimental results are presented in Section 6.4.</p></sec></sec>
<sec>
<label>2.</label>
<title>Problem Statement</title>
<p>The goal in this paper is to detect pixels that contain targets of a known spectrum. The target can be present in a sub- or in a whole pixel. Pixels without targets are classified as background pixels. Each pixel is treated via the content of its neighborhood. A pixel whose spectrum is different from its neighborhood according to [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>] is classified as a “suspicious point”. In each suspicious point there is a mix of target(s) and background. For example, <xref ref-type="fig" rid="f1-remotesensing-04-00532">Figures 1</xref> and <xref ref-type="fig" rid="f2-remotesensing-04-00532">2</xref> display a scene from 2nd figure in Section 2.1 with an exemplary point. The background, the target and the mixed pixel spectra are displayed in <xref ref-type="fig" rid="f3-remotesensing-04-00532">Figure 3</xref>.</p>
<p>Linear mixing is the most widely used as spectral mixing model, which assumes that the observed reflectance spectrum of a given pixel is generated by a linear combination of a small number of unique constituent deterministic spectral signatures. Endmembers are the outputs from the spectral unmixing. This model is defined with constraints in the following way:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>S</mml:mi>
<mml:mi>a</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>w</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mtext>additivity constraint</mml:mtext>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>≥</mml:mo>
<mml:mn>0</mml:mn>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mtext>positivity constraint</mml:mtext>
<mml:mo>,</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>s</italic><sub>1</sub>, <italic>s</italic><sub>2</sub>, . . . , <italic>s<sub>M</sub></italic> are the M endmembers spectra which are assumed to be linearly independent, <italic>a</italic><sub>1</sub>, <italic>a</italic><sub>2</sub>, . . . , <italic>a<sub>M</sub></italic> are the corresponding abundances (cover the material fractions) and <italic>w</italic> is an additive-noise vector. Endmembers may be obtained from spectral libraries, in-scene spectra, or geometrical techniques.</p>
<p>A mixed spectrum means a spectrum that contains background and target spectra. We consider a simplified version of <xref ref-type="disp-formula" rid="FD1">Equation (1)</xref> by defining a simple mixing model that describes the relation between the target and its background. Assume <italic>P</italic> is a pixel of a mixed spectrum and <italic>T</italic> is a given target’s spectrum. Consider three spectra: the background average spectrum 
<inline-formula>
<mml:math id="mm2" display="inline">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, the mixed pixel spectrum (spectrum of a suspicious point) <italic>P</italic>, the target’s spectrum <italic>T</italic> and the fractional abundance <italic>t</italic>. They are related by the following model
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm3" display="block">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>T</mml:mi></mml:mrow></mml:mrow></mml:math></disp-formula>which is a modified version of <xref ref-type="disp-formula" rid="FD1">Equation (1)</xref>, where <italic>a</italic><sub>1</sub> = <italic>t</italic> and <italic>s</italic><sub>1</sub> = <italic>T</italic>, <italic>t</italic> ∈ 𝕉<italic>, t</italic> ∈ (0<italic>,</italic> 1). Each <italic>B<sub>k</sub></italic><italic>, k</italic> = 1, . . . , <italic>M</italic>, was taken from the pixel’s neighborhood, therefore, they are close to each other and have a similar feature.</p>
<p>We are given the target’s spectrum <italic>T</italic> and the mixed pixel spectrum <italic>P</italic>. Our goal is to estimate <italic>t</italic>, denoted by <italic>t̂</italic>, which will satisfy <xref ref-type="disp-formula" rid="FD2">Equation (2)</xref> provided that <italic>B</italic> and <italic>T</italic> have some independent features. Once <italic>t̂</italic> is found, the estimate of the unknown background spectrum <italic>B</italic>, denoted by <italic>B̂</italic>, can be calculated by <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>). The estimated parameter <italic>t</italic> in <xref ref-type="disp-formula" rid="FD2">Equation (2)</xref> is called fractional abundance.</p>
<sec>
<label>2.1.</label>
<title>Motivation and Research Approach</title>
<p>The related algorithms, which appeared in Section 3 (also in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>]), assume that spectra of different materials are statistically independent. Weaker assumptions for the relations between spectra were considered also in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>]. A weaker assumption is the orthogonality condition expressed later in Proposition 4.3: Assume that <italic>B</italic> is an unknown background spectrum, <italic>T</italic> and <italic>P</italic> are the known spectra of the target and the mixed pixel, respectively. Assume that the parameter <italic>t</italic> satisfies the linear mixing model <italic>P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B</italic>. If <italic>B</italic> and <italic>T</italic> are orthogonal, then <italic>t</italic> = <italic>〈T</italic>, <italic>P 〉</italic>/‖<italic>T</italic>‖<sup>2</sup>.</p>
<p>This proposition implies a solution if the background and the target have statistically independent spectra. Note that if <italic>〈T, B〉</italic> = <italic>∊</italic>, then the variance of this estimation error is <italic>∊</italic> /‖<italic>T</italic>‖<sup>2</sup>.</p>
<p>The linear unmixing algorithms, which are presented in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>], assume only linear independency between different spectral materials. But they can have big mutually correlated coefficients. We assume that the spectra of different materials have different sections in their spectrum: some sections are mutually correlated, some sections are smooth, some sections have a Gaussian random behavior and some sections have oscillatory behavior. First and second large derivatives are used as characteristic features to achieve spectral classification.</p>
<p>The first algorithm in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>] (Section 3), titled WDR, works well but does not detect sub-pixel targets. The second algorithm in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>] (Section 4), titled UNSP, works well but it is computationally expensive due to the need to perform eigenvectors decomposition in each pixel’s neighborhood. The UNSP method, which has to construct an orthocomplement of the linear span of the principal directions of the background of each pixel, is computationally expensive.</p>
<p>In this paper, we present two new methods based on the same assumption about the relation between spectra of different materials as was done in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>]. The CLUN algorithm is based on classification by learning. The learning phase finds the set of wavebands which satisfies independency between the target and the background. By using this set, we map the spectra space to another space, which provides independency between the target’s spectrum and the background’s spectra. All the computationally expensive steps in this algorithm belong to the learning phase. The target detection phase is fast. ROTU is based on random orthogonal projections. This algorithm does not require a learning phase. It is less reliable than WDR and UNSP, which were described in [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>], since it generates more false alarms. On the other hand, it is faster. The ROTU algorithm assumes that the first and the second derivatives of the spectra from different materials are close to be orthogonal. In this algorithm, as opposed to the previous algorithm, several targets can be detected. This algorithm scans through all the scene and searches each pixel without taking into consideration its neighborhood.</p>
<p>The experiments in this paper including performance evaluation were performed on three real hyper-spatial datasets titled: “desert” (<xref ref-type="fig" rid="f4-remotesensing-04-00532">Figure 4</xref>), “city” (<xref ref-type="fig" rid="f5-remotesensing-04-00532">Figure 5</xref>) and “field” (<xref ref-type="fig" rid="f6-remotesensing-04-00532">Figure 6</xref>), which were captured by the Specim camera [<xref ref-type="bibr" rid="b3-remotesensing-04-00532">3</xref>]. Their properties with a display of one waveband per dataset is given in <xref ref-type="fig" rid="f4-remotesensing-04-00532">Figures 4</xref>–<xref ref-type="fig" rid="f6-remotesensing-04-00532">6</xref>.</p></sec></sec>
<sec>
<label>3.</label>
<title>Related Work</title>
<p>Up-to-date overview on hyperspectral unmixing is given in [<xref ref-type="bibr" rid="b4-remotesensing-04-00532">4</xref>,<xref ref-type="bibr" rid="b5-remotesensing-04-00532">5</xref>]. The challenges related to target detection, which is the main focus of this paper, are described in the survey papers [<xref ref-type="bibr" rid="b6-remotesensing-04-00532">6</xref>,<xref ref-type="bibr" rid="b7-remotesensing-04-00532">7</xref>]. They provide tutorial review on state-of-the-art target detection algorithms for HSI applications. The main obstacles in having effective detection algorithms are the inherent variability target and background spectra. Adaptive algorithms are effective to solve some of these problems. The solution provided in this paper meet some of the challenges mentioned in [<xref ref-type="bibr" rid="b6-remotesensing-04-00532">6</xref>].</p>
<p>In the rest of this section, we divided the many existing algorithms into several groups. We wish to show some trends but do not attempt to cover the avalanche of related work on unmixing and target detection.</p>
<list list-type="simple">
<list-item>
<p><bold>Linear approach:</bold> Under the linear mixing model, where the number of endmembers and their spectral signatures are known, hyperspectral unmixing is a linear problem, which can be addressed, for example, by the ML setup [<xref ref-type="bibr" rid="b8-remotesensing-04-00532">8</xref>] and by the constrained least squares approach [<xref ref-type="bibr" rid="b9-remotesensing-04-00532">9</xref>]. These methods do not supply sufficiently accurate estimates and do not reflect the physical behavior. Distinction between different material’s spectra is conditioned generally by the distinction in the behavior of the first and second derivatives and not by a trend.</p></list-item>
<list-item>
<p><bold>Independent component analysis (ICA)</bold> is an unsupervised source separation process that finds a linear decomposition of the observed data yielding statistically independent components [<xref ref-type="bibr" rid="b10-remotesensing-04-00532">10</xref>,<xref ref-type="bibr" rid="b11-remotesensing-04-00532">11</xref>]. It has been applied successfully to blind source separation, to feature extraction and to unsupervised recognition such as in [<xref ref-type="bibr" rid="b12-remotesensing-04-00532">12</xref>], where the endmember signatures are treated as sources and the mixing matrix is composed by the abundance fractions. Numerous works including [<xref ref-type="bibr" rid="b13-remotesensing-04-00532">13</xref>] show that ICA cannot be used to unmix hyperspectral data.</p></list-item>
<list-item>
<p><bold>Geometric approach:</bold> Assume a linear mixing scenario where each observed spectral vector is given by <bold>r</bold> = <bold>x+n</bold>=<bold>M</bold><italic>γ</italic><bold>a+n</bold>, <italic>γ</italic><bold>a</bold> = <bold>s</bold>, where <bold>r</bold> is an <italic>L</italic>-vector (<italic>L</italic> is the number of bands), <bold>M</bold> = [<bold>m</bold><sub>1</sub>, <bold>m</bold><sub>2</sub>, . . . <bold>m</bold><italic><sub>p</sub></italic>] is the mixing matrix (<bold>m</bold><italic><sub>i</sub></italic> denotes the <italic>i</italic>th endmember signature and <italic>p</italic> is the number of endmembers present in the sensed area), <bold>s</bold> ≜ <italic>γ</italic><bold>a</bold> (<italic>γ</italic> is a scale factor that models illumination variability due to a surface topography), <bold>a</bold> = [<italic>a</italic><sub>1</sub>, <italic>a</italic><sub>2</sub>, . . . <italic>a<sub>p</sub></italic>]<italic><sup>T</sup></italic> is the abundance vector that contains the fractions of each endmember ((<italic>·</italic>)<italic><sup>T</sup></italic> denotes a transposed vector) and <bold>n</bold> is the system’s additive noise. Owing to physical constraints, abundance fractions are nonnegative and satisfy the so-called positivity constraint 
<inline-formula>
<mml:math id="mm4" display="inline">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>p</mml:mi></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:math></inline-formula>. Each pixel can be viewed as a vector in a <italic>L</italic>-dimensional Euclidean space, where each channel is assigned to one axis. Since the set {<bold>a</bold> ∈ 𝕉<italic><sup>p</sup></italic> : 
<inline-formula>
<mml:math id="mm5" display="inline">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>p</mml:mi></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:math></inline-formula><italic>, a<sub>k</sub></italic> <italic>&gt;</italic> 0 for all <italic>k</italic>} is a simplex, then the set <italic>S<sub>x</sub></italic> ≜ {<bold>x</bold> ∈ 𝕉<italic><sup>L</sup></italic> : <bold>x</bold> = <bold>Ma</bold>, 
<inline-formula>
<mml:math id="mm6" display="inline">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>p</mml:mi></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:math></inline-formula>, <italic>a<sub>k</sub></italic> <italic>&gt;</italic> 0 for all <italic>k</italic>} is also a simplex whose vertices correspond to endmembers.</p>
<p>Several approaches [<xref ref-type="bibr" rid="b14-remotesensing-04-00532">14</xref>,<xref ref-type="bibr" rid="b15-remotesensing-04-00532">15</xref>,<xref ref-type="bibr" rid="b16-remotesensing-04-00532">16</xref>] exploited this geometric feature of hyperspectral mixtures. The minimum volume transform (MVT) algorithm [<xref ref-type="bibr" rid="b16-remotesensing-04-00532">16</xref>] determines the simplex of a minimal volume that contains the data. The method presented in [<xref ref-type="bibr" rid="b17-remotesensing-04-00532">17</xref>] is also of MVT type, but by introducing the notion of bundles, it takes into account the endmember variability that is usually present in hyperspectral mixtures.</p>
<p>The MVT type approaches are complex from computational point of view. Usually, these algorithms first find the convex hull defined by the observed data and then fit a minimum volume simplex to it. Aiming at a lower computational complexity, some algorithms such as the pixel purity index (PPI) [<xref ref-type="bibr" rid="b15-remotesensing-04-00532">15</xref>] and the N-FINDR [<xref ref-type="bibr" rid="b18-remotesensing-04-00532">18</xref>] still find the maximum volume simplex that contains the data cloud. They assume the presence of at least one pure pixel of each endmember in the data. This is a strong assumption that may not be true in general. In any case, these algorithms find the set of most of the pure pixels in the data.</p></list-item>
<list-item>
<p><bold>Extending subspace approach:</bold> A fast unmixing algorithm, termed <italic>vertex component analysis</italic> (VCA), is described in [<xref ref-type="bibr" rid="b19-remotesensing-04-00532">19</xref>]. The algorithm is unsupervised and utilizes two facts: 1. The endmembers are the vertices of a simplex; 2. The affine transformation of a simplex is also a simplex. It works with projected and unprojected data. As PPI and N-FINDR algorithms, VCA also assumes the presence of pure pixels in the data. The algorithm iteratively projects data onto a direction orthogonal to the subspace spanned by the endmembers already detected. The new endmember’s signature corresponds to the extreme projection. The algorithm iterates until all the endmembers are exhausted. VCA performs much better than PPI and better than or comparable to N-FINDR. Yet, its computational complexity is between one and two orders of magnitude lower than N-FINDR.</p>
<p>If the image is of size approximately 300 <italic>×</italic> 2<italic>,</italic> 000 pixels, then this method, which builds linear span in each step, is too computationally expensive. In addition, it relies on “pure” spectra which are not available all the time.</p></list-item>
<list-item>
<p><bold>Statistical methods:</bold> In the statistical framework, spectral unmixing is formulated as a statistical inference problem by adopting a Bayesian methodology where the inference engine is the posterior density of the random objects to be estimated as described for example in [<xref ref-type="bibr" rid="b20-remotesensing-04-00532">20</xref>,<xref ref-type="bibr" rid="b21-remotesensing-04-00532">21</xref>,<xref ref-type="bibr" rid="b22-remotesensing-04-00532">22</xref>].</p></list-item></list>
<sec>
<label>3.1.</label>
<title>Linear Classification for Threshold Optimization</title>
<p>The description in this section, which is based on [<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>], is given since it is being used later in our development of the proposed methodology.</p>
<p>According to [<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>], a binary classification is frequently performed by using a real-valued function <italic>f : X ⊆</italic> 𝕉<italic><sup>n</sup></italic> <italic>→</italic> 𝕉 in the following way: the input <italic>x</italic> = (<italic>x</italic><sub>1</sub><italic>, ..., x<sub>n</sub></italic>)<italic><sup>T</sup></italic> is assigned to a positive class if <italic>f</italic>(<italic>x</italic>) ≥ 0, otherwise, to a negative class. We consider the case where <italic>f</italic>(<italic>x</italic>) is a linear function of <italic>x</italic> with the parameters <italic>w</italic> and <italic>b</italic> such that
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm7" display="block">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mi>w</mml:mi>
<mml:mo>⋅</mml:mo>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>n</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi></mml:mrow></mml:mrow></mml:math></disp-formula>where (<italic>w, b</italic>) <italic>∈</italic> 𝕉<italic><sup>n</sup></italic> <italic>×</italic> 𝕉 are the parameters that control the function. The decision rule is given by <italic>sgn</italic>(<italic>f</italic>(<italic>x</italic>)). <italic>w</italic> is assumed to be the weight vector and <italic>b</italic> is the threshold, then according to [<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>].</p>
<p><bold>Definition 3.1</bold> <italic>A training set is a collection of training examples (data)</italic>
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm8" display="block">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>l</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>y</mml:mi></mml:mrow>
<mml:mi>l</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>⊆</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi>Y</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>l</mml:mi></mml:msup></mml:mrow></mml:math></disp-formula><italic>where l is the number of examples, X ⊆</italic> 𝕉<italic><sup>n</sup></italic> <italic>and Y</italic> = {−1, 1} <italic>is the output domain.</italic></p>
<p>The Rosenblatt’s Perceptron algorithm ([<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>,<xref ref-type="bibr" rid="b24-remotesensing-04-00532">24</xref>], pages 12 and 8, respectively) creates an hyperplane 〈<italic>w ·x</italic>〉+<italic>b</italic> = 0 with respect to the training set <italic>S</italic>. It creates the best linear separation between positive and negative examples via minimization of measurement function of “margin” distribution <italic>λ<sub>i</sub></italic> = <italic>y<sub>i</sub></italic>(<italic>〈w, x<sub>i</sub></italic><italic>〉</italic> + <italic>b</italic>). The classification for (<italic>x<sub>i</sub></italic><italic>, y<sub>i</sub></italic>) is correct when <italic>λ<sub>i</sub></italic> <italic>&gt;</italic> 0.</p>
<p>The Perceptron algorithm is guaranteed to converge only if the training data are linearly separable. A procedure that does not suffer from this limitation is the Linear Discriminant Analysis (LDA) via Fisher’s discriminant functional [<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>]. The aim is to find the hyperplane (<italic>w, b</italic>) on which the projection of the data is maximally separated. The cost function (the Fisher’s function) to be optimized is:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm9" display="block">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>σ</mml:mi></mml:mrow>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>σ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>where <italic>m<sub>i</sub></italic> and <italic>σ<sub>i</sub></italic> are the mean and the standard deviation, respectively, of the function output values <italic>P<sub>i</sub></italic> = {<italic>〈w · x<sub>j</sub></italic><italic>〉</italic> + <italic>b : y<sub>j</sub></italic> = <italic>i</italic>} for the two classes <italic>P<sub>i</sub></italic>, <italic>i</italic> = 1<italic>, −</italic>1. From [<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>] we get the following definition.</p>
<p><bold>Definition 3.2</bold> <italic>The dataset S in <xref ref-type="disp-formula" rid="FD4">Equation 4</xref> is linearly separable if the hyperplane 〈w · x〉</italic> + <italic>b</italic> = 0<italic>, which is obtained via the LDA algorithm ([<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>]), correctly classifies the training data. It means that λ<sub>i</sub></italic> = <italic>y<sub>i</sub></italic>(<italic>〈w, x<sub>i</sub></italic><italic>〉</italic> + <italic>b</italic>) <italic>&gt;</italic> 0<italic>, i</italic> = 1<italic>, . . . , l. In this case, b is the separation threshold. If λ<sub>i</sub></italic> <italic>&lt;</italic> 0<italic>, then the dataset is linearly inseparable.</italic></p>
<p><bold>Definition 3.3</bold> <italic>The vector x ∈</italic> 𝕉<italic><sup>n</sup></italic> <italic>is isolated from the set P</italic> = {<italic>p</italic><sub>1</sub><italic>, ..., p<sub>k</sub></italic>} ⊆ 𝕉<italic><sup>n</sup></italic> <italic>if the training set S</italic> = ((<italic>x,</italic> 1)<italic>,</italic> (<italic>p</italic><sub>1</sub><italic>, −</italic>1)<italic>, ...,</italic> (<italic>p<sub>k</sub></italic><italic>, −</italic>1)) <italic>is linearly separable according to Definition 3.2. In this case, the absolute value of b is the separation threshold.</italic></p>
<p>Suppose that we have a set <italic>S</italic> = {<italic>x</italic><sub>1</sub><italic>, .., x<sub>n</sub></italic>} of <italic>n</italic> samples. First, we want to partition the data into exactly two disjoint subsets <italic>S</italic><sub>1</sub> and <italic>S<sub>−</sub></italic><sub>1</sub>. Each subset represents a cluster. The solution is based on the K-means algorithm ([<xref ref-type="bibr" rid="b25-remotesensing-04-00532">25</xref>]). K-means maximizes the function <italic>J</italic>(<italic>e</italic>) (<xref ref-type="disp-formula" rid="FD6">Equation (6)</xref>) where <italic>e</italic> is a partition. The value of <italic>J</italic>(<italic>e</italic>) depends on how the samples are grouped into clusters and on the number of clusters (see [<xref ref-type="bibr" rid="b25-remotesensing-04-00532">25</xref>]).
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm10" display="block">
<mml:mrow>
<mml:mi>J</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>e</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">tr</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>B</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></disp-formula>where 
<inline-formula>
<mml:math id="mm11" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>W</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>l</mml:mi></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is called “within-cluster scatter matrix” ([<xref ref-type="bibr" rid="b25-remotesensing-04-00532">25</xref>]), <italic>l</italic> is the number of classes, <italic>S<sub>i</sub></italic> are the classes and <italic>m<sub>i</sub></italic> are the center of each class. <italic>S<sub>B</sub></italic> is called “between-cluster scatter matrix” ([<xref ref-type="bibr" rid="b25-remotesensing-04-00532">25</xref>]), where 
<inline-formula>
<mml:math id="mm12" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>B</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>l</mml:mi></mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>−</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>m</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>−</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <italic>n<sub>i</sub></italic> is the cardinality of the class <italic>i</italic> and <italic>m</italic> is the center for the dataset.</p>
<p><bold>Definition 3.4</bold> <italic>Let</italic> (<italic>w, b</italic>) <italic>be the best separation for the set S</italic> = {<italic>x</italic><sub>1</sub><italic>, ..., x<sub>n</sub></italic>} <italic>⊆</italic> 𝕉<italic><sup>n</sup></italic> <italic>via K-means and Fisher’s discriminant analyses [<xref ref-type="bibr" rid="b23-remotesensing-04-00532">23</xref>,<xref ref-type="bibr" rid="b24-remotesensing-04-00532">24</xref>].</italic> (<italic>w, b</italic>) <italic>is called the Fisher’s separation and b the Fisher’s threshold for the data P.</italic></p>
<p>When is a dataset separable? One criterion is when <italic>m</italic><sub>1</sub> <italic>−m<sub>−</sub></italic><sub>1</sub> <italic>&gt; max</italic>(<italic>diam</italic>(<italic>P</italic><sub>1</sub>)<italic>, diam</italic>(<italic>P<sub>−</sub></italic><sub>1</sub>)), where the notation in <xref ref-type="disp-formula" rid="FD5">Equation (5)</xref> is used. Another criterion is:</p>
<p><bold>Definition 3.5</bold> <italic>([<xref ref-type="bibr" rid="b25-remotesensing-04-00532">25</xref>]) A dataset is separable if from <xref ref-type="disp-formula" rid="FD6">Equation (6)</xref>, J</italic>(<italic>e</italic><sub>1</sub>) <italic>&lt; J</italic>(<italic>e</italic><sub>2</sub>) <italic>where e</italic><sub>1</sub> <italic>is the partition and the number of classes is</italic> 1 <italic>and e</italic><sub>2</sub> <italic>is the best partition into two classes. If J</italic>(<italic>e</italic><sub>1</sub>) ≥ <italic>J</italic>(<italic>e</italic><sub>2</sub>) <italic>then the dataset is inseparable and Fisher’s separation is incorrect.</italic></p></sec></sec>
<sec>
<label>4.</label>
<title>Mathematical Preliminaries</title>
<p>This section describes the new mathematical background constructions that are needed for the proposed solution in this paper.</p>
<sec>
<label>4.1.</label>
<title>Definitions</title>
<p>Denote by <italic>pr<sub>W</sub></italic> (<italic>y</italic>) the orthogonal projection of the vector <italic>y ∈</italic> 𝕉<italic><sup>n</sup></italic> on the subspace <italic>W ⊂</italic> 𝕉<italic><sup>n</sup></italic>.</p>
<p><bold>Definition 4.1</bold> <italic>Given a vector x</italic> = (<italic>x</italic><sub>1</sub><italic>, x</italic><sub>2</sub><italic>, ..., x<sub>n</sub></italic>)<italic>. Denote mode<sub>ɛ</sub></italic>(<italic>x</italic>) ≜ <italic>argmax<sub>p</sub></italic>(<italic>card</italic>{<italic>i ∈</italic> [1<italic>, .., n</italic>] : |<italic>p − x<sub>i</sub></italic><italic>| &lt; ɛ</italic>}) <italic>where p is scalar. mode<sub>ɛ</sub></italic>(<italic>x</italic>) <italic>is called the mode with ɛ-error of the vector x.</italic></p>
<p>Let <italic>q</italic> be a positive number. A vector <italic>x</italic> is called <italic>q − ɛ − sparse</italic> if the cardinality of its support {<italic>i : |x<sub>i</sub></italic><italic>| &gt; ɛ</italic>} is less or equal to <italic>q</italic>.</p>
<p>The norm of the linear operator <italic>L</italic>, <italic>L</italic> : 𝕉<italic><sup>n</sup></italic> <italic>→</italic> 𝕉<italic><sup>n</sup></italic>, is ‖<italic>L</italic>‖ ≜ sup<sub>‖<italic>x</italic>‖</sub><sub>=1</sub> ‖<italic>Lx</italic>‖.</p>
<p><bold>Definition 4.2</bold> <italic>Let G be a vector space and assume that there is a finite subset ℑ</italic> ⊂ <italic>G. The set ℑ is ɛ-dense if for any element x</italic> ∈ <italic>G there is s ∈ ℑ such that</italic> ‖<italic>x − s</italic>‖ &lt; <italic>ɛ.</italic></p>
<p><bold>Definition 4.3</bold> <italic>Let G be a vector space and assume that the set ℑ is ɛ-dense in G. Let</italic> 
<inline-formula>
<mml:math id="mm13" display="inline">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>V</mml:mi></mml:mrow>
<mml:mi>G</mml:mi>
<mml:mi>r</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> = {<italic>x ∈ G :</italic> ‖<italic>x − s</italic>‖ <italic>&lt; r</italic>}<italic>. ℑ is homogeneous if for any s</italic><sub>1</sub><italic>, s</italic><sub>2</sub> ∈ <italic>ℑ and r &gt; ɛ,</italic> 
<inline-formula>
<mml:math id="mm14" display="inline">
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>V</mml:mi></mml:mrow>
<mml:mi>G</mml:mi>
<mml:mi>r</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>∩</mml:mo>
<mml:mi>ℑ</mml:mi>
<mml:mo>|</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo>|</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>V</mml:mi></mml:mrow>
<mml:mi>G</mml:mi>
<mml:mi>r</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>∩</mml:mo>
<mml:mi>ℑ</mml:mi>
<mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> <italic>holds. In this notation,</italic> 
<inline-formula>
<mml:math id="mm15" display="inline">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>V</mml:mi></mml:mrow>
<mml:mi>G</mml:mi>
<mml:mi>r</mml:mi></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> <italic>is called the r − neighborhood of s.</italic></p>
<p>For example, the integers set 𝕑 is <italic>ɛ</italic>-dense in the real numbers set 𝕉 when <italic>ɛ</italic> = 1. The set of rational numbers 𝕈 is <italic>ɛ</italic>-dense in 𝕉 for any <italic>ɛ</italic>. 𝕑 and 𝕈 are homogeneous.</p>
<p>Given two vector <italic>v</italic><sub>1</sub> and <italic>v</italic><sub>2</sub> in 𝕉<italic><sup>n</sup></italic>. Assume that <italic>S</italic><sub>1</sub> and <italic>S</italic><sub>2</sub> are two linear operators in 𝕉<italic><sup>n</sup></italic> and <italic>L</italic> is a linear span of {<italic>v</italic><sub>1</sub><italic>, v</italic><sub>2</sub>}. We say that <italic>S</italic><sub>1</sub> and <italic>S</italic><sub>2</sub> are equivalent in relation to <italic>v</italic><sub>1</sub> and <italic>v</italic><sub>2</sub>, denoted by <italic>S</italic><sub>1</sub> <italic>∼ S</italic><sub>2</sub>, if <italic>S</italic><sub>1</sub>(<italic>L</italic>) = <italic>S</italic><sub>2</sub>(<italic>L</italic>).</p>
<sec>
<label>4.1.1.</label>
<title>Sparse-Independency</title>
<p><bold>Definition 4.4</bold> <italic>Given two vectors y</italic><sub>1</sub> <italic>and y</italic><sub>2</sub> <italic>where their components are denoted by y<sub>j</sub></italic>(<italic>i</italic>)<italic>, j</italic> = 1<italic>,</italic> 2<italic>. Let S</italic><sub>1</sub> = {<italic>i : |y</italic><sub>1</sub>(<italic>i</italic>)<italic>| &gt; ɛ</italic>} <italic>and S</italic><sub>2</sub> = {<italic>i : |y</italic><sub>2</sub>(<italic>i</italic>)<italic>| &gt; ɛ</italic>}<italic>. If S</italic><sub>1</sub> ∩ <italic>S</italic><sub>2</sub> = ∅<italic>, then we say that y</italic><sub>1</sub> <italic>and y</italic><sub>2</sub> <italic>are ɛ − sparse − independent. If ɛ</italic> = 0 <italic>then they are called sparse − independent.</italic></p>
<p>Let {<italic>e</italic><sub>1</sub><italic>, e</italic><sub>2</sub><italic>, ..., e<sub>n</sub></italic>} be a basis of 𝕉<italic><sup>n</sup></italic>. Then, any subspace <italic>W</italic> = <italic>e</italic><sub><italic>i</italic><sub>1</sub></sub>𝕉⊕<italic>e</italic><sub><italic>i</italic><sub>2</sub></sub>𝕉 ⊕ ⋯ ⊕<italic>e</italic><sub><italic>i</italic><sub><italic>k</italic></sub></sub>𝕉, where <italic>i</italic><sub>1</sub><italic>, i</italic><sub>2</sub><italic>, ..., i<sub>k</sub></italic> is any subset of {1<italic>,</italic> 2<italic>, ..., n</italic>}, is called a basic subspace of 𝕉<italic><sup>n</sup></italic>.</p>
<p>For example, in the 3D space with the standard basis {<italic>e</italic><sub>1</sub><italic>, e</italic><sub>2</sub><italic>, e</italic><sub>3</sub>}, the plane, which is spanned by the vectors {<italic>e</italic><sub>1</sub><italic>, e</italic><sub>2</sub>}, is a basic subspace. The plane, which is spanned by the vectors {(<italic>e</italic><sub>1</sub> + <italic>e</italic><sub>3</sub>)<italic>/</italic>2<italic>,</italic> (<italic>e</italic><sub>2</sub> + <italic>e</italic><sub>3</sub>)<italic>/</italic>2}, is not a basic subspace. The basic vectors {<italic>e</italic><sub>1</sub><italic>, e</italic><sub>2</sub><italic>, ..., e<sub>n</sub></italic>} of a basis subspace must be a subset of the standard basis 𝕉<italic><sup>n</sup></italic>.</p>
<p>Given a pair of vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> in 𝕉<italic><sup>n</sup></italic>. Assume that <italic>L</italic>(<italic>y</italic><sub>1</sub><italic>, y</italic><sub>2</sub>) is the linear span of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub>. The basic subspace <italic>W ⊂</italic> 𝕉<italic><sup>n</sup></italic> is called a dependent basic subspace of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> if <italic>dim</italic>(<italic>Pr<sub>W</sub></italic> <italic>L</italic>(<italic>y</italic><sub>1</sub><italic>, y</italic><sub>2</sub>)) = 1.</p>
<p>Given a pair of vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> in 𝕉<italic><sup>n</sup></italic>. A natural number <italic>r</italic> is called a basis dependent rank of the vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub>, if the two conditions hold:
<list list-type="order">
<list-item>
<p>If <italic>W</italic> is a dependent basic subspace of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> then <italic>dim</italic>(<italic>W</italic>) ≤ <italic>r</italic>;</p></list-item>
<list-item>
<p>There exists a dependent basic subspace <italic>W</italic> of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> where <italic>dim</italic>(<italic>W</italic>) = <italic>r</italic>.</p></list-item></list></p>
<p>For example, assume that <italic>y</italic><sub>1</sub> = (1<italic>,</italic> 2<italic>,</italic> 2<italic>,</italic> 3) and <italic>y</italic><sub>2</sub> = (2<italic>,</italic> 1<italic>,</italic> 1<italic>, −</italic>1) belong to 𝕉<sup>4</sup> = <italic>e</italic><sub>1</sub>𝕉 ⊕ ⋯ ⊕ <italic>e</italic><sub>4</sub>𝕉. Let <italic>W</italic> = <italic>e</italic><sub>2</sub>𝕉⊕<italic>e</italic><sub>3</sub>𝕉. Then, <italic>z</italic><sub>1</sub> = <italic>Pr<sub>W</sub></italic> (<italic>y</italic><sub>1</sub>) = (2<italic>,</italic> 2), <italic>z</italic><sub>2</sub> = <italic>Pr<sub>W</sub></italic> (<italic>y</italic><sub>2</sub>) = (1<italic>,</italic> 1) and <italic>z</italic><sub>1</sub> = 2<italic>z</italic><sub>2</sub>. Thus, we obtain that <italic>W</italic> is a dependent basic subspace for <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub>. It is clear that the dependency rank of the vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> is 2.</p>
<p>If a pair of vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> is sparse-independent, then their basis dependent rank is zero.</p>
<p>Assume that <italic>y ∈</italic> 𝕉<italic><sup>n</sup></italic>. The set <italic>S<sub>ɛ</sub></italic>(<italic>y</italic>) = {<italic>i : |y</italic>(<italic>i</italic>)<italic>| &gt; ɛ</italic>} is called the <italic>ɛ</italic>-support of <italic>y</italic>.</p>
<p>Given a pair of vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> in 𝕉<italic><sup>n</sup></italic> and <italic>ɛ &gt;</italic> 0. Let <italic>r</italic> = <italic>min</italic>[<italic>card</italic>(<italic>S</italic><sub><italic>ɛ</italic><sup>2</sup></sub> (<italic>y</italic><sub>1</sub>)<italic>\</italic>(<italic>S<sub>ɛ</sub></italic>(<italic>y</italic><sub>1</sub>) ∩ <italic>S<sub>ɛ</sub></italic>(<italic>y</italic><sub>2</sub>))<italic>, card</italic>(<italic>S</italic><sub><italic>ɛ</italic><sup>2</sup></sub> (<italic>y</italic><sub>2</sub>)<italic>\</italic>(<italic>S<sub>ɛ</sub></italic>(<italic>y</italic><sub>1</sub>) ∩ <italic>S<sub>ɛ</sub></italic>(<italic>y</italic><sub>2</sub>))]. <italic>r</italic> is called the <italic>ɛ</italic>-sparse-independent-rank of the vectors <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub>.</p>
<p><bold>Definition 4.5</bold> <italic>Given a pair of vectors y</italic><sub>1</sub> <italic>and y</italic><sub>2</sub><italic>. Assume that r is their ɛ-sparse-independent-rank and b is a basis dependent rank of the vectors y</italic><sub>1</sub> <italic>and y</italic><sub>2</sub><italic>. These two vectors are called a ɛ-sparse-ergodic independent if r &gt; b.</italic>
<list list-type="simple">
<list-item>
<p>For example, sparse-independent vectors have a basis dependent rank zero and they are <italic>ɛ</italic>-sparse-ergodic independent.</p></list-item>
<list-item>
<p>Denote by <italic>Pr<sub>W</sub></italic> (<italic>y</italic>) the orthogonal projection of the vector <italic>y ∈</italic> 𝕉<italic><sup>n</sup></italic> onto the subspace <italic>W ⊂</italic> 𝕉<italic><sup>n</sup></italic>.</p></list-item>
<list-item>
<p>Let <italic>q</italic> be a positive value. A vector <italic>x</italic> is called <italic>q − ɛ − sparse</italic> if the cardinality of its support {<italic>i</italic> : |<italic>x<sub>i</sub>| &gt; ɛ</italic>} is less or equal to <italic>q</italic>.</p></list-item></list></p></sec>
<sec>
<label>4.1.2.</label>
<title>Classification</title>
<p>Assume that Γ is a set of vectors from 𝕉<italic><sup>n</sup></italic> that has zero average and a singular norm. Γ is called the set of weak classifiers. Assume that Φ and Ω are two subsets in 𝕉<italic><sup>n</sup></italic>. The vector <italic>ξ ∈</italic> Γ is called a optimal weak classifier for Φ and Ω relative to Γ if for any vector <italic>a ∈</italic> Φ and <italic>b ∈</italic> Ω, <italic>ξ</italic> = <italic>argmax</italic><sub><italic>ζ</italic>∈Γ</sub> <italic>〈a, ζ〉</italic> holds and <italic>〈ξ, b〉</italic> = 0.</p>
<p>Our goal is to separate between the sets Φ and Ω. The distinction between these two sets can be described in different ways. Behavior of the first or second derivatives of the vectors can be one way. Another way can be the characteristics of the sets such as the relation between the maximum and the minimum in the vectors. These properties define a set of weak classifiers Γ.</p>
<p>Next, we introduce two families of sensors.</p>
<p>The set Γ ⊆ 𝕉<italic><sup>n</sup></italic> is called a <italic>d − support local − characteristic</italic> set of weak classifiers (<italic>d ≪ n</italic>) if for any <italic>ξ ∈</italic> Γ, <italic>diam</italic>{<italic>i :| ξ<sub>i</sub></italic> <italic>|&gt;</italic> 0} <italic>≤ d</italic> holds.</p>
<p>Given <italic>ɛ &gt;</italic> 0. The set Γ ⊆ 𝕉<italic><sup>n</sup></italic> is called a <italic>d − support global − characteristic</italic> set of weak classifiers if for any <italic>ξ ∈</italic> Γ, <italic>card</italic>{<italic>i :| ξ<sub>i</sub></italic> <italic>|&gt;</italic> 0} = <italic>d</italic> holds and {<italic>i :| ξ<sub>i</sub></italic> <italic>|&gt;</italic> 0} is an <italic>∊ − dense</italic> (Definition 4.2) in the set {1<italic>,</italic> 2<italic>, ..., n</italic>}.</p>
<p><bold>Definition 4.6</bold> <italic>Assume that</italic> {<italic>ξ</italic><sub>1</sub><italic>, ξ</italic><sub>2</sub><italic>, ..., ξ<sub>m</sub></italic>} <italic>are the optimal weak classifiers for two sets</italic> Φ <italic>and</italic> Ω<italic>. Define a linear operator</italic> Ξ : 𝕉<italic><sup>n</sup></italic> <italic>→</italic> 𝕉<italic><sup>m</sup></italic> <italic>such that</italic> Ξ(<italic>a</italic>) = (<italic>〈a, ξ</italic><sub>1</sub><italic>〉, 〈a, ξ</italic><sub>2</sub><italic>〉, ..., 〈a, ξ<sub>m</sub></italic><italic>〉</italic>)<italic>. This linear operator is called the separator projection.</italic></p>
<p><bold>Proposition 4.1</bold> <italic>Assume T is the target’s spectrum, B and P are the spectra of the background and the mixed pixel, respectively. Parameter t obeys the linear mixing model P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B. Let</italic> Φ = {<italic>T</italic>} <italic>and B ∈</italic> Ω<italic>, where</italic> Ω <italic>is a set of background vectors. Assume that</italic> {<italic>ξ</italic><sub>1</sub><italic>, ξ</italic><sub>2</sub><italic>, ..., ξ<sub>m</sub></italic>} <italic>are optimal weak classifiers for two sets</italic> Φ <italic>and</italic> Ω. Ξ = [<italic>ξ</italic><sub>1</sub><italic>, ξ</italic><sub>2</sub><italic>, ..., ξ<sub>m</sub></italic>] <italic>is a projection separator and T</italic><sub>1</sub> = Ξ (<italic>T</italic>)<italic>, P</italic><sub>1</sub> = Ξ(<italic>P</italic>)<italic>, B</italic><sub>1</sub> = Ξ (<italic>B</italic>). <italic>Then</italic>, ‖<italic>P</italic><sub>1</sub>‖ = <italic>t</italic>‖<italic>T</italic><sub>1</sub>‖ <italic>and corr</italic>(<italic>P</italic><sub>1</sub><italic>, T</italic><sub>1</sub>) = 1.</p>
<p><bold>Proof:</bold> <italic>P</italic><sub>1</sub> = Ξ(<italic>tT</italic> + (1 <italic>− t</italic>)<italic>B</italic>) = <italic>t</italic>Ξ(<italic>T</italic>) + (1 <italic>− t</italic>) Ξ(<italic>B</italic>). Since Ω <italic>⊂</italic> ker(Ξ) and <italic>B ∈</italic> Ω, then (1 <italic>− t</italic>) Ξ(<italic>B</italic>) = 0 and <italic>P</italic><sub>1</sub> = <italic>t</italic>Ξ(<italic>T</italic>) = <italic>tT</italic><sub>1</sub>.</p></sec></sec>
<sec>
<label>4.2.</label>
<title>The Main Propositions</title>
<p>The goal of this section is to generate the conditions for which the vectors provide a solution for the unmixing problem given by <xref ref-type="disp-formula" rid="FD2">Equation (2)</xref>.</p>
<p><bold>Lemma 4.1</bold> <italic>Let B be an unknown background spectrum, T and P are the known spectra of the target and of the mixed pixel, respectively. The parameter t satisfies the linear mixing model P</italic> = <italic>tT</italic> +(1<italic>−t</italic>)<italic>B. If for any fixed positive value γ &gt;</italic> 0<italic>,</italic> 0 <italic>&lt; ɛ &lt;</italic> 1 <italic>, S</italic><sub>1</sub> = {<italic>i : |B</italic>(<italic>i</italic>)<italic>| &gt; γ</italic>} <italic>and S</italic><sub>2</sub> = {<italic>i : |T</italic> (<italic>i</italic>)<italic>| &gt;γ/ɛ</italic>} <italic>we have that card</italic>(<italic>S</italic><sub>2</sub> <italic>\ S</italic><sub>1</sub>) ≥ <italic>card</italic>(<italic>S</italic><sub>1</sub> ∩ <italic>S</italic><sub>2</sub>)<italic>, then t</italic> = <italic>mode<sub>ɛ</sub></italic><italic>ζ(Definition 4.1) where ζ is a vector whose coordinates are</italic> 
<inline-formula>
<mml:math id="mm16" display="inline">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> <italic>and T</italic><sub>1</sub> = {<italic>T</italic> |‖<italic>T</italic> ‖ <italic>&gt;γ/ɛ</italic>}.</p>
<p><bold>Proof:</bold> Let 
<inline-formula>
<mml:math id="mm17" display="inline">
<mml:mrow>
<mml:mi>ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>T</mml:mi>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mfrac>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula> and let 
<inline-formula>
<mml:math id="mm18" display="inline">
<mml:mrow>
<mml:mi>ζ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula>, then <italic>ξ</italic> = <italic>t</italic> + (1 <italic>− t</italic>) <italic>ζ</italic>. Let <italic>W</italic> = (<italic>S</italic><sub>2</sub>\<italic>S</italic><sub>1</sub>) and <italic>V</italic> = (<italic>S</italic><sub>2</sub> ∩ <italic>S</italic><sub>1</sub>), then |<italic>W</italic>| ≥ |<italic>V</italic>|, <italic>W</italic> ∩ <italic>V</italic> = ∅, {<italic>ζ</italic>(<italic>i</italic>)}<italic><sub>i∈W</sub></italic> ∩{<italic>ζ</italic>(<italic>i</italic>)}<italic><sub>i∈V</sub></italic> = ∅ and {<italic>ζ</italic>(<italic>i</italic>)} ={<italic>ζ</italic>(<italic>i</italic>)}<italic><sub>i∈W</sub></italic> ∪ {<italic>ζ</italic>(<italic>i</italic>)}<italic><sub>i∈V</sub></italic>. If <italic>i ∈ W</italic> then |<italic>ζ</italic>(<italic>i</italic>)| = 
<inline-formula>
<mml:math id="mm19" display="inline">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula> &lt; <italic>γ</italic>/(<italic>γ</italic>/<italic>ɛ</italic>) = <italic>ɛ</italic> and <italic>card</italic>({<italic>i</italic> : |<italic>ζ</italic>(<italic>i</italic>)| ≥ <italic>ɛ</italic>}) ≤ |<italic>V</italic>| ≤ |<italic>W</italic>|. Thus, <italic>mode<sub>ɛ</sub></italic>(<italic>ζ</italic>) = 0 and <italic>mode<sub>ɛ</sub></italic>(<italic>ξ</italic>) = <italic>t</italic>.</p>
<p>¿From Lemma 4.1 we get Proposition 4.2.</p>
<p><bold>Proposition 4.2</bold> <italic>Let B be an unknown background spectrum and T and P be the known spectra of the target and of the mixed pixel, respectively. The parameter t satisfies the linear mixing model P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B. Let T</italic><sub>1</sub> = {<italic>T</italic> |‖<italic>T</italic> ‖ &gt; <italic>γ</italic>} <italic>and ξ is a vector with the coordinates</italic> 
<inline-formula>
<mml:math id="mm20" display="inline">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula><italic>. Assume that B and P are ɛ − sparce − independent. Then, mode<sub>ɛ/γ</sub></italic> (<italic>ξ</italic>) = <italic>t.</italic></p>
<p><bold>Proposition 4.3</bold> <italic>Assume that B is the unknown background spectrum where T and P are the known spectra of the target and of the mixed pixel, respectively. Assume that the parameter t satisfies the linear mixing model P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B. If B and T are orthogonal, then t</italic> = <italic>〈T, P 〉/</italic>‖<italic>T</italic> ‖<sup>2</sup>.</p>
<p><bold>Proof:</bold> <italic>〈T, P 〉</italic> = <italic>〈T, t · T</italic> + (1 <italic>− t</italic>)<italic>B〉</italic> = <italic>t · 〈T, T 〉</italic> = <italic>t ·</italic> ‖<italic>T</italic> ‖<sup>2</sup>.</p>
<p>Note that if <italic>〈T, B〉</italic> = <italic>∊</italic>, then the variance of this estimation error is <italic>∊</italic> /‖<italic>T</italic>‖<sup>2</sup>. “Sparse-independency” and orthogonality are strong conditions. Theorem 4.1 shows that “<italic>ɛ</italic>-sparse-ergodic independency” also provides a solution for the linear unmixing problem.</p>
<p><bold>Theorem 4.1</bold> <italic>Given ɛ &gt;</italic> 0<italic>, B is an unknown background spectrum, T and P are the known spectra of the target and of the mixed pixel, respectively. The parameter t satisfies the linear mixing model P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B. Assume B and P are ɛ-sparse-ergodic independent (see Definition 4.5). T<sub>i</sub></italic> <italic>denotes the ith coordinate of the vector T. Let T</italic><sub>1</sub> = {<italic>T<sub>i</sub></italic> <italic>: |T<sub>i</sub></italic><italic>| &gt; ɛ</italic>} <italic>and ξ is a vector with the coordinates</italic> 
<inline-formula>
<mml:math id="mm21" display="inline">
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>. <italic>Then, mode<sub>ɛ</sub></italic>(<italic>ξ</italic>) = <italic>t.</italic></p>
<p><bold>Proof:</bold> Let 
<inline-formula>
<mml:math id="mm22" display="inline">
<mml:mrow>
<mml:mi>ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo>{</mml:mo>
<mml:mfrac>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm23" display="inline">
<mml:mrow>
<mml:mi>ζ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo>{</mml:mo>
<mml:mfrac>
<mml:mi>B</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula>. <italic>ξ</italic> = <italic>t</italic> + (1 <italic>− t</italic>)<italic>ζ</italic> , then we only need to prove that <italic>mode<sub>ɛ</sub></italic>(<italic>ζ</italic>) = 0. Assume that <italic>p ≠</italic> 0 is a real number. Let Ω = {<italic>i|B<sub>i</sub></italic> = <italic>pT<sub>i</sub></italic>} and Λ = {<italic>i|B<sub>i</sub></italic> <italic>&lt; ɛ</italic><sup>2</sup> <italic>∧ T<sub>i</sub></italic> <italic>&gt; ɛ</italic>}. The <italic>ɛ</italic>-sparse-ergodic independency (see Definition 4.5) of <italic>B</italic> and <italic>T</italic> follows from the fact that <italic>card</italic>(Ω) <italic>&lt; card</italic>(Λ), because if <italic>W</italic> = ⊕<sub><italic>i</italic>∈Ω</sub> <italic>e<sub>i</sub></italic>𝕉, then <italic>W</italic> is a dependent basic subspace of <italic>T</italic> and <italic>B</italic> and <italic>dim</italic>(<italic>W</italic>) = <italic>card</italic>(Ω). Note that {<italic>i|B<sub>i</sub></italic> <italic>&lt; ɛ</italic><sup>2</sup> <italic>∧ T<sub>i</sub></italic> <italic>&gt; ɛ</italic>} = {<italic>i|ζ<sub>i</sub></italic> <italic>&lt; ɛ</italic>} and {<italic>i|B<sub>i</sub></italic> = <italic>pT<sub>i</sub></italic>} = {<italic>i|ζ<sub>i</sub></italic> = <italic>p</italic>}. From the condition <italic>card</italic>({<italic>i|ζ<sub>i</sub></italic> = <italic>p</italic>}) <italic>&lt; card</italic>({<italic>i|ζ<sub>i</sub></italic> <italic>&lt; ɛ</italic>}) follows that <italic>mode<sub>ɛ</sub></italic>(<italic>ζ</italic>) = 0.</p>
<p>Now we show how to obtain the “<italic>ɛ</italic>-sparse-ergodic independency” (see Definition 4.5).</p>
<p><bold>Theorem 4.2</bold> <italic>Assume that</italic> {<italic>v</italic><sub>1</sub><italic>, v</italic><sub>2</sub>} <italic>⊂</italic> 𝕉<italic><sup>n</sup></italic><italic>, 〈v</italic><sub>1</sub><italic>, v</italic><sub>2</sub><italic>〉</italic> = 0<italic>, ɛ &gt;</italic> 0<italic>. Let</italic> Λ = {<italic>S</italic><sub>1</sub><italic>, S</italic><sub>2</sub><italic>, ..., S<sub>r</sub></italic>} <italic>be the ɛ-dense and homogeneous set of isometrics operators in</italic> 𝕉<italic><sup>n</sup></italic><italic>. Then,</italic></p>
<p><italic>w</italic><sub>1</sub> ≜ [<italic>v</italic><sub>1</sub><italic>, S</italic><sub>1</sub>(<italic>v</italic><sub>1</sub>)<italic>, S</italic><sub>2</sub>(<italic>v</italic><sub>1</sub>)<italic>, ..., S<sub>r</sub></italic>(<italic>v</italic><sub>1</sub>)] <italic>and w</italic><sub>2</sub> ≜ [<italic>v</italic><sub>2</sub><italic>, S</italic><sub>1</sub>(<italic>v</italic><sub>2</sub>)<italic>, S</italic><sub>2</sub>(<italic>v</italic><sub>2</sub>)<italic>, ..., S<sub>r</sub></italic>(<italic>v</italic><sub>2</sub>)] <italic>are ɛ-sparse-ergodic independent.</italic></p>
<p>The rest of this section proves Theorem 4.2.</p>
<p><bold>Proposition 4.4</bold> <italic>For any pair of orthogonal vectors</italic> {<italic>v</italic><sub>1</sub><italic>, v</italic><sub>2</sub>} <italic>⊂</italic> 𝕉<sup>2</sup> <italic>such that 〈v</italic><sub>1</sub><italic>, v</italic><sub>2</sub><italic>〉</italic> = 0<italic>, for any linear operator S</italic> : 𝕉<sup>2</sup> <italic>→</italic> 𝕉<sup>1</sup> <italic>where</italic> ‖<italic>S</italic>‖ = 1 <italic>and for any real t ≠</italic> 0<italic>, the inequality P</italic> [<italic>S</italic>(<italic>v</italic><sub>2</sub>) = <italic>t · S</italic>(<italic>v</italic><sub>1</sub>)] <italic>&lt; P</italic> [<italic>S</italic>(<italic>v</italic><sub>2</sub>) = 0] <italic>holds when P is probability.</italic></p>
<p><bold>Proof:</bold> Any linear operator <italic>S</italic> : 𝕉<sup>2</sup> <italic>→</italic> 𝕉<sup>1</sup> with ‖<italic>S</italic>‖ = 1 can be represented as <italic>S</italic>(<italic>x</italic>) = <italic>〈s, x〉</italic> where <italic>s</italic> = (<italic>s</italic><sub>1</sub><italic>, s</italic><sub>2</sub>) is a vector with ‖<italic>s</italic>‖ = 1.</p>
<p>Consider
<disp-formula>
<mml:math id="mm24" display="block">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>S</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi mathvariant="italic">cos</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mi mathvariant="italic">sin</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">sin</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mi mathvariant="italic">cos</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>to be a rotation of 𝕉<sup>2</sup>. This rotation splits the linear operator <italic>S</italic> into a product of the rotation operator with the orthogonal projection such that <italic>S</italic> = <italic>Ŝ</italic> ○ <italic>Pr</italic><sub>1</sub> where <italic>Pr</italic><sub>1</sub>(<italic>x</italic><sub>1</sub><italic>, x</italic><sub>2</sub>) ≜ (<italic>x</italic><sub>1</sub><italic>,</italic> 0).</p>
<p>We can choose the basis {<italic>e</italic><sub>1</sub><italic>, e</italic><sub>2</sub>} of 𝕉<sup>2</sup> such that <italic>v</italic><sub>2</sub> = (<italic>a,</italic> 0) and <italic>v</italic><sub>1</sub> = (0<italic>, b</italic>). In this representation, 
<inline-formula>
<mml:math id="mm25" display="inline">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mo>⋅</mml:mo>
<mml:mi mathvariant="italic">sin</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mo>⋅</mml:mo>
<mml:mi mathvariant="italic">cos</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi>a</mml:mi>
<mml:mo>/</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>tan</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>. For any <italic>t ∈</italic> 𝕉, the solution of <italic>t</italic> = (<italic>−a/b</italic>) tan(<italic>φ</italic>) exists.</p>
<p>In addition, 
<inline-formula>
<mml:math id="mm26" display="inline">
<mml:mrow>
<mml:msup>
<mml:mtext>tan</mml:mtext>
<mml:mo>′</mml:mo></mml:msup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mtext>cos</mml:mtext></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>φ</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula> has a minimum in zero. Hence, the size of the set {<italic>φ : |</italic> tan(<italic>φ</italic>)<italic>| &lt; ɛ</italic>} is more than the size of the set {<italic>φ : |</italic> tan(<italic>φ</italic>) <italic>− t| &lt; ɛ</italic>} for <italic>t ≠</italic> 0.</p>
<p><bold>Corollary 4.1</bold> <italic>Let</italic> {<italic>S</italic><sub>1</sub><italic>, S</italic><sub>2</sub><italic>..., S<sub>r</sub></italic>} <italic>be ɛ-dense and homogeneous set (Definitions 4.2 and 4.3) of the linear operators</italic> 𝕉<sup>2</sup> <italic>→</italic> 𝕉<sup>1</sup> <italic>and</italic> {<italic>v</italic><sub>1</sub><italic>, v</italic><sub>2</sub>} <italic>is orthogonal in</italic> 𝕉<sup>2</sup><italic>. Then, the vector ς with the coordinates</italic> 
<inline-formula>
<mml:math id="mm27" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ς</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula><italic>, i =</italic> 1<italic>, …, r, has mode<sub>ɛ</sub></italic><italic>(ς) = 0.</italic></p>
<p><bold>Proposition 4.5</bold> <italic>Given a pair of orthogonal vectors</italic> {<italic>v</italic><sub>1</sub><italic>, v</italic><sub>2</sub>} <italic>⊂</italic> 𝕉<italic><sup>n</sup></italic><italic>, 〈v</italic><sub>1</sub><italic>, v</italic><sub>2</sub><italic>〉</italic> = 0 <italic>and ɛ &gt;</italic> 0<italic>. Assume that S</italic><sub>1</sub><italic>, S</italic><sub>2</sub><italic>, ..., S<sub>r</sub></italic> <italic>is an ɛ-dense and homogeneous set of isometrics operators in</italic> 𝕉<italic><sup>n</sup></italic><italic>. Then, the vector ς, which has the coordinates</italic>
<inline-formula>
<mml:math id="mm28" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ς</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>α</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>χ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>α</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, <italic>i</italic> = 1<italic>, . . . , n, α</italic> = 1<italic>, . . . , r, has mode<sub>ɛ</sub></italic>(<italic>ς<sub>i,α</sub></italic>) = 0 <italic>where</italic>
<inline-formula>
<mml:math id="mm29" display="inline">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>χ</mml:mi></mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>α</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> <italic>is the ith coordinate of the vector</italic> 
<inline-formula>
<mml:math id="mm30" display="inline">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>χ</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msup>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>n</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>v</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>
<p><bold>Proof:</bold> Denote Λ = {<italic>S</italic><sub>1</sub><italic>, S</italic><sub>2</sub><italic>, ..., S<sub>r</sub></italic>}. The set of <italic>χ</italic> coordinates is divided into the union of the subsets <italic>τ<sub>i</sub></italic> where <italic>τ<sub>i</sub></italic> = {<italic>ς<sub>i,α</sub></italic>}<sub><italic>S</italic><sub><italic>α∈Λ</italic></sub></sub> α = 1<italic>, . . . , r</italic>. We have to show that <italic>mode<sub>ɛ</sub></italic>(<italic>τ<sub>i</sub></italic>) = 0, <italic>i</italic> = 1<italic>, . . . , n</italic>. Assume that <italic>θ<sub>i</sub></italic> is a projection to coordinate <italic>i</italic>, such that for <italic>x</italic> = (<italic>x</italic><sub>1</sub><italic>, ..., x<sub>n</sub></italic>), <italic>θ<sub>i</sub></italic><italic>x</italic> = <italic>x<sub>i</sub></italic>. Assume that <italic>L</italic> is a linear span of {<italic>v</italic><sub>1</sub><italic>, v</italic><sub>2</sub>}. Define <italic>R<sub>α</sub></italic> = <italic>θ<sub>i</sub></italic> ○ <italic>S<sub>α</sub></italic> where <italic>R<sub>α</sub></italic>: 𝕉<sup>2</sup> <italic>→</italic> 𝕉<sup>1</sup>. Our assumption was that Λ = {<italic>S</italic><sub>1</sub><italic>, S</italic><sub>2</sub><italic>, ..., S<sub>r</sub></italic>} is <italic>ɛ</italic>-dense and homogeneous. Therefore, {<italic>R</italic><sub>1</sub><italic>, R</italic><sub>2</sub><italic>, ..., R<sub>r</sub></italic>} is <italic>ɛ</italic>-dense and homogeneous too. Then, Corollary 4.1 implies <italic>mode<sub>ɛ</sub></italic>(<italic>τ<sub>i</sub></italic>) = 0.</p>
<p><bold>Proof: proof of Theorem 4.2:</bold> Let <italic>L</italic> be the linear span of <italic>w</italic><sub>1</sub> and <italic>w</italic><sub>2</sub> from 𝕉<italic><sup>nr</sup></italic> = ⊕<italic><sub>S⊂Λ</sub></italic> 𝕉<italic><sup>n</sup></italic>. If the theorem is not true, then there is basic subspace <italic>W</italic> in ⊕<italic><sub>S⊂Λ</sub></italic> 𝕉<italic><sup>n</sup></italic> such that <italic>dim</italic>(<italic>W</italic>) = <italic>q</italic> and <italic>dim</italic>(<italic>Pr<sub>w</sub></italic>(<italic>L</italic>)) = 1. It means that <italic>w</italic><sub>1</sub><italic>|<sub>W</sub></italic> = <italic>s</italic> and <italic>w</italic><sub>2</sub><italic>|<sub>W</sub></italic> = <italic>t · s</italic>, <italic>t ≠</italic> 0, such that <italic>dim</italic>(<italic>W</italic>) <italic>&gt; card</italic>{<italic>i|</italic>(<italic>w</italic><sub>2</sub>)(<italic>i</italic>) = 0 and (<italic>w</italic><sub>1</sub>)(<italic>i</italic>) <italic>≠</italic> 0}. This means that a vector with the coordinates 
<inline-formula>
<mml:math id="mm31" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ς</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula>, <italic>i</italic> =1<italic>, . . . , nr</italic>, has <italic>mode<sub>ɛ</sub></italic>(<italic>ς<sub>i</sub></italic>) = <italic>t ≠</italic> 0. This contradicts Proposition 4.5.</p></sec></sec>
<sec>
<label>5.</label>
<title>Random Orthogonal Transformation Algorithm for Unmixing (ROTU)</title>
<p>We denote the first derivative of a spectrum of a pixel <italic>X</italic> by <italic>d</italic>(<italic>X</italic>) and it is called the <italic>d − spectrum</italic> of the pixel <italic>X</italic>.</p>
<p>For independent spectra and for special type of dependent spectra, which will be defined below, we can use a faster and less computationally expensive method than [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>] (Section 4). This is described in this section. This method works well if the <italic>d − spectra</italic> of different materials are related by the so-called <italic>sparse − independent</italic> (see Definition 4.4). This relation will be described below. If they are not <italic>sparse − independent</italic> then this method may generate false alarms.</p>
<p>We assume now that the conditions in Section 2 hold. We are given the target’s spectrum <italic>T</italic> and the mixed pixel spectrum <italic>P</italic>. Their relationship is described by <xref ref-type="disp-formula" rid="FD2">Equation (2)</xref> via the average background spectrum <italic>B</italic>, which is a mix of the background spectra without the presence of the target, such that 
<inline-formula>
<mml:math id="mm32" display="inline">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:math></inline-formula> and
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm33" display="block">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>.</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>Our goal is to estimate <italic>t</italic>, denoted by <italic>t̂</italic>, which will satisfy <xref ref-type="disp-formula" rid="FD7">Equation (7)</xref> provided that <italic>B</italic> and <italic>T</italic> have some independent features. Once <italic>t̂</italic> is found, the estimation of an unknown average background spectrum <italic>B</italic>, denoted by <italic>B̂</italic>, can be calculated by <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>).</p>
<sec>
<label>5.1.</label>
<title>The Unmixing Algorithm</title>
<p>Generally speaking, spectra of different materials have different sections in them: some sections are mutually correlated, some sections are smooth and some have sections with oscillatory behavior. Large first and second derivatives are used as characteristic features for a spectral classification.</p>
<p>Some section in the spectrum can be autocorrelated. Let <italic>D</italic> = (<italic>D</italic><sub>1</sub><italic>, D</italic><sub>2</sub>) be a pair of the first and the second derivatives operator. Our assumptions imply that <italic>D</italic>(<italic>T</italic>) and <italic>D</italic>(<italic>B</italic>) are more independent than <italic>T</italic> and <italic>B</italic>. The <italic>D</italic> operator reduces the correlation between spectra and makes them more independent.</p>
<p>Let <italic>x</italic> be a spectrum. We denote by a <italic>d − spectrum</italic> the vector <italic>Dx</italic>. If two <italic>d − spectra</italic> of different materials are statistically independent (or orthogonal), then estimating <italic>t</italic> can be obtained by using Proposition 4.3. Statistical independency is a strong condition. Sometimes we have <italic>sparse − independency</italic> for the <italic>d − spectra</italic>. The sparse-independency condition yields its estimation from Proposition 4.2. In general, we have <italic>ɛ</italic> deviation from orthogonality and the condition of <italic>sparse − independency</italic> never holds.</p>
<p>The unmixing algorithm has the following steps:</p>
<p><bold>Step 1:</bold> The operator <italic>D</italic> is applied to <italic>T</italic> and <italic>P</italic>. We assume that <italic>|corr</italic>(<italic>DT, DB</italic>)<italic>| &lt; ɛ</italic>.</p>
<p><bold>Step 2:</bold> Assume that Λ = {<italic>S</italic><sub>1</sub><italic>, S</italic><sub>2</sub><italic>, ..., S<sub>r</sub></italic>} is random isometric rotations in 𝕉<italic><sup>n</sup></italic>, <italic>W</italic> : 𝕉<italic><sup>n</sup></italic> ↣ 𝕉<sup>2</sup><italic><sup>nr</sup></italic> = ⊕<italic><sub>S⊂Λ</sub></italic> 𝕉<italic><sup>n</sup></italic> is the embedding of 𝕉<italic><sup>n</sup></italic> into 𝕉<sup>2</sup><italic><sup>rn</sup></italic> such that
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm34" display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>T</mml:mi></mml:msub>
<mml:mo>≜</mml:mo>
<mml:mi>W</mml:mi>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>D</mml:mi>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>T</mml:mi>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<disp-formula id="FD9">
<label>(9)</label>
<mml:math id="mm35" display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>B</mml:mi></mml:msub>
<mml:mo>≜</mml:mo>
<mml:mi>W</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>D</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
<disp-formula id="FD10">
<label>(10)</label>
<mml:math id="mm36" display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>P</mml:mi></mml:msub>
<mml:mo>≜</mml:mo>
<mml:mi>W</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="false">[</mml:mo>
<mml:mi>D</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub>
<mml:mi>D</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">]</mml:mo>
<mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p><bold>Step 3:</bold> Construct the vector <italic>ς</italic> with the coordinates
<disp-formula id="FD11">
<label>(11)</label>
<mml:math id="mm37" display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ς</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>α</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>P</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>T</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>n</mml:mi>
<mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p><bold>Step 4:</bold> Estimation of <italic>t</italic>, denoted by <italic>t̂</italic>, is computed by 
<inline-formula>
<mml:math id="mm38" display="inline">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>t</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">mode</mml:mi></mml:mrow>
<mml:mi>ɛ</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>P</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>T</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>, <italic>i</italic> = 1, …, 2<italic>rn. B</italic> is estimated by <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>).</p>
<p>The vectors <italic>w<sub>T</sub></italic> , <italic>w<sub>B</sub></italic> and <italic>w<sub>P</sub></italic> are related by <italic>w<sub>P</sub></italic> = <italic>t · w<sub>T</sub></italic> + (1 <italic>− t</italic>) <italic>· w<sub>B</sub></italic>, where <italic>w<sub>B</sub></italic> is unknown. Proposition 4.5 and Theorems 4.1 and 4.2 yield that {<italic>w<sub>T</sub></italic><italic>, w<sub>B</sub></italic>} are <italic>ɛ</italic>-sparse-ergodic independent (Definition 4.5) and the <italic>mode<sub>ɛ</sub></italic>(<italic>ς</italic>) of the vector <italic>ς</italic>, which have the coordinates
<disp-formula id="FD12">
<label>(12)</label>
<mml:math id="mm39" display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ς</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>α</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>D</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mi>B</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi></mml:mrow>
<mml:mi>α</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>D</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub>
<mml:mi>T</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>r</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>n</mml:mi>
<mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>is <italic>mode<sub>ɛ</sub></italic>(<italic>ς</italic>) = 0.</p>
<p>It follows that 
<inline-formula>
<mml:math id="mm40" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">mode</mml:mi></mml:mrow>
<mml:mi>ɛ</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>P</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>T</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>}</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>r</mml:mi>
<mml:mi>n</mml:mi></mml:mrow></mml:msubsup></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> estimates <italic>t</italic>. As the number of operators is increased then this estimate becomes more accurate.</p>
<p><bold>Definition 5.1</bold> <italic>Equations (<xref ref-type="disp-formula" rid="FD8">8</xref>–<xref ref-type="disp-formula" rid="FD11">11</xref>) denote the vector ζ</italic> = (<italic>ζ<sub>k</sub></italic>) <italic>with the set of coordinates</italic> 
<inline-formula>
<mml:math id="mm41" display="inline">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ς</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>P</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>k</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>T</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula><italic>, k</italic> = 1<italic>, . . . ,</italic> 2<italic>rn. Assume ɛ &gt;</italic> 0 <italic>and b</italic> = {0<italic>, ɛ,</italic> 2<italic>ɛ,</italic> 3<italic>ɛ,</italic> 4<italic>ɛ, . . . ,</italic> 1 = <italic>mɛ</italic>} <italic>is a partition of</italic> [0<italic>,</italic> 1]<italic>. Then, the vector H such that H<sub>i</sub></italic> = <italic>card</italic>{<italic>k|iɛ − ɛ/</italic>2 <italic>&lt; ζ<sub>k</sub></italic> <italic>≤ iɛ</italic> + <italic>ɛ/</italic>2}<italic>, i</italic> = 1<italic>, . . . , m, is an histogram of ζ<sub>k</sub></italic> <italic>related to the partition b. This histogram is called the “unmixing histogram”.</italic></p>
<p><xref ref-type="fig" rid="f7-remotesensing-04-00532">Figure 7</xref> illustrates the “unmixing histogram” for <italic>B</italic> and <italic>P</italic> from <xref ref-type="fig" rid="f3-remotesensing-04-00532">Figure 3</xref>.</p>
<p>In <xref ref-type="fig" rid="f7-remotesensing-04-00532">Figure 7</xref>, we can see the distribution of the values of the vector <italic>ζ</italic> on the interval [0<italic>,</italic> 1], where the partition step is 0.01. This histogram shows that the mode of this distribution is 0.4 = 40 <italic>·</italic> 0.01. Thus, <italic>t</italic> is estimated by <italic>t̂</italic> = <italic>mode<sub>ɛ</sub></italic>(<italic>ξ</italic>) = 0.4, where the vector <italic>ξ</italic> is computed by <xref ref-type="disp-formula" rid="FD11">Equation (11)</xref>. As long as <italic>t̂</italic> is known, we can estimate the spectrum of the background <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>). The comparison between <italic>B</italic> and <italic>B̂</italic> is shown in <xref ref-type="fig" rid="f8-remotesensing-04-00532">Figure 8</xref>.</p></sec>
<sec sec-type="results">
<label>5.2.</label>
<title>Experimental Results</title>
<p><bold>1. Experiment with random spectral signatures.</bold> Assume the vectors <italic>B</italic> and <italic>T</italic> were randomly generated. <italic>B</italic> is unknown. The vector <italic>P</italic> was computed by <italic>P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B</italic> for <italic>t</italic> = 0.1<italic>,</italic> 0.3<italic>,</italic> 0.7<italic>,</italic> 0.9. <xref ref-type="fig" rid="f9-remotesensing-04-00532">Figure 9</xref> displays the plots of <italic>P</italic> , <italic>T</italic> and <italic>B</italic> for <italic>t</italic> = 0.7. The vector <italic>P</italic> corresponds to the spectrum of the mixed pixel, <italic>T</italic> corresponds to the known target spectrum and <italic>B</italic> corresponds to an unknown background spectrum.</p>
<p>The ROTU algorithm, which estimates <italic>t</italic>, is applied to the pair <italic>P</italic> and <italic>T</italic> with different <italic>t</italic> values. Then, the estimations of <italic>t̂</italic>and <italic>t</italic> are compared. Once <italic>t̂</italic> is obtained then the unknown <italic>B</italic> is estimated by <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>).</p>
<p>The “unmixing histogram” from Definition 5.1 is displayed in <xref ref-type="fig" rid="f10-remotesensing-04-00532">Figure 10</xref> for several <italic>t</italic> values.</p>
<p>From <xref ref-type="fig" rid="f10-remotesensing-04-00532">Figure 10</xref> we get that the estimations for the true values of <italic>t</italic> by <italic>t̂</italic> are:</p>
<p><italic>t</italic> = 0.3 <italic>vs. t̂</italic> = 0.3, <italic>t</italic> = 0.7 <italic>vs. t̂</italic> = 0.7, <italic>t</italic> = 0.9 <italic>vs. t̂</italic> = 0.9 and <italic>t</italic> = 0.1 <italic>vs. t̂</italic> = 0.1. In all these cases, <italic>t̂</italic> = <italic>t</italic>. It means that the reliability of our algorithm on these datasets is 100%.</p>
<p><bold>2. Experiment with Spectra of Real Materials.</bold> The spectra <italic>B</italic> and <italic>T</italic> of two materials were taken from a database of signatures. They are presented in <xref ref-type="fig" rid="f11-remotesensing-04-00532">Figure 11</xref>. <italic>P</italic> is computed by <italic>P</italic> = <italic>tT</italic> + (1 <italic>− t</italic>)<italic>B</italic> for <italic>t</italic> = 0.3<italic>,</italic> 0.55<italic>,</italic> 0.7<italic>,</italic> 0.8.</p>
<p><italic>P</italic> and <italic>T</italic> are known while <italic>B</italic> is unknown. The ROTU algorithm is applied to the pair <italic>P</italic> and <italic>T</italic> for <italic>t</italic> = 0.3<italic>,</italic> 0.55<italic>,</italic> 0.7<italic>,</italic> 0.8. The ROTU algorithm estimates <italic>t</italic> by <italic>t̂</italic>. Once <italic>t̂</italic> is obtained, then the unknown <italic>B</italic> can be estimated from <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>).</p>
<p>The “unmixing histograms” from Definition 5.1 and the comparison between <italic>B</italic> and its estimate <italic>B̂</italic> are given below. In each of the <xref ref-type="fig" rid="f12-remotesensing-04-00532">Figures 12</xref>–<xref ref-type="fig" rid="f15-remotesensing-04-00532">15</xref>, the horizontal axes of the left images represent the <italic>b</italic> partition of [0, 1], <italic>ɛ</italic> = 0.01. The vertical axes of the left images represent the cardinality of the coordinate’s set that belongs to each interval in the <italic>b</italic> partition.</p>
<p><bold>Case 1.</bold> True value is t = 0.3 where its estimate is <italic>t̂</italic>= 0.28.</p>
<p><bold>Case 2.</bold> True value is t = 0.7 where its estimate is <italic>t̂</italic>= 0.69.</p>
<p><bold>Case 3.</bold> True value is t = 0.49 where its estimate is <italic>t̂</italic>= 0.45.</p>
<p><bold>Case 4.</bold> True value is t = 0.8 where its estimate is <italic>t̂</italic>= 0.82.</p>
<p><bold>3. Experiment with sub- and above-pixel’s target recognition.</bold> We present the results from the application of the ROTU algorithm to two different types of scenes: (1). “Desert” (see <xref ref-type="fig" rid="f4-remotesensing-04-00532">Figure 4</xref>) has targets that occupy more than a pixel. It is shown in <xref ref-type="fig" rid="f16-remotesensing-04-00532">Figure 16</xref>. “Field” (see <xref ref-type="fig" rid="f6-remotesensing-04-00532">Figure 6</xref>) has a subpixel target. It is shown in <xref ref-type="fig" rid="f17-remotesensing-04-00532">Figure 17</xref>. In each figure, the parameter <italic>t</italic>, which was introduced in <xref ref-type="disp-formula" rid="FD7">Equation (7)</xref>, means the portion of the target’s material in the detected pixel (fractional abundance).</p></sec></sec>
<sec>
<label>6.</label>
<title>Classification for an Unmixing (CLUN) Algorithm</title>
<p>In this section, we present another unmixing algorithm that is based on classification. This method works fast and does not require any conditions on the relation between the background and the target spectra. The spectra of the target and the background can be correlated and their derivatives can be correlated too. We outline the following consecutive steps in a target identification via classification by HSI learning:
<list list-type="order">
<list-item>
<p>Learning phase: Finding typical background features by the application of PCA to the spectra of randomly selected multi-pixels from the scene where a multi-pixel is one pixel with all its wavelengths;</p></list-item>
<list-item>
<p>Separation of the target’s spectra from the background via a linear operator, which is obtained from the “learning” phase;</p></list-item>
<list-item>
<p>Construct a set of sensors which are highly sensitive to targets and insensitive to background;</p></list-item>
<list-item>
<p>Embedding the spectra from the scene into a space generated by “good” sensors vectors and detect the points that contain the target that has a big norm value.</p></list-item></list>We do not need to assume that spectra of different materials are dependent. If they are independent then all the related works, which were mentioned in Section 3 and [<xref ref-type="bibr" rid="b2-remotesensing-04-00532">2</xref>], work well. Now we describe in details the above outlined method.</p>
<sec>
<label>6.1.</label>
<title>Sensor Building Algorithm (SBA)</title>
<p>Let <italic>V</italic> = {<italic>v</italic><sub>1</sub><italic>, ..., v<sub>s</sub></italic>} be a set of multi-pixels spectra that were randomly chosen. We assume that the majority of the pixels from the scene are background pixels. Hence, after the application of PCA to <italic>V</italic> we get information about the background and not about the targets. Assume that Ω = {<italic>w</italic><sub>1</sub><italic>, ..., w<sub>p</sub></italic>} are the principal components of the PCA of <italic>V</italic> and Φ is a set of the known targets spectra. In our approach, Φ = {<italic>T</italic>} has only one vector, which is the vector of the target’s signature. Our goal is to separate between the two sets Ω and Φ via the sensor building algorithm (SBA) that is discussed next. Algorithm 1: Outline of SBA
<list list-type="order">
<list-item>
<p>Let <italic>T</italic> be the target’s spectrum: Construct the function
<disp-formula id="FD13">
<label>(13)</label>
<mml:math id="mm42" display="block">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>ξ</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mi>ξ</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mo>‖</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mi>ξ</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mi>ξ</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo>〈</mml:mo>
<mml:mrow>
<mml:mi>ξ</mml:mi>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi></mml:mrow>
<mml:mi>p</mml:mi></mml:msub></mml:mrow>
<mml:mo>〉</mml:mo></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>‖</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>such that <italic>ξ</italic> maximizes <italic>F</italic> (<italic>ξ</italic>) and <italic>W</italic> is a weight coefficient.</p></list-item>
<list-item>
<p>Given a set of weak classifiers Γ: Perform <italic>s</italic> iterations to collect the <italic>s</italic> functions from Γ each of which provides a local supremum for the function <italic>F</italic>. Each function <italic>ξ</italic> from Γ is named a “weak classifier”. If <italic>s</italic> is big (for example, <italic>s</italic> = 1000), then our algorithm becomes more accurate while the computational time becomes longer.</p></list-item>
<list-item>
<p>Construct the “strong classifier”: It is a linear operator Ξ = [<italic>ξ</italic><sub>1</sub><italic>, ξ</italic><sub>2</sub><italic>, ..., ξ<sub>s</sub></italic>], where each <italic>ξ<sub>i</sub></italic> was obtained in step 2. This operator is a “projection separator” (see Proposition 4.1).</p></list-item>
<list-item>
<p>Construct <italic>T</italic><sub>1</sub> = Ξ (<italic>T</italic>) (see Definition 4.6), where <italic>T</italic> is the spectrum of the target.</p></list-item></list></p>
<p>Detailed description of the “weak classifier” collection:
<list list-type="order">
<list-item>
<p>Choose an integer <italic>r &gt;</italic> 1000 and a value for <italic>∊ &gt;</italic> 0.</p></list-item>
<list-item>
<p>First iteration: Select a subset Γ̂ of <italic>r</italic> vectors from Γ via random choices.</p></list-item>
<list-item>
<p>Let <italic>ξ</italic><sub>1</sub> = <italic>argmax</italic>{<italic>F</italic> (<italic>ξ</italic>)<italic>|ξ ∈</italic> Γ̂} (<xref ref-type="disp-formula" rid="FD13">Equation (13)</xref>).</p></list-item>
<list-item>
<p>Assume <italic>s−</italic>1 iterations took place. We describe now the <italic>s<sup>th</sup></italic> step. We select a subset Γ̂ of <italic>r</italic> vectors from Γ via a random choice such that for each <italic>ξ<sub>i</sub></italic>, <italic>i</italic> = 1<italic>, .., s −</italic> 1, <italic>dist</italic>(Γ̂, <italic>ξ<sub>i</sub></italic>) <italic>&gt; ∊</italic> holds.</p></list-item>
<list-item>
<p>Let <italic>ξ<sub>s</sub></italic> = <italic>argmax</italic>{<italic>F</italic> (<italic>ξ</italic>)<italic>|ξ ∈</italic> Γ̂}.</p></list-item></list></p>
<p>Assume <italic>P</italic> is the current pixel’s spectrum. Proposition 4.1 contains the following two properties where Ξ is defined in Definition 4.6.</p>
<list list-type="order">
<list-item>
<p><italic>corr</italic>(Ξ(<italic>P</italic>)<italic>, T</italic><sub>1</sub>) is close to 1.</p></list-item>
<list-item>
<p><italic>t</italic> = ‖Ξ(<italic>P</italic>)‖/‖<italic>T</italic><sub>1</sub>‖ is the portion of the target in the current pixel.</p></list-item></list></sec>
<sec>
<label>6.2.</label>
<title>Description of the CLUN Algorithm</title>
<p>We assume that we have the same conditions as in Section 2. We are given the target’s spectrum <italic>T</italic> and the mixed pixel spectrum <italic>P</italic> . Their relationship is described by <xref ref-type="disp-formula" rid="FD2">Equation (2)</xref> using the background’s average spectrum <italic>B</italic>, which is the mix of the background’s spectra without the presence of the target, such that 
<inline-formula>
<mml:math id="mm43" display="inline">
<mml:mrow>
<mml:mi>B</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:math></inline-formula> and
<disp-formula id="FD14">
<label>(14)</label>
<mml:math id="mm44" display="block">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:mi>B</mml:mi>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>t</mml:mi>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>M</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>B</mml:mi></mml:mrow>
<mml:mi>k</mml:mi></mml:msub>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>.</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>Our goal is to estimate <italic>t</italic>, denoted by <italic>t̂</italic>, which will satisfy <xref ref-type="disp-formula" rid="FD14">Equation (14)</xref> provided that <italic>B</italic> and <italic>T</italic> have some independent features. Once <italic>t̂</italic> is found, the estimation of an unknown background average spectrum <italic>B</italic>, denoted by <italic>B̂</italic>, is computed by <italic>B̂</italic> = (<italic>P − t̂T</italic>)<italic>/</italic>(1 <italic>− t̂</italic>).</p>
<p>The CLUN algorithm contains the following steps:
<list list-type="order">
<list-item>
<p>Apply the sensor building algorithm (SBA - Algorithm 1).</p></list-item>
<list-item>
<p>Construct an operator that is the projection separator Ξ(<italic>P</italic>) from Definition 4.6.</p></list-item>
<list-item>
<p>Apply the operator Ξ(<italic>P</italic>) to each pixel’s spectrum.</p></list-item></list></p>
<p>Let {<italic>P</italic><sub>1</sub><italic>, . . . , P<sub>ω</sub></italic>} be the spectra dataset from all the pixels in the scene. Denote <italic>Y<sub>i</sub></italic> = ‖Ξ (<italic>X<sub>i</sub></italic>)‖. There are two possibilities for the dataset {<italic>Y</italic><sub>1</sub><italic>, . . . , Y<sub>ω</sub></italic>} <italic>⊆</italic> 𝕉<sup>+</sup>:
<list list-type="order">
<list-item>
<p>The set {<italic>Y</italic><sub>1</sub><italic>, . . . , Y<sub>ω</sub></italic>} is separable according to Definition 3.5;</p></list-item>
<list-item>
<p>The set {<italic>Y</italic><sub>1</sub><italic>, . . . , Y<sub>ω</sub></italic>} is inseparable according to Definition 3.5.</p></list-item></list></p>
<p>From Definition 3.5, a dataset is separable if <italic>J</italic>(<italic>e</italic><sub>1</sub>) <italic>&lt; J</italic>(<italic>e</italic><sub>2</sub>) where <italic>e</italic><sub>1</sub> is the partition and the number of classes is 1 and <italic>e</italic><sub>2</sub> is the best partition for the two classes. The function <italic>J</italic>(<italic>e</italic>) measures the separation quality between the clusters. If <italic>J</italic>(<italic>e</italic><sub>1</sub>) ≥ <italic>J</italic>(<italic>e</italic><sub>2</sub>), then the dataset is inseparable and Fisher’s separation is incorrect.</p>
<p>In the first case, (<italic>w, b</italic>) is the best separation between two clusters of the set <italic>S</italic> = {<italic>Y</italic><sub>1</sub><italic>, . . . , Y<sub>ω</sub></italic>} <italic>⊆</italic> 𝕉<sup>+</sup> according to Definition 3.4 and <italic>b</italic> is the Fisher’s threshold for this separation. Then, {<italic>i|Y<sub>i</sub></italic> <italic>&gt; b</italic>} is the set of pixels that contains the targets. In the other case, there are no pixels that contain the targets.</p>
<p>Assume <italic>P<sub>i</sub></italic> is the spectrum of the current pixel. Assume ‖Ξ(<italic>X<sub>i</sub></italic>)‖ &gt; <italic>b</italic>. Then, according to our assumption <italic>P<sub>i</sub></italic> contains the target’s spectrum as a linear component. It means that <italic>i</italic> is a suspicious pixel. <italic>corr</italic>(Ξ(<italic>P</italic>)<italic>, T</italic><sub>1</sub>) is close to 1, then <italic>t</italic> = ‖Ξ(<italic>P</italic>)‖/‖<italic>T</italic><sub>1</sub>‖ is the portion that contains <italic>T</italic> in the current pixel.</p></sec>
<sec>
<label>6.3.</label>
<title>Implementations of the CLUN Algorithm</title>
<p>We present the results from the application of the CLUN algorithm to hyperspectral images where the target once occupies more than a pixel and once it occupies a subpixel. In <xref ref-type="fig" rid="f18-remotesensing-04-00532">Figures 18</xref> and <xref ref-type="fig" rid="f19-remotesensing-04-00532">19</xref>, the parameter <italic>t</italic>, which was introduced in <xref ref-type="disp-formula" rid="FD7">Equation (7)</xref>, is the portion of the target in the current pixel.</p>
<p>To separate between Ω and Φ, we utilize the following sets types Γ:
<list list-type="order">
<list-item>
<p><italic>d − support local − characteristic</italic> is the set of weak classifiers.</p></list-item>
<list-item>
<p><italic>d − support global − characteristic</italic> is the set of weak classifiers.</p></list-item>
<list-item>
<p>Assume that Γ = {<italic>ξ<sub>i,j</sub></italic>} is the set of weak classifiers where 〈<italic>ξ<sub>i,j</sub></italic>, <italic>X</italic>〉 = <italic>X<sub>i</sub></italic> − <italic>X<sub>j</sub></italic>. This set of weak classifiers works well if the vectors from Φ and Ω have a low entropy. It means that many coordinates have equal values and random differences between some coordinates provide sparse and independent behavior.</p></list-item></list></p>
<p><xref ref-type="fig" rid="f20-remotesensing-04-00532">Figures 20</xref> and <xref ref-type="fig" rid="f21-remotesensing-04-00532">21</xref> illustrate the separation between Ω and Φ. In this example, we used the <italic>d−support local − characteristic</italic> as the set of weak classifiers to achieve this separation.</p>
<p><xref ref-type="fig" rid="f22-remotesensing-04-00532">Figure 22</xref> illustrates the other type of situation. The spectrum was taken from the scene “City” (<xref ref-type="fig" rid="f5-remotesensing-04-00532">Figure 5</xref>). Here the features, which are located in small segments, have noise. Hence, global features sensors are required to separate between the target’s spectrum and the background’s spectrum.</p>
<p><xref ref-type="fig" rid="f23-remotesensing-04-00532">Figure 23</xref> shows how the wavelengths after the projection by the application of PCA.</p></sec>
<sec sec-type="results">
<label>6.4.</label>
<title>Experimental Results for Sub- and Above Pixel Target Recognition</title>
<p>We present the results from the application of the CLUN algorithm to three different types of scenes (<xref ref-type="fig" rid="f4-remotesensing-04-00532">Figures 4</xref>–<xref ref-type="fig" rid="f6-remotesensing-04-00532">6</xref>). In the first scene (<xref ref-type="fig" rid="f18-remotesensing-04-00532">Figure 18</xref>), the target occupies more than a pixel. The other two scenes (<xref ref-type="fig" rid="f19-remotesensing-04-00532">Figures 19</xref> and <xref ref-type="fig" rid="f24-remotesensing-04-00532">24</xref>) contain subpixel targets. In each figure, the parameter <italic>t</italic>, which was introduced in <xref ref-type="disp-formula" rid="FD7">Equation (7)</xref>, is the portion of the target in each pixel.</p></sec></sec>
<sec sec-type="conclusions">
<label>7.</label>
<title>Conclusions</title>
<p>The paper presents two linear unmixing algorithms ROTU and CLUN to recognize targets in hyperspectral images where each given target’s spectral signature can be located in a sub- or above pixel. The CLUN algorithm is based on automatic extraction of features from the target’s spectrum. These features separate the target from the background. The ROTU algorithm is based on embedding the spectra space into a special space by random orthogonal transformation and by examining the statistical properties of the embedding’s result. In addition, we also compute the portion of the target’s material in the detected pixel (fractional abundance).The ROTU algorithm works well and fast if the spectra of the sought after target has sparse derivatives. It also works well if the fractional abundance of the target was 10, 30, 70 and 90%. It also detects sub- and above pixels with minimal false alarms. The CLUN algorithm works better but slower since it needs training phase and does background modeling.</p></sec></body>
<back>
<ref-list>
<title>References</title>
<ref id="b1-remotesensing-04-00532"><label>1</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Madar</surname><given-names>E</given-names></name></person-group><source>Literature Survey on Anomaly Detection in Hyperspectral Imaging</source><comment>Research Report</comment><publisher-name>Technion</publisher-name><publisher-loc>Haifa, Israel</publisher-loc><year>2009</year></citation></ref>
<ref id="b2-remotesensing-04-00532"><label>2</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Averbuch</surname><given-names>A.</given-names></name><name><surname>Zheludev</surname><given-names>M.</given-names></name><name><surname>Zheludev</surname><given-names>V</given-names></name></person-group><source>Unmix and target recognition in hyperspectral images</source><year>2011</year><comment>submitted</comment></citation></ref>
<ref id="b3-remotesensing-04-00532"><label>3</label><citation citation-type="web"><comment>Spectral Imaging Ltd. Specim Camera; 2006. Available online: <ext-link xlink:href="http://www.specim.fi/" ext-link-type="uri">http://www.specim.fi/</ext-link> (accessed on 1 February 2012).</comment></citation></ref>
<ref id="b4-remotesensing-04-00532"><label>4</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Bioucas-Dias</surname><given-names>J.</given-names></name><name><surname>Plaza</surname><given-names>A</given-names></name></person-group><article-title>An Overview on Hyperspectral Unmixing: Geometrical, Statistical, and Sparse Regression Based Approaches</article-title><conf-name>Proceeding of 2011 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)</conf-name><conf-loc>Vancouver, BC, Canada</conf-loc><conf-date>24–29 July 2011</conf-date><fpage>1135</fpage><lpage>1138</lpage></citation></ref>
<ref id="b5-remotesensing-04-00532"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bioucas-Dias</surname><given-names>J.</given-names></name><name><surname>Plaza</surname><given-names>A</given-names></name></person-group><article-title>Hyperspectral unmixing: Geometrical, statistical and sparse regression-based approaches</article-title><source>Proc. SPIE</source><year>2010</year><volume>7830</volume><fpage>78300A-78300A-15</fpage></citation></ref>
<ref id="b6-remotesensing-04-00532"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manolakis</surname><given-names>D.</given-names></name><name><surname>Shaw</surname><given-names>G.</given-names></name></person-group><article-title>Detection algorithms for hyperspectral imaging applications</article-title><source>IEEE Signal Proc. Mag</source><year>2002</year><volume>19</volume><fpage>29</fpage><lpage>43</lpage><pub-id pub-id-type="doi">10.1109/79.974724</pub-id></citation></ref>
<ref id="b7-remotesensing-04-00532"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manolakis</surname><given-names>D.</given-names></name><name><surname>Marden</surname><given-names>D.</given-names></name><name><surname>Shaw</surname><given-names>G.</given-names></name></person-group><article-title>Hyperspectral image processing for automatic target detection applications</article-title><source>Lincoln Lab J</source><year>2003</year><volume>14</volume><fpage>79</fpage><lpage>114</lpage></citation></ref>
<ref id="b8-remotesensing-04-00532"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Settle</surname><given-names>J.J.</given-names></name></person-group><article-title>On the relationship between spectral unmixing and subspace projection</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>1996</year><volume>34</volume><fpage>1045</fpage><lpage>1046</lpage><pub-id pub-id-type="doi">10.1109/36.508422</pub-id></citation></ref>
<ref id="b9-remotesensing-04-00532"><label>9</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Chang</surname><given-names>C.I.</given-names></name></person-group><source>Hyperspectral Imaging: Techniques for Spectral Detection and Classification</source><publisher-name>Kluwer Academic</publisher-name><publisher-loc>New York, NY, USA</publisher-loc><year>2003</year></citation></ref>
<ref id="b10-remotesensing-04-00532"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Common</surname><given-names>P.</given-names></name></person-group><article-title>Independent component analysis: A new concept</article-title><source>Signal Process</source><year>1994</year><volume>36</volume><fpage>287</fpage><lpage>314</lpage><pub-id pub-id-type="doi">10.1016/0165-1684(94)90029-9</pub-id></citation></ref>
<ref id="b11-remotesensing-04-00532"><label>11</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hyvarinen</surname><given-names>A.</given-names></name><name><surname>Karhunen</surname><given-names>J.</given-names></name><name><surname>Oja</surname><given-names>E</given-names></name></person-group><source>Independent Component Analysis</source><publisher-name>John Wiley &amp; Sons Inc</publisher-name><publisher-loc>New York, NY, USA</publisher-loc><year>2001</year></citation></ref>
<ref id="b12-remotesensing-04-00532"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bayliss</surname><given-names>J.D.</given-names></name><name><surname>Gualtieri</surname><given-names>J.A.</given-names></name><name><surname>Cromp</surname><given-names>R.F.</given-names></name></person-group><article-title>Analysing hyperspectral data with independent component analysis</article-title><source>Proc. SPIE</source><year>1997</year><volume>3240</volume><fpage>133</fpage><lpage>143</lpage></citation></ref>
<ref id="b13-remotesensing-04-00532"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nascimento</surname><given-names>J.M.P.</given-names></name><name><surname>Bioucas-Dias</surname><given-names>J.M.P.</given-names></name></person-group><article-title>Does independent component analysis play a role in unmixing hyperspectral data?</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>2005</year><volume>43</volume><issue>1</issue><fpage>175</fpage><lpage>187</lpage><pub-id pub-id-type="doi">10.1109/TGRS.2004.839806</pub-id></citation></ref>
<ref id="b14-remotesensing-04-00532"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ifarraguerri</surname><given-names>A.</given-names></name><name><surname>Chang</surname><given-names>C.I.</given-names></name></person-group><article-title>Multispectral and hyper-spectral image analysis with convex cones</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>1999</year><volume>37</volume><fpage>756</fpage><lpage>770</lpage></citation></ref>
<ref id="b15-remotesensing-04-00532"><label>15</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Boardman</surname><given-names>J</given-names></name></person-group><article-title>Automating Spectral Unmixing of AVIRIS Data Using Convex Geometry Concepts</article-title><conf-name>Proceedings of the Fourth Annunl Airborne Gcoscicnce Workshop</conf-name><conf-loc>Washington, DC</conf-loc><conf-date>25–29 October 1993</conf-date><fpage>11</fpage><lpage>14</lpage></citation></ref>
<ref id="b16-remotesensing-04-00532"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Craig</surname><given-names>M.D.</given-names></name></person-group><article-title>Minimum-volume transforms for remotely sensed data</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>1994</year><volume>32</volume><fpage>542</fpage><lpage>552</lpage></citation></ref>
<ref id="b17-remotesensing-04-00532"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bateson</surname><given-names>C.</given-names></name><name><surname>Asner</surname><given-names>G.</given-names></name><name><surname>Wessman</surname><given-names>C.</given-names></name></person-group><article-title>Endmember bundles: A new approach to incorporating endmember variability into spectral mixture analysis</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>2000</year><volume>38</volume><fpage>1083</fpage><lpage>1094</lpage></citation></ref>
<ref id="b18-remotesensing-04-00532"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winter</surname><given-names>M.E.</given-names></name></person-group><article-title>N-FINDR: An algorithm for fast autonomous spectral end-member determination in hyperspectral data</article-title><source>Proc. SPIE</source><year>1999</year><volume>3753</volume><fpage>266</fpage><lpage>275</lpage></citation></ref>
<ref id="b19-remotesensing-04-00532"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nascimento</surname><given-names>M.P.</given-names></name><name><surname>Bioucas-Dias</surname><given-names>M.</given-names></name></person-group><article-title>Vertex component analysis: A fast algorithm to unmix hyperspectral data</article-title><source>IEEE Trans. Geosci. Remote Sens</source><year>2005</year><volume>4</volume><fpage>898</fpage><lpage>910</lpage></citation></ref>
<ref id="b20-remotesensing-04-00532"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dobigeon</surname><given-names>N.</given-names></name><name><surname>Moussaoui</surname><given-names>S.</given-names></name><name><surname>Coulon</surname><given-names>M.</given-names></name><name><surname>Tourneret</surname><given-names>J.-Y.</given-names></name><name><surname>Hero</surname><given-names>A.O.</given-names></name></person-group><article-title>Joint Bayesian endmember extraction and linear unmixing for hyperspectral imagery</article-title><source>IEEE Trans. Signal Process</source><year>2009</year><volume>57</volume><fpage>4355</fpage><lpage>4368</lpage><pub-id pub-id-type="doi">10.1109/TSP.2009.2025797</pub-id></citation></ref>
<ref id="b21-remotesensing-04-00532"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moussaoui</surname><given-names>S.</given-names></name><name><surname>Carteretb</surname><given-names>C.</given-names></name><name><surname>Briea</surname><given-names>D.</given-names></name><name><surname>Mohammad-Djafaric</surname><given-names>A.</given-names></name></person-group><article-title>Bayesian analysis of spectral mixture data using markov chain monte carlo methods</article-title><source>Chemometr. Intell. Lab</source><year>2006</year><volume>81</volume><fpage>137</fpage><lpage>148</lpage></citation></ref>
<ref id="b22-remotesensing-04-00532"><label>22</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Arngren</surname><given-names>M.</given-names></name><name><surname>Schmidt</surname><given-names>M.N.</given-names></name><name><surname>Larsen</surname><given-names>J</given-names></name></person-group><article-title>Bayesian Nonnegative Matrix Factorization with Volume Prior for Unmixing of Hyperspectral Images</article-title><conf-name>Proceedings of IEEE Workshop on Machine Learning for Signal Processing (MLSP)</conf-name><conf-loc>Grenoble, France</conf-loc><conf-date>2–4 September 2009</conf-date><fpage>1</fpage><lpage>6</lpage></citation></ref>
<ref id="b23-remotesensing-04-00532"><label>23</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cristianini</surname><given-names>N.</given-names></name><name><surname>Shawe-Taylor</surname><given-names>J</given-names></name></person-group><source>Support Vector Machines and Other Kernel-Based Learning Methods</source><publisher-name>Cambridge University Press</publisher-name><publisher-loc>Cambridge, UK</publisher-loc><year>2000</year></citation></ref>
<ref id="b24-remotesensing-04-00532"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burges</surname><given-names>J.C.</given-names></name></person-group><article-title>A tutorial on Support Vector Machines for pattern recognition</article-title><source>Data Min. Knowl. Disc</source><year>1998</year><volume>2</volume><fpage>121</fpage><lpage>167</lpage></citation></ref>
<ref id="b25-remotesensing-04-00532"><label>25</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Duda</surname><given-names>R.O.</given-names></name><name><surname>Hart</surname><given-names>P.E.</given-names></name><name><surname>Stork</surname><given-names>D.G.</given-names></name></person-group><source>Pattern Classification</source><publisher-name>John Wiley &amp; Sons Inc</publisher-name><publisher-loc>New York, NY, USA</publisher-loc><year>2001</year></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures</title>
<fig id="f1-remotesensing-04-00532" position="float">
<label>Figure 1</label>
<caption>
<p>Exemplary points that were taken from the area pointed by the arrow (full image is given in 2nd figure in Section 2.1).</p></caption>
<graphic xlink:href="remotesensing-04-00532f1.gif"/></fig>
<fig id="f2-remotesensing-04-00532" position="float">
<label>Figure 2</label>
<caption>
<p>Zoom on the exemplary point in <xref ref-type="fig" rid="f1-remotesensing-04-00532">Figure 1</xref>. Left: The marked pixel contains the target with the spectrum <italic>T</italic> as a whole pixel. Center: The marked pixel is a mixed pixel with the spectrum <italic>P</italic>. Right: The marked pixel is a pure background pixel with the spectrum <italic>B</italic>.</p></caption>
<graphic xlink:href="remotesensing-04-00532f2.gif"/></fig>
<fig id="f3-remotesensing-04-00532" position="float">
<label>Figure 3</label>
<caption>
<p>Spectra of the background, the mixed pixel and the target from <xref ref-type="fig" rid="f1-remotesensing-04-00532">Figures 1</xref> and <xref ref-type="fig" rid="f2-remotesensing-04-00532">2</xref>.</p></caption>
<graphic xlink:href="remotesensing-04-00532f3.gif"/></fig>
<fig id="f4-remotesensing-04-00532" position="float">
<label>Figure 4</label>
<caption>
<p>The dataset “desert” is an hyperspectral image of a desert place taken by an airplane flying 10,000 feet above sea level. The resolution is 1.3 m/pixel, 286 <italic>×</italic> 2,640 pixels per waveband with 168 wavebands.</p></caption>
<graphic xlink:href="remotesensing-04-00532f4.gif"/></fig>
<fig id="f5-remotesensing-04-00532" position="float">
<label>Figure 5</label>
<caption>
<p>The dataset “city” is an hyperspectral image of a city taken by an airplane flying 10,000 feet above sea level. The resolution is 1.5 m/pixel, 294 <italic>×</italic> 501 pixels per waveband with 28 wavebands.</p></caption>
<graphic xlink:href="remotesensing-04-00532f5.gif"/></fig>
<fig id="f6-remotesensing-04-00532" position="float">
<label>Figure 6</label>
<caption>
<p>The dataset “field” is an hyperspectral image of a field taken by an airplane flying 9,500 feet above sea level. The resolution is 1.2 m/pixel, 286 <italic>×</italic> 300 pixels per waveband with 50 wavebands.</p></caption>
<graphic xlink:href="remotesensing-04-00532f6.gif"/></fig>
<fig id="f7-remotesensing-04-00532" position="float">
<label>Figure 7</label>
<caption>
<p>The “unmixing histogram” for <xref ref-type="fig" rid="f3-remotesensing-04-00532">Figure 3</xref>. The <italic>x</italic>-axis is the <italic>b</italic> partition of [0<italic>,</italic> 1], <italic>ɛ</italic> = 0.01. The <italic>y</italic>-axis is the number of coordinates that belongs to each interval in the <italic>b</italic> partition</p></caption>
<graphic xlink:href="remotesensing-04-00532f7.gif"/></fig>
<fig id="f8-remotesensing-04-00532" position="float">
<label>Figure 8</label>
<caption>
<p>The solid line corresponds to the real spectrum of the background and the dotted line corresponds to the estimated spectrum of the background</p></caption>
<graphic xlink:href="remotesensing-04-00532f8.gif"/></fig>
<fig id="f9-remotesensing-04-00532" position="float">
<label>Figure 9</label>
<caption>
<p>Random spectral vector signatures.</p></caption>
<graphic xlink:href="remotesensing-04-00532f9.gif"/></fig>
<fig id="f10-remotesensing-04-00532" position="float">
<label>Figure 10</label>
<caption>
<p>The “unmixing histogram” from Definition 5.1 for different <italic>t</italic> values. The horizontal and the vertical axes are the <italic>b</italic> partition of [0<italic>,</italic> 1], <italic>ɛ</italic> = 0.01. The cardinality of the coordinate’s set belongs to each interval in the <italic>b</italic> partition. The boxes inside the graphs is the estimated results from the application of the ROTU algorithm.</p></caption>
<graphic xlink:href="remotesensing-04-00532f10.gif"/></fig>
<fig id="f11-remotesensing-04-00532" position="float">
<label>Figure 11</label>
<caption>
<p>The spectra <italic>T</italic> and <italic>B</italic> of different materials from the database of signatures, where the x- and y-axes are the wavebands and their values, respectively</p></caption>
<graphic xlink:href="remotesensing-04-00532f11.gif"/></fig>
<fig id="f12-remotesensing-04-00532" position="float">
<label>Figure 12</label>
<caption>
<p>Left: The “unmixing histogram” from Definition 5.1. Right: Comparison between <italic>T</italic> and its estimate <italic>T̂</italic>.</p></caption>
<graphic xlink:href="remotesensing-04-00532f12.gif"/></fig>
<fig id="f13-remotesensing-04-00532" position="float">
<label>Figure 13</label>
<caption>
<p>Left: The “unmixing histogram” from Definition 5.1. Right: Comparison between <italic>B</italic> and its estimation <italic>B̂</italic></p></caption>
<graphic xlink:href="remotesensing-04-00532f13.gif"/></fig>
<fig id="f14-remotesensing-04-00532" position="float">
<label>Figure 14</label>
<caption>
<p>Left: The “unmixing histogram” from Definition 5.1. Right: Comparison between <italic>B</italic> and its estimation <italic>B̂</italic></p></caption>
<graphic xlink:href="remotesensing-04-00532f14.gif"/></fig>
<fig id="f15-remotesensing-04-00532" position="float">
<label>Figure 15</label>
<caption>
<p>Left: The “unmixing histogram” from Definition 5.1. Right: Comparison between <italic>B</italic> and its estimation <italic>B̂</italic></p></caption>
<graphic xlink:href="remotesensing-04-00532f15.gif"/></fig>
<fig id="f16-remotesensing-04-00532" position="float">
<label>Figure 16</label>
<caption>
<p>Left: The scene from <xref ref-type="fig" rid="f4-remotesensing-04-00532">Figure 4</xref>. Right: Points that contain a target in above pixel when <italic>t</italic> = 1.</p></caption>
<graphic xlink:href="remotesensing-04-00532f16.gif"/></fig>
<fig id="f17-remotesensing-04-00532" position="float">
<label>Figure 17</label>
<caption>
<p>Left: The scene from <xref ref-type="fig" rid="f5-remotesensing-04-00532">Figure 5</xref>. Right: Points that contain a target in a sub-pixel when <italic>t &gt;</italic> 0.5.</p></caption>
<graphic xlink:href="remotesensing-04-00532f17.gif"/></fig>
<fig id="f18-remotesensing-04-00532" position="float">
<label>Figure 18</label>
<caption>
<p>Left: The source image. Right: The result from the application of the projection separators. The white points contain targets with <italic>t ≥</italic> 0.7.</p></caption>
<graphic xlink:href="remotesensing-04-00532f18.gif"/></fig>
<fig id="f19-remotesensing-04-00532" position="float">
<label>Figure 19</label>
<caption>
<p>Left: The source image. Right: The result from the application of the projection separators. The white points contain a targets with <italic>t ≥</italic> 0.7.</p></caption>
<graphic xlink:href="remotesensing-04-00532f19.gif"/></fig>
<fig id="f20-remotesensing-04-00532" position="float">
<label>Figure 20</label>
<caption>
<p>Left: The red line is the target’s spectrum and the black line measures the background (it was taken from the scene in <xref ref-type="fig" rid="f4-remotesensing-04-00532">Figure 4</xref>). Right: The same two vectors after the projection of the separating mapping. The x-axis represents the wavelengths and the y-axis represents the wavelengths amplitude before (left) and after (right) the application of the projection.</p></caption>
<graphic xlink:href="remotesensing-04-00532f20.gif"/></fig>
<fig id="f21-remotesensing-04-00532" position="float">
<label>Figure 21</label>
<caption>
<p>Representation of 20 sensors (“weak classifiers”) with small supports. They were obtained as shown in Section 6.1. The x-axis represents the wavelengths index and the y-axis represents the values of the sensor-function.</p></caption>
<graphic xlink:href="remotesensing-04-00532f21.gif"/></fig>
<fig id="f22-remotesensing-04-00532" position="float">
<label>Figure 22</label>
<caption>
<p>Left: The target’s spectrum. Right: The principal components of the background’s spectra. The spectra was taken from the “city” image (<xref ref-type="fig" rid="f5-remotesensing-04-00532">Figure 5</xref>).</p></caption>
<graphic xlink:href="remotesensing-04-00532f22.gif"/></fig>
<fig id="f23-remotesensing-04-00532" position="float">
<label>Figure 23</label>
<caption>
<p>Left: This plot represents 200 <italic>d</italic>-support global-characteristic sensors, <italic>d</italic> = 4. They were obtained as shown in Section 6.1. Right: The spectrum of the target (red) and the PCA of the background (black) after the application of the projection separators. The x-axis represents the wavelengths and the y-axis represents the wavelengths amplitude before (left) and after (right) the application of the projection.</p></caption>
<graphic xlink:href="remotesensing-04-00532f23.gif"/></fig>
<fig id="f24-remotesensing-04-00532" position="float">
<label>Figure 24</label>
<caption>
<p>Left: The source image. Right: The white points contain a target with <italic>t &gt;</italic> 0.5 and <italic>corr &gt;</italic> 0.8.</p></caption>
<graphic xlink:href="remotesensing-04-00532f24.gif"/></fig></sec></back></article>
