<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">information</journal-id>
      <journal-title>Information</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Information</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Information</abbrev-journal-title>
      <issn pub-type="epub">2078-2489</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/info3040756</article-id>
      <article-id pub-id-type="publisher-id">information-03-00756</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Quaternionic Multilayer Perceptron with Local Analyticity </article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Isokawa</surname>
            <given-names>Teijiro</given-names>
          </name>
          <xref rid="af1-information-03-00756" ref-type="aff">1</xref>
          <xref rid="c1-information-03-00756" ref-type="corresp">*</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Nishimura</surname>
            <given-names>Haruhiko</given-names>
          </name>
          <xref rid="af2-information-03-00756" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Matsui</surname>
            <given-names>Nobuyuki</given-names>
          </name>
          <xref rid="af1-information-03-00756" ref-type="aff">1</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-information-03-00756"><label>1</label> Graduate School of Engineering, University of Hyogo, 2167 Shosha, Himeji, Hyogo 671-2280, Japan; Email: <email>matsui@eng.u-hyogo.ac.jp</email> </aff>
      <aff id="af2-information-03-00756"><label>2</label> Graduate School of Applied Informatics, University of Hyogo, 7-1-28 Minatojima-minamimachi, Chuo-ku,Kobe, Hyogo 650-0047, Japan; Email: <email>haru@ai.u-hyogo.ac.jp</email> </aff>
      <author-notes>
        <corresp id="c1-information-03-00756"><label>*</label> Author  to whom correspondence should be addressed; Email: <email>isokawa@eng.u-hyogo.ac.jp</email>; Tel.: +81-79-267-4952; Fax: +81-79-267-4975.</corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>28</day>
        <month>11</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"><month>12</month>
        <year>2012</year>
      </pub-date>
      <volume>3</volume>
      <issue>4</issue>
      <fpage>756</fpage>
      <lpage>770</lpage>
      <history>
        <date date-type="received">
          <day>11</day>
          <month>09</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>13</day>
          <month>11</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>20</day>
          <month>11</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>©  2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>A multi-layered perceptron type neural network is presented and analyzed in this paper. All neuronal parameters such as input, output, action potential and connection weight are encoded by quaternions, which are a class of hypercomplex number system. Local analytic condition is imposed on the activation function in updating neurons’ states in order to construct learning algorithm for this network. An error back-propagation algorithm is introduced for modifying the connection weights of the network. </p>
      </abstract>
      <kwd-group>
        <kwd>quaternion</kwd>
        <kwd>local analyticity</kwd>
        <kwd>Wirtinger calculus</kwd>
        <kwd>multilayer perceptron</kwd>
        <kwd>error back-propagation </kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <p>Processing multi-dimensional data is an important problem for artificial neural networks. A single neuron can take only one real value as its input, thus a network should be configured so that several neurons are used for accepting multi-dimensional data. This type of configuration is sometimes unnatural in applications of artificial neural networks to engineering problem, such as processing of acoustic signals or coordinates in the plane. Thus, complex number systems have been utilized to represent two-dimensional data elements as a single entity. Application of complex numbers to neural networks have been extensively investigated, as summarized in the references [<xref ref-type="bibr" rid="B1-information-03-00756">1</xref>,<xref ref-type="bibr" rid="B2-information-03-00756">2</xref>,<xref ref-type="bibr" rid="B3-information-03-00756">3</xref>]. </p>
      <p>Though complex values can treat two-dimensional data elements as a single entity, what we should treat data with more than two-dimension in artificial neural networks? Although this problem can of course be solved by applying several real-valued or complex-valued neurons, it would be useful to introduce a number system with higher dimensions, the so-called hypercomplex number systems. </p>
      <p>Quaternion is a four-dimensional hypercomplex number system introduced by Hamilton [<xref ref-type="bibr" rid="B4-information-03-00756">4</xref>,<xref ref-type="bibr" rid="B5-information-03-00756">5</xref>]. This number system has been extensively employed in several fields, such as modern mathematics, physics, control of satellites, computer graphics, <italic>etc. </italic> [<xref ref-type="bibr" rid="B6-information-03-00756">6</xref>,<xref ref-type="bibr" rid="B7-information-03-00756">7</xref>,<xref ref-type="bibr" rid="B8-information-03-00756">8</xref>]. One of the benefits provided by quaternions is that affine transformations of geometric figures in three-dimensional spaces, especially spatial rotations, can be represented compactly and efficiently. Applying quaternions to the field of neural networks has been recently explored in an effort to naturally represent high-dimensional information, such as color and three-dimensional body coordinates, by a quaternionic neuron, rather than complex-valued or real-valued neurons. </p>
      <p>Thus, there has been a growing number of studies concerning the use of quaternions in neural networks. Multilayer perceptron (MLP) models have been developed in [<xref ref-type="bibr" rid="B9-information-03-00756">9</xref>,<xref ref-type="bibr" rid="B10-information-03-00756">10</xref>,<xref ref-type="bibr" rid="B11-information-03-00756">11</xref>,<xref ref-type="bibr" rid="B12-information-03-00756">12</xref>,<xref ref-type="bibr" rid="B13-information-03-00756">13</xref>,<xref ref-type="bibr" rid="B14-information-03-00756">14</xref>]. The use of quaternion in MLP models has been applied to several engineering problems such as control problems [<xref ref-type="bibr" rid="B10-information-03-00756">10</xref>], color image compression [<xref ref-type="bibr" rid="B12-information-03-00756">12</xref>], color night vision [<xref ref-type="bibr" rid="B15-information-03-00756">15</xref>,<xref ref-type="bibr" rid="B16-information-03-00756">16</xref>], and predictions for the output of chaos circuits and winds in three-dimensional space [<xref ref-type="bibr" rid="B13-information-03-00756">13</xref>,<xref ref-type="bibr" rid="B14-information-03-00756">14</xref>]. Other types of network models has also been explored, such as the computational ability of a single quaternionic neuron [<xref ref-type="bibr" rid="B17-information-03-00756">17</xref>] and the existence condition of an energy function in continuous-time and continuous-state recurrent networks [<xref ref-type="bibr" rid="B18-information-03-00756">18</xref>]. There are several types of quaternionic Hopfield-type networks with discrete-time driven, such as bipolar state [<xref ref-type="bibr" rid="B19-information-03-00756">19</xref>,<xref ref-type="bibr" rid="B20-information-03-00756">20</xref>], continuous state [<xref ref-type="bibr" rid="B21-information-03-00756">21</xref>,<xref ref-type="bibr" rid="B22-information-03-00756">22</xref>], multistate by phase representation [<xref ref-type="bibr" rid="B23-information-03-00756">23</xref>,<xref ref-type="bibr" rid="B24-information-03-00756">24</xref>]. Learning schemes for these networks have also been proposed in [<xref ref-type="bibr" rid="B25-information-03-00756">25</xref>].</p>
      <p>One of the difficulties in constructing neural networks in the quaternionic domain is about the introduction of suitable functions for the activation function in updating the neurons’ states. A typical type of activation function is the so-called “split” type function, in which a real-valued function is appliedto update each component of a quaternionic value [<xref ref-type="bibr" rid="B10-information-03-00756">10</xref>]. Real-valued sigmoidal function and hyperbolic function, which are analytic (differentiable) functions, are often used for this purpose. However, as an activation function, the split-type quaternionic function is not appropriate due to lack of analyticity. Thus, it is necessary to define other types of differential functions in order to construct learning schemes such as the error back-propagation algorithm. The Cauthy-Riemann-Fueter (CRF) equation defines the analytic condition for the quaternionic functions, which corresponds to the Cauthy-Riemann equation for the complex-valued functions. The functions satisfying the CRF equation turn out to be only linear functionsor constants; therefore it is impossible to introduce non-linearity into the updates of the neurons’ states.</p>
      <p>Recently, another class of analyticity for the quaternionic functions has been developed [<xref ref-type="bibr" rid="B26-information-03-00756">26</xref>,<xref ref-type="bibr" rid="B27-information-03-00756">27</xref>,<xref ref-type="bibr" rid="B28-information-03-00756">28</xref>]. Called <italic>“local analyticity”</italic>, this analytic condition is derived at a quaternionic point with its local coordinate, rather than in a quaternionic space with a global coordinate. The derivation of local analytic condition shows that a quaternion in the local coordinate system is isomorphic to the complex number system and thus it can be treated as a complex value. A neural network with an activation function with local analyticity has been first proposed and analyzed in [<xref ref-type="bibr" rid="B13-information-03-00756">13</xref>,<xref ref-type="bibr" rid="B14-information-03-00756">14</xref>] in terms of MLP-type network. It is shown that the performance for this network is superior to the network with a split-type activation function through several applications. Another application of analytic activation function to quaternionic neural networks has been investigated in terms of Hopfield-type network [<xref ref-type="bibr" rid="B22-information-03-00756">22</xref>]. The local analytic condition is constructed based on [<xref ref-type="bibr" rid="B28-information-03-00756">28</xref>], and the stability conditions are derived in the case of quaternionic tanh function being used for an activation function is deduced in this network, in which the complex-valued tanh function used in [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>] can be used as a quaternionic function. </p>
      <p>This paper presents an MLP-type quaternionic neural network with locally analytic activation function. All variables in this network, such as input, output, action potential and connection weights, are encoded by quaternions. A learning scheme, a quaternionic equivalent of error back-propagation algorithm, is presented and theoretically explored. The derivation of the learning scheme in this papera dopts the Wirtinger calculus [<xref ref-type="bibr" rid="B30-information-03-00756">30</xref>], which has been invented in the field of complex analysis, where a quaternionic value and its conjugate are treated independent of each other. This calculus enables the derivations to be more straightforward of its description, than by using the conventional description, <italic>i.e.</italic>, Cartesian representation.</p>
    </sec>
    <sec>
      <title>2. Quaternionic Algebra</title>
      <sec>
        <title>2.1. Definition of Quaternion</title>
        <p>Quaternions forma class of hypercomplex numbers consistingofareal number and three imaginary numbers-<italic>i</italic>, <italic>j</italic>, and <italic>k</italic>. Formally, a quaternion number is defined as a vector x in a four-dimensional vector space,
        <disp-formula id="information-03-00756-i001"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i001.tif"/><label>(1)</label></disp-formula>
        where <italic>x</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>, <italic>x</italic><sup>(</sup><sup><italic>i</italic></sup><sup>)</sup>, <italic>x</italic><sup>(</sup><sup><italic>j</italic></sup><sup>)</sup>, and <italic>x</italic><sup>(</sup><sup><italic>k</italic></sup><sup>) </sup>are real numbers. The division ring of quaternions, <italic><bold>H</bold></italic>, constitutes the four-dimensional vector space over the real numbers with bases 1, <italic><bold>i</bold></italic>, <italic><bold>j</bold></italic>, and <italic><bold>k</bold></italic>. Equation (1) can also be written using 4-tuple or 2-tuple notation as 
        <disp-formula id="information-03-00756-i002"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i002.tif"/><label>(2)</label></disp-formula>
        where <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> = {<italic> x</italic><sup>(</sup><sup><italic>i</italic></sup><sup>)</sup><italic>, x</italic><sup>(</sup><sup><italic>j</italic></sup><sup>)</sup><italic>, x</italic><sup>(</sup><sup><italic>k</italic></sup><sup>)</sup>}. In this representation, <italic>x</italic><sup>(</sup><sup><italic>e</italic></sup><sup>) </sup>is the scalar part of <italic>x</italic>, and <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> forms the vector part. The quaternion conjugate is defined as
        <disp-formula id="information-03-00756-i003"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i003.tif"/><label>(3)</label></disp-formula></p>
        <p>Quaternion bases satisfy the following identities, 
        <disp-formula id="information-03-00756-i004"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i004.tif"/><label>(4)</label></disp-formula>
        <disp-formula id="information-03-00756-i005"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i005.tif"/><label>(5)</label></disp-formula>
        known as the Hamilton rule. From these rules, it follows immediately that multiplication of quaternions is not commutative.</p>
        <p>Next, we define the operations between quaternions <bold><italic>p</italic></bold> = (<italic>p</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>, <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i047.tif"/>) = (<italic>p</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>, <italic>p</italic><sup>(</sup><sup><italic>i</italic></sup><sup>)</sup>, <italic>p</italic><sup>(</sup><sup><italic>j</italic></sup><sup>)</sup>, <italic>p</italic><sup>(</sup><sup><italic>k</italic></sup><sup>)</sup>) and <bold><italic>q</italic></bold> = (<italic>q</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>, <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i047.tif"/>) = (<italic>q</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>, <italic>q</italic><sup>(</sup><sup><italic>i</italic></sup><sup>)</sup>, <italic>q</italic><sup>(</sup><sup><italic>j</italic></sup><sup>)</sup>, <italic>q</italic><sup>(</sup><sup><italic>k</italic></sup><sup>)</sup>). The addition and subtraction of quaternions are defined in a similar manner as for complex-valued numbers or vectors, <italic>i.e.</italic>, 
        <disp-formula id="information-03-00756-i006"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i006.tif"/><label>(6)</label></disp-formula>
        <disp-formula id="information-03-00756-i007"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i007.tif"/><label>(7)</label></disp-formula></p>
        <p>The product of <italic><bold>p</bold></italic> and <italic><bold>q</bold></italic> is determined by Equation (5) as 
        <disp-formula id="information-03-00756-i008"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i008.tif"/><label>(8)</label></disp-formula>
        where <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i047.tif"/> · <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i048.tif"/> and <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i047.tif"/> × <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i048.tif"/> denote the dot and cross products, respectively, between three-dimensional vectors <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i047.tif"/> and <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i048.tif"/>. The conjugate of the product is given as 
        <disp-formula id="information-03-00756-i009"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i009.tif"/><label>(9)</label></disp-formula></p>
        <p>The quaternion norm of <italic>x</italic>, denoted by |<italic>x</italic>|, is defined as 
        <disp-formula id="information-03-00756-i010"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i010.tif"/><label>(10)</label></disp-formula></p>
      </sec>
      <sec>
        <title>2.2. Quaternionic Analyticity</title>
        <p>It is important to introduce an analytic function (or differentiable function) to serve as the activation function in the neural network. This section describes the required analyticity of the function in the quaternionic domain, in order to construct activation functions for quaternionic neural networks. </p>
        <p>The condition for differentiability of the quaternionic function <bold><italic>f</italic></bold> is given by 
        <disp-formula id="information-03-00756-i011"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i011.tif"/><label>(11)</label></disp-formula></p>
        <p>The analytic condition for the quaternionic function, called the Cauchy-Riemann-Fueter (CRF) equation, yields: 
        <disp-formula id="information-03-00756-i012"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i012.tif"/><label>(12)</label></disp-formula></p>
        <p>This is an extension of the Cauchy-Riemann (CR) equations defined for the complex domain. However, only linear functions and constants satisfy the CRF equation [<xref ref-type="bibr" rid="B14-information-03-00756">14</xref>,<xref ref-type="bibr" rid="B26-information-03-00756">26</xref>,<xref ref-type="bibr" rid="B27-information-03-00756">27</xref>]. </p>
        <p>An alternative approach to assure analyticity in the quaternionic domain has been explored in [<xref ref-type="bibr" rid="B26-information-03-00756">26</xref>,<xref ref-type="bibr" rid="B27-information-03-00756">27</xref>,<xref ref-type="bibr" rid="B28-information-03-00756">28</xref>]. This approach is called <italic>local analyticity </italic>and is distinguished from the standard analyticity, <italic>i.e.</italic>, global analyticity. In the following, we introduce local derivatives with Wirtinger representation and analytic conditions for quaternionic functions, with reference to [<xref ref-type="bibr" rid="B28-information-03-00756">28</xref>]. </p>
        <p>A quaternion <italic>x</italic> can be alternatively represented as: 
        <disp-formula id="information-03-00756-i013"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i013.tif"/><label>(13)</label></disp-formula>
        <disp-formula id="information-03-00756-i014"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i014.tif"/><label>(14)</label></disp-formula>
        <disp-formula id="information-03-00756-i015"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i015.tif"/><label>(15)</label></disp-formula></p>
        <p>From the definition in Equation (15), we deduce that <italic>u</italic><sup>2 </sup><sub>x </sub>= −1. If <italic>u</italic><italic><sub>x</sub></italic> holds a commutative property against a difference of <italic>x</italic>, then the system with <italic>u</italic><italic><sub>x</sub></italic> can be regarded as locally isomorphic to the complex number system. </p>
        <p>A quaternionic difference of <italic>x</italic>, denoted by <italic>dx</italic> =(<italic>dx</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>, <italic>dx</italic><sup>(</sup><sup><italic>i</italic></sup><sup>)</sup>, <italic>dx</italic><sup>(</sup><sup><italic>j</italic></sup><sup>)</sup>, <italic>dx</italic><sup>(</sup><sup><italic>k</italic></sup><sup>)</sup>), can be decomposed by using: 
        <disp-formula id="information-03-00756-i016"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i016.tif"/><label>(16)</label></disp-formula>
        where,
        <disp-formula id="information-03-00756-i033"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i033.tif"/></disp-formula></p>
        <p>Then, the following relations hold: 
        <disp-formula id="information-03-00756-i034"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i034.tif"/></disp-formula>
        when we set <italic>dx</italic><sub>⊥</sub> = 0, <italic>i.e.</italic>, <italic>dx</italic> + <italic>u</italic><italic><sub>x</sub></italic><italic>dx</italic><italic>u</italic><italic><sub>x</sub></italic> = 0,which results in <italic>u</italic><italic><sub>x</sub></italic><italic>dx</italic> = <italic>dx</italic><italic>u</italic><italic><sub>x</sub></italic>. This leads to <italic>u</italic><italic><sub>x</sub></italic> × <italic>d</italic><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> =0, because <italic>u</italic><sub>x</sub> is a quaternion without a real part. Thus, <italic>u</italic><sub>x</sub> and <italic>d</italic><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> are parallel to each other. Then, <italic>d</italic><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> = <italic>u</italic><italic><sub>x</sub></italic>δ can be obtained, where <italic>δ</italic> is a real-valued constant. From Equation (14), it follows that 
        <disp-formula id="information-03-00756-i035"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i035.tif"/></disp-formula></p>
        <p>Then, 
        <disp-formula id="information-03-00756-i036"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i036.tif"/></disp-formula></p>
        <p>Considering <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> = <italic>u</italic><italic><sub>x</sub></italic><italic>r</italic> and <italic>d</italic><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> = <italic>u</italic><italic><sub>x</sub></italic><italic>δ</italic>, we obtain 
        <disp-formula id="information-03-00756-i037"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i037.tif"/></disp-formula></p>
        <p>Hence, <italic>δ</italic> = <italic>dr</italic> is derived and <italic>d</italic><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i046.tif"/> = <italic>u</italic><italic><sub>x</sub></italic><italic>dr</italic> is obtained. <italic>dx</italic> is represented as <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i049.tif"/>.</p>
        <p><xref ref-type="fig" rid="information-03-00756-f001">Figure 1</xref> shows a schematic coordinate system for defining a local complex plane. The component <italic>k</italic> is omitted (<italic>x</italic><sup>(</sup><sup><italic>k</italic></sup><sup>) </sup>= 0) in this figure due to difficulties in representing a four-dimensional vector space. In this example, for a given quaternion <italic>x</italic>, its unit vector <italic>u</italic><sub>x</sub> is defined in the <italic>i</italic>-<italic>j</italic> plane. Then, a complex plane is defined by spanning the components <italic>x</italic><sup>(</sup><sup><italic>e</italic></sup><sup>)</sup>(real axis) and <italic>x</italic><sup>(</sup><sup><italic>r</italic></sup><sup>) </sup>in the quaternionic space, and the analytic condition is constrained in this plane.</p>
        <fig id="information-03-00756-f001" position="anchor">
          <label>Figure 1</label>
          <caption>
            <p>A schematic illustration of local complex plane in a quaternionic space, where the component <italic>k</italic> is omitted for simplicity.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-g001.tif"/>
        </fig>
        <p>The local derivative operators are introduced, corresponding to the form of <italic>dx</italic><sub>‖</sub>, as follows:
        <disp-formula id="information-03-00756-i038"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i038.tif"/></disp-formula>
        where
        <disp-formula id="information-03-00756-i039"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i039.tif"/></disp-formula>
        with the properties
        <disp-formula id="information-03-00756-i040"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i040.tif"/></disp-formula></p>
        <p>Note that the variables <italic>x</italic> and <italic>x</italic><italic><sup>∗</sup></italic> turn out to be independent of each other. These derivative operators are quaternionic equivalents to the well-known <italic>Wirtinger derivative </italic>in the complex domain [<xref ref-type="bibr" rid="B30-information-03-00756">30</xref>]. </p>
        <p><italic><bold>F</bold></italic> (<italic>x</italic> + <italic>dx</italic>) can be expanded using the above-mentioned representations as 
        <disp-formula id="information-03-00756-i017"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i017.tif"/><label>(17)</label></disp-formula>
        where, 
        <disp-formula id="information-03-00756-i041"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i041.tif"/></disp-formula></p>
        <p>When <italic>dx</italic><sub>⊥</sub> = 0, the local derivative of <italic><bold>F</bold></italic> (<italic>x</italic>) is written as 
        <disp-formula id="information-03-00756-i042"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i042.tif"/></disp-formula>
        and the local analytic condition for the function <italic><bold>F</bold></italic> (<italic>x</italic>) is given by 
        <disp-formula id="information-03-00756-i043"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i043.tif"/></disp-formula>
        in the corresponding local complex plane. This result corresponds to the one presented in [<xref ref-type="bibr" rid="B27-information-03-00756">27</xref>], where <italic>dx</italic><sub>⊥</sub> =0 always holds. </p>
        <p>Moreover, if <italic><bold>F</bold></italic> is a function with the two arguments, <italic>x</italic> and <italic>x</italic><sup>∗</sup>, it becomes: 
        <disp-formula id="information-03-00756-i018"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i018.tif"/><label>(18)</label></disp-formula>
        with <italic>x</italic> and <italic>x</italic><sup>∗</sup> being independent of each other. As a result, we can treat quaternionic functions in the same manner as complex-valued functions under the condition of local analyticity. </p>
      </sec>
    </sec>
    <sec>
      <title>3. Quaternionic Multilayer Perceptron</title>
      <sec>
        <title>3.1. Network Model</title>
        <p>The structure of the network assumed in this paper is shown in <xref ref-type="fig" rid="information-03-00756-f002">Figure 2</xref>. This network is a so-called multilayer perceptron network with one hidden layer, and the parameters in the network are encoded by quaternionic values. </p>
        <p>The numbers of neurons in the input, hidden and output layers are set to <italic>M</italic>, <italic>N</italic> and <italic>K</italic>, respectively. A set of quaternionic signals denoted by <italic>z</italic> is input to the neurons in the input layer of the network. The outputs of the neurons in the input layer are the same as input <italic>z</italic>’s. In the hidden layer, each neuron takes the weighted sum of the output signals from the input layer. The (connection) weight from the <italic>n</italic>-th neuron in the input layer to the <italic>m</italic>-th neuron in the hidden layer is denoted by <italic>v</italic><italic><sub>nm</sub></italic>. The output of the neuron in the hidden layer, denoted by <italic>x</italic><italic><sub>n</sub></italic>, is determined by 
        <disp-formula id="information-03-00756-i019"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i019.tif"/><label>(19)</label></disp-formula>
        where <italic>g</italic> is a quaternionic activation function introducing non-linearity between the action potential and output in the neuron. This function satisfies the following condition: 
        <disp-formula id="information-03-00756-i020"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i020.tif"/><label>(20)</label></disp-formula></p>
        <fig id="information-03-00756-f002" position="anchor">
          <label>Figure 2</label>
          <caption>
            <p>The structure of the multilayer perceptron in this paper.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-g002.tif"/>
        </fig>
        <p>Processing the neurons’ outputs in the output layer can be defined in the same manner as in the hidden layer. The output of the neuron in the output layer, <italic>y</italic><italic><sub>k</sub></italic>, is defined as 
        <disp-formula id="information-03-00756-i021"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i021.tif"/><label>(21)</label></disp-formula>
        where the function <italic>h</italic> is a quaternionic activation function from the action potential to the output, and <italic>w</italic><italic><sub>kn</sub></italic> is the connection weight between the <italic>n</italic>-th neuron in the hidden layer and the <italic>k</italic>-th neuron in the output layer. The function <italic>h</italic> also satisfies 
        <disp-formula id="information-03-00756-i022"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i022.tif"/><label>(22)</label></disp-formula></p>
        <p>The connection weights should be modified by the so-called learning algorithms, in order to obtain the desired output signals with respect to the input signals. One of the learning algorithms for MLP-type networks is the error back-propagation (EBP) algorithm. The following section describes our derivation of this algorithm for the presented network. </p>
      </sec>
      <sec>
        <title>3.2. Learning Algorithm</title>
        <p>An EBP algorithm works so that the output error, calculated by the neurons’ outputs at the output layer and the desired output signals, is minimized. In the case of networks with three layers as shown in <xref ref-type="fig" rid="information-03-00756-f002">Figure 2</xref>, the connection weights between the hidden and output layers are first modified, and then the weights between the input and hidden layers are modified. In general (networks with n layers), EBP algorithms first modify the connection weights between the <italic>n</italic>-th layer (the output layer) and (<italic>n</italic> − 1)-th layer, and then between the (<italic>n</italic> − 1)-th layer and the (<italic>n</italic> − 2)-th layer, <italic>etc. </italic>This section only describes the three-layers case. </p>
        <p>First, let <italic>d</italic><italic><sub>k</sub></italic> be a quaternionic desired signal for the <italic>k</italic>-th output neuron when <italic>z</italic>’s are input to the network. The connection weights affect the output signals with respect to a set of input signals, thus the error <italic>E</italic> is regarded as a function with arguments <italic>w</italic><italic><sub>kn</sub></italic>’s and <italic>w</italic><italic><sub>kn</sub></italic><sup> ∗</sup>’s. The output error E at the time t is then defined as 
        <disp-formula id="information-03-00756-i023"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i023.tif"/><label>(23)</label></disp-formula></p>
        <p>The output error should be real-valued so that it can be minimized.</p>
        <p>Suppose that the connection weights are updated at the time (<italic>t </italic>+ 1) by
        <disp-formula id="information-03-00756-i024"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i024.tif"/><label>(24)</label></disp-formula>
        where Δ<italic>w</italic><italic><sub>kn</sub></italic> is a quantity in updating. Then, the output error at the time (<italic>t </italic>+ 1) can be written as
        <disp-formula id="information-03-00756-i025"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i025.tif"/><label>(25)</label></disp-formula></p>
        <p>Note that the local analytic condition in quaternionic domain should be satisfied in calculating the derivatives. Thus, if we set Δ<italic>w</italic><italic><sub>kn</sub></italic> as 
        <disp-formula id="information-03-00756-i026"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i026.tif"/><label>(26)</label></disp-formula>
        where <italic>µ</italic> is a quaternionic constant, the temporal difference of the output error, ∆<italic>E</italic>, becomes 
        <disp-formula id="information-03-00756-i027"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i027.tif"/><label>(27)</label></disp-formula></p>
        <p>If the real part of <italic>µ</italic> is positive, ∆<italic>E</italic> ≤ 0 holds. This indicates that the output error would decrease upon updating weights according to Equations (24) and (26). For calculating the updated quantity in Equation (26), the component <italic>∂E</italic>/<italic>∂</italic><italic>w</italic><sub>kn</sub><sup>∗</sup><sub>‖</sub> are expanded by using chain rule of derivative and ∂<italic>y</italic>/<italic>∂</italic><italic>w</italic><sup>∗</sup><sub>‖</sub>= 0 from the local analytic condition is applied: 
        <disp-formula id="information-03-00756-i028"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i028.tif"/><label>(28)</label></disp-formula>
        where <italic>h</italic>′ is the (local) derivative of the activation function <italic>h</italic>, and <italic>δ</italic><italic><sub>k</sub></italic> is defined as <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i050.tif"/>.</p>
        <p>Similarly, the updates for the connection weights <italic>v</italic>’s can be deduced. The output error function <italic>E</italic> isa function with arguments <italic>v</italic><italic><sub>nm</sub></italic>’s and <italic>v</italic><italic><sup>∗</sup><sub>nm</sub></italic>’s, thus the output error at the time (<italic>t</italic> + 1) can be represented by 
        <disp-formula id="information-03-00756-i029"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i029.tif"/><label>(29)</label></disp-formula></p>
        <p>Hence, <italic>v</italic><italic><sub>nm</sub></italic> is updated with the quantity ∆<italic>v</italic><italic><sub>nm</sub></italic>, 
        <disp-formula id="information-03-00756-i030"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i030.tif"/><label>(30)</label></disp-formula>
        <disp-formula id="information-03-00756-i031"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i031.tif"/><label>(31)</label></disp-formula></p>
        <p>This leads to ∆<italic>E</italic> ≤ 0 under the condition of the real part of <italic>µ</italic> being positive. The component <italic>∂E</italic>/<italic>∂<sup>*</sup></italic><italic>v</italic><italic><sub>nm</sub></italic><sub>‖ </sub>can be expanded by chain rules and local analytic conditions, <italic>∂</italic><italic>x</italic><italic>/∂</italic><italic>v</italic><sup>*</sup><sub>‖</sub> = 0, <italic>∂</italic><italic>y</italic>/<italic>∂</italic><italic>x</italic><sup>*</sup><sub>‖</sub> = 0, and <italic>∂</italic><italic>x</italic><italic>/∂</italic><italic>v</italic><sup>*</sup><sub>‖</sub> = 0 are applied:
        <disp-formula id="information-03-00756-i044"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i044.tif"/></disp-formula></p>
        <p>Using the derivatives <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i051.tif"/>, and <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i052.tif"/>, we finally obtain 
        <disp-formula id="information-03-00756-i032"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="information-03-00756-i032.tif"/><label>(32)</label></disp-formula></p>
        <p>Once a set of network output {<italic>y</italic><italic><sub>k</sub></italic>} is obtained for a set of net work input, the output error with respect to a target set {<italic>d</italic><italic><sub>k</sub></italic>} can be calculated by Equation (23). Then, the connection weights between hidden and output layers are modified by Equations (24-28). The connection weights between input and hidden layers are finally modified by Equations (30-32). </p>
      </sec>
      <sec>
        <title>3.3. Universal Approximation Capability</title>
        <p>As an example of activation functions for neurons (<italic><bold>g </bold></italic> and <italic><bold>h</bold></italic>), the quaternionic tanh function [<xref ref-type="bibr" rid="B22-information-03-00756">22</xref>] can be used. Other types of activation functions are also available, because complex-valued functions can be used for the presented network and the properties of several functions have been explored for the activation functions in [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>]. It is important to consider the capability of the proposed quaternionic network with these activation functions, <italic>i.e.</italic>, whether the proposed network can approximate given functions. </p>
        <p>This concern is known as the universal approximation theorem [<xref ref-type="bibr" rid="B31-information-03-00756">31</xref>,<xref ref-type="bibr" rid="B32-information-03-00756">32</xref>]. In the real-valued MLPs with single hidden layer,the universality has been proven with a so-called sigmoidal function being used as an activation function of neurons. Other than sigmoidal functions, the bounded and differentiable functions are also available for activation functions. </p>
        <p>This theorem is also discussed in the case of complex-valued networks [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>]. The condition for boundedness in the real-valued MLPs is not required in some complex-valued MLPs. There are three types of activation functions discussed in [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>], which are categorized by the properties of complex-valued functions, and for each of them, it is shown that universal approximation can be achieved. </p>
        <p>The first type of complex-valued functions concerns the functions without anysingular points. These functions can be used as activation functions and the networks with this type of activation functions are shown as good approximators. Although some of the functions are not bounded, they can be used by introducing bounding operation for their regions. The second type concerns the functions having the bounded singular points, e.g., the discontinuous functions. These singularities can be removed and thus they can also be used for activation functions and can achieve their universality. The last type is for the functions with the so-called essential singularities, <italic>i.e.</italic>, their singularities cannot be removed. These functions can also be used as activation functions, with the consideration of restricting the regions for them so that their regions never cover their singularities. </p>
        <p>In the proposed quaternionic MLPs, for example, a quaternionic tanh function can be used as an activation function. This function is unbounded and may contain several kinds of singularities as in the case of complex-valued functions described above. Thus this quaternionic MLPs would face the same problem, <italic>i.e.</italic>, the existence of singularities, but this can also be handled similarly as in the case of complex-valued MLPs for the removal or avoidance of such singularities. It could be possible to show the universality with handling singularities for the proposed MLPs according to the ways adopted in the complex-valued MLPs [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>],but it remains as our future work. </p>
      </sec>
    </sec>
    <sec sec-type="conclusions">
      <title>4. Conclusions and Discussion</title>
      <p>This paper has proposed a multilayer type neural network and an error back-propagation algorithm for its learning scheme in the quaternionic domain. The neurons in this network adopt locally analytic activation functions. The quaternionic functions with local analytic conditions are isomorphic to the complex functions, thus several activation functions, such as complex-valued tanh function, can be used extendedly in the quaternionic domain. The Wirtinger calculus, where a quaternion and its conjugate are treated as independent of each other, makes the derivation of the learning scheme clear and compact. </p>
      <p>Analytic conditions for quaternionic functions are derived by defining a complex plane at a quaternionic point, which is a kind of reduction from quaternionic domain to complex domain. There exists another type of reduction in quaternionic domains, such as the commutative quaternion, which is a four-dimensional hypercomplex number system with commutativity in its multiplication [<xref ref-type="bibr" rid="B33-information-03-00756">33</xref>,<xref ref-type="bibr" rid="B34-information-03-00756">34</xref>,<xref ref-type="bibr" rid="B35-information-03-00756">35</xref>,<xref ref-type="bibr" rid="B36-information-03-00756">36</xref>]. A principal property is that a commutative quaternion can be decomposed and represented by two complex numbers with two linearly independent bases (called the decomposed form by idempotent bases). Commutative quaternions have been applied to neural networks only in terms of Hopfield-type network [<xref ref-type="bibr" rid="B24-information-03-00756">24</xref>], but defining multilayer perceptron type networks can alsobe straightforward. Thus, it will be interesting to explore the relationship between commutative quaternion-based networks and networks with the local analyticity. </p>
      <p>Showing the universality of the proposed network is also an important issue. Quaternionic functions, such as tanh function, may contain several kinds of singularities where the values of functions or their differentials are not defined in particular regions. In complex-valued networks with fully complex-valued function [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>], the universality of the networks can be shown by dealing with these singularities, so that such singularities are removed or avoided by restricting the regions. It is expected to show the universality of the proposed network, in a similar way to the case of complex-valued networks. </p>
      <p>Also, it is necessary to investigate the performances of the proposed network, though in this paper the experimental exploration could not be accomplished. The proposed network is similar to the networks proposed in [<xref ref-type="bibr" rid="B13-information-03-00756">13</xref>,<xref ref-type="bibr" rid="B14-information-03-00756">14</xref>], due to the introduction of local analytic function in quaternionic domain, thus the performances for both types of networks may have similar tendencies. Performance comparisons can also be conducted between the proposed network and the ones in [<xref ref-type="bibr" rid="B29-information-03-00756">29</xref>], because both networks adopt the same representation in their constructions, <italic>i.e.</italic>, Wirtinger calculus. A wide variety of activation functions have been investigated including the split-type function, phase-preserving function [<xref ref-type="bibr" rid="B37-information-03-00756">37</xref>], and circular-type function [<xref ref-type="bibr" rid="B38-information-03-00756">38</xref>]. Similar experiments should be conducted for the quaternionic networks. </p>
      <p>Application of the presented network to engineering problems is also challenging. The processing of three or four dimensional vector data, such as color/multi-spectral image processing, predictions for three-dimensional protein structures, and controls of motion in three-dimensional space, will be the candidates from now on. </p>
    </sec>
  </body>
  <back>
    <ack>
      <title>Acknowledgments</title>
      <p>This study was financially supported by Japan Society for the Promotion of Science (Grant-in-Aids for Young Scientists (B) 24700227 and Scientific Research (C) 23500286). </p>
    </ack>
    <ref-list>
      <title>References</title>
      <ref id="B1-information-03-00756">
        <label>1.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Hirose</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <source>Complex-Valued Neural Networks: Theories and Application</source>
          <publisher-name>World Scientific Publishing</publisher-name>
          <publisher-loc>Singapore</publisher-loc>
          <year>2003</year>
        </citation>
      </ref>
      <ref id="B2-information-03-00756">
        <label>2.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Hirose</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <source>Complex-Valued Neural Networks</source>
          <publisher-name>Springer-Verlag</publisher-name>
          <publisher-loc>Berlin, Germany</publisher-loc>
          <year>2006</year>
        </citation>
      </ref>
      <ref id="B3-information-03-00756">
        <label>3.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Nitta</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <source>Complex-Valued Neural Networks: Utilizing High-Dimensional Parameters</source>
          <publisher-name>Information Science Reference</publisher-name>
          <publisher-loc>New York, NY, USA</publisher-loc>
          <year>2009</year>
        </citation>
      </ref>
      <ref id="B4-information-03-00756">
        <label>4.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Hamilton</surname>
              <given-names>W.R.</given-names>
            </name>
          </person-group>
          <source>Lectures on Quaternions</source>
          <publisher-name>Hodges and Smith</publisher-name>
          <publisher-loc>Dublin, Ireland</publisher-loc>
          <year>1853</year>
        </citation>
      </ref>
      <ref id="B5-information-03-00756">
        <label>5.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Hankins</surname>
              <given-names>T.L.</given-names>
            </name>
          </person-group>
          <source>Sir William Rowan Hamilton</source>
          <publisher-name>Johns Hopkins University Press</publisher-name>
          <publisher-loc>Baltimore, MD, USA</publisher-loc>
          <year>1980</year>
        </citation>
      </ref>
      <ref id="B6-information-03-00756">
        <label>6.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Mukundan</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Quaternions: From classical mechanics to computer graphics, and beyond</article-title>
          <source>Proceedings of the 7th Asian Technology Conference in Mathematics</source>
          <conf-loc>Melaka, Malaysia</conf-loc>
          <conf-date>17-21 December 2002</conf-date>
          <fpage>97</fpage>
          <lpage>105</lpage>
        </citation>
      </ref>
      <ref id="B7-information-03-00756">
        <label>7.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Kuipers</surname>
              <given-names>J.B.</given-names>
            </name>
          </person-group>
          <source>Quaternions and Rotation Sequences: A Primer with Applications to Orbits, Aerospace and Virtual Reality</source>
          <publisher-name>Princeton University Press</publisher-name>
          <publisher-loc>Princeton, NJ, USA</publisher-loc>
          <year>1998</year>
        </citation>
      </ref>
      <ref id="B8-information-03-00756">
        <label>8.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Hoggar</surname>
              <given-names>S.G.</given-names>
            </name>
          </person-group>
          <source>Mathematics for Computer Graphics</source>
          <publisher-name>Cambridge University Press</publisher-name>
          <publisher-loc>Cambridge, MA, USA</publisher-loc>
          <year>1992</year>
        </citation>
      </ref>
      <ref id="B9-information-03-00756">
        <label>9.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Nitta</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>An extension of the back-propagation algorithm to quaternions</article-title>
          <source>Proceedings of International Conference on Neural Information Processing (ICONIP’96)</source>
          <conf-loc>Hong Kong, China</conf-loc>
          <conf-date>24-27 September 1996</conf-date>
          <volume>1</volume>
          <fpage>247</fpage>
          <lpage>250</lpage>
        </citation>
      </ref>
      <ref id="B10-information-03-00756">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Arena</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Fortuna</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Muscato</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Xibilia</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Multilayer perceptronsto approximate quaternion valued functions</article-title>
          <source>Neural Netw.</source>
          <year>1997</year>
          <volume>10</volume>
          <fpage>335</fpage>
          <lpage>342</lpage>
          <pub-id pub-id-type="doi">10.1016/S0893-6080(96)00048-2</pub-id>
        </citation>
      </ref>
      <ref id="B11-information-03-00756">
        <label>11.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Buchholz</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Sommer</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Quaternionic spinor MLP</article-title>
          <source>Proceeding of 8th European Symposium on Artificial Neural Networks (ESANN 2000)</source>
          <conf-loc>Bruges, Belgium</conf-loc>
          <conf-date>26-28 April 2000</conf-date>
          <fpage>377</fpage>
          <lpage>382</lpage>
        </citation>
      </ref>
      <ref id="B12-information-03-00756">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Kusamichi</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Peper</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Quaternion neural network with geometrical operators</article-title>
          <source>J. Intell. Fuzzy Syst.</source>
          <year>2004</year>
          <volume>15</volume>
          <fpage>149</fpage>
          <lpage>164</lpage>
        </citation>
      </ref>
      <ref id="B13-information-03-00756">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mandic</surname>
              <given-names>D.P.</given-names>
            </name>
            <name>
              <surname>Jahanchahi</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Took</surname>
              <given-names>C.C.</given-names>
            </name>
          </person-group>
          <article-title>A quaternion gradient operator and its applications</article-title>
          <source>IEEE Signal Proc. Lett.</source>
          <year>2011</year>
          <volume>18</volume>
          <fpage>47</fpage>
          <lpage>50</lpage>
          <pub-id pub-id-type="doi">10.1109/LSP.2010.2091126</pub-id>
        </citation>
      </ref>
      <ref id="B14-information-03-00756">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ujang</surname>
              <given-names>B.C.</given-names>
            </name>
            <name>
              <surname>Took</surname>
              <given-names>C.C.</given-names>
            </name>
            <name>
              <surname>Mandic</surname>
              <given-names>D.P.</given-names>
            </name>
          </person-group>
          <article-title>Quaternion-valued nonlinear adaptive filtering</article-title>
          <source>IEEE Trans. Neural Netw.</source>
          <year>2011</year>
          <volume>22</volume>
          <fpage>1193</fpage>
          <lpage>1206</lpage>
          <pub-id pub-id-type="doi">10.1109/TNN.2011.2157358</pub-id>
        </citation>
      </ref>
      <ref id="B15-information-03-00756">
        <label>15.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Kusamichi</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Ogawa</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Maeda</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Anewschemeforcolornight vision by quaternion neural network</article-title>
          <source>Proceedings of the 2nd International Conferenceon Autonomous Robots and Agents (ICARA2004)</source>
          <conf-loc>Palmerston North, New Zealand</conf-loc>
          <conf-date>13-15 December 2004</conf-date>
          <fpage>101</fpage>
          <lpage>106</lpage>
        </citation>
      </ref>
      <ref id="B16-information-03-00756">
        <label>16.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
          </person-group>
          <article-title>Quaternionic neural networks: Fundamental properties and applications</article-title>
          <source>Complex-Valued Neural Networks: Utilizing High-Dimensional Parameters</source>
          <person-group person-group-type="editor">
            <name>
              <surname>Nitta</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <publisher-name>Information Science Reference</publisher-name>
          <publisher-loc>New York, NY, USA</publisher-loc>
          <year>2009</year>
          <fpage>411</fpage>
          <lpage>439</lpage>
          <comment>Chapter XVI</comment>
        </citation>
      </ref>
      <ref id="B17-information-03-00756">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Nitta</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>A solution to the 4-bit parity problem with a single quaternary neuron</article-title>
          <source>Neural Inf. Process. Lett. Rev.</source>
          <year>2004</year>
          <volume>5</volume>
          <fpage>33</fpage>
          <lpage>39</lpage>
        </citation>
      </ref>
      <ref id="B18-information-03-00756">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yoshida</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kuroe</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Mori</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>Models of hopfield-type quaternion neural networks and their energy functions</article-title>
          <source>Int. J. Neural Syst.</source>
          <year>2005</year>
          <volume>15</volume>
          <fpage>129</fpage>
          <lpage>135</lpage>
          <pub-id pub-id-type="doi">10.1142/S012906570500013X</pub-id>
        </citation>
      </ref>
      <ref id="B19-information-03-00756">
        <label>19.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kamiura</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Fundamental properties of quaternionic hopfield neural network</article-title>
          <source>Proceedings of 2006 International Joint Conference on Neural Networks</source>
          <conf-loc>Vancouver BC, USA</conf-loc>
          <conf-date>30 October 2006</conf-date>
          <fpage>610</fpage>
          <lpage>615</lpage>
        </citation>
      </ref>
      <ref id="B20-information-03-00756">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kamiura</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Associative memoryin quaternionic hopfield neural network</article-title>
          <source>Int. J. Neural Syst.</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>135</fpage>
          <lpage>145</lpage>
          <pub-id pub-id-type="doi">10.1142/S0129065708001440</pub-id>
        </citation>
      </ref>
      <ref id="B21-information-03-00756">
        <label>21.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Kamiura</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Dynamics of discrete-time quaternionic hopfield neural networks</article-title>
          <source>Proceedings of 17th International Conference on Artificial Neural Networks</source>
          <conf-loc>Porto, Portugal</conf-loc>
          <conf-date>9-13 September 2007</conf-date>
          <fpage>848</fpage>
          <lpage>857</lpage>
        </citation>
      </ref>
      <ref id="B22-information-03-00756">
        <label>22.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>On the fundamental properties of fully quaternionic hopfield network</article-title>
          <source>Proceedings of IEEE World Congress on Computational Intelligence (WCCI2012)</source>
          <conf-loc>Brisbane, Australia</conf-loc>
          <conf-date>10-15 June 2012</conf-date>
          <fpage>1246</fpage>
          <lpage>1249</lpage>
        </citation>
      </ref>
      <ref id="B23-information-03-00756">
        <label>23.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Saitoh</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Kamiura</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>On the scheme of quaternionic multistate hopfield neural network</article-title>
          <source>Proceedings of Joint 4th International Conference on Soft Computing and Intelligent Systems and 9th International Symposium on Advanced Intelligent Systems (SCIS&amp;ISIS 2008)</source>
          <conf-loc>Nagoya, Japan</conf-loc>
          <conf-date>17-21 September 2008</conf-date>
          <fpage>809</fpage>
          <lpage>813</lpage>
        </citation>
      </ref>
      <ref id="B24-information-03-00756">
        <label>24.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Commutative quaternion and multistate hopfield neural networks</article-title>
          <source>Proceedings of IEEE World Congress on Computational Intelligence (WCCI2010)</source>
          <conf-loc>Barcelona, Spain</conf-loc>
          <conf-date>18-23 July 2010</conf-date>
          <fpage>1281</fpage>
          <lpage>1286</lpage>
        </citation>
      </ref>
      <ref id="B25-information-03-00756">
        <label>25.</label>
        <citation citation-type="confproc">
          <person-group person-group-type="author">
            <name>
              <surname>Isokawa</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Nishimura</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Matsui</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>An iterative learning schemefor multistate complex-valued and quaternionic hopfield neural networks</article-title>
          <source>Proceedings of International Joint Conference on Neural Networks (IJCNN2009)</source>
          <conf-loc>Atlanta, GA, USA</conf-loc>
          <conf-date>14-19 June 2009</conf-date>
          <fpage>1365</fpage>
          <lpage>1371</lpage>
        </citation>
      </ref>
      <ref id="B26-information-03-00756">
        <label>26.</label>
        <citation citation-type="web">
          <person-group person-group-type="author">
            <name>
              <surname>Leo</surname>
              <given-names>S.D.</given-names>
            </name>
            <name>
              <surname>Rotelli</surname>
              <given-names>P.P.</given-names>
            </name>
          </person-group>
          <article-title>Local hypercomplex analyticity</article-title>
          <year>1997</year>
          <access-date>(accessed on 20 November 2012)</access-date>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://arxiv.org/abs/funct-an/9703002" ext-link-type="uri">http://arxiv.org/abs/funct-an/9703002</ext-link></comment>
        </citation>
      </ref>
      <ref id="B27-information-03-00756">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Leo</surname>
              <given-names>S.D.</given-names>
            </name>
            <name>
              <surname>Rotelli</surname>
              <given-names>P.P.</given-names>
            </name>
          </person-group>
          <article-title>Quaternonic analyticity</article-title>
          <source>Appl. Math. Lett.</source>
          <year>2003</year>
          <volume>16</volume>
          <fpage>1077</fpage>
          <lpage>1081</lpage>
          <pub-id pub-id-type="doi">10.1016/S0893-9659(03)90097-8</pub-id>
        </citation>
      </ref>
      <ref id="B28-information-03-00756">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schwartz</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Calculus with a quaternionic variable</article-title>
          <source>J. Math. Phys.</source>
          <year>2009</year>
          <volume>50</volume>
          <fpage>013523:1</fpage>
          <lpage>013523:11</lpage>
        </citation>
      </ref>
      <ref id="B29-information-03-00756">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kim</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Adalı</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>Approximationby fully complex multilayer perceptrons</article-title>
          <source>Neural Comput.</source>
          <year>2003</year>
          <volume>15</volume>
          <fpage>1641</fpage>
          <lpage>1666</lpage>
          <pub-id pub-id-type="doi">10.1162/089976603321891846</pub-id>
        </citation>
      </ref>
      <ref id="B30-information-03-00756">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wirtinger</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Zur formalen theorie der funktionen von mehr komplexen veränderlichen</article-title>
          <source>Math. Ann.</source>
          <year>1927</year>
          <volume>97</volume>
          <fpage>357</fpage>
          <lpage>375</lpage>
          <pub-id pub-id-type="doi">10.1007/BF01447872</pub-id>
        </citation>
      </ref>
      <ref id="B31-information-03-00756">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cybenko</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Approximations by superpositions of sigmoidal functions</article-title>
          <source>Math. Control Signals Syst.</source>
          <year>1989</year>
          <volume>2</volume>
          <fpage>303</fpage>
          <lpage>314</lpage>
          <pub-id pub-id-type="doi">10.1007/BF02551274</pub-id>
        </citation>
      </ref>
      <ref id="B32-information-03-00756">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hornik</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Approximation capabilities of multilayer feedforward networks</article-title>
          <source>Neural Netw.</source>
          <year>1991</year>
          <volume>4</volume>
          <fpage>215</fpage>
          <lpage>257</lpage>
        </citation>
      </ref>
      <ref id="B33-information-03-00756">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Segre</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>The real representations of complex elements and extension to bicomplex systems</article-title>
          <source>Math. Ann.</source>
          <year>1892</year>
          <volume>40</volume>
          <fpage>322</fpage>
          <lpage>335</lpage>
        </citation>
      </ref>
      <ref id="B34-information-03-00756">
        <label>34.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Catoni</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Cannata</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Zampetti</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>An Introduction to commutative quaternions</article-title>
          <source>Adv. Appl. CliffordAlgebras</source>
          <year>2006</year>
          <volume>16</volume>
          <fpage>1</fpage>
          <lpage>28</lpage>
          <pub-id pub-id-type="doi">10.1007/s00006-006-0002-y</pub-id>
        </citation>
      </ref>
      <ref id="B35-information-03-00756">
        <label>35.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Davenport</surname>
              <given-names>C.M.</given-names>
            </name>
          </person-group>
          <article-title>A commutative hypercomplex algebra with associated function theory</article-title>
          <source>Clifford Algebra With Numeric and Symbolic Computation</source>
          <person-group person-group-type="editor">
            <name>
              <surname>Ablamowicz</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <publisher-name>Birkhauser</publisher-name>
          <publisher-loc>Boston, MA, USA</publisher-loc>
          <year>1996</year>
          <fpage>213</fpage>
          <lpage>227</lpage>
        </citation>
      </ref>
      <ref id="B36-information-03-00756">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pei</surname>
              <given-names>S.C.</given-names>
            </name>
            <name>
              <surname>Chang</surname>
              <given-names>J.H.</given-names>
            </name>
            <name>
              <surname>Ding</surname>
              <given-names>J.J.</given-names>
            </name>
          </person-group>
          <article-title>Commutative reduced biquaternions and their Fourier Transformfor signal and image processing applications</article-title>
          <source>IEEE Trans. Signal Proc.</source>
          <year>2004</year>
          <volume>52</volume>
          <fpage>2012</fpage>
          <lpage>2031</lpage>
          <pub-id pub-id-type="doi">10.1109/TSP.2004.828901</pub-id>
        </citation>
      </ref>
      <ref id="B37-information-03-00756">
        <label>37.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hirose</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Continuous complex-valued back-propagation learning</article-title>
          <source>Electron. Lett.</source>
          <year>1992</year>
          <volume>28</volume>
          <fpage>1854</fpage>
          <lpage>1855</lpage>
          <pub-id pub-id-type="doi">10.1049/el:19921186</pub-id>
        </citation>
      </ref>
      <ref id="B38-information-03-00756">
        <label>38.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Georgiou</surname>
              <given-names>G.M.</given-names>
            </name>
            <name>
              <surname>Koutsougeras</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Complex domain backpropagation</article-title>
          <source>IEEE Trans. Circuits Syst. II</source>
          <year>1992</year>
          <volume>39</volume>
          <fpage>330</fpage>
          <lpage>334</lpage>
          <pub-id pub-id-type="doi">10.1109/82.142037</pub-id>
        </citation>
      </ref>
    </ref-list>
  </back>
</article>
