<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Journal of Low Power Electronics and Applications</journal-id>
<journal-title>Journal of Low Power Electronics and Applications</journal-title>
<issn pub-type="epub">2079-9268</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/jlpea1010150</article-id>
<article-id pub-id-type="publisher-id">jlpea-01-00150</article-id>
<article-categories>
<subj-group>
<subject>Review</subject></subj-group></article-categories>
<title-group>
<article-title>Data-Driven Approaches for Computation in Intelligent Biomedical Devices: A Case Study of EEG Monitoring for Chronic Seizure Detection</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Verma</surname><given-names>Naveen</given-names></name><xref ref-type="aff" rid="af1-jlpea-01-00150"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-jlpea-01-00150"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Lee</surname><given-names>Kyong Ho</given-names></name><xref ref-type="aff" rid="af1-jlpea-01-00150"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-jlpea-01-00150"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Shoeb</surname><given-names>Ali</given-names></name><xref ref-type="aff" rid="af2-jlpea-01-00150"><sup>2</sup></xref></contrib></contrib-group>
<aff id="af1-jlpea-01-00150">
<label>1</label> Princeton University, Engineering Quadrangle B-226, Olden Street, Princeton, NJ 08544, USA</aff>
<aff id="af2-jlpea-01-00150">
<label>2</label> Massachusetts General Hospital &amp; MIT, 77 Massachusetts Avenue, Cambridge, MA 02139, USA;E-Mail: <email>ashoeb@mit.edu</email></aff>
<author-notes>
<corresp id="c1-jlpea-01-00150">
<label>*</label> Authors to whom correspondence should be addressed; E-Mails: <email>nverma@princeton.edu</email> (N.V.); <email>kyonglee@princeton.edu</email> (K.H.L.); Tel.: +1-609-258-1424.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>26</day>
<month>04</month>
<year>2011</year></pub-date>
<volume>1</volume>
<issue>1</issue>
<fpage>150</fpage>
<lpage>174</lpage>
<history>
<date date-type="received">
<day>25</day>
<month>11</month>
<year>2010</year></date>
<date date-type="rev-recd">
<day>13</day>
<month>04</month>
<year>2011</year></date>
<date date-type="accepted">
<day>20</day>
<month>04</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>©2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>Intelligent biomedical devices implies systems that are able to detect specific physiological processes in patients so that particular responses can be generated. This closed-loop capability can have enormous clinical value when we consider the unprecedented modalities that are beginning to emerge for sensing and stimulating patient physiology. Both delivering therapy (e.g., deep-brain stimulation, vagus nerve stimulation, <italic>etc.</italic>) and treating impairments (e.g., neural prosthesis) requires computational devices that can make clinically relevant inferences, especially using minimally-intrusive patient signals. The key to such devices is algorithms that are based on data-driven signal modeling as well as hardware structures that are specialized to these. This paper discusses the primary application-domain challenges that must be overcome and analyzes the most promising methods for this that are emerging. We then look at how these methods are being incorporated in ultra-low-energy computational platforms and systems. The case study for this is a seizure-detection SoC that includes instrumentation and computation blocks in support of a system that exploits patient-specific modeling to achieve accurate performance for chronic detection. The SoC samples each EEG channel at a rate of 600 Hz and performs processing to derive signal features on every two second epoch, consuming 9 <italic>μ</italic>J/epoch/channel. Signal feature extraction reduces the data rate by a factor of over 40×, permitting wireless communication from the patient's head while reducing the total power on the head by 14×.</p></abstract>
<kwd-group>
<kwd>biomedical</kwd>
<kwd>body-area networks</kwd>
<kwd>electroencephalography</kwd>
<kwd>implantable</kwd>
<kwd>low-noise amplifiers</kwd>
<kwd>low-power electronics</kwd>
<kwd>machine-learning</kwd>
<kwd>subthreshold</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>Sensors and stimulators are emerging that are allowing biomedical devices to advance to new frontiers. Much of this has been initiated by innovations in biologically compliant materials and technologies, but its relevance has become apparent thanks to clinical work that has led to the conception of new <italic>systems</italic> that are beginning to take advantage of the unprecedented modalities that these technologies offer for delivering therapy and for sensing physiology. Deep-brain stimulation (DBS), for instance, involves the implantation of a stimulation electrode in targeted locations of brain as well as the use of a simple control device (similar to a pacemaker) that actuates stimulation. Devices employing DBS have shown astounding efficacy in some patient towards the treatment of a range of neurological conditions from Parkinson's disease to dystonia [<xref ref-type="bibr" rid="b1-jlpea-01-00150">1</xref>]. Vagus nerve stimulation (VNS), is an alternate approach that has been used to treat epilepsy patients; it avoids the need for brain surgery by using an implanted stimulator in the neck that is actuated either using a similar control device or through manual control by the patient [<xref ref-type="bibr" rid="b2-jlpea-01-00150">2</xref>]. Similarly, sensors are emerging that are capable of recording extremely rich physiological signals over long time periods as a part of chronic disease management (<italic>i.e.</italic>, as a long-term means of managing pathophysiologic patient states). Flexible materials, such as silk [<xref ref-type="bibr" rid="b3-jlpea-01-00150">3</xref>] and silicone polydimethylsiloxane (PDMS) [<xref ref-type="bibr" rid="b4-jlpea-01-00150">4</xref>], that are also histologically promising, provide a substrate both for electrodes and active electronics; this, for instance, forms the basis for high-resolution, multi-channel recording systems for bio-potentials that manifest on curvilinear biological surfaces (such as the cortex of the brain or the surface of cardiac tissue [<xref ref-type="bibr" rid="b3-jlpea-01-00150">3</xref>,<xref ref-type="bibr" rid="b5-jlpea-01-00150">5</xref>]).</p>
<p>In order to take advantage of such technologies in <italic>intelligent, closed-loop</italic> systems, however, we require the ability to detect specific physiological processes with a high level of accuracy. Identifying specific, clinically-relevant indicators in bio-potential signals that are available in implantable and wearable systems currently poses a dominating limitation. As we describe in this paper, the signals themselves are physiologically complex, and the correlations are both diverse and subject to noise. An important concern for chronic wearable and implantable systems is also that the computations must be achieved with very low power consumption; the total power budget for wearable devices, for instance, is 1–10 mW, and that for implantable devices is 10–100 <italic>μ</italic>W. These power levels are derived from the energy delivery options that are viable from application to application (ranging from primary and chargeable storage sources (e.g., thin-film batteries) to wireless power transfer), as described in [<xref ref-type="bibr" rid="b6-jlpea-01-00150">6</xref>]. Ultimately, the power and/or energy constraints make it essential to carefully consider the algorithmic approach and the computations required in order to identify opportunities for reducing the complexity or improving the efficiency.</p>
<p>Understanding the algorithmic approaches is also important when we consider how limited current clinical decision-support systems are in terms of accuracy. Current bedside hospital monitors, for instance, employ models that are insufficient for detecting acute events with high specificity [<xref ref-type="bibr" rid="b7-jlpea-01-00150">7</xref>]. Alarm fatigue is thus one of the most prominent problems in intensive care units [<xref ref-type="bibr" rid="b8-jlpea-01-00150">8</xref>]. Nonetheless, automatic patient monitoring remains an important aspect for managing the scale of healthcare delivery, even in the hospital setting. Beyond this, chronic out-patient monitoring (following hospital discharge) is also emerging as an important mode of healthcare [<xref ref-type="bibr" rid="b9-jlpea-01-00150">9</xref>]. In the face of limited clinical resources, it further stresses the importance for accuracy in order to manage the resulting increase in scale (<italic>i.e.</italic>, false-alarm rates must be low when monitoring large patient populations that can only be accessed through out-of-hospital response). It also places greater emphasis on the need for low-energy platforms due to the mobility required in the monitoring devices.</p>
<p>A powerful development in the domain of clinical decision support has been the emergence of <italic>data-driven</italic> modeling techniques. These refer to methods of modeling physiological signals based on observing and analyzing data. This is in contrast to attempts at modeling the underlying processes, which are exceedingly complex, and therefore tend to yield low-order models. In the remainder of this paper, we describe why data-driven techniques are a promising means to achieve the accuracy required in practical chronic biomedical devices. We then describe the computational approaches that can be used to take advantage of these. Finally, we consider a practical device that attempts to exploit data-driven modeling. An SoC for chronic seizure detection is described, illustrating the design considerations and methodologies involved. The SoC employs an architecture that performs local acquisition of patient electroencephalographs (EEGs), analog-to-digital conversion, and signal feature extraction. Local feature extraction results in a compressed signal representation for data-driven classification, making wireless transmission from the scalp viable.</p></sec>
<sec>
<label>2.</label>
<title>The Need for Data-Driven Techniques</title>
<p>Currently, clinical decision support systems rely on algorithms that employ very simple models of the physiological signals in order to detect the correlations of interest. Often, for instance, the models are based on pre-determined thresholds of specific signal parameters [<xref ref-type="bibr" rid="b10-jlpea-01-00150">10</xref>]. These parameters can typically be extracted through signal processing, and thus low-energy DSP has been the predominant computational approach in current biomedical devices [<xref ref-type="bibr" rid="b11-jlpea-01-00150">11</xref>]. In this section, however, we highlight three key challenges in biomedical detection applications that motivate a change in focus from <italic>signal processing</italic>, which involves computations to transform signals, to <italic>inference</italic>, which involves computations to model and detect specific signal correlations. This leads to the emphasis on data-driven modeling techniques.</p>
<sec>
<label>2.1.</label>
<title>Detection Challenges in Biomedical Applications</title>
<p>Clinically relevant processes that must be detected in biomedical applications are generally physiologically complex. The ability to model these, and their manifestations in the signals that are available through chronic sensing, poses a primary challenge. Thus, the concern, as we try to achieve higher accuracy, is not just the ability to extract specific signal parameters with high signal-to-noise ratio, but rather the ability to discriminate the meaning of those parameters. Although the development of physiology-based models has played an important role in understanding the underlying processes, their viability in practical detection systems is limited. First, these have typically resulted in low-order models [<xref ref-type="bibr" rid="b12-jlpea-01-00150">12</xref>]; this leads to simple threshold rules that yield limited accuracy. Second, even after the physiological processes have been modeled, the specific manifestations in the signals must also be modeled. The primarily signals of interest in low-power chronic biomedical devices are collaterally representative of the target physiological processes. As an example, neural electrophysiology appears in scalp electroencephalographs (EEG) after volume conduction through brain tissue and then signal transduction across the tissue membranes and skull that surround the brain. The physiological sources thus manifest in the signals through complex coupling mechanisms. Physiology-based models thus face severe complexities that limit the accuracy that they have been able to provide.</p>
<p>A second challenge is that that signals available through low-power chronic sensing typically represent the superposition of numerous physiological sources. Thus, the models must be able to reject these with very high specificity, usually higher than (linear) signal processing affords. In the case of EEG-based seizure detection, for instance, electrical activity from muscles (<italic>i.e.</italic>, electromyographic (EMG) activity) is a major concern. As an example, <xref ref-type="fig" rid="f1-jlpea-01-00150">Figure 1</xref> is from the case-study in [<xref ref-type="bibr" rid="b13-jlpea-01-00150">13</xref>], where amplitude-integrated EEG (AEEG) is used to monitor a neonatal patient. AEEG is a method of presenting the EEG over long time periods (using filtering, peak rectification, and semi-logarithmic compression). Using a threshold-margin-based detection approach, seizure activity was not detected in the initial period despite observing abnormal symptoms in the patient. The EEG trace, exhibited the artifact shown below, which was suspected to be EMG activity. In the following periods, the electrode position was changed, and then a muscle relaxant was administered, suppressing the artifact. At this point, the AEEG margins indicated electrographic seizure activity, consistent with the clinical symptoms. In addition to physiological interference sources, mechanical motion artifacts and coupling of electrical environmental noise are prominent concerns [<xref ref-type="bibr" rid="b14-jlpea-01-00150">14</xref>]. Although these degrade the performance of detection algorithms that are based on simplistic rules, data-driven algorithms that explicitly model artifact-corrupted signals can retain their accuracy and thus mitigate the need for further instrumentation and/or signal processing to suppress the artifacts [<xref ref-type="bibr" rid="b15-jlpea-01-00150">15</xref>].</p>
<p>The third challenge is that efficient methods for developing and adapting the detection models are required. This can play a critical role in improving accuracy by allowing patient-specific factors to be more readily incorporated. The importance of patient-specific factors is seen across applications. In [<xref ref-type="bibr" rid="b10-jlpea-01-00150">10</xref>], even very simple adaption of model thresholds based on sparse sampling from a target patient group shows to yield substantial performance improvement for cardiac monitoring applications. Adaption has shown to be beneficial even when very rich models are possible. For instance, a great deal of clinical work has gone into characterizing the typical behavior exhibited in a patient's EEG during a seizure. Most importantly, a characteristic frequency range of spectral activity has been identified as a strong indicator [<xref ref-type="bibr" rid="b16-jlpea-01-00150">16</xref>]. <xref ref-type="fig" rid="f2-jlpea-01-00150">Figure 2</xref>, however, shows two actual EEG bursts from a patient. Although, no clinical seizure corresponds with the burst in <xref ref-type="fig" rid="f2-jlpea-01-00150">Figure 2a</xref>, the electrographic activity has the rhythmic behavior typically associated with a seizure. On the other hand, <xref ref-type="fig" rid="f2-jlpea-01-00150">Figure 2b</xref> corresponds to a clinical seizure, and, although its rhythmic behavior is consistent with that typically exhibited during seizures, it is more specific, both in terms of the spectral content and in terms of the spatial EEG channels that are excited. Thus, modeling the background and seizure characteristics, and doing this on a patient-by-patient basis improves the accuracy of seizure detection [<xref ref-type="bibr" rid="b17-jlpea-01-00150">17</xref>]. The ability to efficiently and systematically develop new models is therefore a key aspect for practical detection systems.</p></sec>
<sec>
<label>2.2.</label>
<title>Exploiting the Availability of Medical Data</title>
<p>With respect to the challenges faced in biomedical detection applications, two important developments have recently emerged. The first is the availability of patient data in the healthcare domain, and the second the advancement of machine-learning techniques for data-analysis. Hospitals today widely employ data-bases where patient signals recorded from bedside monitors (e.g., EEG, ECG, <italic>etc.</italic>) are logged along with observational and/or diagnostic annotations [<xref ref-type="bibr" rid="b12-jlpea-01-00150">12</xref>,<xref ref-type="bibr" rid="b18-jlpea-01-00150">18</xref>]. As a result, a large amount of data is available, corresponding to patients with a broad range of clinical conditions. Practices for annotating the datasets are also improving, leading to high-quality, clinically-relevant labels are being directly applied to the signals [<xref ref-type="bibr" rid="b18-jlpea-01-00150">18</xref>]. A technological advancement of importance is low-power recording technologies that can provide long-term access to the same physiologically-indicative signals recorded by hospital bedside monitors [<xref ref-type="bibr" rid="b14-jlpea-01-00150">14</xref>]. Through these, biomedical devices can directly take advantage of the annotated databases without the need for supplementary models to correlate with alternate signals.</p>
<p>Machine-learning techniques provide powerful methods for exploiting the large-scale availability of patient data. These have particularly advanced thanks to applied work related to physiological data analysis [<xref ref-type="bibr" rid="b19-jlpea-01-00150">19</xref>]. More specifically for chronic biomedical devices, the <italic>data-modeling</italic> and <italic>classification</italic> methods (<italic>i.e.</italic>, methods for applying the models) have seen substantial development [<xref ref-type="bibr" rid="b20-jlpea-01-00150">20</xref>]. The efficiency with which machine-learning techniques allow the development of quantitative models from large datasets, means that patient-specific models can be readily developed as patient-specific data becomes available, and thus a high degree of specificity can be achieved. In [<xref ref-type="bibr" rid="b17-jlpea-01-00150">17</xref>], for instance, a patient-specific seizure detector achieves a sensitivity of 93%, a specificity of 0.3 ± 0.7 false alarms per hour, and a latency of 6.77 ± 3.0 s, compared to 66%, 2.0 ± 5.3 false alarms per hour, and 30.1 ± 15 s for a patient-generic detector.</p>
<p>The generality of the modeling and classification methods means that they can be applied across a wide range of biomedical applications. <xref ref-type="fig" rid="f3-jlpea-01-00150">Figure 3</xref> shows the generic structure of machine-learning-based detection algorithms. The seizure detection application is based on [<xref ref-type="bibr" rid="b21-jlpea-01-00150">21</xref>], where a sensitivity of 96%, a specificity of 2 false detections per day, and a latency of 3.0 s is achieved across 24 patients, and the arrhythmia beat classification application is based on [<xref ref-type="bibr" rid="b22-jlpea-01-00150">22</xref>], where a sensitivity above 75% and a false positive rate below 1.5% is achieved. The two primary aspects of the algorithms are (1) biomarker extraction and (2) biomarker interpretation. Biomarkers correspond to defined parameters in the signals that are determined to have some correlation with the target physiological patient state. The precise correlation, however, is determined by constructing a model of the biomarkers by employing a training dataset. Real-time interpretation of the biomarkers is then achieved by applying that model using a classifier. The classifier thus requires a training phase; this, however, does not face real-time detection-throughput constraints and can be performed offline, either once or periodically to track slowly evolving physiology [<xref ref-type="bibr" rid="b23-jlpea-01-00150">23</xref>].</p>
<p>By treating the biomarkers as a feature vector, a machine-learning classifier can be used. Various options for machine-learning-based classification are available, and these offer the possibility to efficiently scale the models to handle a large number of features with diverse correlations. The following section focuses on classification approaches that can be applied in biomedical devices. Biomarkers, however, are closely coupled to the clinical factors, and therefore their precise choice varies from application to application. For instance, in <xref ref-type="fig" rid="f3-jlpea-01-00150">Figure 3</xref>, two applications are considered. For seizure detection, information is contained in the spectral content of the patient's electroencephalograph (EEG), while for cardiac arrhythmia detection information is contained in the waveform morphology of the patient's electrocardiograph (ECG); the biomarkers are thus chose accordingly. Even within an application such as arrhythmia detection, which is associated with a range of cardiovascular conditions, various biomarkers have been used based on the physiological factors they represent [<xref ref-type="bibr" rid="b22-jlpea-01-00150">22</xref>]. Finally, the optimal biomarkers can also be patient dependant; for instance, seizures in some patients have exhibited notable correlation with cardiac beat-rate in addition to EEG activity. Thus, detection accuracy improves when beat-rate features (extracted from the ECG) are included [<xref ref-type="bibr" rid="b21-jlpea-01-00150">21</xref>]. As a result, programmable platforms that can support a wide range of biomarker extraction processing can play an important role.</p></sec></sec>
<sec>
<label>3.</label>
<title>Data-Driven Classification Frameworks</title>
<p>Biomedical signals are non-stationary; as mentioned, they reflect the state of numerous dynamic physiological processes. Thus, an approach based on simply detecting changes in the signal characteristics is not sufficient. Instead, biomedical devices must overlook <italic>physiologic</italic> signal changes, and recognize specific <italic>pathophysiologic</italic> signal changes that are expressed due to a dynamic disease state that is of interest. As an example, the EEG trace (over 18 channels) in <xref ref-type="fig" rid="f4-jlpea-01-00150">Figure 4</xref> exhibits a burst at the 2994 s mark corresponding to the start of an epileptic seizure. This is characterized by rhythmic activity (most prominent on the FP2-F4 and T8-P8 channels). Earlier rhythmic activity, for instance that between the 2989–2992 marks, is distinct from typical background activity. Nonetheless, it in fact represents a normal characteristic of sleep EEG (known as a <italic>spindle</italic>), and should be differentiated by a detection device.</p>
<p>Biomedical signals can also exhibit a wide range of characteristics within both the physiologic and pathophysiologic states. For instance, in the physiologic state, EEG exhibits different characteristics depending on whether an individual is awake or asleep. Similarly, in the pathophysiologic state, the EEG observed at the onset, middle, and end of a seizure may be very different [<xref ref-type="bibr" rid="b24-jlpea-01-00150">24</xref>]. Depending on the application, however, a system may be required to differentiate between the physiologic and pathophysiologic states, or to discriminate among various sub-states within each of these two states. Differentiating between the physiologic and pathophysiologic states of a signal amounts to solving a binary classification problem. A binary classifier aims to label an observation as a member of one of two classes in a manner that optimizes a chosen performance objective; for example, this could be minimization of the misclassification probability. In the seizure onset detection problem, the observation is an EEG epoch (<italic>i.e.</italic>, some window of time) and the two classes are seizure (Class <italic>C</italic><sub>1</sub>) and non-seizure (Class <italic>C</italic><sub>2</sub>).</p>
<p>The non-seizure class, in actual fact, includes EEG gathered during awake, sleep, and ambulatory periods. Discriminating among the sub-states of the physiologic and pathophysiologic states amounts to solving a multi-class classification problem. However, since the most powerful and robust classification algorithms available are formulated within the setting of binary classification, multi-class classification problems are often solved using a collection of binary classifiers [<xref ref-type="bibr" rid="b25-jlpea-01-00150">25</xref>]. As an example, consider differentiating between <italic>K</italic> classes using binary classification. One approach, known as <italic>one vs. all</italic>, involves combining the outputs of <italic>K</italic> classifiers, each of which discriminates between a class (e.g., <italic>C</italic><sub>1</sub>) and the union of the remaining <italic>K</italic> − 1 classes (e.g., <italic>C</italic><sub>2</sub> ∪ <italic>C</italic><sub>3</sub> ∪ … <italic>C<sub>K</sub></italic>).</p>
<p>For classification, rather than directly using the observed signal, the signal is represented by biomarkers that correspond to salient signal properties. This generically reduces the complexity of the observation without attempting to discriminate the data, and it creates a representation that is appropriate for the specific task of model-based classification via the formalizations described below. Namely, the observation is converted into a set of <italic>features</italic>. The choice of features strongly affects the success of a binary classifier. Ideally, the distributions of the feature values should be as distinct as possible for observation belonging to the different classes. Then, by letting a feature be denoted by <italic>x<sub>i</sub></italic> and a vector of features by <italic>x</italic>⃗ = [<italic>x</italic><sub>1</sub> <italic>x<sub>2</sub></italic> <italic>x<sub>3</sub></italic> <italic>x<sub>k</sub></italic>], we can represent a binary classifier as a function that maps a feature vector <italic>x</italic>⃗ to Class <italic>C</italic><sub>1</sub> or <italic>C</italic><sub>2</sub>. More succinctly
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>→</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>Several techniques exist for determining the mapping function. Below, we discuss the merits of the generative and discriminative approaches. While the generative approach may provide classification outputs that indicate class membership and <italic>degree</italic> of membership, the discriminative approach simply aims to provide an indication of class membership; this is both sufficient for many of the medical devices envisioned, and, as described below, it is more viable for model construction based on the data typically available [<xref ref-type="bibr" rid="b26-jlpea-01-00150">26</xref>,<xref ref-type="bibr" rid="b27-jlpea-01-00150">27</xref>].</p>
<sec>
<label>3.1.</label>
<title>Discriminative Approach</title>
<p>Discriminative approaches aim to derive a <italic>decision boundary</italic> or boundaries that divide the feature space into regions containing feature vectors from the same class. A powerful algorithm for determining decision boundaries is the support-vector machine (SVM) algorithm [<xref ref-type="bibr" rid="b28-jlpea-01-00150">28</xref>]. In its simplest form, the SVM algorithm determines a decision boundary in the form of a hyper-plane [<xref ref-type="bibr" rid="b29-jlpea-01-00150">29</xref>]. The hyper-plane is chosen to maximize the classification margin, which is the geometric distance between the hyper-plane and the boundary data instances for each class that are observed during training [<xref ref-type="bibr" rid="b30-jlpea-01-00150">30</xref>]. The chosen boundary cases are known as the support vectors.</p>
<p>As an example, <xref ref-type="fig" rid="f5-jlpea-01-00150">Figure 5a</xref> shows the linear decision boundary separating intra-cranial EEG (IEEG) seizure feature vectors (circles) from non-seizure feature vectors (crosses). Here, the <italic>x</italic> and <italic>y</italic>-axis features represent the energy in the spectral band 8 ± 8 Hz and 25 ± 11 Hz, respectively. The seizure feature vectors are distributed into a cluster that lies away from the non-seizure data as well as several sparse points that lie in close proximity to the non-seizure data. The seizure vectors closest to the non-seizure vectors are associated with the onset period while those further away are associated with later stages of the seizure (e.g., the vectors numbered 1–5 are extracted from the first 7 s of a seizure). It can be seen that the two classes are not well separated by a linear decision boundary, as numerous non-seizures data points are classified as seizure feature vectors. SVMs, however, allow nonlinear boundaries to be evoked through a kernel transformation, improving the classifier performance (by, for instance, reducing misclassification of the non-seizure feature vectors, as shown in <xref ref-type="fig" rid="f5-jlpea-01-00150">Figure 5b</xref>).</p>
<p>The mapping function that establishes whether a feature vector falls within the region of the feature space defined by seizure activity is expressed below for the case of a linear boundary. Here, the vector <italic>W</italic> (which is normal to the separating hyper-plane) and the bias term <italic>b</italic> are parameters determined by the SVM learning algorithm.</p>
<p>
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>T</mml:mi></mml:msup>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>≥</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:msup>
<mml:mi>W</mml:mi>
<mml:mi>T</mml:mi></mml:msup>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>A highly flexible nonlinear boundary can be achieved using a radial-basis function kernel, where the discriminant function takes the form shown in the equation below. The coefficient <italic>α<sub>i</sub></italic>, the support-vectors <italic>x</italic>⃗<italic><sub>i</sub></italic> and the bias term <italic>b</italic> are determined by the SVM learning algorithm based on <italic>observed</italic> data. The parameter <italic>γ</italic>, which controls how tightly the nonlinear boundary circumscribes a class, is user-defined.
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm3" display="block">
<mml:semantics id="sm3">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo>
<mml:mtext>if</mml:mtext>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mo>γ</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:msup>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>≥</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>2</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:mtext>if</mml:mtext>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:msubsup>
<mml:mrow>
<mml:msub>
<mml:mi>α</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mo>γ</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:msup>
<mml:mo>+</mml:mo>
<mml:mi>b</mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>The number of support-vectors <italic>N</italic> is strongly affected by the complexity of the classification task. As the similarity between the physiologic and pathophysiologic data increases, more support-vectors may be needed to define a more complex decision boundary, and the computational cost scales accordingly.</p>
<p>Nonetheless, the SVM algorithm is well suited for biomedical detection tasks because its learning mechanism focuses on determining a decision boundary by identifying boundary data cases from among those that are <italic>observed</italic> for each class. This readily allows the generation of the models. It has also been shown to perform well in high-dimensionality classification problems [<xref ref-type="bibr" rid="b31-jlpea-01-00150">31</xref>] and in settings where one class has a much smaller number of training samples, as is often the case for the pathophysiologic class [<xref ref-type="bibr" rid="b32-jlpea-01-00150">32</xref>].</p></sec>
<sec>
<label>3.2.</label>
<title>Generative Approach</title>
<p>In contrast to discriminative approaches (which focus on boundary data cases), generative approaches first model the probability distributions of each class and then label an observation as a member of the class whose model is most likely to have generated an observation [<xref ref-type="bibr" rid="b26-jlpea-01-00150">26</xref>]. Mathematically, this logic is represented as follows [<xref ref-type="bibr" rid="b33-jlpea-01-00150">33</xref>]:
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm4" display="block">
<mml:semantics id="sm4">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>≥</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&lt;</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>|</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>P(<italic>C<sub>i</sub></italic>| <italic>x</italic>⃗), which is known as the <italic>a posteriori</italic> probability, is the probability that the source of the observation is the class <italic>C<sub>i</sub></italic> given that we have observed the feature vector <italic>x</italic>⃗. The <italic>a posteriori</italic> probability of a class is typically not known. It can, however, be computed using Bayes' Rule. Bayes' Rule formulates the <italic>a posteriori</italic> probability in terms of probability functions and densities that can be more readily estimated from data:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm5" display="block">
<mml:semantics id="sm5">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">|</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>The <italic>prior P</italic>(<italic>C<sub>i</sub></italic>) is the probability that observations originate from the class <italic>C<sub>i</sub></italic>. The <italic>likelihood function f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>) describes how likely it is to observe the feature vector <italic>x</italic>⃗ assuming that the source is the class <italic>C<sub>i</sub></italic>. A tractable and flexible choice for <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>) is a mixture of gaussians:
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm6" display="block">
<mml:semantics id="sm6">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>K</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mi>π</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mi mathvariant="script">N</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>μ</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mo>Λ</mml:mo>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo></mml:mrow>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi>i</mml:mi></mml:mrow>
<mml:mi>K</mml:mi></mml:munderover>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>π</mml:mi>
<mml:mi>j</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>π</mml:mi>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">|</mml:mo>
<mml:msub>
<mml:mo>Λ</mml:mo>
<mml:mi>j</mml:mi></mml:msub>
<mml:msup>
<mml:mo stretchy="false">|</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mrow>
<mml:msup>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>μ</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mi>T</mml:mi></mml:msup>
<mml:msubsup>
<mml:mo>Λ</mml:mo>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msubsup>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>μ</mml:mi>
<mml:mi>j</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>The parameters of the mixture model (number of gaussian densities <italic>K</italic> and the weight <italic>π<sub>j</sub></italic>, mean <italic>μ<sub>j</sub></italic>, and covariance matrix Λ<italic><sub>j</sub></italic> of each gaussian density) can be estimated using training feature vectors derived from the class <italic>C<sub>i</sub></italic> by using the Expectation Maximization (EM) algorithm [<xref ref-type="bibr" rid="b34-jlpea-01-00150">34</xref>]. As an example, consider the scalp EEG seizure (circles) and non-seizure (crosses) feature vectors shown in <xref ref-type="fig" rid="f6-jlpea-01-00150">Figure 6a</xref>. These feature vectors were originally 126 dimensional but have been projected onto a 2 dimensional space through principal components analysis for illustration. <xref ref-type="fig" rid="f6-jlpea-01-00150">Figure 6b</xref> shows the likelihood function for the non-seizure class estimated using the EM algorithm with a mixture of 3 gaussians; notice how the gaussian mixture facilitates the approximation of the multimodal, non-gaussian distribution of the non-seizure class. In general, it is important to remember that reliably estimating the parameters of a gaussian mixture model in the original high-dimensional feature space (which may be necessary in order to model the relevant physiology) is much more challenging and requires an abundance of data; this challenge is often referred to as the <italic>curse of dimensionality</italic>. In the biomedical context, gathering an abundance of data can be difficult, especially for the pathophysiologic class.</p>
<p>Using the prior and likelihood function we can rewrite the <italic>a posteriori</italic> probability and mapping function as follows:
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm7" display="block">
<mml:semantics id="sm7">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>≥</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>&lt;</mml:mo>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula></p>
<p>If the prior probabilities <italic>P</italic>(<italic>C<sub>i</sub></italic>) are not known, then the above decision rule can be reduced to comparing the ratio of the likelihood functions to a threshold λ. The value of λ is determined experimentally (or analytically if mathematically tractable forms of <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>) are chosen [<xref ref-type="bibr" rid="b33-jlpea-01-00150">33</xref>]) using training data to yield a desired probability of true and false detections:
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm8" display="block">
<mml:semantics id="sm8">
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mfrac>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac>
<mml:mo>≥</mml:mo>
<mml:mo>λ</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtext>if</mml:mtext>
<mml:mspace width="0.2em"/>
<mml:mfrac>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>→</mml:mo></mml:mover>
<mml:mo>|</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac>
<mml:mo>&lt;</mml:mo>
<mml:mo>λ</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>From the above decision rule, we can derive a boundary that separates the values of <italic>x</italic>⃗ where <italic>F</italic>(<italic>x</italic>⃗) = <italic>C<sub>1</sub></italic> from the values of <italic>x</italic>⃗ where <italic>F</italic>(<italic>x</italic>⃗) = <italic>C<sub>2</sub></italic>. As in the case of a discriminative classifier, this decision boundary can be a line, an open curve, or a closed curve depending on the form of the term <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>). Beyond this, however, we can also use the likelihood functions <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>) to assess an observation's degree membership in the class <italic>C<sub>i</sub></italic>. Unlike a discriminative classifier, which must assign an observation to a class, a generative classifier can thus use the degree of membership of an observation to implement a <italic>reject option</italic>. This can be used to state that an observation is unlikely to belong to either class whenever values of <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>) are very small. Such an option can be useful when classifying signals exhibiting many regimes peripheral to those of concern. This arises in the context of brain-machine interface applications [<xref ref-type="bibr" rid="b35-jlpea-01-00150">35</xref>].</p>
<p>For illustration, <xref ref-type="fig" rid="f7-jlpea-01-00150">Figure 7</xref> shows the decision boundary resulting from applying the above (generative) decision rule with <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>seizure</sub></italic>) modeled using a single gaussian<bold>,</bold> <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>nonseizure</sub></italic>) modeled using a mixture of 3 gaussians, and λ = 30. A smaller λ would result in a wider boundary that includes more seizure feature vectors, and potentially, more non-seizure feature vectors. Also shown in <xref ref-type="fig" rid="f7-jlpea-01-00150">Figure 7</xref> is the boundary resulting from using a discriminative classifier (SVM using radial basis function with γ = 3). Note how the model assumption of a single guassian for <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>seizure</sub></italic>) forces the decision boundary to be an ellipsoid, while the SVM results in a more curved decision boundary that encloses more seizure feature vectors. However, when there is confidence that the proposed form of <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>i</sub></italic>) describes the data well, a generative model may outperform a discriminative model [<xref ref-type="bibr" rid="b26-jlpea-01-00150">26</xref>].</p>
<p>In summary, the discriminative approach can be more readily applied to biomedical applications characterized by high-dimensional feature vectors and limited training data; this is thanks to the limited characterization required of the statistics underlying the data. However, when information about the statistics is available (thanks to sufficient training data), the generative approach can provide models that produce accurate results for classification <italic>as well as</italic> degree of class membership.</p></sec></sec>
<sec>
<label>4.</label>
<title>Case Study: Chronic Seizure Detection System</title>
<p>Since data-driven techniques introduce distinct benefits for addressing the challenges mentioned in Section 2.1, we are motivated to incorporate these in practical biomedical devices. In this context, system- and application-level considerations for the devices also become important. For instance, recognizing the energy limitations and obtrusiveness of instrumenting patients with many sensing channels, SVM-based approaches have been investigated that minimize the number of electrodes from which continuous recording is required. In the seizure detection system of [<xref ref-type="bibr" rid="b36-jlpea-01-00150">36</xref>], coarse detection is thus performed continuously using only 60% (on average) EEG channels; the resulting detector misses only 7 out of 143 seizures, has a detection latency of 7.0 s, and achieves a false alarm rate of 0.11 per hour (compared to 4 out of 143, 6.0 s, and 0.07 per hour, respectively, for a full 18-channel system). In addition, low-energy devices that exploit the topology of the analog instrumentation front end to also directly extract typical biomarkers have been reported for EEG-sensing applications [<xref ref-type="bibr" rid="b37-jlpea-01-00150">37</xref>] and ECG sensing applications [<xref ref-type="bibr" rid="b38-jlpea-01-00150">38</xref>]. Anticipating the need for low-energy classification, studies have also been pursued investigating simplified classifier implementations that reduce computational complexity through the use of approximate models (e.g., employing a reduced set of support vectors for SVM classification in seizure detection [<xref ref-type="bibr" rid="b39-jlpea-01-00150">39</xref>]); although this compromises the classification model, limiting generality and accuracy, it can lead to low-power implementations.</p>
<p>This section describes details of an SoC designed at the circuit and system levels to take advantage of data-driven modeling for chronic seizure onset detection. The SoC incorporates a low-noise instrumentation amplifier for EEG acquisition, an ADC, and a feature extraction processor for SVM-based classification. Each SoC corresponds to one EEG channel, and the number of channels required ranges from 2–18 depending on the patient. Seizure onset detection can serve to generate signals to alert the patient and/or caregivers, control automatic data-logging and charting systems, or actuate therapeutic stimulators in a closed-loop manner (e.g., vagus-nerve stimulators [<xref ref-type="bibr" rid="b17-jlpea-01-00150">17</xref>]).</p>
<p>The system-level partitioning of the detector is driven by patient safety considerations. Namely, no cables originating from the patient's scalp are permitted since these can pose a strangulation hazard in case the patient convulses during a seizure. Additionally, the battery size and weight on the patient's head and neck must be minimal in order to reduce the risk of injury in case the patient loses balance. To minimize energy consumption on the scalp, the SoC performs EEG acquisition and feature-extraction processing locally, but leaves the classification computation for a device worn away from the patient's head. As described below, local feature extraction plays an important role in reducing the energy of wireless communication from the head, mitigating the need in this application for further data-rate reduction through classification; however, the form factor and/or implantation protocols in future systems can make the integration of classifiers compelling.</p>
<sec>
<label>4.1.</label>
<title>Algorithm and System Design</title>
<p>Clinical studies have determined that seizure onset information is contained in the spectral energy distribution of a patient's EEG [<xref ref-type="bibr" rid="b16-jlpea-01-00150">16</xref>]. The SoC thus extracts biomarkers corresponding to the spectral energy distribution in seven frequency bins from each EEG channel. The seizure detection algorithm in <xref ref-type="fig" rid="f3-jlpea-01-00150">Figure 3</xref>, summarizes the overall processing employed in the system. The seven features from each channel are combined to form a total feature vector of up to 126 dimensions (since as many as 18 channels may be used). For continuous seizure detection, each feature vector is classified in real-time by a pre-trained SVM (utilizing a radial-basis function kernel). The SVM is trained based on patient-specific data, and this is verified to yield substantial improvement in sensitivity and specificity compared to patient-generic detectors (testing is based on 536 hours of patient data across 16 patients from Boston Children's Hospital [<xref ref-type="bibr" rid="b17-jlpea-01-00150">17</xref>]).</p>
<p>As mentioned, wireless transmission of information from the scalp sensors is essential to avoid the presence of cables. <xref ref-type="table" rid="t1-jlpea-01-00150">Table 1</xref> considers the total power for a full-scale, 18-channel EEG monitoring system, illustrating the opportunity afforded by local biomarker (feature) extraction as compared to complete wireless EEG transmission. In the case of wireless EEG, the radio transmitter power dominates (assuming a low-power Zigbee radio [<xref ref-type="bibr" rid="b40-jlpea-01-00150">40</xref>]). Local feature extraction, however, reduces the transmission data-rate by a factor of over forty at the cost of negligible digital computation power. As a result, the total system power on the scalp is reduced by a factor of fourteen. The remaining radio power is no longer dominant, and thus further data-rate reduction is not essential. As a result, the classification computation is left to be performed away from the scalp where the battery size and weight constraints are less severe. A direct implementation of the SVM classification kernel on an MSP430 processor consumes approximately 4 mW.</p></sec>
<sec>
<label>4.2.</label>
<title>Low-Noise EEG Acquisition</title>
<p>As shown in <xref ref-type="table" rid="t1-jlpea-01-00150">Table 1</xref>, the power consumption of the instrumentation amplifier is an important concern. The amplifier employed thus aims to maximize power efficiency by operating at a low supply voltage (&lt;1 V) and by using a voltage-feedback topology where a minimum number of noise contributing devices are required. For a chronic wearable system, robustness to signal interferences (such as 60 Hz environmental coupling, motion artifacts on the electrodes, <italic>etc.</italic>) are also primary concerns. Since data-driven modeling affords a high detection specificity with respect to the biomarkers, some degree of signal interference is tolerated. Nonetheless, stability in the acquisition system is critical so that the learned correlations remain valid during real-time detection.</p>
<p><xref ref-type="fig" rid="f8-jlpea-01-00150">Figure 8</xref> summarizes the key instrumentation considerations, which include 1/<italic>f</italic> and wideband device noise sources, 60 Hz coupling, interfering electrographic scalp activity (which may be correlated between recording and reference electrodes), and the electrode offsets. For the electrodes, low-cost Ag/AgCl electrodes, which are the most common, are presumed. In the equivalent circuit model shown, signal transduction is dominated by the capacitor (<italic>C<sub>P</sub></italic>) and the resistor (<italic>R<sub>S</sub></italic>). For on-scalp electrodes, the worst case value of these can be as low as 50 nF and as high as 2 MΩ, respectively [<xref ref-type="bibr" rid="b42-jlpea-01-00150">42</xref>]. The input impedance of the instrumentation amplifier is constrained accordingly (to have much lower capacitance and much higher resistance). In addition, transduction through <italic>C<sub>P</sub></italic> requires the formation of a charge double layer at the electrode surface, which causes a low-frequency electrode offset voltage (EOV). The EOV can be as high as 100 mV, posing the possibility of easily saturating a low-voltage design.</p>
<p><xref ref-type="fig" rid="f9-jlpea-01-00150">Figure 9</xref> show the architecture and first stage of the instrumentation amplifier. Chopper stabilization [<xref ref-type="bibr" rid="b43-jlpea-01-00150">43</xref>] is used to handle low-frequency (1/<italic>f</italic>) noise, and input capacitors (<italic>C<sub>IN</sub></italic>) are used to passively isolate large electrode offsets from the active portion of the low-voltage amplifier. Several other implementations of chopper stabilized EEG amplifiers have recently been reported. In [<xref ref-type="bibr" rid="b44-jlpea-01-00150">44</xref>], input capacitors are also used, but chopper modulation is performed before these. This yields very high common-mode rejection ratio by mitigating the effect of capacitor mismatch [<xref ref-type="bibr" rid="b14-jlpea-01-00150">14</xref>], but it limits the EOV tolerable and degrades the amplifier's input resistance (compromising the ability to handle large electrode impedances in on-scalp sensing applications). Alternatively, the design in [<xref ref-type="bibr" rid="b45-jlpea-01-00150">45</xref>] uses current-feedback to handle EOV while maintaining very high input impedance through the use of input transconductors. The magnitude of EOV cancelation achievable through active feedback, however, is limited in low supply voltage designs, and the use of current-feedback transconductors degrades the noise efficiency somewhat.</p>
<p>In the topology of <xref ref-type="fig" rid="f9-jlpea-01-00150">Figure 9b</xref>, the voltage-feedback is retained for noise efficiency, but input chopper modulation is performed after the input capacitors, at the virtual ground node. Although this degrades the resistance between the op-amp nodes, the input resistance seen by the electrodes remains high since feedback ensures that electrodes do not need to cause substantial voltage swing at the virtual ground node. As shown, however, input offset voltage in the op-amp does cause an offset current (<italic>I<sub>OS,CHOP</sub></italic>) between its input nodes. To avoid <italic>I<sub>OS,CHOP</sub></italic> saturating the amplifier, the <italic>G<sub>M</sub></italic> − <italic>C</italic> servo loop is used in feedback, providing a cancelation current. Further details of the amplifier, including noise analysis, are provided in [<xref ref-type="bibr" rid="b41-jlpea-01-00150">41</xref>].</p>
<p>The ADC used to digitize the instrumentation amplifier output is a differential, 12-bit SAR converter (circuit details are provided in [<xref ref-type="bibr" rid="b46-jlpea-01-00150">46</xref>]). The resolution was selected based on algorithm performance from testing (in a manner similar to the feature-extraction parameter analysis described below). It was determined that detection performance improved up to 10-bits; thus, the 12-bit ADC with 10.6 effective number of bits (ENOB) was employed. The instrumentation amplifier has a bandwidth of 200 Hz (required for generic EEG acquisition). In order to ease the anti-aliasing filtering requirements in the instrumentation front-end, the ADC samples at 600 Hz, since (as shown in <xref ref-type="table" rid="t1-jlpea-01-00150">Table 1</xref>) its power consumption is not a dominating concern.</p></sec>
<sec>
<label>4.3.</label>
<title>Feature-extraction Processing</title>
<p>Although the importance of spectral energy biomarkers in the EEG is well established for seizure onset detection, the precise parameters required for their representation (assuming an SVM classifier) is not well known. In order to determine this, analysis is pursued to evaluate the performance of the algorithm over the parameter space. The parameters considered include the dynamic range of the EEG samples, the precision of the feature-extraction computation, the bin resolution of the spectral energy distribution, and the frequency response of the spectral analysis filters. Due the data-driven nature of the detection algorithm employed, its performance over the parameter space depends on the characteristics of the data. Thus, the analysis is approached by testing the performance using available patient data. As an example, <xref ref-type="fig" rid="f10-jlpea-01-00150">Figure 10</xref> considers the analysis pertaining to the bin resolution for the spectral energy distribution. The critical metrics for seizure onset detection include sensitivity, latency, and specificity (represented by the number of false alarms per hour). As shown, these are determined for bin resolutions ranging from 2–8, in a 0–20 Hz band (<italic>i.e.</italic>, the band of interest for seizure detection [<xref ref-type="bibr" rid="b16-jlpea-01-00150">16</xref>]). Based on over 1100 h of data across 30 patients, the results show that the performance improves substantially up to a resolution of 6 bins. Thus, in the implementation, a bin resolution of seven filters is chosen.</p>
<p>In addition to the performance metrics, <xref ref-type="fig" rid="f10-jlpea-01-00150">Figure 10</xref> also shows the number of support vectors required. The number of support vectors is representative of the classification model complexity. Although SVM classification is performed away from the patient's head, it is worth noting that there is a trade-off between the quality of the biomarkers and complexity of the classification task. Nonetheless, in the range of bin resolutions required for sufficient performance, the number of support vectors required does not change dramatically.</p>
<p><xref ref-type="fig" rid="f11-jlpea-01-00150">Figure 11</xref> shows the implementation of the feature extraction processor. In order to determine the precision and frequency response required for the spectral analysis filters, analysis similar to <xref ref-type="fig" rid="f10-jlpea-01-00150">Figure 10</xref> is performed, evaluating the performance over various filter parameters. Although the input EEG is sampled at a rate of 600 Hz (as described above), the band of interest for seizure detection is 0–20 Hz. In order to ease the implementation of the filters, the input is first down-sampled by eight using a decimation filter of order 48. Each of the seven spectral analysis filters then require an order of 46 to achieve the chosen frequency response. The spectral analysis filters have the following frequency response: bandwidth of 3 Hz, transition band of 3 Hz, and stop-band attenuation of −25 dB. The filters are all implemented using a folded-delay line FIR structure.</p>
<p><xref ref-type="table" rid="t2-jlpea-01-00150">Table 2</xref>, shows the final parameters used in the feature-extraction processing, which have been chosen based on analysis of the algorithm performance with patient data.</p></sec>
<sec>
<label>4.4.</label>
<title>SoC Performance</title>
<p>The SoC is implemented in a 5M2P 180nm CMOS process. <xref ref-type="fig" rid="f12-jlpea-01-00150">Figure 12</xref> shows the die photograph along with a performance summary. Although the feature extraction processor occupies a large portion of the die area, its energy is very low, making local feature extraction for communication data-rate reduction compelling from the perspective of the energy and safety concerns related to the patient's head and neck.</p>
<p><xref ref-type="fig" rid="f13-jlpea-01-00150">Figure 13</xref> shows a demonstration of EEG acquisition, digitization, feature extraction, and classification. A recording electrode at the occipital location (O2) and a reference electrode at the scalp mid-line indicate a relaxed eyes-closed state in the subject through the appearance of an <italic>α</italic>-wave in the EEG. The <italic>α</italic>-wave corresponds to a 10 Hz rhythm. In the test shown, the subject periodically closes and opens his eyes to evoke the onset of the <italic>α</italic>-wave. An SVM has been trained (using 20 s of data) to detect the subject's relaxed eyes-closed state. The waveforms show the EEG acquired using the on-chip instrumentation amplifier and ADC. The presence of the <italic>α</italic>-wave is thus annotated. The second waveform indicates the output of the classifier so that the accuracy and latency of the detector can be characterized. Over a five minute run, the <italic>α</italic>-wave is correctly detected with less than 2.5 s of latency.</p></sec>
<sec>
<label>4.5.</label>
<title>Low-energy Programmable Processors Through Voltage Scaling</title>
<p>In the SoC described, a dedicated processor is used for biomarker extraction. This leads to very low energy, motivating the computation <italic>vs.</italic> communication trade-off of <xref ref-type="table" rid="t1-jlpea-01-00150">Table 1</xref>. As described in Section 2.2, however, the optimal choice of biomarkers depends on a wide range of considerations from application-to-application. Programmable platforms can thus help cover the range of choices pertaining to biomarker selection and processing. The challenge is that programmability incurs much higher energy. A generic approach that is valuable for reducing energy is voltage reduction. Voltage reduction quadratically lowers the active switching energy 
<inline-formula>
<mml:math id="mm9" display="inline">
<mml:semantics id="sm9">
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi mathvariant="italic">CV</mml:mi>
<mml:mrow>
<mml:mi mathvariant="italic">DD</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></inline-formula>. Although, this degrades circuit speed, allowing leakage power sources to integrate over a longer period, energy is typically minimized at voltages near the subthreshold regime, since here the circuit speed begins degrading rapidly [<xref ref-type="bibr" rid="b47-jlpea-01-00150">47</xref>]. This is shown, for instance in the case of an MSP430 (implemented in low-power 65 nm CMOS) [<xref ref-type="bibr" rid="b48-jlpea-01-00150">48</xref>], where the energy is minimized at a <italic>V<sub>DD</sub></italic> of 0.5 V where an operating frequency of 400 kHz is achieved.</p>
<p>Since many physiological signals of interest have low-bandwidths (<italic>i.e.</italic>, below 10 kHz) [<xref ref-type="bibr" rid="b14-jlpea-01-00150">14</xref>]. The energy savings afforded by aggressive voltage reduction are compelling. The ability to operate circuits at ultra-low-voltages, however, requires specialized circuit techniques to tolerate the loss of digital noise margins. Voltage scaling dramatically exacerbates sensitivity to process variation. This is most severe in memory circuits, such as SRAMs, which are relied on heavily in programmable platforms. Due to the high number of devices used in large arrays, it is beneficial to use specialized bit-cells (e.g., 8-transistor topologies). Although these require more devices than the standard 6-transistor topologies, they achieve better area efficiency at low voltages, since the wider noise margins enable aggressive device sizing which would otherwise exacerbate variability to an intolerable level [<xref ref-type="bibr" rid="b49-jlpea-01-00150">49</xref>]. Logic-gate upsizing to ensure noise margins in the logic circuits may also be required, and can be achieved using a Monte Carlo simulation-based modeling and validation methodology [<xref ref-type="bibr" rid="b50-jlpea-01-00150">50</xref>].</p></sec></sec>
<sec sec-type="conclusions">
<label>5.</label>
<title>Conclusions and Challenges</title>
<p>The emergence of advanced biomedical sensing and stimulation technologies raises the opportunity for chronic biomedical devices that are clinically compelling. A critical aspect in systems that aim to take advantage of these, however, is the ability to discern specific physiological processes so that proper therapy can be automatically delivered or so that clinically relevant states can be continuously tracked. Interpreting the correlations in the patient signals that are available through chronic sensors is challenging. Namely, physiological processes, and the mechanisms through which they are exhibited in the signals, are typically too complex to model accurately. Further, the precise correlations exhibited may be variable from patient-to-patient, and these must be discriminated with high specificity since numerous physiological processes simultaneously affect the signals. The ability to efficiently and accurately model the data is thus critical. Fortunately, thanks to recent clinical practices, databases now exist where patient signals are annotated with diagnostic information. Additionally, efficient algorithms (from the domain of machine-learning) have emerged that allow us to exploit such databases to model the clinically-relevant correlations between the data and the physiological patient states. This paves the way for promising detection algorithms that can allow chronic biomedical devices to be intelligent.</p>
<p>Incorporating such algorithms in practical devices introduces energy, instrumentation, form-factor, and computational resource constraints. To demonstrate system design, the case study of an SoC for a chronic seizure detection system is considered. The SoC incorporates a low-noise instrumentation amplifier for EEG acquisition, an ADC, and a processor for spectral feature extraction. Analysis, based on patient data testing, is presented for determining the feature-extraction parameters and the computational precision required. This leads to a system that is viable for chronic seizure detection; however, several open challenges remain for future systems. The implementation of low-power machine-learning classifiers is opposed by the complexity of the classification task. The ability to efficiently scale the classifier model for the datasets of interest is thus important in order to avoid compromising the accuracy of a detector [<xref ref-type="bibr" rid="b39-jlpea-01-00150">39</xref>]. In the case study considered, the power consumption of classification motivated moving its implementation off the scalp, where the battery size and weight limitations are the most severe. Additionally, although data-driven modeling provides a high degree of specificity by training based on previously observed data, physiological changes in the patient can invalidate the generated model. Although, this is not observed to limit seizure detection applications (where seizures in a given patient appear to be stereotypical and stable), it can be important in applications where patient monitoring is required following an acute event to which the physiology responds by evolving over time [<xref ref-type="bibr" rid="b23-jlpea-01-00150">23</xref>]. Nonetheless, patient data is one of the most important resources in modern healthcare. Exploiting it to achieve accurate patient monitoring systems is emerging as a promising approach. Incorporating these methods in chronic biomedical devices is thus highly compelling, since new modalities for delivering therapy and sensing patient signals could make such devices invaluable for effective healthcare delivery.</p></sec></body>
<back>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-jlpea-01-00150" position="float">
<label>Figure 1.</label>
<caption>
<p>Seizure detection based on amplitude EEG margins. The electromyographic (EMG) activity in the EEG trace (initially observed) blocked seizure detection until a muscle relaxant was administered [<xref ref-type="bibr" rid="b13-jlpea-01-00150">13</xref>].</p></caption>
<graphic xlink:href="jlpea-01-00150f1.gif"/></fig>
<fig id="f2-jlpea-01-00150" position="float">
<label>Figure 2.</label>
<caption>
<p>EEG from an epileptic patient showing (<bold>a</bold>) typical background burst resembling a seizure and (<bold>b</bold>) burst corresponding to a clinical seizure, having specific spectral and spatial characteristics.</p></caption>
<graphic xlink:href="jlpea-01-00150f2.gif"/></fig>
<fig id="f3-jlpea-01-00150" position="float">
<label>Figure 3.</label>
<caption>
<p>The typical structure of data-driven biomedical detection algorithms consists of biomarker (feature) extraction followed by biomarker interpretation, which is based on a generated model.</p></caption>
<graphic xlink:href="jlpea-01-00150f3.gif"/></fig>
<fig id="f4-jlpea-01-00150" position="float">
<label>Figure 4.</label>
<caption>
<p>Biomedical signals represent various physiologic (e.g., 2989–2992 s marks) and pathophysiologic states (e.g., 2994 s mark). Thus, simple detection of signal changes (<italic>i.e.</italic>, anomaly detection) is insufficient.</p></caption>
<graphic xlink:href="jlpea-01-00150f4.gif"/></fig>
<fig id="f5-jlpea-01-00150" position="float">
<label>Figure 5.</label>
<caption>
<p>Support-vector machine decision boundaries for intracranial EEG from a seizure detection application using (<bold>a</bold>) a linear classification kernel and (<bold>b</bold>) a radial-basis function classification kernel (seizure data is represented by circles and non-seizure data is represented by crosses).</p></caption>
<graphic xlink:href="jlpea-01-00150f5.gif"/></fig>
<fig id="f6-jlpea-01-00150" position="float">
<label>Figure 6.</label>
<caption>
<p>(<bold>a</bold>) 126 dimensional seizure (red) and non-seizure (blue) feature vectors extracted from Scalp EEG and projected onto a two dimensional space spanned by the first two principal components; (<bold>b</bold>) Three gaussian mixture model of the non-seizure likelihood function (<italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>nonseizure</sub></italic>)) estimated using the EM algorithm.</p></caption>
<graphic xlink:href="jlpea-01-00150f6.gif"/></fig>
<fig id="f7-jlpea-01-00150" position="float">
<label>Figure 7.</label>
<caption>
<p>Seizure (red) and Non-seizure (blue) feature vectors from <xref ref-type="fig" rid="f6-jlpea-01-00150">Figure 6a</xref> separated using decision boundaries estimated using a generative and discriminative approach. The decision boundary derived using the generative approach (black) with <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>seizure</sub></italic>) modeled using a single gaussian, <italic>f</italic>(<italic>x</italic>⃗|<italic>C<sub>nonseizure</sub></italic>) modeled using a mixture of 3 gaussians, and λ = 30. The decision boundary derived using the discriminative approach used a Support Vector Machine and radial basis kernel with parameter <italic>γ</italic> = 3.</p></caption>
<graphic xlink:href="jlpea-01-00150f7.gif"/></fig>
<fig id="f8-jlpea-01-00150" position="float">
<label>Figure 8.</label>
<caption>
<p>Instrumentation challenges with EEG sensing.</p></caption>
<graphic xlink:href="jlpea-01-00150f8.gif"/></fig>
<fig id="f9-jlpea-01-00150" position="float">
<label>Figure 9.</label>
<caption>
<p>The instrumentation amplifier showing (<bold>a</bold>) the overall architecture and (<bold>b</bold>) the first stage low-noise amplifier.</p></caption>
<graphic xlink:href="jlpea-01-00150f9.gif"/></fig>
<fig id="f10-jlpea-01-00150" position="float">
<label>Figure 10.</label>
<caption>
<p>Analysis (from 1136 h of patient data) to determine the feature-extraction parameters required in order to accurately classify the data (the analysis for spectral bin resolution is shown).</p></caption>
<graphic xlink:href="jlpea-01-00150f10.gif"/></fig>
<fig id="f11-jlpea-01-00150" position="float">
<label>Figure 11.</label>
<caption>
<p>Implementation of the feature-extraction processor (with parameters determined from analysis).</p></caption>
<graphic xlink:href="jlpea-01-00150f11.gif"/></fig>
<fig id="f12-jlpea-01-00150" position="float">
<label>Figure 12.</label>
<caption>
<p>Performance summary and die photograph of seizure detection SoC.</p></caption>
<graphic xlink:href="jlpea-01-00150f12.gif"/></fig>
<fig id="f13-jlpea-01-00150" position="float">
<label>Figure 13.</label>
<caption>
<p>Operation of seizure detection system including EEG acquisition, digitization, feature extraction, and classification of <italic>α</italic>-state (wired transmission of the feature vectors is employed to simplify demonstration of the detection system).</p></caption>
<graphic xlink:href="jlpea-01-00150f13.gif"/></fig>
<table-wrap id="t1-jlpea-01-00150" position="float">
<label>Table 1.</label>
<caption>
<p>Power comparison showing the benefit of local feature extraction <italic>vs</italic>. complete wireless EEG [<xref ref-type="bibr" rid="b41-jlpea-01-00150">41</xref>]. An 18-channel EEG recording system is assumed, and the aggregate power over all channels is reported.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th align="left" valign="top"/>
<th align="center" valign="top"><bold>No Local Processing</bold></th>
<th align="center" valign="top"><bold>Local Feature Extraction</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top"><bold>Instrumentation amps</bold></td>
<td align="center" valign="top"><bold>72 <italic>μ</italic>W</bold></td>
<td align="center" valign="top"><bold>72 <italic>μ</italic>W</bold></td></tr>
<tr>
<td align="left" valign="top"><bold>ADCs (12 b, 600 Hz/ch.)</bold></td>
<td align="center" valign="top"><bold>3 <italic>μ</italic>W</bold></td>
<td align="center" valign="top"><bold>3 <italic>μ</italic>W</bold></td></tr>
<tr>
<td align="left" valign="top"><bold>Digital Processors</bold></td>
<td align="center" valign="top"><bold>-</bold></td>
<td align="center" valign="top"><bold>2.1 <italic>μ</italic>W</bold></td></tr>
<tr>
<td align="left" valign="top"><bold>Radio (CC2550)</bold>
<list list-type="simple">
<list-item>
<label>-</label>
<p>Active: <italic>bit-rate</italic> × 40 nJ/bit</p></list-item>
<list-item>
<label>-</label>
<p>Start-up: 4.8 <italic>μ</italic>W (avg./s)</p></list-item>
<list-item>
<label>-</label>
<p>Idle mode: 0.46 <italic>μ</italic>W</p></list-item></list></td>
<td align="center" valign="top"><bold>1733 <italic>μ</italic>W</bold>
<list list-type="simple">
<list-item>
<label>-</label>
<p>Active: 43.2 kb/s <xref ref-type="table-fn" rid="tfn1-jlpea-01-00150">1</xref> × 40 nJ/bit</p></list-item>
<list-item>
<label>-</label>
<p>Start-up: 4.8 <italic>μ</italic>W</p></list-item>
<list-item>
<label>-</label>
<p>Idle mode: 0.46 <italic>μ</italic>W</p></list-item></list></td>
<td align="center" valign="top"><bold>43 <italic>μ</italic>W</bold>
<list list-type="simple">
<list-item>
<label>-</label>
<p>Active: 2 kb/2 s × 40 nJ/bit</p></list-item>
<list-item>
<label>-</label>
<p>Start-up: 4.8 <italic>μ</italic>W/2 s</p></list-item>
<list-item>
<label>-</label>
<p>Idle mode: 0.46 <italic>μ</italic>W</p></list-item></list></td></tr>
<tr>
<td align="left" valign="top"><bold>Total</bold></td>
<td align="center" valign="top"><bold>1808 <italic>μ</italic>W</bold></td>
<td align="center" valign="top"><bold>120 <italic>μ</italic>W</bold></td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-jlpea-01-00150">
<label>1</label>
<p>Assumes EEG waveforms are downsampled to 200 Hz (as in typical EEG recording interfaces).</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t2-jlpea-01-00150" position="float">
<label>Table 2.</label>
<caption>
<p>Parameters selected for feature-extraction processing based on analysis of algorithm performance.</p></caption>
<table frame="box" rules="all">
<thead>
<tr>
<th align="left" valign="top"><bold>Feature Extraction Parameters</bold></th>
<th align="center" valign="top"><bold>Parameter Values</bold></th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Decimation filter input rate</td>
<td align="center" valign="top">600 Hz</td></tr>
<tr>
<td align="left" valign="top">Decimation filter output rate</td>
<td align="center" valign="top">75 Hz</td></tr>
<tr>
<td align="left" valign="top">Decimation filter BW</td>
<td align="center" valign="top">150 Hz</td></tr>
<tr>
<td align="left" valign="top">Decimation filter input precision</td>
<td align="center" valign="top">12-bits</td></tr>
<tr>
<td align="left" valign="top">Decimation filter output precision</td>
<td align="center" valign="top">16-bits</td></tr>
<tr>
<td align="left" valign="top">Decimation filter order</td>
<td align="center" valign="top">48</td></tr>
<tr>
<td align="left" valign="top">Spectral analysis band</td>
<td align="center" valign="top">0–20 Hz</td></tr>
<tr>
<td align="left" valign="top">Spectral analysis bins</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top">Spectral analysis filter BW</td>
<td align="center" valign="top">3 Hz</td></tr>
<tr>
<td align="left" valign="top">Spectral analysis filter pass band</td>
<td align="center" valign="top">3 Hz</td></tr>
<tr>
<td align="left" valign="top">Spectral analysis filter output precision</td>
<td align="center" valign="top">16-bits</td></tr>
<tr>
<td align="left" valign="top">Spectral analysis filter order</td>
<td align="center" valign="top">46</td></tr></tbody></table></table-wrap></sec>
<ack>
<p>The authors acknowledge the support of the Gigascale Systems Research Center, one of six research centers funded under the Focus Center Research Program (FCRP), a Semiconductor Research Corporation entity.</p></ack>
<ref-list>
<title>References and Notes</title>
<ref id="b1-jlpea-01-00150"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benabid</surname><given-names>A.L.</given-names></name></person-group><article-title>Deep brain stimulation for Parkinson's disease</article-title><source>Curr. Opin. Neurobiol.</source><year>2003</year><volume>13</volume><fpage>696</fpage><lpage>706</lpage><pub-id pub-id-type="doi">10.1016/j.conb.2003.11.001</pub-id><pub-id pub-id-type="pmid">14662371</pub-id></citation></ref>
<ref id="b2-jlpea-01-00150"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schachter</surname><given-names>S.C.</given-names></name></person-group><article-title>Vagus nerve stimulator</article-title><source>Epilepsia</source><year>1998</year><volume>39</volume><fpage>677</fpage><lpage>686</lpage><pub-id pub-id-type="doi">10.1111/j.1528-1157.1998.tb01151.x</pub-id><pub-id pub-id-type="pmid">9670894</pub-id></citation></ref>
<ref id="b3-jlpea-01-00150"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>D.H.</given-names></name><name><surname>Viventi</surname><given-names>J.</given-names></name><name><surname>Amsden</surname><given-names>J.</given-names></name><name><surname>Xiao</surname><given-names>J.</given-names></name><name><surname>Vigeland</surname><given-names>L.</given-names></name><name><surname>Kim</surname><given-names>Y.S.</given-names></name><name><surname>Blanco</surname><given-names>J.</given-names></name><name><surname>Frechette</surname><given-names>E.</given-names></name><name><surname>Contreras</surname><given-names>D.</given-names></name><name><surname>Kaplan</surname><given-names>D.</given-names></name><etal/></person-group><article-title>Dissolvable films of silk fibroin for ultrathin, conformal bio-integrated electronics</article-title><source>Nat. Mater.</source><year>2010</year><volume>9</volume><fpage>511</fpage><lpage>517</lpage><pub-id pub-id-type="doi">10.1038/nmat2745</pub-id><pub-id pub-id-type="pmid">20400953</pub-id></citation></ref>
<ref id="b4-jlpea-01-00150"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Graudejus</surname><given-names>O.</given-names></name><name><surname>Yu</surname><given-names>Z.</given-names></name><name><surname>Jones</surname><given-names>J.</given-names></name><name><surname>Morrison</surname><given-names>B.</given-names></name><name><surname>Wagner</surname><given-names>S.</given-names></name></person-group><article-title>Characterization of an elastically stretchable microelectrode array and its application to neural field potential recordings</article-title><source>J. Electrochem. Soc.</source><year>2009</year><volume>156</volume><fpage>85</fpage><lpage>94</lpage><pub-id pub-id-type="doi">10.1149/1.3115465</pub-id></citation></ref>
<ref id="b5-jlpea-01-00150"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Viventi</surname><given-names>J.</given-names></name><name><surname>Kim</surname><given-names>D.H.</given-names></name><name><surname>Moss</surname><given-names>J.</given-names></name><name><surname>Kim</surname><given-names>Y.S.</given-names></name><name><surname>Blanco</surname><given-names>J.</given-names></name><name><surname>Anetta</surname><given-names>N.</given-names></name><name><surname>Hicks</surname><given-names>A.</given-names></name><name><surname>Xiao</surname><given-names>J.</given-names></name><name><surname>Huang</surname><given-names>Y.</given-names></name><name><surname>Callans</surname><given-names>D.J.</given-names></name><etal/></person-group><article-title>Dissolvable films of silk fibroin for ultrathin, conformal bio-integrated electronics</article-title><source>Sci. Trans. Med.</source><year>2010</year><volume>2</volume><fpage>2694</fpage><lpage>2699</lpage></citation></ref>
<ref id="b6-jlpea-01-00150"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chandrakasan</surname><given-names>A.</given-names></name><name><surname>Verma</surname><given-names>N.</given-names></name><name><surname>Daly</surname><given-names>D.</given-names></name></person-group><article-title>Ultralow-power electronics for biomedical applications</article-title><source>Annu. Rev. Biomed. Eng.</source><year>2008</year><volume>10</volume><fpage>247</fpage><lpage>274</lpage><pub-id pub-id-type="doi">10.1146/annurev.bioeng.10.061807.160547</pub-id><pub-id pub-id-type="pmid">18647116</pub-id></citation></ref>
<ref id="b7-jlpea-01-00150"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leong</surname><given-names>T.</given-names></name><name><surname>Aronsky</surname><given-names>D.</given-names></name><name><surname>Michael Shabot</surname><given-names>M.</given-names></name></person-group><article-title>Guest editorial: Computer-based decision support for critical and emergency care</article-title><source>J. Biomed. Inform.</source><year>2008</year><volume>41</volume><fpage>409</fpage><lpage>412</lpage><pub-id pub-id-type="doi">10.1016/j.jbi.2008.04.006</pub-id><pub-id pub-id-type="pmid">18479974</pub-id></citation></ref>
<ref id="b8-jlpea-01-00150"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallis</surname><given-names>L.</given-names></name></person-group><article-title>Alarm fatigue linked to patient's death</article-title><source>Am. J. Nurs.</source><year>2010</year><volume>110</volume><fpage>16</fpage><pub-id pub-id-type="pmid">20980881</pub-id></citation></ref>
<ref id="b9-jlpea-01-00150"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dishman</surname><given-names>E.</given-names></name></person-group><article-title>Inventing wellness systems for aging in place</article-title><source>IEEE Comput.</source><year>2004</year><volume>37</volume><fpage>34</fpage><lpage>41</lpage></citation></ref>
<ref id="b10-jlpea-01-00150"><label>10.</label><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Tsien</surname><given-names>C.</given-names></name></person-group><article-title>TrendFinder: Automated detection of alarmable trends</article-title><source>Ph.D. Thesis</source><publisher-name>Massachusetts Institute of Technology</publisher-name><publisher-loc>Cambridge, MA, USA</publisher-loc><year>2000</year></citation></ref>
<ref id="b11-jlpea-01-00150"><label>11.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Tompkins</surname><given-names>W.J.</given-names></name></person-group><source>Biomedical Digital Signal Processing: C Language Examples and Laboratory Experiments for the IBM PC</source><publisher-name>Prentice-Hall</publisher-name><publisher-loc>Bergen County, NJ, USA</publisher-loc><year>1993</year></citation></ref>
<ref id="b12-jlpea-01-00150"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hau</surname><given-names>D.</given-names></name><name><surname>Coiera</surname><given-names>E.</given-names></name></person-group><source>Learning Qualitative Models from Physiological Signals</source><comment>AAAI Technical Report SS-94-01</comment><publisher-name>AAAI Press</publisher-name><publisher-loc>Menlo Park, CA, USA</publisher-loc><year>1994</year><fpage>67</fpage><lpage>71</lpage></citation></ref>
<ref id="b13-jlpea-01-00150"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hagmann</surname><given-names>C.F.</given-names></name><name><surname>Robertson</surname><given-names>N.J.</given-names></name><name><surname>Azzopardi</surname><given-names>D.</given-names></name></person-group><article-title>Artifacts on electroencephalograms may influence the amplitude-integrated EEG classification: A qualitative analysis in neonatal encephalopathy</article-title><source>Pediatrics</source><year>2006</year><volume>118</volume><fpage>2552</fpage><lpage>2554</lpage><pub-id pub-id-type="doi">10.1542/peds.2006-2519</pub-id><pub-id pub-id-type="pmid">17142543</pub-id></citation></ref>
<ref id="b14-jlpea-01-00150"><label>14.</label><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Yazicioglu</surname><given-names>R.</given-names></name></person-group><article-title>Biopotential Readout Circuits for Portable Acquisition Systems</article-title><source>Ph.D. Thesis</source><publisher-name>Katholieke Universiteit</publisher-name><publisher-loc>Leuven, Belgium</publisher-loc><year>2008</year></citation></ref>
<ref id="b15-jlpea-01-00150"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shoeb</surname><given-names>A.</given-names></name><name><surname>Schachter</surname><given-names>S.</given-names></name><name><surname>Schomer</surname><given-names>D.</given-names></name><name><surname>Bourgeois</surname><given-names>B.</given-names></name><name><surname>Treves</surname><given-names>S.T.</given-names></name><name><surname>Guttag</surname><given-names>J.</given-names></name></person-group><article-title>Detecting seizure onset in the ambulatory setting: Demonstrating feasibility</article-title><source>Conf. Proc. IEEE Eng. Med. Biol. Soc.</source><year>2005</year><volume>4</volume><fpage>3546</fpage><lpage>3550</lpage><pub-id pub-id-type="pmid">17280990</pub-id></citation></ref>
<ref id="b16-jlpea-01-00150"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gotman</surname><given-names>J.</given-names></name><name><surname>Ives</surname><given-names>J.</given-names></name><name><surname>Gloor</surname><given-names>G.</given-names></name></person-group><article-title>Frequency content of eeg and emg at seizure onset: Possibility of removal of EMG artefact by digital filtering</article-title><source>EEG Clin. Neurophysiol.</source><year>1981</year><volume>52</volume><fpage>626</fpage><lpage>639</lpage><pub-id pub-id-type="doi">10.1016/0013-4694(81)91437-1</pub-id></citation></ref>
<ref id="b17-jlpea-01-00150"><label>17.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Shoeb</surname><given-names>A.</given-names></name><name><surname>Bourgeois</surname><given-names>B.</given-names></name><name><surname>Treves</surname><given-names>S.</given-names></name><name><surname>Schachter</surname><given-names>S.</given-names></name><name><surname>Guttag</surname><given-names>J.</given-names></name></person-group><article-title>Impact of patient-specificity on seizure onset detection performance</article-title><conf-name>Proceedings of the International Conference of the IEEE Engineering in Medicine and Biology Society</conf-name><conf-loc>Lyon, France</conf-loc><conf-date>2007</conf-date><fpage>4110</fpage><lpage>4114</lpage></citation></ref>
<ref id="b18-jlpea-01-00150"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meyfroidt</surname><given-names>G.</given-names></name><name><surname>Guiza</surname><given-names>F.</given-names></name><name><surname>Ramon</surname><given-names>J.</given-names></name><name><surname>Bruynooghe</surname><given-names>M.</given-names></name></person-group><article-title>Machine learning techniques to examine large patient databases</article-title><source>Best Pract. Res. Clin. Anaesthesiol.</source><year>2009</year><volume>23</volume><fpage>127</fpage><lpage>143</lpage><pub-id pub-id-type="pmid">19449621</pub-id></citation></ref>
<ref id="b19-jlpea-01-00150"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sajda</surname><given-names>P.</given-names></name></person-group><article-title>Machine learning for diagnosis and detection of disease</article-title><source>Annu. Rev. Biomed. Eng.</source><year>2006</year><volume>8</volume><fpage>537</fpage><lpage>565</lpage><pub-id pub-id-type="doi">10.1146/annurev.bioeng.8.061505.095802</pub-id><pub-id pub-id-type="pmid">16834566</pub-id></citation></ref>
<ref id="b20-jlpea-01-00150"><label>20.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Podgorelec</surname><given-names>V.</given-names></name><name><surname>Druzovec</surname><given-names>T.W.</given-names></name></person-group><article-title>Some applications of intelligent systems in medicine</article-title><conf-name>Proceedings of the IEEE 3rd International Conference on Computational Cybernetics</conf-name><conf-loc>Mauritius</conf-loc><conf-date>2005</conf-date><fpage>35</fpage><lpage>39</lpage></citation></ref>
<ref id="b21-jlpea-01-00150"><label>21.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Shoeb</surname><given-names>A.</given-names></name><name><surname>Guttag</surname><given-names>J.</given-names></name></person-group><article-title>Application of Machine Learning to Epileptic Seizure Detection</article-title><conf-name>Proceedings of the International Conference on Machine Learning</conf-name><conf-loc>Haifa, Israel</conf-loc><conf-date>2010</conf-date></citation></ref>
<ref id="b22-jlpea-01-00150"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Chazal</surname><given-names>P.</given-names></name><name><surname>O'Dwyer</surname><given-names>M.</given-names></name><name><surname>Reilly</surname><given-names>R.B.</given-names></name></person-group><article-title>Automatic classification of heartbeats using ECG morphology and heartbeat interval features</article-title><source>IEEE Trans. Biomed. Eng.</source><year>2004</year><volume>51</volume><fpage>1196</fpage><lpage>1206</lpage><pub-id pub-id-type="doi">10.1109/TBME.2004.827359</pub-id><pub-id pub-id-type="pmid">15248536</pub-id></citation></ref>
<ref id="b23-jlpea-01-00150"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sorenson</surname><given-names>D.</given-names></name><name><surname>Grissom</surname><given-names>C.</given-names></name><name><surname>Carpenter</surname><given-names>L.</given-names></name><name><surname>Austin</surname><given-names>A.</given-names></name><name><surname>Sward</surname><given-names>K.</given-names></name><name><surname>Napoli</surname><given-names>L.</given-names></name><name><surname>Warner</surname><given-names>H.</given-names></name><name><surname>Morris</surname><given-names>A.</given-names></name></person-group><article-title>and for the Reengineering Clinical Research in Critical Care Investigators. A frame-based representation for a bedside ventilator weaning protocol</article-title><source>J. Biomed. Inform.</source><year>2008</year><volume>41</volume><fpage>461</fpage><lpage>468</lpage><pub-id pub-id-type="doi">10.1016/j.jbi.2008.02.002</pub-id><pub-id pub-id-type="pmid">18358789</pub-id></citation></ref>
<ref id="b24-jlpea-01-00150"><label>24.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Jouny</surname><given-names>C.</given-names></name><name><surname>Bergey</surname><given-names>G.</given-names></name></person-group><article-title>Dynamics of Epileptic Seizures During Evolution And Propagation</article-title><source>Computational Neuroscience in Epilepsy</source><edition>1st ed.</edition><person-group person-group-type="editor"><name><surname>Soltesz</surname><given-names>I.</given-names></name><name><surname>Staley</surname><given-names>K.</given-names></name></person-group><publisher-name>Academic Press</publisher-name><publisher-loc>San Diego, CA, USA</publisher-loc><year>2008</year><volume>28</volume><fpage>457</fpage><lpage>470</lpage></citation></ref>
<ref id="b25-jlpea-01-00150"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsu</surname><given-names>C.W.</given-names></name><name><surname>Lin</surname><given-names>C.J.</given-names></name></person-group><article-title>A comparison of methods for multiclass support vector machines</article-title><source>IEEE Trans. Neural Networks</source><year>2002</year><volume>13</volume><fpage>415</fpage><lpage>425</lpage><pub-id pub-id-type="doi">10.1109/72.991427</pub-id></citation></ref>
<ref id="b26-jlpea-01-00150"><label>26.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Rubinstein</surname><given-names>Y.D.</given-names></name><name><surname>Hastie</surname><given-names>T.</given-names></name></person-group><article-title>Discriminative vs Informative Learning</article-title><conf-name>Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining</conf-name><publisher-name>AAAI Press</publisher-name><publisher-loc>Menlo Park, CA, USA</publisher-loc><year>1997</year><fpage>49</fpage><lpage>53</lpage></citation></ref>
<ref id="b27-jlpea-01-00150"><label>27.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname><given-names>A.Y.</given-names></name><name><surname>Jordan</surname><given-names>M.I.</given-names></name></person-group><article-title>On Discriminative <italic>vs</italic>. Generative classifiers: A comparison of logistic regression and naive Bayes</article-title><source>Adv. Neural Inf. Process. Syst.</source><year>2001</year><volume>14</volume><fpage>841</fpage><lpage>848</lpage></citation></ref>
<ref id="b28-jlpea-01-00150"><label>28.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cristianini</surname><given-names>N.</given-names></name><name><surname>Shawe-Taylor</surname><given-names>J.</given-names></name></person-group><source>An Introduction to Support Vector Machines and Other Kernel Based Learning Methods</source><publisher-name>Cambridge University Press</publisher-name><publisher-loc>Cambridge, UK</publisher-loc><year>2000</year></citation></ref>
<ref id="b29-jlpea-01-00150"><label>29.</label><note><p>If the classes cannot be well separated by a hyper-plane, the SVM can be used to determine a quadratic or circumferential decision boundary. The SVM determines a nonlinear boundary in the input feature space by solving for linear boundary in higher-dimensional feature space, which is formed by introducing a <italic>kernel</italic>.</p></note></ref>
<ref id="b30-jlpea-01-00150"><label>30.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Theodoridis</surname><given-names>S.</given-names></name><name><surname>Koutroumbas</surname><given-names>K.</given-names></name></person-group><article-title>Linear Classifiers</article-title><source>Pattern Recognition</source><publisher-name>Elsevier</publisher-name><publisher-loc>Oxford, UK</publisher-loc><year>2009</year><comment>Chapter 3</comment></citation></ref>
<ref id="b31-jlpea-01-00150"><label>31.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Joachims</surname><given-names>T.</given-names></name></person-group><article-title>Text Categorization with Support-Vector Machines: Learning with Many Relevant Features</article-title><conf-name>Proceedings of the European Conference on Machine Learning</conf-name><publisher-name>Springer</publisher-name><publisher-loc>Berlin, Germany</publisher-loc><year>1998</year><fpage>137</fpage><lpage>142</lpage></citation></ref>
<ref id="b32-jlpea-01-00150"><label>32.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Akbani</surname><given-names>R.</given-names></name><name><surname>Kwek</surname><given-names>S.</given-names></name><name><surname>Japkowicz</surname><given-names>N.</given-names></name></person-group><article-title>Applying Support-Vector Machines to Imbalanced Datasets</article-title><conf-name>Proceedings of the European Conference on Machine Learning</conf-name><publisher-name>Springer</publisher-name><publisher-loc>Berlin, Germany</publisher-loc><year>2004</year></citation></ref>
<ref id="b33-jlpea-01-00150"><label>33.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kay</surname><given-names>S.M.</given-names></name></person-group><article-title>Cramer-Rao Lower Bound</article-title><source>Fundamentals of Statistical Signal Processing: Detection Theory</source><publisher-name>Prentice-Hall</publisher-name><publisher-loc>Bergen County, NJ, USA</publisher-loc><year>1998</year><comment>Chapter 3</comment><fpage>60</fpage><lpage>92</lpage></citation></ref>
<ref id="b34-jlpea-01-00150"><label>34.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bishop</surname><given-names>C.M.</given-names></name></person-group><article-title>Probability Distributions</article-title><source>Pattern Recognition and Machine Learning</source><publisher-name>Springer</publisher-name><publisher-loc>Berlin, Germany</publisher-loc><year>2006</year><comment>Chapter 2</comment><fpage>67</fpage><lpage>127</lpage></citation></ref>
<ref id="b35-jlpea-01-00150"><label>35.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lotte</surname><given-names>F.</given-names></name><name><surname>Congedo</surname><given-names>M.</given-names></name><name><surname>Lauyer</surname><given-names>A.</given-names></name><name><surname>Lamarche</surname><given-names>F.</given-names></name><name><surname>Arnaldi</surname><given-names>B.</given-names></name></person-group><article-title>A Review of Classification Algorithms for EEG-Based Brain-Computer Interface</article-title><source>J. Neural Eng.</source><year>2007</year><volume>4</volume><fpage>R1</fpage><lpage>R13</lpage><pub-id pub-id-type="doi">10.1088/1741-2560/4/2/R01</pub-id><pub-id pub-id-type="pmid">17409472</pub-id></citation></ref>
<ref id="b36-jlpea-01-00150"><label>36.</label><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Shih</surname><given-names>E.I.</given-names></name></person-group><article-title>Reducing the Computational Demands of Medical Monitoring Classifiers by Examining Less Data</article-title><source>Ph.D. Thesis</source><publisher-name>Massachu-setts Institute of Technology</publisher-name><publisher-loc>Cambridge, MA, USA</publisher-loc><year>2010</year></citation></ref>
<ref id="b37-jlpea-01-00150"><label>37.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Avestruz</surname><given-names>A.T.</given-names></name><name><surname>Santa</surname><given-names>W.</given-names></name><name><surname>Carlson</surname><given-names>D.</given-names></name><name><surname>Jensen</surname><given-names>R.</given-names></name><name><surname>Stanslaski</surname><given-names>S.</given-names></name><name><surname>Helfenstine</surname><given-names>A.</given-names></name><name><surname>Denison</surname><given-names>T.</given-names></name></person-group><article-title>A 5 <italic>μ</italic>W/channel spectral analysis IC for chronic bidirectional brain-machine interfaces</article-title><source>IEEE J. Solid-State Circuits</source><year>2008</year><volume>43</volume><fpage>3006</fpage><lpage>3024</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2008.2006460</pub-id></citation></ref>
<ref id="b38-jlpea-01-00150"><label>38.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Yazicioglu</surname><given-names>R.F.</given-names></name><name><surname>Kim</surname><given-names>S.</given-names></name><name><surname>Torfs</surname><given-names>T.</given-names></name><name><surname>Merken</surname><given-names>P.</given-names></name><name><surname>Hoof</surname><given-names>C.V.</given-names></name></person-group><article-title>A 30 <italic>μW</italic> analog signal processor ASIC for biomedical signal monitoring</article-title><conf-name>Proceedings of the IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC)</conf-name><conf-loc>San Francisco, CA, USA</conf-loc><conf-date>2010</conf-date><fpage>124</fpage><lpage>125</lpage></citation></ref>
<ref id="b39-jlpea-01-00150"><label>39.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Shoeb</surname><given-names>A.</given-names></name><name><surname>Carlson</surname><given-names>D.</given-names></name><name><surname>Panken</surname><given-names>E.</given-names></name><name><surname>Denison</surname><given-names>T.</given-names></name></person-group><article-title>A Micropower Support Vector Machine Based Seizure Detection Architecture for Embedded Medical Devices</article-title><conf-name>Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2009)</conf-name><conf-loc>Minneapolis, MN, USA</conf-loc><conf-date>2009</conf-date><fpage>4202</fpage><lpage>4205</lpage></citation></ref>
<ref id="b40-jlpea-01-00150"><label>40.</label><citation citation-type="web"><person-group person-group-type="author"><collab>Texas Instruments</collab></person-group><article-title>Low-cost low-power 2.4 GHz RF transmitter</article-title><comment>Available Online: <ext-link xlink:href="http://focus.ti.com/docs/prod/folders/print/cc2550.html" ext-link-type="uri">http://focus.ti.com/docs/prod/folders/print/cc2550.html</ext-link> (accessed on 1 September 2010)</comment></citation></ref>
<ref id="b41-jlpea-01-00150"><label>41.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Verma</surname><given-names>N.</given-names></name><name><surname>Shoeb</surname><given-names>A.</given-names></name><name><surname>Bohorquez</surname><given-names>J.</given-names></name><name><surname>Dawson</surname><given-names>J.</given-names></name><name><surname>Guttag</surname><given-names>J.</given-names></name><name><surname>Chandrakasan</surname><given-names>A.P.</given-names></name></person-group><article-title>A micro-power EEG acquisition SoC with integrated feature extraction processor for a chronic seizure detection system</article-title><source>IEEE J. Solid-State Circuits</source><year>2010</year><volume>45</volume><fpage>804</fpage><lpage>816</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2010.2042245</pub-id></citation></ref>
<ref id="b42-jlpea-01-00150"><label>42.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Dozio</surname><given-names>R.</given-names></name><name><surname>Baba</surname><given-names>A.</given-names></name><name><surname>Assambo</surname><given-names>C.</given-names></name><name><surname>Burke</surname><given-names>M.J.</given-names></name></person-group><article-title>Time based measurement of the impedance of the skin-electrode interface for dry electrode ECG recording</article-title><conf-name>Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2007)</conf-name><conf-loc>Lyon, France</conf-loc><conf-date>2007</conf-date><fpage>5001</fpage><lpage>5004</lpage></citation></ref>
<ref id="b43-jlpea-01-00150"><label>43.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Enz</surname><given-names>C.</given-names></name><name><surname>Temes</surname><given-names>G.</given-names></name></person-group><article-title>Circuit techniques for reducing the effects of op-amp imperfections: auto-zeroing, correlated double sampling, and chopper stabilization</article-title><source>Proc. IEEE</source><year>1996</year><volume>84</volume><fpage>1584</fpage><lpage>1614</lpage><pub-id pub-id-type="doi">10.1109/5.542410</pub-id></citation></ref>
<ref id="b44-jlpea-01-00150"><label>44.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Denison</surname><given-names>T.</given-names></name><name><surname>Consoer</surname><given-names>K.</given-names></name><name><surname>Santa</surname><given-names>W.</given-names></name><name><surname>Avestruz</surname><given-names>A.T.</given-names></name><name><surname>Cooley</surname><given-names>J.</given-names></name><name><surname>Kelly</surname><given-names>A.</given-names></name></person-group><article-title>A 2 <italic>μ</italic>W 100 nV/rtHz chopper-stabilized instrumentation amplifier for chronic measurement of neural field potentials</article-title><source>IEEE J. Solid-State Circuits</source><year>2007</year><volume>42</volume><fpage>2934</fpage><lpage>2945</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2007.908664</pub-id></citation></ref>
<ref id="b45-jlpea-01-00150"><label>45.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yazicioglu</surname><given-names>R.</given-names></name><name><surname>Merken</surname><given-names>P.</given-names></name><name><surname>Puers</surname><given-names>R.</given-names></name><name><surname>Hoof</surname><given-names>C.V.</given-names></name></person-group><article-title>A 60 <italic>μ</italic>W 60 <inline-formula><mml:math id="mm10" display="inline"><mml:semantics id="sm10"><mml:mrow><mml:mtext>nV</mml:mtext><mml:mo stretchy="false">/</mml:mo><mml:msqrt><mml:mrow><mml:mi>H</mml:mi><mml:mi>z</mml:mi></mml:mrow></mml:msqrt></mml:mrow></mml:semantics></mml:math></inline-formula> readout front-end for portable biopotential acquisition system</article-title><source>IEEE J. Solid-State Circuits</source><year>2007</year><volume>42</volume><fpage>1100</fpage><lpage>1110</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2007.894804</pub-id></citation></ref>
<ref id="b46-jlpea-01-00150"><label>46.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Verma</surname><given-names>N.</given-names></name><name><surname>Chandrakasan</surname><given-names>A.P.</given-names></name></person-group><article-title>An ultra low energy 12-bit rate-resolution scalable SAR ADC for wireless sensor nodes</article-title><source>IEEE J. Solid-State Circuits</source><year>2007</year><volume>42</volume><fpage>1196</fpage><lpage>1205</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2007.897157</pub-id></citation></ref>
<ref id="b47-jlpea-01-00150"><label>47.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname><given-names>A.</given-names></name><name><surname>Chandrakasan</surname><given-names>A.P.</given-names></name></person-group><article-title>A 180 mV subthreshold FFT processor using a minimum energy design methodology</article-title><source>IEEE J. Solid-State Circuits</source><year>2005</year><volume>40</volume><fpage>310</fpage><lpage>319</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2004.837945</pub-id></citation></ref>
<ref id="b48-jlpea-01-00150"><label>48.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kwong</surname><given-names>J.</given-names></name><name><surname>Ramadass</surname><given-names>Y.K.</given-names></name><name><surname>Verma</surname><given-names>N.</given-names></name><name><surname>Chandrakasan</surname><given-names>A.</given-names></name></person-group><article-title>A 65 nm Sub-V<italic><sub>t</sub></italic> microcontroller with integrated SRAM and switched capacitor DC-DC converter</article-title><source>IEEE J. Solid-State Circuits</source><year>2009</year><volume>44</volume><fpage>115</fpage><lpage>126</lpage><pub-id pub-id-type="doi">10.1109/JSSC.2008.2007160</pub-id></citation></ref>
<ref id="b49-jlpea-01-00150"><label>49.</label><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Verma</surname><given-names>N.</given-names></name></person-group><article-title>Ultra-Low-Power SRAM Design In High Variability Advanced CMOS</article-title><source>Ph.D. Thesis</source><publisher-name>Massachusetts Institute of Technology</publisher-name><publisher-loc>Cambridge, MA, USA</publisher-loc><year>2009</year></citation></ref>
<ref id="b50-jlpea-01-00150"><label>50.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Kwong</surname><given-names>J.</given-names></name><name><surname>Chandrakasan</surname><given-names>A.</given-names></name></person-group><article-title>Variation-Driven Device Sizing for Minimum Energy Sub-threshold Circuits</article-title><conf-name>Proceedings of the International Symposium on Low Power Electronics and Design</conf-name><conf-loc>Tegernsee, Germany</conf-loc><conf-date>2006</conf-date><fpage>8</fpage><lpage>13</lpage></citation></ref></ref-list></back></article>
