Next Article in Journal
Correction of Myopic Astigmatism with Topography-Guided Laser In Situ Keratomileusis (TOPOLINK)
Next Article in Special Issue
The Association between Olfactory Function and Cognitive Impairment in Older Persons with Cognitive Impairments: A Cross-Sectional Study
Previous Article in Journal
Predictors of Staying at Home during the COVID-19 Pandemic and Social Lockdown based on Protection Motivation Theory: A Cross-Sectional Study in Japan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Robust Discriminant Framework Based on Functional Biomarkers of EEG and Its Potential for Diagnosis of Alzheimer’s Disease

Department of Medical Statistics, Sun Yat-sen University, Guangzhou 510080, China
*
Author to whom correspondence should be addressed.
Healthcare 2020, 8(4), 476; https://doi.org/10.3390/healthcare8040476
Submission received: 7 October 2020 / Revised: 8 November 2020 / Accepted: 9 November 2020 / Published: 11 November 2020
(This article belongs to the Special Issue Prevention and Clinical Treatment of Alzheimer's Disease)

Abstract

:
(1) Background: Growing evidence suggests that electroencephalography (EEG), recording the brain’s electrical activity, can be a promising diagnostic tool for Alzheimer’s disease (AD). The diagnostic biomarkers based on quantitative EEG (qEEG) have been extensively explored, but few of them helped clinicians in their everyday practice, and reliable qEEG markers are still lacking. The study aims to find robust EEG biomarkers and propose a systematic discrimination framework based on signal processing and computer-aided techniques to distinguish AD patients from normal elderly controls (NC). (2) Methods: In the proposed study, EEG signals were preprocessed firstly and Maximal overlap discrete wavelet transform (MODWT) was applied to the preprocessed signals. Variance, Pearson correlation coefficient, interquartile range, Hoeffding’s D measure, and Permutation entropy were extracted as the input of the candidate classifiers. The AD vs. NC discriminant performance of each model was evaluated and an automatic diagnostic framework was eventually developed. (3) Results: A classification procedure based on the extracted EEG features and linear discriminant analysis based classifier achieved the accuracy of 93.18 ± 3.65 (%), the AUC of 97.92 ± 1.66 (%), the F-measure of 94.06 ± 4.04 (%), separately. (4) Conclusions: The developed discrimination framework can identify AD from NC with high performance in a systematic routine.

1. Introduction

Alzheimer’s disease (AD) is an irreversible neurodegenerative disorder and the leading cause of dementia, which slowly destroys patients’ memory, thinking, and eventually, the ability to conduct basic bodily functions [1]. The progressive accumulation of disability affects the daily lives of patients, their families, and wider society [2]. The number of dementia patients is drastically increasing with the increasing life expectancy and the rapidly aging global population; 132 million people are expected to be affected in 2050 [3]. None of the treatments available today can prevent or block AD’s progression. The most effective method for preserving brain function is believed to be based on early diagnosis and active management started from the very beginning of the clinical spectrum of the disease [1]. The pathophysiological process of AD begins 20 years or more before noticeable symptoms such as memory loss and cognitive dysfunction [4,5]. Further, the AD symptoms are in many cases confused with a normal aging process, thus usually resulting in a late diagnosis when the disease is practically irreversible [6,7]. Early diagnosis is crucial for optimal treatment planning and may avoid rapid deterioration of the illness, thus improving the life quantity of both patients and caregivers. These have driven the search for early diagnosis biomarkers towards the timely detection of this insidious disease [8,9,10].
Evidence suggests that amyloid β protein (Aβ) accumulation and neuronal degeneration (elevated tau/phosphorylated tau) are two major biomarkers of AD, and also the core neuropathological elements of the disease [11]. The former is thought to develop first during the AD continuum, indicating the initiating events from decades prior to clinical symptoms, which can be measured in vivo by positron emission tomography (PET) amyloid imaging and low cerebrospinal fluid (CSF) Aβ42 [10]. The latter is shortly before the appearance of the clinical symptoms, with elevated CSF tau, decreased fluorodeoxyglucose (FDG) uptake on PET, and atrophy on structural magnetic resonance imaging (MRI) [12]. While CSF and neuroimaging markers are gold standards for the in vivo assessment of the patients, they are invasive and expensive and, therefore, have limited utility as frontline screening and diagnostic tools for AD [13]. Moreover, the recent study shifts focus on biomarkers that reveal AD-associated disturbances in brain function rather than simply in structure. Further, there is growing evidence that synaptic loss is most closely correlated with cognitive impairment, suggesting measures of brain functional deficits as early biomarker candidates of AD [14,15]. Electroencephalography (EEG) is a widely available and noninvasive diagnostic method that reflects the real-time functioning of brain synapses [16]. Quantitative EEG (qEEG) analysis provides multi-perspective information of EEG signals such as its frequency components (spectral domain), dynamic alterations (time domain), and source imaging (spatial domain). Numerous studies have demonstrated that qEEG as potential markers for brain function evaluation can detect the following abnormalities in AD patients: (a) modifications in EEG rhythms, (b) decrease in synchronization, (c) reduced complexity, and (d) neuromodulatory deficits [17,18]. Moreover, qEEG provides objective and quantifiable results that can be repeated in further studies, and also has the advantages of requiring simpler testing procedures and lower cost [14,19]. These make it optimal for large-scale screening and early diagnosis of AD. To this end, EEG as an appropriate tool for analysis of AD has been extensively studied in the last decades [17]. Nevertheless, as far as we know, few of the research results help clinicians in their everyday practice and decision support. The problem lies in the fact that the EEG signal is susceptible to noise and has characteristics of being non-stationary, making diagnosis challenging. Moreover, as a large variability exists - in inter-subject- [20,21], it is hard to discriminate artifacts and patterns form normal brain activity. Reliable biomarkers and systematic diagnostic procedures with the capability to extract useful information from rough EEG signals are still urgently required [22,23].
Wavelet Transform (WT) has been suggested as an effective time–frequency analysis method for EEG signal processing [24,25]. It is to convolve the EEG signal with a time window with adjustable width, using small window widths at higher frequencies and large window widths at low frequencies. This adapts well to the characteristics of EEG signals that are composed of short duration events of high-frequency and low-frequency events of long duration [26]. Time-frequency analysis of EEG signals integrated with Machine Learning (ML) techniques may promote a deeper comprehension of AD and aid the diagnosis [27,28,29]. Machine learning algorithms, such as feature selection (FS), removing redundant information from the high dimensional data, could avoid over-fitting and improve diagnosis precision. Motivated by the encouraging outcomes obtained with the WT analysis and ML methods [23,29,30], the study aims to explore robust functional biomarkers based on time-frequency features of qEEG and develop a computer-aided discriminant system for classifying EEG signals of AD and normal elderly controls (NC) automatically. In this study, EEG signals were preprocessed by down-sampling, filtering, segmenting, manual inspection, and independent components analysis firstly to get relatively clean epochs, and maximal overlap discrete wavelet transformation (MODWT) were applied to obtain the time-frequency information of EEG signals. Then, four wavelet features (Variance, Pearson correlation coefficient, interquartile range, Hoeffding’s D measure) and Permutation entropy, which aim at characterizing the change of complexity and synchronization of EEG due to AD, were extracted per channel per epoch. The 20 most discriminant features, selected by dimensionality reduction technique, were fed into conventional ML-based classifiers to realize automatic identification of AD from NC. The results showed that the proposed EEG features and framework could be used for diagnosis and disease progression monitoring of AD with high performance.

2. Materials and Methods

The workflow of the proposed discriminant framework is summarized in Figure 1. It mainly involves four steps: (1) EEG data acquisition and preprocessing; (2) Maximal overlap discrete wavelet transform; (3) Feature extraction and selection; (4) Classification and validation. The details were represented in this section.

2.1. EEG Data Acquisition

A total of 46 eligible subjects were enrolled between September 2012 and September 2019 from the First Affiliated Hospital of Sun Yat-sen University (SYSU) in Guangzhou (China) and their raw EEG data were collected. Ethics approval was obtained from the Ethics Committee of School of Public Health of SYSU (No.123), as well as consent from the participants in the study. The subjects comprised two groups: the first group (NC) consisted of 23 normal elderly controls without memory loss complaints and functional cognitive decline (14 females, aged 59 (50, 74) years), and the second group (AD) of 23 patients diagnosed with AD symptoms (12 females, aged 73 (56, 80) years). The diagnosis of AD patients were based on the Aging and the Alzheimer’s Association (NIA-AA) criteria [31]. The details were that: (1) the core clinical presentation of AD such as evident cognitive or behavioral (neuropsychiatric) symptoms were met through history-taking and the Mini Mental State Examination (MMSE), and (2) neuroimaging tools such as the structural or functional (fMRI) scans were performed for detecting AD biomarkers and excluding other brain diseases, and (3) excluding patients with other forms of dementia or accompanying Parkinson’s disease, epilepsy, psychiatric disorders, and serious organic disease.
All subjects were righthanded and homogeneous for sex and age; indeed, the Chi-square test did not show statistically any difference between the sex of two groups (χ2 = 0.35, p = 0.552). In addition, the two sided Mann–Whitney U test did not find any statistical difference in the age of the two groups (U = −183.50, p = 0.075). The details of the participants’ characteristics are shown in Table 1. The sample size was augmented to 631 epochs (NC epochs of 237 and AD of 394) by segmenting the continuous signals into smaller epochs introduced later in Section 2.2; this data augmentation process was commonly used in previous works [23,32,33].
Resting-state EEG was recorded using Nicolet EEG machine with 500 or 125 Hz sampling rate. Scalp electrodes were placed according to the international 10–20 montage system, and Cz electrodes were used as reference. Sixteen-channel EEG signals (Fp1, Fp2, F3, F4, C3, C4, P3, P4, O1, O2, F7, F8, T3, T4, T5, and T6) were acquired with the subjects awake, relaxed, and with their eyes closed for at least five minutes. The subjects were readily prompted if the EEG technicians who continuously monitored the EEG traces detected any signs of drowsiness. Under the circumstances, the brain background activity was recorded.

2.2. EEG Preprocessing

As EEG is prone to a variety of artifacts, the step of preprocessing and denoising is necessary before analysis. Firstly, EEG recordings with sampling frequency of 500 Hz were down-sampled to 125 Hz for the convenience of subsequent processing. Following, low EEG frequency artifacts, like those generated by eye blinking and movements, were attenuated by applying to all signals a high pass filter at 0.5 Hz using the pop_eegfiltnew function of the EEGLab toolbox in Matlab, and power grid interference of 50 Hz were removed by applying notch filter using filtfilt function. Afterward, every signal was segmented into s non-overlapping EEG epochs sized l = 10 s (where s depends on the EEG length) and baseline drift was removed. An overall of 631 epochs was finally obtained. The segmenting step not only augments the sample size but also assures that each piece-wise signal (epoch) is quasi-stationary [34]. The acquired EEG signals were later visually inspected to identify every 10-s epochs free of excessive artifacts. If there were less than three channels with excessive artifacts, they were then interpolated using spherical interpolation, otherwise the segments were rejected. The person performing the preprocessing was blinded to whether the participants were NC or AD. Lastly, automated artifact removal was performed on the manually processed dataset using the wavelet-enhanced independent components analysis (wICA) algorithm [35]. This automated enhancement method was shown to be the best for AD assessment [36]. The above was performed using the EEGLAB and wICA toolboxes in Matlab R2014a software, resulting in the preprocessed EEG dataset.

2.3. Maximal Overlap Discrete Wavelet Transform (MODWT)

The maximal overlap discrete wavelet transform (MODWT) is a modified version of the discrete wavelet transform (DWT) [37,38]. Both of them are defined in terms of the pyramid algorithm that decomposes the original signal into different layers by wavelet and scaling filters; each layer represents different frequency components of the input signal. The DWT requires the length of input time series must be an integer power of 2 and it is sensitive to the starting point of the time series due to the down-sampling. The MODWT eliminates this down-sampling and creates wavelet coefficients of which the length at each layer will be the same as the original time series, thus it is more applicable in most contexts.
Let the DWT wavelet and scaling filter be h j , l and g j , l , l = 0, 1, 2, …, L − 1, where L is the filter width and j the layer of decomposition. The MODWT wavelet h ˜ j , l   and scaling filters g ˜ j , l are directly defined by renormalizing the DWT filters: h ˜ j , l = h j , l / 2 j / 2 and g ˜ j , l = g j , l / 2 j / 2 . Scaling filter g ˜ l is a quadrature mirror filter that corresponds to wavelet filter h ˜ l by the equation g ˜ l = 1 l + 1 h ˜ L 1 l ; the inverse relationship is h ˜ l = 1 l g ˜ L 1 l . Then, the MODWT wavelet, W ˜ j , and scaling, V ˜ j , coefficients of layer j are defined as the convolution of the time series X = X t ,   t = 0 ,   1 ,   2 , , N 1 and the MODWT filters [39]:
W ˜ j , t = l = 0 L j 1 h ˜ j , l X t l   mod   N ,
V ˜ j , t = l = 0 L j 1 g ˜ j , l X t l   mod   N ,
where L j = 2 j 1 L 1 + 1 . From the above expressions, the MODWT wavelet coefficients is well-defined for any sample size N.
The wavelet detail (series of inverse wavelet coefficients), D ˜ j , and smooth (the inverse of the series of scaling coefficients), A ˜ j , at jth layer are defined as:
D ˜ j , t = l = 0 N 1 h ˜ j , l W ˜ j , t + l   mod   N ,
A ˜ j , t = l = 0 N 1 g ˜ j , l V ˜ j , t + l   mod   N .
The original time series can be decomposed from the multi-resolution analysis (MRA) of the MODWT as follows:
X = D ˜ 1 + D ˜ 2 + + D ˜ j + A ˜ j .
The choice of wavelet filters is important in conducting wavelet analysis. Different wavelet families have a trade-off between the degree of symmetry (i.e., linear phase characteristics of wavelets) and the degree to which ideal high-pass filters are approximated [40]. The selected wavelet should be well adapted to the events analyzed. Here, wavelet filters of length 2 of Haar, of lengths 4, 6, and 8 of the Daubechies family (D4, D6, and D8), of length 8 from the least asymmetric (LA8), and of length 6 from the Coiflets family (C6) were used as candidates to generate the MODWT coefficients, and hence the MODWT features.

2.4. Feature Extraction and Selection

Given an EEG epoch (n × L, with n = 16 channels, L = 1250 samples), the MODWT was taken on each channel of the epoch, according to Equations (1) and (2), coming up with 16 × *j wavelet coefficient per epoch, where j is the number of layer of decomposition depending on the selected wavelet filter. For jth wavelet coefficient, the following features are estimated:
  • Variance (VA):
V X , j 2 = 1 M j t = L j 1 N 1 W j , t ,
where M j N L j + 1 .
  • Pearson correlation coefficient (PCC):
ρ X Y , j = t = L j 1 N 1 W X , j , t W ¯ X , j W Y , j , t W ¯ Y , j V X , j * × V Y , j ,
  • Interquartile range (IQR):
I Q R X , j = P 75 W X , j , t P 25 W X , j , t ,
  • Hoeffding’s D measure (D):
D = 30 N 2 N 3 D 1 + D 2 2 N 2 D 3 N N 1 N 2 N 3 N 3 ,
where, D 1 = i = 1 N Q i Q i 1 , D 2 = i = 1 N R i R i 1 R i 2 S i 1 S i 2 , D 3 = i = 1 N R i 2 S i 2 Q i ; The R i and S i are ranks of the epoch W X , j , t and W Y , j , t , the Q i is the bivariate ranks.
Besides the above four time-frequency features based on wavelet coefficients, permutation entropy as a useful feature based on the rank of original signals was also extracted.
  • Permutation entropy (PE):
P E = j = 1 m ! p j ln p j ln m ! ,
where, p j = n j j = 1 m ! n j , m is the embedding dimension, n j is the number of times the jth permutation is occurring.
Table 2 shows the maximum number of decompose layer for the six filters selected in Section 2.3, the layer of 1–4 carrying nonsense information is excluded in this study. The total numbers of the features obtained for each epoch based on different filters are also showed in Table 2 (16 (# channel) × j (# layer) × 2 (for VA and IQR) + 120 (# combination of any two channels) × j (# layer) × 2 (for PCC and D) + 16 (for PE)).
High-dimensionality feature vectors often lead to bias and overfitting due to redundant information, especially when limited data is available. The stepwise implementation of [41] was used to perform feature reduction, i.e., to select the top 20 features for each classification algorithm.

2.5. Classification and Validation

A total of eight supervised learning classifiers were employed to identify EEG epochs as belonging to AD or HC subjects. They are linear discriminant analysis (LDA), logistic regression (Logreg), k-nearest neighbor (KNN), support vector machine (SVM), random forest (RF), naive Bayes (Nbayes), adabag boosting (Adaboost), and neural network (NNet). Classifier performance was evaluated using the five-fold cross-validation (CV5), which is particularly useful for small sample sizes. In CV5, The EEG epochs were initially divided into 5 folders, and 4 folders were then selected as training set and the remaining folder as test set. This step was repeated 5 times until each folder has been used as a test set. The dividing results of the sample set by CV5 were saved and used for each model evaluation for comparison. It should be noted here that all epochs of a subject were fixed as a “blocking” that would not be separated when dividing the folders. In this way, we were sure that each subject’s epochs were not simultaneously present in the training and test groups in each iteration, thus reducing over-fitting and avoiding misleading results. Hyperparameters optimization was also performed to limit overfitting and increase model generalizability.
For evaluating the classification performances of each model, the following metrics were adopted:
  • Accuracy, or correct rate:
A c c u r a c y = T P + T N T P + F P + T N + F N ,
  • Precision, or Positive Predictive Value:
P r e c i s i o n = T P T P + F P ,
  • Recall, True Positive Rate (TPR) or sensitivity:
R e c a l l = T P T P + F N ,
  • Specificity, True Negative Rate (TNR):
S p e c i f i c i t y = T N F P + T N ,
  • F-measure:
F - m e a s u r e = 2 × R e c a l l × P r e c i s i o n R e c a l l + P r e c i s i o n ,
  • Area under the curve (AUC) for the receiver operating characteristic curve:
A U C = i n s i p o s i t i v e c l a s s r a n k i n s i M × M + 1 2 M × N ,
where, M, N are the number of positive sample and negative sample, separately. True Positives (TP): the number of correctly classified AD; False Positives (FP): the number of incorrectly classified NC; True Negatives (TN): the number of correctly classified NC; False Negatives (FN): the number of misclassified AD.
Although accuracy is an intuitive measure used in several contexts, just getting a high accuracy is not enough to ensure good classification performance, since it is possible that it hides a low number of TP or TN [42]. To counter this problem, we also calculated the F-measure and AUC. A high value of F-measure implies that both Precision and Recall are close to 1, meaning the model performed well. The AUC can reflect the total effect of Sensitivity and Specificity.
The process, including classification, parameters tuning, and model validation, were achieved using R package “mlr”.

3. Results

3.1. Selection of Wavelets Filters

As described in Section 2.3, six wavelet filter candidates, including Haar, D4, D6, D8, LA6, and C6, were used for the wavelet-based procedures to generate the MODWT coefficients, and hence the MODWT features. Figure 2 shows the AD vs. NC discriminant performances of the six filters based on LDA in terms of six evaluation metrics. As can be seen, the LA8 filter performed best, i.e., all six evaluation measures of LA8 were the highest. The wavelet features got from LA8 were used in the following analysis. It should be noted here that the LA8 also performed well when using other classifiers (See Supplementary Materials Table S1 for more details).

3.2. Discrinimant Performance of Different Classifiers

Table 3 reports the AD vs. HC discriminant performances using the eight classifiers (linear discriminant analysis (LDA), logistic regression (Logreg), k-nearest neighbor (KNN), support vector machine (SVM), random forest (RF), naive bayes (Nbayes), adabag boosting (Adaboost), and neural network (NNet)) in terms of six model evaluation metrics. The five-fold cross-validation resulted in five subsets from the original dataset, so all classification measures are herein expressed in terms of mean value ± standard deviation (i.e., mean (Accuracy) ± std (Accuracy)) over five outputs.
It is noticeable that linear discriminant analysis is superior to all other classifiers mentioned here, i.e., the value of six evaluation measures from LDA are all higher than the other classifiers. Indeed, LDA reports the accuracy of 93.18 ± 3.65 (%), the AUC of 97.92 ± 1.66 (%), the F-measure of 94.06 ± 4.04 (%), the specificity of 91.45 ± 6.98 (%), the recall of 94.55 ± 3.85 (%), and the precision of 94.02 ± 7.95 (%). Besides, the results of Logreg and NNet is close to LDA, they give the second and the third highest accuracy of 92.44 ± 3.61 (%) and 91.47 ± 4.93 (%), separately. Other classifiers performed less well than these three, the contrast can be seen clearly in Figure 3. Moreover, the standard deviation of accuracy of LDA is similar to that of Logreg, and smaller than NNet, showing a relatively shorter boxplot in Figure 3, indicating the discriminant performance of LDA is stable in the 5 subsets of CV5.

3.3. Discrinimant Performance of Different Features

Furthermore, we are motivated to examine the inclusion of the five features to assess whether they provide useful information. The individual classification results for each feature as input to LDA are reported in Table 4, and this can be seen intuitively in Figure 4. The proposed parametric features (the combination of VA and PCC) and the nonparametric features (the combination of IQR, D, and PE) gave similar performance when discriminating NC and AD patients, showing the accuracy of 89.52 ± 6.32 (%) and 88.83 ± 4.18 (%), respectively. The PCC and D also gave relatively high accuracy of 88.18 ± 5.28 (%) and 84.75 ± 3.77 (%), compared with VA (71.32 ± 7.25 (%)), IQR (70.32 ± 9.38 (%)); and PE showed the lowest accuracy of 57.09 ± 19.75 (%) between the five features. The combination of all five features achieved the best accuracy of 93.18 ± 3.65 (%). The other five model evaluation measures gave similar comparison results.

4. Discussion

EEG recordings, which are non-invasive, cheap, and acceptable by the aging, is an ideal diagnostic tool for AD. Early detection of AD based on EEG biomarkers has been extensively studied. However, reliable markers extraction for better generalization remains challenging due to the noisy, nonlinear, non-stationary, and multidimensional nature of EEG signals [43]. The study aims to develop a robust discriminant framework based on quantitative EEG biomarkers by combining mature signal processing methods with modern machine learning techniques. Results show that the proposed routine is effective for differential diagnosis of AD patients and NC with an average accuracy of 93.18%.
An appropriate wavelet filter can well adapt the feature of the events present in signals, thus maximizing the information extracted [38]. The LA8 wavelet gives a moderately higher accuracy compared with the other five wavelets used in this study (Figure 2). The LA8 with a relatively large length improves the approximation to an ideal high-pass filter and can better match the characteristic features in the signals, thus, it can give suitable MODWT feature for detection of AD.
The linear classifier is the best choice in this research framework compared with other more complex ML classifiers mentioned here (Table 3 and Figure 3). This is not surprising, although EEG signals have the characteristics of randomness and non-stationarity, which are difficult to be processed by normal linear classifiers. In this study, before automatic classification, MODWT was used to decompose the signals in a locally nonlinear way, then the discriminative wavelet features were extracted, making the discriminant much easier [44]. Figure 5 shows that the input features satisfy the assumption of Gaussian distribution and are linearly separable. The classification results show quite favorable accuracy of 93.18 ± 3.65 (%) and AUC of 97.92 ± 1.66 (%) when using the LDA with all features together.
With multidimensional time series, such as 16 channels for an epoch here, the discriminant task is more complex because as well as taking into account the multiple series associated with each subject, it is of great interest to exploit the relationships between the components of each channel [45,46,47]. In this analysis the PCC and D of wavelet coefficients are used to describe the quantitative relationship between the multi-channels, reflecting the character of brain synchronization. When using each feature as input variable, the accuracy of 88.18 ± 5.28 (%) of PCC was the highest and the accuracy of 84.75 ± 3.77 (%) of D was the second highest between the five features (Table 4 and Figure 4). It is clearly an indication that these interrelationship features provide useful information about the inter-channel relations and hence make an important contribution to distinguishing between the patterns of EEG signals of individuals with AD, and those of NC. While PE gave the lowest accuracy of 57.09 ± 19.75 (%) when used alone, this may because that one epoch only has 16 PE features from 16 channels, extracted from the original series, not the transformed wavelet coefficient, resulting in the information from PE being much less than the other four wavelet features. The VA and IQR could assess the change of brain complexity in AD patients and can provide supplementary information for discrimination. The best classification results were obtained when all five features were used. Further, the proposed parametric features (the combination of VA and PCC) and the nonparametric features (the combination of IQR, HCC, and PE) exhibited similar performance.
In conclusion, a robust discriminant framework based on functional biomarkers of qEEG used to identify Alzheimer’s disease patients from normal elderly controls was developed in this study. The time-frequency analysis method of MODWT accompanying with LA8 wavelet better captured the characteristic features in the EEG signals of AD and thus well adapted to the classification task. The combination of five extracted features with the LDA classifier achieved the best average accuracy of 93.18%, the AUC of 97.92%, the F-measure of 94.06%, the specificity of 91.45%, the recall of 94.55%, and the precision of 94.02%, separately. This is an inspiring result because the features selected in the study and classifier of linear discriminant analysis are easy to calculate and interpret, indicating quantitative EEG combining signal processing techniques and machine learning methods could be a powerful tool. Further, the EEG with 16 channels used in this study were widely available in low-income and middle-income countries, making the proposed classification framework more practical. In a word, the proposed discriminant framework can achieve an automatic diagnosis of AD with high accuracy in a systematic way, as well as aid AD screening.
One of the major limitations of the current study is the small sample size used to train and test the model, but the data augmentation and five-fold CV compensate for this. By segmenting the continuous EEG signals, we finally obtained 631 epochs for analysis, and the cross-validation ensured the stability of the classification results. Larger studies are needed to generalize the findings of the study in the future. Moreover, in future work, EEG data of early patients, such as mild cognitive impairment (MCI) patients who have not yet developed into AD, need to be collected to verify whether the proposed framework could realize earlier detection of possible AD patients and perform a tri-classification task (AD vs. MCI vs. NC). Most of the patients came to the First Affiliated Hospital of SYSU (one of the best hospitals in South China) for severe clinical symptoms, which resulted in the absence of the MCI in this study. The gradual realization of medical information sharing in the future could help us track the information of patients with Alzheimer’s disease at early stages or before they become ill, thus promoting the early diagnosis of the disease. However, the classification framework proposed in the study based on the existing study still has potential for aiding the diagnosis of AD with high performance and can be easily reproduced in future data.

Supplementary Materials

The following are available online at https://www.mdpi.com/2227-9032/8/4/476/s1, Table S1: Discriminant results by different classifiers using 5-fold CV based on six candidate wavelet filters.

Author Contributions

Conceptualization, data collection and curation, J.-X.Z. and Q.G.; software and formal analysis, Q.G., Y.-X.G. and Z.-C.L.; writing—original draft preparation, Q.G.; writing—review and editing, J.-X.Z. and Q.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (NSFC81773545) and Natural Science Foundation of Guangdong Province (2016A030313365).

Acknowledgments

The authors would like to thank the participants of the First Affiliated Hospital of Sun Yat-sen University (Guangzhou, China) for supporting the data of this study. The authors would like to acknowledge Xi-Lin Lu, Song-Jie Liao and Jiao-Xing Li for help with the data collection and sharing. The WMTSA Wavelet Toolkit developed by Cornish [48] was used to perform MODWT and to obtain the wavelet features.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alzheimer’s Association. 2020 Alzheimer’s disease facts and figures. Alzheimer’s Dement. 2020, 16, 391–460. [Google Scholar] [CrossRef] [PubMed]
  2. Winblad, B.; Amouyel, P.; Andrieu, S.; Ballard, C.; Brayne, C.; Brodaty, H.; Cedazo-Minguez, A.; Dubois, B.; Edvardsson, D.; Feldman, H.; et al. Defeating Alzheimer’s disease and other dementias: A priority for European science and society. Lancet Neurol. 2016, 15, 455–532. [Google Scholar] [CrossRef] [Green Version]
  3. World Health Organization. Global Action Plan on the Public Health Response to Dementia 2017–2025; World Health Organization: Geneva, Switzerland, 2017; Volume 52. [Google Scholar]
  4. Morris, J.C. Early-stage and preclinical Alzheimer disease. Alzheimer Dis. Assoc. Disord. 2005, 19, 163–165. [Google Scholar] [CrossRef]
  5. Gordon, B.A.; Blazey, T.M.; Su, Y.; Hari-Raj, A.; Dincer, A.; Flores, S.; Christensen, J.; McDate, E.; Qiao, G.W.; Jie, X.C.; et al. Spatial patterns of neuroimaging biomarker change in individuals from families with autosomal dominant Alzheimer’s disease: A longitudinal study. Lancet Neurol. 2018, 17, 241–250. [Google Scholar] [CrossRef] [Green Version]
  6. Brooker, D.; Fontaine, J.L.; Evans, S.; Bray, J.; Saad, K. Public health guidance to facilitate timely diagnosis of dementia: Alzheimer’s cooperative valuation in Europe recommendations. Int. J. Geriatr. Psychiatr. 2014, 29, 682–693. [Google Scholar] [CrossRef]
  7. Alberdi, A.; Aztiria, A.; Basarab, A. On the early diagnosis of Alzheimer’s Disease from multimodal signals: A survey. Artif. Intell. Med. 2016, 71, 1–29. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Jack, C.R., Jr.; Bennett, D.A.; Blennow, K.; Carrillo, M.C.; Dunn, B.; Haeberlein, S.B.; Holtzman, D.M.; Jagust, W.; Jessen, F.; Karlawish, J.; et al. NIA-AA Research Framework: Toward a biological definition of Alzheimer’s disease. Alzheimer’s Dement. 2018, 14, 535–562. [Google Scholar] [CrossRef]
  9. Dubois, B.; Feldman, H.H.; Jacova, C.; Hampel, H.; Molinuevo, J.L.; Blennow, K.; DeKosky, S.T.; Gauthier, S.; Selkoe, D.; Bateman, R.; et al. Advancing research diagnostic criteria for Alzheimer’s disease: The IWG-2 criteria. Lancet Neurol. 2014, 13, 614–629. [Google Scholar] [CrossRef]
  10. Jack, C.R.; Albert, M.S.; Knopman, D.S.; McKhann, G.M.; Sperling, R.A.; Carrillo, M.C.; Thies, B.; Phelps, C.H. Introduction to the recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s Dement. 2011, 7, 257–262. [Google Scholar] [CrossRef] [Green Version]
  11. Hampel, H.; Frank, R.; Broich, K.; Teipel, S.J.; Katz, R.G.; Hardy, J.; Herholz, K.; Bokde, A.L.; Jessen, F.; Hoessler, Y.C.; et al. Biomarkers for Alzheimer’s disease: Academic, industry and regulatory perspectives. Nat. Rev. Drug Discov. 2010, 9, 560–574. [Google Scholar] [CrossRef]
  12. Jack, C.R., Jr.; Knopman, D.S.; Jagust, W.J.; Shaw, L.M.; Aisen, P.S.; Weiner, M.W.; Petersen, R.C.; Trojanowski, J.Q. Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. Lancet Neurol. 2010, 9, 119–128. [Google Scholar] [CrossRef] [Green Version]
  13. Laske, C.; Sohrabi, H.R.; Frost, S.M.; López-de-Ipiña, K.; Garrard, P.; Buscema, M.; Dauwels, J.; Soekadar, S.R.; Mueller, S.; Linnemann, C.; et al. Innovative diagnostic tools for early detection of Alzheimer’s disease. Alzheimer’s Dement. 2015, 11, 561–578. [Google Scholar] [CrossRef] [PubMed]
  14. Scheff, S.W.; Price, D.A.; Schmitt, F.A.; DeKosky, S.T.; Mufson, E.J. Synaptic alterations in CA1 in mild Alzheimer disease and mild cognitive impairment. Neurology 2007, 68, 1501–1508. [Google Scholar] [CrossRef] [PubMed]
  15. DeKosky, S.T.; Scheff, S.W. Synapse loss in frontal cortex biopsies in Alzheimer’s disease: Correlation with cognitive severity. Ann. Neurol. 1990, 27, 457–464. [Google Scholar] [CrossRef]
  16. Smailovic, U.; Jelic, V. Neurophysiological Markers of Alzheimer’s Disease: Quantitative EEG Approach. Neurol. Ther. 2019, 8, 37–55. [Google Scholar] [CrossRef] [Green Version]
  17. Cassani, R.; Estarellas, M.; San-Martin, R.; Fraga, F.J.; Falk, T.H. Systematic review on resting-state EEG for Alzheimer’s disease diagnosis and progression assessment. Dis. Markers 2018, 17, 241–250. [Google Scholar] [CrossRef] [Green Version]
  18. Rodriguez, G.; Arnaldi, D.; Picco, A. Brain functional network in Alzheimer’s disease: Diagnostic markers for diagnosis and monitoring. Int. J. Alzheimer’s Dis. 2011, 2011, 481903. [Google Scholar] [CrossRef] [Green Version]
  19. Vecchio, F.; Babiloni, C.; Lizio, R.; Fallani, F.D.V.; Blinowska, K.; Verrienti, G.; Frisoni, G.; Rossini, P.M. Resting state cortical EEG rhythms in Alzheimer’s disease: Toward EEG markers for clinical applications: A review. Suppl. Clin. Neurophysiol. 2013, 62, 223–236. [Google Scholar] [CrossRef]
  20. Latchoumane, C.F.V.; Vialatte, F.; Cichocki, A.; Jeong, J. Multiway analysis of Alzheimer’s disease: Classification based on space-frequency characteristics of EEG time series. Proc. World Congr. Eng. 2008, 2, 2–4. [Google Scholar]
  21. Bi, X.; Wang, H. Early Alzheimer’s disease diagnosis based on EEG spectral images using deep learning. Neural. Netw. 2019, 114, 119–135. [Google Scholar] [CrossRef]
  22. Dauwels, J.; Vialatte, F.; Cichocki, A. Diagnosis of Alzheimer’s disease from EEG signals: Where are we standing. Curr. Alzheimer Res. 2010, 7, 487–505. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Durongbhan, P.; Zhao, Y.; Chen, L.; Zis, P.; De Marco, M.; Unwin, Z.C.; Venneri, A.; He, X.; Li, S.; Zhao, Y.; et al. A dementia classification framework using frequency and time-frequency features based on EEG signals. IEEE Trans. Neural Syst. Rehabil. Eng. 2019, 27, 826–835. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Ahmadlou, M.; Adeli, H.; Adeli, A. Fractality and a wavelet-chaos-methodology for EEG-based diagnosis of Alzheimer disease. Alzheimer Dis. Assoc. Disord. 2011, 25, 85–92. [Google Scholar] [CrossRef] [PubMed]
  25. Sharma, N.; Kolekar, M.H.; Jha, K.; Kumar, Y. EEG and cognitive biomarkers based mild cognitive impairment diagnosis. IRBM 2019, 40, 113–121. [Google Scholar] [CrossRef]
  26. Hu, L.; Zhang, Z. EEG Signal Processing and Feature Extraction; Springer Nature Singapore Pte Ltd.: Singapore, 2019. [Google Scholar]
  27. Morabito, F.C.; Campolo, M.; Ieracitano, C.; Ebadi, J.M.; Bonanno, L.; Bramanti, A.; Desalvo, S.; Mammone, N.; Bramanti, P. Deep convolutional neural networks for classification of mild cognitive impaired and Alzheimer’s disease patients from scalp EEG recordings. In Proceedings of the 2016 IEEE 2nd International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI), Bologna, Italy, 7–9 September 2016; pp. 1–6. [Google Scholar]
  28. Simpraga, S.; Alvarez-Jimenez, R.; Mansvelder, H.D.; Van Gerven, J.M.; Groeneveld, G.J.; Poil, S.S.; Linkenkaer-Hansen, K. EEG machine learning for accurate detection of cholinergic intervention and Alzheimer’s disease. Sci. Rep. 2017, 7, 5775. [Google Scholar] [CrossRef] [PubMed]
  29. Fiscon, G.; Weitschek, E.; Cialini, A.; Felici, G.; Bertolazzi, P.; De Salvo, S.; Bramanti, A.; Bramanti, P.; De Cola, M.C. Combining EEG signal processing with supervised methods for Alzheimer’s patients classification. BMC Med. Inform. Decis Mak. 2018, 18, 35. [Google Scholar] [CrossRef]
  30. Fiscon, G.; Weitschek, E.; Felici, G.; Bertolazzi, P.; De Salvo, S.; Bramanti, P.; De Cola, M.C. Alzheimer’s disease patients classification through EEG signals processing. In Proceedings of the 2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM), Orlando, FL, USA, 9–12 December 2014; pp. 105–112. [Google Scholar]
  31. McKhann, G.M.; Knopman, D.S.; Chertkow, H.; Hyman, B.T.; Jack, C.R., Jr.; Kawas, C.H.; Klunk, W.E.; Koroshetz, W.J.; Manly, J.; Mayeux, R.; et al. The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s Dement. 2011, 7, 263–269. [Google Scholar] [CrossRef] [Green Version]
  32. Ieracitano, C.; Mammone, N.; Bramanti, A.; Hussain, A.; Morabito, F.C. A convolutional neural network approach for classification of dementia stages based on 2D-spectral representation of EEG recordings. Neurocomputing 2019, 323, 96–107. [Google Scholar] [CrossRef]
  33. Mammone, N.; Bonanno, L.; Salvo, S.; Marino, S.; Bramanti, P.; Bramanti, A.; Morabito, F. Permutation Disalignment Index as an Indirect, EEG-Based, Measure of Brain Connectivity in MCI and AD Patients. Int. J. Neural Syst. 2017, 27, 1750020. [Google Scholar] [CrossRef]
  34. Bodenstein, G.; Praetorius, H.M. Feature extraction from the electroencephalogram by adaptive segmentation. Proc. IEEE 1977, 65, 642–652. [Google Scholar] [CrossRef]
  35. Sorokin, J. wICA MATLAB Toolkits. 2020. Available online: https://www.mathworks.com/matlabcentral/fileexchange/55413-wica-data-varargin (accessed on 10 November 2020).
  36. Cassani, R.; Falk, T.H.; Fraga, F.J.; Kanda, P.A.; Anghinah, R. The effects of automated artifact removal algorithms on electroencephalography-based Alzheimer’s disease diagnosis. Front. Aging Neurosci. 2014, 6, 55. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Nason, G.P.; Sachs, R.V. Wavelets in time series analysis. Philos Trans. R. Soc. Lond. A 1999, 357, 2511–2526. [Google Scholar] [CrossRef]
  38. Percival, D.B.; Walden, A.T. Wavelet Methods for Time Series Analysis; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  39. Zhu, L.; Wang, Y.; Fan, Q. MODWT-ARMA model for time series prediction. Appl. Math. Model. 2014, 38, 1859–1865. [Google Scholar] [CrossRef]
  40. Alarcon-Aquino, V. Anomaly Detection and Prediction in Communication Networks Using Wavelet Transforms. Ph.D. Thesis, Imperial College London, University of London, London, UK, 2003. [Google Scholar]
  41. Strauss, R.E. Statistical Matlab Library; 2004. Available online: http://www.faculty.biol.ttu.edu/Strauss/Matlab/matlab.htm (accessed on 10 November 2020).
  42. Sokolova, M.; Japkowicz, N.; Szpakowicz, S. Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation. In Australasian Joint Conference on Artificial Intelligence; Springer: Berlin/Heidelberg, Germany, 2006; pp. 1015–1021. [Google Scholar]
  43. Schreiber, T.; Kantz, H. Nonlinear Time Series Analysis; Cambridge University Press: Cambridge, UK, 1997. [Google Scholar]
  44. Serroukh, A.; Walden, A.T. Wavelet scale analysisof bivariate time series ii: Statistical properties for linear processes. J. Nonparametr. Stat. 2000, 13, 37–56. [Google Scholar] [CrossRef]
  45. Houmani, N.; Dreyfus, G.; Vialatte, F.B. Epoch-based Entropy for Early Screening of Alzheimer’s Disease. Int. J. Neural Syst. 2015, 25, 1550032. [Google Scholar] [CrossRef] [PubMed]
  46. Kakizawa, Y.; Shumway, R.H.; Taniguchi, M. Discrimination and clustering for multivariate time series. J. Am. Stat. Assoc. 1998, 93, 328–340. [Google Scholar] [CrossRef]
  47. Discriminant Analysis of Multivariate Time Series Using Wavelets. Available online: https://e-archivo.uc3m.es/bitstream/handle/10016/13359/ws120603.pdf?sequence=1&isAllowed=y (accessed on 10 November 2020).
  48. Cornish, C. WMTSA Wavelet Toolkit for MATLAB. 2004. Available online: http://staff.washington.edu/dbp/wmtsa.html (accessed on 10 November 2020).
Figure 1. The flow diagram of the discriminant framework.
Figure 1. The flow diagram of the discriminant framework.
Healthcare 08 00476 g001
Figure 2. AD vs. NC discriminant results of the six wavelet filters based on LDA (mean value).
Figure 2. AD vs. NC discriminant results of the six wavelet filters based on LDA (mean value).
Healthcare 08 00476 g002
Figure 3. Boxplot of discriminant results by different classifiers using five-fold CV based on LA8 wavelet filter in terms of: (a) Accuracy; (b) AUC; (c) F-measure; (d) Specificity; (e) Recall; (f) Precision. Note: Each boxplot is constructed of the box and the whiskers. The box is drawn from Q1 (25th percentile) to Q3 (75th percentile) with a horizontal line drawn in the middle to denote the median. The upper whisker extends to the largest value no further than 1.5 × IQR from the Q3 (where IQR is the inter-quartile range, IQR = Q3Q1). The lower whisker extends to the smallest value at most −1.5 × IQR from the Q1. The dots plotted individually are outliners whose values are beyond the end of whiskers (±1.5 × IQR).
Figure 3. Boxplot of discriminant results by different classifiers using five-fold CV based on LA8 wavelet filter in terms of: (a) Accuracy; (b) AUC; (c) F-measure; (d) Specificity; (e) Recall; (f) Precision. Note: Each boxplot is constructed of the box and the whiskers. The box is drawn from Q1 (25th percentile) to Q3 (75th percentile) with a horizontal line drawn in the middle to denote the median. The upper whisker extends to the largest value no further than 1.5 × IQR from the Q3 (where IQR is the inter-quartile range, IQR = Q3Q1). The lower whisker extends to the smallest value at most −1.5 × IQR from the Q1. The dots plotted individually are outliners whose values are beyond the end of whiskers (±1.5 × IQR).
Healthcare 08 00476 g003aHealthcare 08 00476 g003b
Figure 4. AD vs. NC discriminant results of the different features based on LDA (mean value).
Figure 4. AD vs. NC discriminant results of the different features based on LDA (mean value).
Healthcare 08 00476 g004
Figure 5. Distribution of linear combination of the features of AD or NC after LDA.
Figure 5. Distribution of linear combination of the features of AD or NC after LDA.
Healthcare 08 00476 g005
Table 1. The demographic characteristics of participants.
Table 1. The demographic characteristics of participants.
SubjectsNSex/N (%)Age/Years
FemaleMaleMedian (QL, QU)Range
HC2314 (60.9)9 (39.1)59 (50, 74)(44, 83)
AD2312 (52.2)11 (47.8)73 (56, 80)(49, 85)
Table 2. The maximum decompose layer for different wavelet filters.
Table 2. The maximum decompose layer for different wavelet filters.
Wavelet FiltersHaarD4D6D8LA8C6
Maximum decompose layer1087777
The number of layers used643333
The number of features extracted11481104832832832832
Table 3. Discriminant results by different classifiers using five-fold CV based on LA8 wavelet filter (mean ± standard deviation, %).
Table 3. Discriminant results by different classifiers using five-fold CV based on LA8 wavelet filter (mean ± standard deviation, %).
ClassifierAccuracyAUCF-MeasureSpecificityRecallPrecision
LDA93.18 ± 3.6597.92 ± 1.6694.06 ± 4.0491.45 ± 6.9894.55 ± 3.8594.02 ± 7.95
Logreg92.44 ± 3.6197.18 ± 1.7793.36 ± 4.3188.04 ± 9.7194.26 ± 5.0793.05 ± 8.32
KNN76.76 ± 5.8586.69 ± 4.6178.79 ± 9.2175.81 ± 2.7676.86 ± 12.0681.85 ± 10.45
SVM83.56 ± 4.8191.28 ± 3.9984.92 ± 7.7481.18 ± 7.7885.53 ± 11.0485.85 ± 11.43
RF79.28 ± 7.1888.05 ± 7.1282.01 ± 9.4966.30 ± 8.4387.04 ± 11.0778.78 ± 12.77
Nbayes78.97 ± 6.2289.12 ± 3.8177.28 ± 13.1185.68 ± 9.1469.26 ± 16.9988.88 ± 6.89
Adaboost79.73 ± 8.0387.40 ± 6.8581.69 ± 10.2175.41 ± 11.0082.89 ± 11.4082.00 ± 13.85
NNet91.47 ± 4.9396.60 ± 1.8492.68 ± 5.1885.02 ± 12.1394.34 ± 5.4991.76 ± 9.50
Table 4. Discriminant results by different features using five-fold CV and LDA based on LA8 wavelet filter (mean ± standard deviation, %).
Table 4. Discriminant results by different features using five-fold CV and LDA based on LA8 wavelet filter (mean ± standard deviation, %).
FeatureAccuracyAUCF-MeasureSpecificityRecallPrecision
VA71.32 ± 7.2584.13 ± 9.5575.67 ± 8.5770.04 ± 18.8578.26 ± 12.5777.32 ± 19.06
PCC88.18 ± 5.2895.65 ± 2.8289.87 ± 3.9986.67 ± 5.4590.66 ± 7.7989.89 ± 7.35
IQR70.32 ± 9.3877.20 ± 14.6973.61 ± 13.3759.51 ± 19.7176.99 ± 17.6573.26 ± 15.15
D84.75 ± 3.7792.58 ± 5.5387.03 ± 2.8185.18 ± 6.4887.05 ± 7.3888.02 ± 8.46
PE57.09 ± 19.7556.82 ± 21.3659.99 ± 21.8152.65 ± 23.6861.36 ± 34.1565.82 ± 18.38
VA + PCC89.52 ± 6.3296.43 ± 2.3091.03 ± 5.7487.80 ± 6.6691.65 ± 6.4490.98 ± 8.77
IQR + HCC + PE88.83 ± 4.1896.16 ± 2.2989.97 ± 5.3084.70 ± 4.3192.45 ± 6.1388.25 ± 9.06
ALL93.18 ± 3.6597.92 ± 1.6694.06 ± 4.0491.45 ± 6.9894.55 ± 3.8594.02 ± 7.95
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ge, Q.; Lin, Z.-C.; Gao, Y.-X.; Zhang, J.-X. A Robust Discriminant Framework Based on Functional Biomarkers of EEG and Its Potential for Diagnosis of Alzheimer’s Disease. Healthcare 2020, 8, 476. https://doi.org/10.3390/healthcare8040476

AMA Style

Ge Q, Lin Z-C, Gao Y-X, Zhang J-X. A Robust Discriminant Framework Based on Functional Biomarkers of EEG and Its Potential for Diagnosis of Alzheimer’s Disease. Healthcare. 2020; 8(4):476. https://doi.org/10.3390/healthcare8040476

Chicago/Turabian Style

Ge, Qi, Zhuo-Chen Lin, Yong-Xiang Gao, and Jin-Xin Zhang. 2020. "A Robust Discriminant Framework Based on Functional Biomarkers of EEG and Its Potential for Diagnosis of Alzheimer’s Disease" Healthcare 8, no. 4: 476. https://doi.org/10.3390/healthcare8040476

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop