Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study

Al-Qazzaz, Noor Kamal; Ali, Sawal Hamid Bin Mohd; Ahmad, Siti Anom

doi:10.3390/a18100620

Open AccessArticle

Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study

by

Noor Kamal Al-Qazzaz

^1,*

,

Sawal Hamid Bin Mohd Ali

^2,3

and

Siti Anom Ahmad

^4,5

¹

Department of Biomedical Engineering, Al-Khwarizmi College of Engineering, University of Baghdad, Baghdad 47146, Iraq

²

Department of Electrical, Electronic and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia

³

Centre of Advanced Electronic and Communication Engineering, Department of Electrical, Electronic and Systems Engineering, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia

⁴

Department of Electrical and Electronic Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia

⁵

Malaysian Research Institute of Ageing (MyAgeing) TM, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(10), 620; https://doi.org/10.3390/a18100620

Submission received: 27 July 2025 / Revised: 2 September 2025 / Accepted: 6 September 2025 / Published: 1 October 2025

(This article belongs to the Special Issue Machine Learning in Medical Signal and Image Processing (3rd Edition))

Download

Browse Figures

Review Reports Versions Notes

Abstract

Diagnosing dementia and recognizing substantial cognitive decline are challenging tasks. Thus, the objective of this study was to classify electroencephalograms (EEGs) recorded during a working memory task in 15 patients with mild cognitive impairment (MCogImp), 5 patients with vascular dementia (VasD), and 15 healthy controls (NC). Before creating spectrogram pictures from the EEG dataset, the data were subjected to preprocessing, which included preprocessing using conventional filters and the discrete wavelet transformation. The convolutional neural network (CNN) MobileNetV2 was employed in our investigation to identify features and assess the severity of dementia. The features were extracted from five layers of the MobileNetV2 CNN architecture—convolutional layers (‘Conv-1’), batch normalization (‘Conv-1-bn’), clipped ReLU (‘out-relu’), 2D Global Average Pooling (‘global-average-pooling2d1’), and fully connected (‘Logits’) layers. This was carried out to find the efficient features layer for dementia severity from EEGs. Feature extraction from MobileNetV2’s five layers was carried out using a decision tree (DT) and k-nearest neighbor (KNN) machine learning (ML) classifier, in conjunction with a MobileNetV2 deep learning (DL) network. The study’s findings show that the DT classifier performed best using features derived from MobileNetV2 with the 2D Global Average Pooling (global-average-pooling2d-1) layer, achieving an accuracy score of 95.9%. Second place went to the characteristics of the fully connected (Logits) layer, which achieved a score of 95.3%. The findings of this study endorse the utilization of deep processing algorithms, offering a viable approach for improving early dementia identification with high precision, hence facilitating the differentiation among NC individuals, VasD patients, and MCogImp patients.

Keywords:

vascular dementia; deep learning models; machine learning techniques; CNN; MobileNetV2; classification

1. Introduction

In terms of prevalence, vascular dementia (VasD) is second only to Alzheimer’s disease (AD) [1]. Roughly 20% of dementia cases globally are vascular dementia [2,3]. The cognitive impairment in memory known as vascular dementia (VasD) is linked to cerebrovascular disease (CVD) [4]. Vascular disease is most often linked to stroke, which is the second most prevalent vascular illness. When compared to motoric dysfunction, the impact of stroke on cognitive performance is frequently underappreciated [5]. The patient’s health difficulties and their reliance on others around them may be worsened by cognitive impairment, according to this theory. A total of 20 to 25% of stroke patients will experience delayed dementia, while 15 to 30% will acquire dementia within three months following the stroke, according to prior research [6]. As people become older, a decline in cognitive function brought on by dementia can have far-reaching consequences, such as a decline in everyday social activities and higher expenses for families, communities, and the government. Consequently, it is crucial to recognise dementia at an early stage to halt its rapid progression [6,7].

Vascular dementia is difficult to diagnose because it requires a clinical diagnosis of cognitive impairment that is severe enough to be dementia, as well as imaging evidence from the brain showing that the dementia is caused by cerebrovascular events, such as a stroke [8]. For the diagnosis of dementia following a stroke, the electroencephalogram (EEG) is one of the biomarkers that could serve as an alternative when researching brain function with dementia [9].

Early detection, severity evaluation, and dementia type differentiation have been the recent foci of EEG research on dementia [10]. The EEG has been the tool of choice for the majority of investigations that use Alzheimer’s (AD) detection or characterization [11,12]. On the other hand, research on dementia following a stroke is limited. It is important to investigate the use of EEG for the diagnosis of post-stroke dementia because the incidence of stroke is rising annually and is followed by the risk of dementia. Normative participants and stroke patients with cognitive impairment had their EEG signals characterized in a recent study. Compared to healthy subjects, those with cognitive impairment following a stroke had distinct power spectrum features [13]. Additional research on the use of EEG for the diagnosis of post-stroke dementia has been investigated by evaluating the signal complexity by applying entropy measures of EEG dynamics as one of the complexity degrees [14]. It is believed that a cognitive function involving a change in signal pattern in response to a stimulus is closely related to the layer of EEG spectral dynamics. Cognitively impaired participants had fewer high relative power layers and an increased number of low relative power layers compared to healthy controls [13].

Researchers have looked into employing several machine learning (ML) algorithms to classify EEGs in people with cognitive problems [13]. However, standard ML methods have a hard time dealing with high-dimensional data since they depend on features that are clearly specified. Deep learning (DL), on the other hand, is a revolutionary new development in ML that fixes these problems. DL can learn and find latent discriminative features directly from raw data using end-to-end learning, which gives it a big edge over traditional approaches [15,16,17].

Because people with dementia typically demonstrate EEG waves that show less non-linear behavioral dynamics and linear connection in the cortex of the brain, this study could provide useful information about the disease. Both the complexity and the functional linkages can be reduced by incorporating these aspects.

Additionally, age and psychopathic illness are known to significant in relation to changing EEG frequency profiles, which are important for explaining brain activity in neurological and cognitive studies. The process of aging is associated with a change in EEG power, including a reduction in alpha (8–13 Hz) and an increase in Delta (1–4 Hz) and Thea (4–8 Hz) activity, reflecting converted cortical excitability and connection [18]. Similarly, psychiatry diseases, such as depression or anxiety, which are normal comorbidities in dementia, can change the EEG profile by increasing the slow wave activity or by reducing beta (13–30 Hz) coherence [19]. Moreover, the coherence of the cerebral EEG and muscular electromyogram (EMG) signal, as well as the extent to which the activity of EEG-EMG can be synchronized, is a significant method of studying sensorimotor integration and its variation during neurological disorders such as dementia. The studies cited by the reviewer form a good foundation to this study. James et al. have demonstrated that EEG-EMG coherence and cumulant changes over time, as well as corticospinal oscillations in the cortex, reduce with aging [20]. Mima and Hallett have provided a detailed review of corticomuscular coherence and its importance in motor control; this could explain the interpretation of EEG patterns [21]. Moreover, the physiological principles and clinical implications of our EEG-EMG frequency analysis have been elucidated by Grosse et al., thus justifying the relevance of using a high frequency range in identifying motor-related aberrations [22]. Liu et al. have provided another evaluation of corticomuscular coherence applications, but focus on their possible use in monitoring neurodegenerative diseases, which is consistent with our aims of dementia classification [23]. Tuncel et al. have explored the concept of time–frequency coherence in the state of tiredness and offer a perspective into the interactions between dynamic signals that can enhance our preprocessing resiliency [24].

Dementia patients’ non-linear EEG dynamics must be investigated if this problem is to be better understood. In the early stages of dementia, the EEG may appear normal and have the same rhythm as healthy individuals of the same age. To assess the influence of dementia on the brain and to understand the progression of the disease, it is necessary to analyse and interpret EEG recordings obtained from individuals with dementia using signal analysis techniques. In this study, MobileNetV2 was employed to extract features from its CNN architecture. Specifically, we looked for the best features to use for EEG-based dementia severity assessment in the following five layers: convolutional (‘Conv-1’), batch normalisation (‘Conv-1-bn’), clipped ReLU (‘out-relu’), 2D Global Average Pooling (‘global-average-pooling2d1’), and fully connected (‘Logits’) layers. Feature extraction from MobileNetV2’s five layers was carried out using a DT and KNN ML classifier, in conjunction with a MobileNetV2 DL network. To better distinguish between NC people, VasD patients, and MCogImp patients, these findings provide credence to the utilisation of deep processing algorithms.

2. Materials and Methods

To help doctors figure out what is wrong with people who have had a stroke, the recorded EEG goes through several steps that separate the signals of dementia patients. Figure 1 demonstrates the process flow diagram of the suggested strategy.

2.1. Participants and EEG Signal Acquisition

There were 35 people in this study, divided into three groups: 15 healthy normal controls (NC; 8F/7M; mean age 60.06 ± 5.21 years), 15 patients with post-stroke mild cognitive impairment (MCogImp; 10F/5M; mean age 60.26 ± 7.77 years), and 5 patients with vascular dementia (VD; 2F/3M; mean age 64.6 ± 4.8 years). Patients with MCogImp and patients with VasD were recruited from Pusat Perubatan Universiti Kebangsaan Malaysia (

P P U K M

). People in the NC group had never had a mental illness before. The

P P U K M

Human Ethics Committee granted the study protocols with their stamp of approval, and all participants signed a paper saying they understood what they were signing up for. The Nicolet One (V32) system was used to capture EEG activity while participants performed an auditory working memory task [25].

2.2. Preprocessing Steps

Conventional Filtering and Wavelet Denoising
In the first step, each channel of EEG-based working memory datasets was tackled with conventional filters, including a notch-filter at 50 Hz and a 0.1–70 Hz band-pass-filter to remove high-frequency noise, such as muscle-related EMG signals typically exceeding 20–50 Hz [26,27]. Additionally, the discrete wavelet transform (DWT) was an additional preprocessing procedure that was used to enhance data quality and to reduce EMG contamination in EEG signals [27].
Spectrogram Images
The signals from each of the denoised EEG electrodes were split into 6 segments of 10 s each. Each epoch has $N = 2560$ samples because $f s = 256$ Hz. Therefore, each problem that needed to be examined was saved on a computer and dealt with separately. The power spectral densities ( $P S D s$ ) were figured out from these segments. A periodogram, which determined the frequency distribution of the EEG data, is a good way to find $P S D s$ [28]. The Fourier transform of the autocorrelation function is a nonparametric evaluation of $P S D s$ . For example, Equation (1) shows the periodogram of a signal $x_{l}$ with a length of L [28], as follows:

$P S D (f) = \frac{T_{s}}{L} {|\sum_{l = 0}^{L - 1} x_{l} e^{- 2 π i f l}|}^{2}$

(1)

where $- \frac{1}{2 T_{s}} < f \leq \frac{1}{2 T_{s}}$ and $\frac{1}{2 T_{s}}$ is the sampling period. The following modified periodogram is obtained by multiplying the input time series by the window function $w_{l}$ , as in Equation (2):

$\hat{P S D} (f) = \frac{T_{s}}{L} {|\sum_{l = 0}^{L - 1} w_{l} x_{l} e^{- 2 π i f l}|}^{2}$

(2)

The spectrogram function was used to figure out the $P S D s$ of the $x^{t h}$ EEG time series for this study. This is because the modified periodogram reduces the spectral leakage of the standard periodogram and softens the edges of the signal. It has a high resolution and can be used to analyse biomedical signs [29].
The PSD EEG images were turned into three-dimensional images and were resized into $224 \times 224$ pixels to fit the CNN MobileNetV2 input design that is used to automatically extract and sort features.

2.3. Deep Feature Extraction with MobileNetV2 Layers

This study employs MobileNetV2 as a feature extractor network to compare the performance of different MobileNetV2 layers for feature identification in order to determine the most appropriate layer for determining the severity of dementia from an EEG dataset. It should be noted that the optimal feature layer selection can vary depending on the specific characteristics of the EEG dataset and the complexity of the classification task.

This classification necessitates differentiating among three categories: NC individuals, MCogImp, and VasD. The input layer MobileNetV2 decreases the image size from 224 by 224 to 14 by 14, while simultaneously increasing the filter response depth from 3 to 512 [30,31]. The last fully connected layer classes have to be reduced from the standard 1000 to 3 classes in order to meet this study classification structure. Hence, MobileNetV2 transfer learning was utilised in this work [32]. To meet the number of classes needed to discriminate dementia severity, three new layers were added:

f u l l y c o n n e c t e d l a y e r - N e w

,

s o f t m a x l a y e r - N e w

and

c l a s s i f i c a t i o n l a y e r - N e w

. The three previous layers, ‘Logits’, ‘Logits-softmax,’ and ‘ClassificationLayer-Logits’, were removed and replaced with these three. Layers for classification output, those that were completely connected, and softmax layers augment MobileNetV2’s deep CNN architecture’s several convolutional layers.

Afterwards, five feature sets were obtained from the EEG dataset using MobileNetV2 architecture; these are convolutional layers (‘Conv-1’), batch normalisation (‘Conv-1-bn’), clipped ReLU (‘out-relu’), 2D Global Average Pooling (‘global-average-pooling2d-1’), fully connected (‘Logits’), and Softmax (‘Logits-softmax’) layers that extract features from the MobileNetV2 CNN to be classified by the MobileNetV2 classification output layers (‘ClassificationLayer-Logits’) DL and ML classifier.

2.4. Classification

In total, 30% of the data are used for testing, while the remaining 70% are used for training and checking. The 70:30 ratio is a good balance between training and testing data. It allows the model to learn well while still being tested on enough data. This ratio is commonly recommended in the literature [33,34] since it maintains generalization.

The DL

s t o c h a s t i c g r a d i e n t d e s c e n t w i t h m o m e n t u m

(

s g d m

) optimizer was used. Additionally, the EEG classification model that relies on MobileNetV2 had its hyperparameters chosen using 70% of the training dataset, which included the NC, MCogImp, and VasD groups. Optimizations were made to the learning rate, batch size, and epoch count, which are critical hyperparameters. A validation frequency of 3, a mini-batch size of 64, and a learning rate of

0.001

were the final hyperparameters chosen.

The evaluation’s scope is enlarged to include the accuracy and confusion matrix in order to evaluate dementia severity prediction performance [35]. These performance measures are chosen and interpreted based on generally accepted machine learning and deep learning principles [36].

As in medical diagnostics, the confusion matrix is an important tool for analyzing classification results since it provides a granular breakdown of model performance that goes beyond total accuracy. In the context of this EEG-based dementia classification study—differentiating NC, MCogImp, and VasD using MobileNetV2 features—its significance resides in analyzing class-specific performance and is compatible with best practices advised in machine learning and deep learning studies [37,38].

First of all, the features were classified using the DL MobileNetV2 classification output layer. Afterwards, the KNN and DT ML classifiers were used.

2.4.1. Deep MobileNetV2 Classifier

Features from convolutional layers (‘Conv-1’), batch normalisation (‘Conv-1-bn’), clipped ReLU (‘out-relu’), 2D Global Average Pooling (‘global-average-pooling2d-1’), fully connected (‘Logits’), and Softmax (‘Logits-softmax’) layers were classified by MobileNetV2’s classification output layer (‘ClassificationLayer-Logits’) DL network. Figure 2 illustrates the MobileNetV2 architecture.

2.4.2. Machine Learning Classifiers

DT and KNN ML classifiers were implemented to identify the features extracted from MobileNetV2’s five layers. In KNN, the training data are obtained from a data model, making it a lazy learning method. Finding similar patterns in the training data is the main idea for classifying the test data. Euclidean distances were used to calculate similarity among data objects. Before classifying a new test sample, its distance from all samples in the training data must be calculated. The next step is to select the class of the new test sample based on the frequency with which k appears in adjacent (neighboring) training samples [39]. The key feature of KNN is the parameter k. In this study, k = 5 was used and the nearest neighbors were also considered in the calculation of their weights so that the nearest ones would have a greater influence on the final stage than those further away.

IF-THEN-ELSE procedures are implemented by decision trees (DTs), which are classifiers that rely on initial criteria or decision rules. To reach a final decision, these conditions are connected in a tree-based framework [40]. The classifier’s output classes are the “leaves” of the DT. A simple feature check is performed at each node of the tree. One possible outcome of the situation is the presence of children in the node. A DT is constructed during training to find the non-joint partitions of the output classes at each node. Typically, the Gini index and the information received act as a classifier for each node. The feature/value combination that maximizes the Gini index or minimizes the information gained is selected for each node. Furthermore, as a postprocessing step, we construct the trees.

3. Results and Discussion

The results of this research offered a comprehensive comparative analysis of the MobileNetV2 model, delivering insights into the effectiveness of the model through the utilization of evaluation metrics including precision, sensitivity, specificity, F-score, and confusion matrix.

3.1. Results of the Preprocessing

During preprocessing, all of the EEG datasets were denoised using conventional filters and WT techniques. Moreover, spectrogram images were created from the

P S D s

of the EEG dataset to meet the requirement for the MobileNetV2 inputs with a size of

224 \times 224

pixels.

Once the model’s output layers have been taken away to make the end output layer fit the needs of the task at hand, the model’s weights are added to them.

3.2. Results of Deep MobileNetV2 Feature Extraction

The MobileNetV2 model was already trained using data from EEG spectrogram images. Subsequently, five feature groups were extracted from the EEG dataset using the MobileNetV2 CNN algorithm: ‘Conv-1’ from convolutional layers, ‘Conv-1-bn’ from batch normalization, ‘out-relu’ from clipped ReLU, ‘global-average-collection2d-1’ from 2D Global Average Pooling, ‘Logits’ from full correlation, and ‘Logits-softmax’ from Softmax. The classification output layers of the MobileNetV2 CNN consisting of DL and machine learning classifiers were trained with these feature sets.

3.3. Results of Classification

MobileNetV2, as a DL classifier, was used to classify the features extracted from the last five layers of MobileNetv2. Table 1 shows the evaluation metric results from the MobileNetV2 classifier. Moreover, Figure 3 shows the confusion matrix utilising the characteristics and classifier of MobileNetV2 for cases of VasD, MCogImp, and NC.

KNN, as an ML classifier, was used to classify the features extracted from the last five layers of the MobileNetv2. Table 2 shows the evaluation metric results from the KNN machine learning classifiers. Moreover, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 show the confusion matrix for VasD, MCogImp, and NC using MobileNetV2 features from the five layers and the KNN classifier.

DT, as an ML classifier, was used to classify the features extracted from the last five layers of MobileNetv2. Table 3 shows the evaluation metric results from the DT machine learning classifiers. Moreover, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13 show the confusion matrix utilising the characteristics and classifier of DT for cases of VasD, MCogImp, and NC.

This study investigates the last five convolutional MobileNetV2 network architectures, including convolutional layers (‘Conv-1’), batch normalisation (‘Conv-1-bn’), clipped ReLU (‘out-relu’), 2D Global Average Pooling (‘global-average-pooling2d-1’), fully connected (‘Logits’), and Softmax (‘Logits-softmax’) layers, considering the often time- and space-dependent nature of EEG data. As a result, each layer assumes a distinct layer of abstraction in the input EEG spectrogram data. The Conv-1 layer represents the network’s original convolutional layer, whereas the Conv-bn layer is the depth-separating convolutional layer with batch normalisation. Given the frequently time- and space-dependent nature of EEG data, it is better to employ the convolutional layers (Conv-1-bn) and batch normalisation (Conv-1-bn) as feature layers. These layers capture similar low-layer properties, such as edges, textures, and simple patterns, which may be useful for EEG analysis. Moreover, MobileNetV2’s 2D Global Average Pooling (global-average-pooling2d-1) and fully linked (Logits) layers make excellent feature layers for EEG classification. These layers frequently preserve a lot of spatiotemporal information, which might be useful for capturing meaningful patterns in EEG data. Using these layers as feature layers, you can extract significant features from EEG data and export them to the classification output layers of the MobileNetV2, KNN, and DT classifiers. Using transfer learning with DT and KNN ML classifiers produced an improved classification accuracy of 95.9% and 94.7%, respectively, particularly from the 2D Global Average Pooling (‘global-average-pooling2d-1’) layer, which is considered the optimal feature extraction layer because it captures fine features like edges, textures, and simple patterns that may be suitable for EEG investigation; the MobileNetV2 DL network, as a classifier, obtained the lowest classification accuracy of 74.9%. The research results showed that the DT ML classifier, when used in conjunction with features extracted from different MobileNetV2 CNN layers, produced output from the 2D Global Average Pooling (global-average-pooling2d-1) layer with an accuracy of 95.9%, followed by 95.3% from the fully connected (Logits) layer features. These results provide support for the use of deep processing systems to enhance the differentiation between NC subjects, VasD, and MCogImp patients.

4. Conclusions

This research aimed to classify the EEGs of MCogImp and VasD patients with NC individuals while they performed a working memory task. The EEG data were subjected to preprocessing analysis including noise removal using conventional filters and the DWT, while spectrogram images were made from the EEG dataset. Our study used MobileNetV2 (a CNN) to find relevant features and rate the severity of vascular dementia. Therefore, to find the optimum feature layer for EEG-based vascular dementia severity assessment, the features were specifically extracted from five layers of the MobileNetV2 CNN architecture: convolutional layers (‘Conv-1’), batch normalisation (‘Conv-1-bn’), clipped ReLU (‘out-relu’), 2D Global Average Pooling (‘global-average-pooling2d1’), and fully connected (‘Logits’) layers. As a classifier, a MobileNetV2 DL network and DT and KNN ML classifiers were implemented to identify the features extracted from MobileNetV2’s five layers. This study’s results demonstrate that features obtained from MobileNetV2 with the 2D Global Average Pooling (global-average-pooling2d-1) layer yielded the most accurate results for the DT ML classifier, with an accuracy score of 95.9%. The fully connected (Logits) layer features came in second, with a score of 95.3%. These results provide support for the use of deep processing systems to enhance the differentiation between NC subjects, VasD, and MCogImp patients. The limitations of this study include the relatively small sample size (35 participants). The dependence on a singular working memory task may inadequately represent the varied EEG patterns linked to dementia progression, perhaps constraining the model’s relevance across numerous cognitive circumstances. Future work intends to use XGBoost, SVR, and possibly more models such as Random Forest on identical EEG spectrogram datasets, utilizing the same preprocessing workflow. Moreover, the integration of EEG-EMG coherence metrics will be utilized to improve dementia severity assessment.

Author Contributions

Conceptualization, N.K.A.-Q.; methodology, N.K.A.-Q.; investigation, S.A.A.; resources, S.H.B.M.A.; writing original draft preparation, N.K.A.-Q.; writing review and editing, S.A.A. and S.H.B.M.A.; visualization, S.A.A.; supervision, S.H.B.M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The dataset utilised in this study can be made available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Korczyn, A.D.; Vakhapova, V.; Grinberg, L.T. Vascular dementia. J. Neurol. Sci. 2012, 322, 2–10. [Google Scholar] [CrossRef]
Rizzi, L.; Rosset, I.; Roriz-Cruz, M. Global epidemiology of dementia: Alzheimer’s and vascular types. BioMed Res. Int. 2014, 2014, 908915. [Google Scholar] [CrossRef]
Acharya, M.; Deo, R.C.; Tao, X.; Barua, P.D.; Devi, A.; Atmakuru, A.; Tan, R.S. Deep learning techniques for automated Alzheimer’s and mild cognitive impairment disease using EEG signals: A comprehensive review of the last decade (2013–2024). Comput. Methods Programs Biomed. 2025, 259, 108506. [Google Scholar] [CrossRef] [PubMed]
Al-Qazzaz, N.K.; Ali, S.H.B.; Ahmad, S.A.; Chellappan, K.; Islam, M.; Escudero, J. Role of EEG as biomarker in the early detection and classification of dementia. Sci. World J. 2014, 2014, 906038. [Google Scholar] [CrossRef]
Mellon, L.; Brewer, L.; Hall, P.; Horgan, F.; Williams, D.; Hickey, A. Cognitive impairment six months after ischaemic stroke: A profile from the ASPIRE-S study. BMC Neurol. 2015, 15, 31–39. [Google Scholar] [CrossRef] [PubMed]
Kalaria, R.N.; Akinyemi, R.; Ihara, M. Stroke injury, cognitive impairment and vascular dementia. Biochim. Biophys. Acta—Mol. Basis Dis. 2016, 1862, 915–925. [Google Scholar] [CrossRef] [PubMed]
Song, J.; Lee, W.T.; Park, K.A.; Lee, J.E. Association between risk factors for vascular dementia and Adiponectin. BioMed Res. Int. 2014, 2014, 261672. [Google Scholar] [CrossRef]
Gorelick, P.B.; Scuteri, A.; Black, S.E.; DeCarli, C.; Greenberg, S.M.; Iadecola, C.; Launer, L.J.; Laurent, S.; Lopez, O.L.; Seshadri, S.; et al. Vascular contributions to cognitive impairment and dementia: A statement for healthcare professionals from the American Heart Association/American Stroke Association. Stroke 2011, 42, 2672–2713. [Google Scholar] [CrossRef]
Meghdadi, A.H.; Stevanovic Karic, M.; McConnell, M.; Rupp, G.; Richard, C.; Hamilton, J.; Salat, D.; Berka, C. Resting state EEG biomarkers of cognitive decline associated with Alzheimer’s disease and mild cognitive impairment. Front. Aging Neurosci. 2021, 16, e0244180. [Google Scholar] [CrossRef]
Bonanni, L.; Franciotti, R.; Nobili, F.; Kramberger, M.G.; Taylor, J.P.; Garcia-Ptacek, S.; Falasca, N.W.; Famá, F.; Cromarty, R.; E-DLB Study Group; et al. EEG markers of dementia with Lewy bodies: A multicenter cohort study. J. Alzheimer’s Dis. 2016, 54, 1649–1657. [Google Scholar] [CrossRef]
Escudero, J.; Hornero, R.; Abásolo, D.; Fernández, A. Quantitative evaluation of artifact removal in real magnetoencephalogram signals with blind source separation. Ann. Biomed. Eng. 2011, 39, 2274–2286. [Google Scholar] [CrossRef]
Siuly, S.; Alçin, Ö.F.; Wang, H.; Li, Y.; Wen, P. Exploring rhythms and channels-based EEG biomarkers for early detection of alzheimer’s disease. IEEE Trans. Emerg. Top. Comput. Intell. 2024, 8, 1609–1623. [Google Scholar] [CrossRef]
Al-Qazzaz, N.K.; Ali, S.H.B.M.; Ahmad, S.A.; Islam, M.S.; Escudero, J. Discrimination of stroke-related mild cognitive impairment and vascular dementia using EEG signal analysis. Med. Biol. Eng. Comput. 2018, 56, 137–157. [Google Scholar] [CrossRef] [PubMed]
Cataldo, A.; Criscuolo, S.; De Benedetto, E.; Masciullo, A.; Pesola, M.; Picone, J.; Schiavoni, R. EEG complexity-based algorithm using Multiscale Fuzzy Entropy: Towards a detection of Alzheimer’s disease. Measurement 2024, 225, 114040. [Google Scholar] [CrossRef]
Subramanian, N.; Elharrouss, O.; Al-Maadeed, S.; Chowdhury, M. A review of deep learning-based detection methods for COVID-19. Comput. Biol. Med. 2022, 143, 105233. [Google Scholar] [CrossRef]
Soomro, T.A.; Zheng, L.; Afifi, A.J.; Ali, A.; Soomro, S.; Yin, M.; Gao, J. Image segmentation for MR brain tumor detection using machine learning: A Review. IEEE Rev. Biomed. Eng. 2022, 16, 70–90. [Google Scholar] [CrossRef]
Marin, I.; Marasović, T.; Gotovac, S. Siamese Network for Content-Based Image Retrieval: Detection of Alzheimer’s Disease from neuroimaging data. In Proceedings of the 2022 IEEE International Conference on Software, Telecommunications and Computer Networks (SoftCOM), Split, Croatia, 22–24 September 2022; pp. 1–6. [Google Scholar]
Klimesch, W. EEG alpha and theta oscillations reflect cognitive and memory performance: A review and analysis. Brain Res. Rev. 1999, 29, 169–195. [Google Scholar] [CrossRef]
Boutros, N.N.; Arfken, C.; Galderisi, S.; Warrick, J.; Pratt, G.; Iacono, W. The status of spectral EEG abnormality as a diagnostic test for schizophrenia. Schizophr. Res. 2008, 99, 225–237. [Google Scholar] [CrossRef]
James, L.M.; Halliday, D.M.; Stephens, J.A.; Farmer, S.F. On the development of human corticospinal oscillations: Age-related changes in EEG–EMG coherence and cumulant. Eur. J. Neurosci. 2008, 27, 3369–3379. [Google Scholar] [CrossRef]
Amin, S.U.; Alsulaiman, M.; Muhammad, G.; Mekhtiche, M.A.; Hossain, M.S. Deep Learning for EEG motor imagery classification based on multi-layer CNNs feature fusion. Future Gener. Comput. Syst. 2019, 101, 542–554. [Google Scholar] [CrossRef]
Grosse, P.; Cassidy, M.J.; Brown, P. EEG–EMG, MEG–EMG and EMG–EMG frequency analysis: Physiological principles and clinical applications. Clin. Neurophysiol. 2002, 113, 1523–1531. [Google Scholar] [CrossRef]
Liu, J.; Sheng, Y.; Liu, H. Corticomuscular coherence and its applications: A review. Front. Hum. Neurosci. 2019, 13, 100. [Google Scholar] [CrossRef]
Tuncel, D.; Dizibuyuk, A.; Kiymik, M.K. Time frequency based coherence analysis between EEG and EMG activities in fatigue duration. J. Med. Syst. 2010, 34, 131–138. [Google Scholar] [CrossRef]
Al-Qazzaz, N.K.; Ali, S.; Ahmad, S.A.; Islam, M.S.; Ariff, M.I. Selection of mother wavelets thresholding methods in denoising multi-channel EEG signals during working memory task. In Proceedings of the 2014 IEEE Conference on Biomedical Engineering and Sciences (IECBES), Kuala Lumpur, Malaysia, 8–10 December 2014; pp. 214–219. [Google Scholar]
Al-Qazzaz, N.K.; Sabir, M.K.; Ali, S.H.B.M.; Ahmad, S.A.; Grammer, K. Multichannel optimization with hybrid spectral-entropy markers for gender identification enhancement of emotional-based EEGs. IEEE Access 2021, 9, 107059–107078. [Google Scholar] [CrossRef]
Makaram, N.; Gupta, S.; Pesce, M.; Bolton, J.; Stone, S.; Haehn, D.; Pomplun, M.; Papadelis, C.; Pearl, P.; Rotenberg, A.; et al. Deep learning-based visual complexity analysis of electroencephalography time-frequency images: Can it localize the epileptogenic zone in the brain? Algorithms 2023, 16, 567. [Google Scholar] [CrossRef] [PubMed]
Al-Qazzaz, N.K.; Hamid Bin Mohd Ali, S.; Ahmad, S.A.; Islam, M.S.; Escudero, J. Automatic artifact removal in EEG of normal and demented individuals using ICA–WT during working memory tasks. Sensors 2017, 17, 1326. [Google Scholar] [CrossRef] [PubMed]
Ieracitano, C.; Mammone, N.; Bramanti, A.; Hussain, A.; Morabito, F.C. A convolutional neural network approach for classification of dementia stages based on 2D-spectral representation of EEG recordings. Neurocomputing 2019, 323, 96–107. [Google Scholar] [CrossRef]
Rahman, T.; Khandakar, A.; Qiblawey, Y.; Tahir, A.; Kiranyaz, S.; Kashem, S.B.A.; Islam, M.T.; Al Maadeed, S.; Zughaier, S.M.; Khan, M.S.; et al. Exploring the effect of image enhancement techniques on COVID-19 detection using chest X-ray images. Comput. Biol. Med. 2021, 132, 104319. [Google Scholar] [CrossRef]
Chandra, S.K.; Bajpai, M.K. Fractional mesh-free linear diffusion method for image enhancement and segmentation for automatic tumor classification. Biomed. Signal Process. Control 2020, 58, 101841. [Google Scholar] [CrossRef]
Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.C. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 4510–4520. [Google Scholar]
Rabcan, J.; Levashenko, V.; Zaitseva, E.; Kvassay, M. EEG signal classification based on fuzzy classifiers. IEEE Trans. Ind. Inform. 2021, 18, 757–766. [Google Scholar] [CrossRef]
Ganaie, M.; Kumari, A.; Malik, A.K.; Tanveer, M. EEG signal classification using improved intuitionistic fuzzy twin support vector machines. Neural Comput. Appl. 2024, 36, 163–179. [Google Scholar] [CrossRef]
Cherian, R.; Kanaga, E.G. Theoretical and methodological analysis of EEG based seizure detection and prediction: An exhaustive review. J. Neurosci. Methods 2022, 369, 109483. [Google Scholar] [CrossRef]
Lin, F.; Han, J.; Xue, T.; Lin, J.; Chen, S.; Zhu, C.; Lin, H.; Chen, X.; Lin, W.; Huang, H. Predicting cognitive impairment in outpatients with epilepsy using machine learning techniques. Sci. Rep. 2021, 11, 20002. [Google Scholar] [CrossRef] [PubMed]
Hashmi, A.; Barukab, O. Dementia classification using deep reinforcement learning for early diagnosis. Appl. Sci. 2023, 13, 1464. [Google Scholar] [CrossRef]
Ayman, U.; Zia, M.S.; Okon, O.D.; Rehman, N.U.; Meraj, T.; Ragab, A.E.; Rauf, H.T. Epileptic patient activity recognition system using extreme learning machine method. Biomedicines 2023, 11, 816. [Google Scholar] [CrossRef] [PubMed]
Rokach, L.; Maimon, O. Data Mining with Decision Trees, 2nd ed.; World Scientific: Singapore, 2014. [Google Scholar]
de Miras, J.R.; Ibáñez-Molina, A.J.; Soriano, M.F.; Iglesias-Parro, S. Schizophrenia classification using machine learning on resting state EEG signal. Biomed. Signal Process. Control 2023, 79, 104233. [Google Scholar] [CrossRef]

Figure 1. The flowchart of the proposed method.

Figure 2. The architecture of MobileNetV2.

Figure 3. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features as a classifier.

Figure 4. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘Conv-1’ layer and the KNN classifier.

Figure 5. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘Conv-1-bn’ layer and the KNN classifier.

Figure 6. Confusion matrix for VasD, MCogImp, and NC using MobileNetV2 features from the ‘out-relu’ layer and the KNN classifier.

Figure 7. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘global-average-pooling2d-1’ layer and the KNN classifier.

Figure 8. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘Logits-softmax’ layer and the KNN classifier.

Figure 9. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘Conv-1’ layer and the DT classifier.

Figure 10. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘Conv-1-bn’ layer and the DT classifier.

Figure 11. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘out-relu’ layer and the DT classifier.

Figure 12. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘global-average-pooling2d-1’ layer and the DT classifier.

Figure 13. The confusion matrix for Class 1 (VasD), Class 2 (MCogImp), and Class 3 (NC) using MobileNetV2 features from the ‘Logits-softmax’ layer and the DT classifier.

Table 1. The evaluation metric results from the MobileNet Deep learning classifier.

Evaluation Metrics	Control	MCI	VaD
precision	71.2	76.9	76.1
sensitivity	64.9	70.2	89.5
specificity	86.8	89.5	86
accuracy	74.9	74.9	74.9
F-measure	67.9	73.4	82.3

Table 2. The evaluation metric results using MobileNetV2 features and DT machine learning classifier.

KNN	Evaluation Metrics	Control	MCI	VaD
‘Conv-1’	precision	89.8	96.2	93.2
	sensitivity	93	89.5	96.5
	specificity	94.7	98.2	96.5
	accuracy	93	93	93
	F-measure	91.4	92.7	94.8
‘Conv-1-bn’	precision	89.8	96.2	93.2
	sensitivity	93	89.5	96.5
	specificity	94.7	98.2	96.5
	accuracy	93	93	93
	F-measure	91.4	92.7	94.8
‘out-relu’	precision	59.6	94.7	98.2
	sensitivity	100	80.6	80
	specificity	100	83.2	97.1
	accuracy	84.2	84.2	84.2
	F-measure	74.7	87	88.2
‘global-average-pooling2d-1’	precision	91.7	98.1	94.8
	sensitivity	96.5	91.2	96.5
	specificity	95.6	99.1	97.4
	accuracy	94.7	94.7	94.7
	F-measure	94	94.5	95.7
‘fc’	precision	94.5	93.1	96.6
	sensitivity	91.2	94.7	98.2
	specificity	97.4	96.5	98.2
	accuracy	94.2	94.2	94.2
	F-measure	92.9	93.9	97.4

Table 3. The evaluation metric results using MobileNetV2 features and KNN machine learning classifier.

DT	Evaluation Metrics	Control	MCI	VaD
‘Conv-1’	precision	93.1	96.4	94.7
	sensitivity	94.7	94.7	94.7
	specificity	96.5	98.2	97.4
	accuracy	94.7	94.7	94.7
	F-measure	93.9	95.6	94.7
‘Conv-1-bn’	precision	93.1	96.4	94.7
	sensitivity	94.7	94.7	94.7
	specificity	96.5	98.2	97.4
	accuracy	94.7	94.7	94.7
	F-measure	93.9	95.6	94.7
‘out-relu’	precision	96.4	88.7	98.1
	sensitivity	93	96.5	93
	specificity	98.2	93.9	99.1
	accuracy	94.2	94.2	94.2
	F-measure	94.6	92.4	95.5
‘global-average-pooling2d-1’	precision	94.8	96.4	96.5
	sensitivity	96.5	94.7	96.5
	specificity	97.4	98.2	98.2
	accuracy	95.9	95.9	95.9
	F-measure	95.7	95.6	96.5
‘fc’	precision	96.4	94.8	96.6
	sensitivity	93	96.5	98.2
	specificity	98.2	97.4	98.2
	accuracy	95.3	95.3	95.3
	F-measure	94.6	95.7	97.4

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Al-Qazzaz, N.K.; Ali, S.H.B.M.; Ahmad, S.A. Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study. Algorithms 2025, 18, 620. https://doi.org/10.3390/a18100620

AMA Style

Al-Qazzaz NK, Ali SHBM, Ahmad SA. Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study. Algorithms. 2025; 18(10):620. https://doi.org/10.3390/a18100620

Chicago/Turabian Style

Al-Qazzaz, Noor Kamal, Sawal Hamid Bin Mohd Ali, and Siti Anom Ahmad. 2025. "Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study" Algorithms 18, no. 10: 620. https://doi.org/10.3390/a18100620

APA Style

Al-Qazzaz, N. K., Ali, S. H. B. M., & Ahmad, S. A. (2025). Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study. Algorithms, 18(10), 620. https://doi.org/10.3390/a18100620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Investigation of the MobileNetV2 Optimal Feature Extraction Layer for EEG-Based Dementia Severity Classification: A Comparative Study

Abstract

1. Introduction

2. Materials and Methods

2.1. Participants and EEG Signal Acquisition

2.2. Preprocessing Steps

2.3. Deep Feature Extraction with MobileNetV2 Layers

2.4. Classification

2.4.1. Deep MobileNetV2 Classifier

2.4.2. Machine Learning Classifiers

3. Results and Discussion

3.1. Results of the Preprocessing

3.2. Results of Deep MobileNetV2 Feature Extraction

3.3. Results of Classification

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI