Automated Classiﬁcation of Mental Arithmetic Tasks Using Recurrent Neural Network and Entropy Features Obtained from Multi-Channel EEG Signals

: The automated classiﬁcation of cognitive workload tasks based on the analysis of multichannel EEG signals is vital for human–computer interface (HCI) applications. In this paper, we propose a computerized approach for categorizing mental-arithmetic-based cognitive workload tasks using multi-channel electroencephalogram (EEG) signals. The approach evaluates various entropy features, such as the approximation entropy, sample entropy, permutation entropy, dispersion entropy, and slope entropy, from each channel of the EEG signal. These features were fed to various recurrent neural network (RNN) models, such as long-short term memory (LSTM), bidirectional LSTM (BLSTM), and gated recurrent unit (GRU), for the automated classiﬁcation of mental-arithmetic-based cognitive workload tasks. Two cognitive workload classiﬁcation strategies (bad mental arithmetic calculation (BMAC) vs. good mental arithmetic calculation (GMAC); and before mental arithmetic calculation (BFMAC) vs. during mental arithmetic calculation (DMAC)) are considered in this work. The approach was evaluated using the publicly available mental arithmetic task-based EEG database. The results reveal that our proposed approach obtained classiﬁcation accuracy values of 99.81%, 99.43%, and 99.81%, using the LSTM, BLSTM, and GRU-based RNN classiﬁers, respectively for the BMAC vs. GMAC cognitive workload classiﬁcation strategy using all entropy features and a 10-fold cross-validation (CV) technique. The slope entropy features combined with each RNN-based model obtained higher classiﬁcation accuracy compared with other entropy features for the classiﬁcation of the BMAC vs. GMAC task. We obtained the average classiﬁcation accuracy values of 99.39%, 99.44%, and 99.63% for the classiﬁcation of the BFMAC vs. DMAC tasks, using the LSTM, BLSTM, and GRU classiﬁers with all entropy features and a hold-out CV scheme. Our developed automated mental arithmetic task system is ready to be tested with more databases for real-world applications.


Introduction
The amount of mental effort performed by each person in response to certain cognitive tasks is called the cognitive workload [1]. The human brain produces different responses to various cognitive tasks, and these responses can be further investigated by analyzing the brain's electrical activity [2]. The information regarding the brain's electrical tures have been used for various applications, such as the detection of generalized and partial epileptic seizures [17][18][19], emotion recognition [20], and brain-computer interface (BCI) [21] applications.
These non-linear entropy features, such as dispersion entropy [22], slope entropy [23], and other entropy measures [24,25], have not been explored for the mental-arithmetic-based cognitive task classification using EEG signals. Recently, deep learning approaches, such as the convolutional neural network (CNN) and recurrent neural network (RNN), have been extensively used for EEG signal processing applications [26]. A RNN is a deep neural network model used for the analysis of sequential data in natural language processing (NLP) and speech recognition applications [27].
This type of network explores the long-range dependencies for the modeling of timeseries data [28]. It considers the information from the previous state and input to evaluate the output at the present time-step. The LSTM-based RNN model has been used for mental arithmetic task classification using EEG signal features [15]. The RNN models can exploit the correlations between the features of EEG signals in different time-steps for the classification of mental arithmetic tasks.
Other RNN variants, such as bidirectional LSTM (BLSTM) and gated recurrent unit (GRU) [29], have not been used for the categorization of mental arithmetic tasks, such as BFMAC vs. DMAC and BMAC vs. GMAC, respectively, using EEG signal features. The novelty of this work is to explore various state-space domain non-linear entropy features and RNNs for mental-arithmetic-based cognitive workload task classification using EEG signals. The major contributions of this work are highlighted as follows: • The slope entropy, dispersion entropy, permutation entropy, sample entropy, and approximation entropy measures were computed from each EEG channel. • The LSTM, BLSTM, and GRU-based RNN models were used to classify mental arithmetic tasks. • The classification strategies, such as BMAC vs. GMAC and BFMAC vs. DMAC, are considered in this work.
The remaining parts of this paper are organized as follows. In Section 2, a description of the multi-channel EEG database for mental-arithmetic-based cognitive workload classification is written. The proposed approach is explained in Section 3. In Section 4, the results are evaluated using the proposed approach, and a discussion on the results is presented. Finally, in Section 5, the conclusions of this paper are written.

Multi-Channel EEG Database for Mental Arithmetic Tasks
In this work, we used a public database (MIT Physionet EEG mental arithmetic task dataset) to evaluate the proposed approach [30,31]. This database contains artifact-free multi-channel EEG recordings from 36 subjects. The multi-channel EEG signals were recorded using the neurocom monopolar 23 channel data acquisition system [30]. The electrode setup to record the multi-channel EEG signals is depicted in Figure 1.
The clinical protocols, such as no clinical manifestations of cognitive impairment, no verbal and non-verbal learning disabilities, and normal vision were followed by the subjects during the EEG recordings [30]. Those subjects who were drug or alcohol addicted, and with psychiatric disorders were excluded during the recording of the EEG signals. Each subject performed an arithmetic task such as the subtraction of two numbers. In this database, multi-channel EEG recordings from each subject were comprised of 180-s resting-state EEG and 60-s mental arithmetic calculation-based cognitive state EEG data. The sampling frequency of the multi-channel EEG recordings was 500 Hz. The multi-channel EEG recordings were divided into two groups-good (G) and bad (B)-based on the arithmetic calculations performed by each subject. The GMAC class is interpreted as the subjects who performed good quality arithmetic calculations with the number of calculations per four minutes as 21 ± 7.4. Similarly, the BMAC is termed as the subjects who performed bad quality mental arithmetic calculations with the number of calculations per four minutes as 7 ± 3.6.
The annotations or the count quality of each multi-channel EEG recording are given in the database. The symbols 'B' and 'G' denote bad and good mental-arithmetic-based EEG recordings based on the number of subtractions performed by the subject. The annotations of the multi-channel EEG recordings for all subjects before performing any mental arithmetic calculations (BFMAC or rest state) and during the mental arithmetic calculations (DMAC or active state) are also given in the database. In this work, the cognitive task classification strategies, such as BMAC vs. GMAC and BFMAC vs. DMAC, are studied.

Method
In this section, we describe the proposed method for the automated classification of BFMAC vs. DMAC and BMAC vs. GMAC using multi-channel EEG signals. The flowchart for the mental arithmetic calculation-based cognitive workload classification task is depicted in Figure 2. The step-by-step procedure for the automated categorization of mental arithmetic calculation-based cognitive workload classification tasks is as follows: • Segmentation of multi-channel EEG recordings into multi-channel EEG frames. • Evaluation of the state-space domain non-linear entropy features from each multichannel EEG frame. • Classification of mental arithmetic tasks using RNN models.
We describe the detailed theory of each block of the flowchart in the following subsections.

Segmentation of Multi-Channel EEG Recordings
In this study, we considered a non-overlapping window of 2 s duration (2 × 500 = 1000 samples) for segmenting each multi-channel EEG recording. A total of 30 frames were evaluated from each multi-channel EEG recordings. The total number of multi-channel EEG frames considered in this work was 36 × 30 = 1080. Figures 3 and 4 show the EEG frames of the Fp1 and O1 channels for BMAC (Figure 3a,b)) and GMAC (Figure 3c,d) tasks as well as DMAC (Figure 4a,b) and BFMAC (Figure 4c,d) tasks. We observed significant differences in the amplitude values and temporal characteristics of EEG signals between BMAC-and GMAC-, and between BFMAC-and DMAC-based cognitive tasks.
A study [32] reported that the positive level amplitude increased during mental arithmetic tasks. The temporo-centro-parietal activity in multi-channel EEG signal increased during the mental arithmetic calculation-based active state compared to the rest state [32,33]. These physiological changes affect both the temporal and spatial characteristics of multi-channel EEG signals. Therefore, the features evaluated from the EEG signals can be used for the automated classification of mental-arithmetic-based cognitive tasks. In the following subsection, the entropy features evaluated from the multi-channel EEG frames are described.

Non-Linear Entropy Features
In this study, we computed five entropy measures, viz. the slope entropy [23], dispersion entropy [22], permutation entropy [24], sample entropy [25], and approximation entropy [34], from each EEG channel for the classification of mental-arithmetic-based cognitive workload tasks. The slope entropy was evaluated using the difference between the consecutive amplitude values of each embedded vector extracted from the signal [23].
Here, we evaluated the slope entropy from each EEG channel. The step-by-step procedure to evaluate the slope entropy for ith channel EEG signal was as follows [23]: Step-1: The ith channel EEG signal is denoted as The jth embedded vector extracted from the ith channel EEG signal is given by y i,L j = [x i (j), x i (j + 1), . . . , x i (j + L − 1)] where L is the embedded dimension. The factor, y i,L j is termed as the jth embedded vector and j = 1, 2,. . . , N − L + 1.
Step-2: The difference between the consecutive sample values of each embedded vector of ith channel EEG signal is evaluated, and the slope signal for jth embedded vector is evaluated as follows: where k = 1, 2. . . L − 1. In vector form, the slope signal jth embedded vector is denoted as follows: Step-3: Each element of the slope signal for the jth embedded vector of ith channel is mapped to negative and positive integer values, and this mapping is given as follows: η and ζ are the slope entropy parameters, and ζ > η.
Step-4: For jth embedded vector of the ith channel, the mapped pattern containing the positive and negative integer values for the EEG signal is evaluated and is denoted as π j .
Step-5: The mapped patterns for all embedded vectors are evaluated. The relative frequency (RF) vector is computed using the pattern matching concept by considering all patterns and is denoted as r i = [r i (1), r i (2), . . . , r i (T)], where T is the total number of elements in the RF vector.
Step-6: The probability evaluated using the RF vector for ith channel is given as follows [23]: Thus, the slope entropy of ith channel EEG signal is evaluated as follows: Similarly, the approximation entropy (AppEN), permutation entropy (PermEN), dispersion entropy (DisEn), and sample entropy (SampEN) features were also calculated from each EEG channel, and for the ith EEG channel, these features are denoted as AppEN i , PermEN i , DesEN i , and SampEN i , respectively. For more details on AppEN, PermEN, DesEN, and SampEN, we encourage readers to refer to [22,24,25,34].
As five entropy features were computed from each EEG channel, a total of 95 dimensional feature vectors were obtained from each multi-channel EEG frame. This 95dimensional feature vector sequence, z(t) was used as the input to the RNN-based model for classification.

Recurrent Neural Network (RNN)
In this study, we used three RNN variants-namely, LSTM, BLSTM, and GRU. The block diagram of the mental arithmetic task classification using the LSTM, BLSTM, and GRU classifiers is shown in Figure 5. The classification strategies, such as BMAC vs. GMAC and BFMAC vs. DMAC, were used in this work. The feature matrix is denoted as Z ∈ R q×s where q denotes the number of instances or time-steps and s = 95 denotes the number of features. The training and the test instances for each type of RNN classifier were selected using both hold-out and 10-fold cross-validation (CV) techniques [6,35].
For hold-out CV, we considered 60%, 10%, and 30% of the instances for training, validation, and testing, respectively, for the LSTM, BLSTM, and GRU models [35]. The numbers of instances used for each class are shown in Table 1. From this table, for the BMAC vs. GMAC classification task, there is a class imbalance problem. We considered random over-sampling during the training of each type of RNN classifier to overcome the class imbalance problem [36]. The LSTM classifier is a type of RNN [37] and has been used in different biomedical applications [28,38,39].
The LSTM layer mainly comprises of the cell, candidate value, input gate, forget gate, and output gate. The cell is considered as a memory and is used to remember the information at different time-intervals [37]. Similarly, the information flow from and to the cell is performed using the gates. The architecture of the LSTM network is shown in Figure 6. In LSTM, the output at the tth time-step was evaluated using the input z(t) and the activation g(t − 1) from the (t − 1)th time-step [40]. Here, we denote different parameters, such as memory cells as m(t), candidate values asm(t), forget gates as FG, update gates as UG, output gates as OG, and outputs at the tth level step as g(t) [37]. The mathematical expressions of LSTM for the forget gate, update gate, candidate value, memory cell, and output are given as follows: [37]: where W FG and b FG are the weight and bias values for forget gate. Similarly, W UG and the b UG are termed as the weight and the bias values for update gate. Moreover, W OU and b OU are the weight and bias values at the output gate. ' f ' is denoted as the sigmoid activation function. The operator ⊗ is termed as the Hadamard product [41].   BLSTM considers both past and future time-step information to model the current time step for RNN model [42]. It consists of both forward and backward LSTM parts. The forward LSTM mathematical expressions are same as in Equations (5)-(10). For backward LSTM, however, the mathematical expressions of forget gate, update gate, candidate value, memory cell and output gate are given as follows [42]: The GRU is the simplified version of LSTM, which consists of two gates-namely, the update gate and the reset gate [43]. The GRU model architecture is shown in Figure 7. The mathematical expressions for the reset gate (RG), update gate (UG), candidate value,m(t), and memory cell m(t) at the tth time-step are given below [43]: In this study, the classification model shown in Figure 4 comprises the input layer, RNN variant layer, fully-connected (FC) layer, softmax layer, and classification layer. The input feature vector at the tth time-step is denoted as z(t). The LSTM/BLSTM/GRU layer was used to process the input feature sequence. The number of hidden neurons considered in this layer is denoted as n h . The output obtained in the LSTM/BLSTM layer is denoted as where T is the total number of time-steps considered for the LSTM, BLSTM, and GRU layers. For the GRU layer, the output is the same as the memory, m(t).
The FC layer was used to convert the LSTM/BLSTM/GRU layer activation to a feature vector containing two features. This feature vector for the ith instance is denoted as The softmax based activation function was used in the classification layer. The kth output neuron activation for the tth time-step was evaluated as follows: where K = 2 is the number of classes. The binary cross-entropy function was used as the cost function for the LSTM, BLSTM, and GRU classifiers [35]. The training parameters used for the LSTM, BLSTM, and GRU models are shown in Table 2. The Adam optimizer was used for the evaluation of the weight and bias parameters. The performance of three RNN variants for the classification of mental arithmetic calculation tasks was evaluated using the accuracy, sensitivity, F1-score, and specificity measures [35].

Results and Discussion
In this section, we discuss the statistical analysis of the selected entropy features for BMAC vs. GMAC and BFMAC vs. DMAC classification tasks and the results of classification using RNN-based models. Student's t-test was used to evaluate the statistical significance of the entropy features of the EEG signals for the BFMAC vs. DMAC and BMAC vs. GMAC classification tasks [44]. The significant level for the t-test of the entropy features of each channel for both classification tasks was considered as 0.05.
Box-plots showing the within-class variations for the Fp1-channel EEG signal dispersion entropy, F7-channel EEG signal slope entropy, C4-channel EEG signal approximation entropy, O1-channel EEG signal sample entropy, and O1-channel EEG signal permutation entropy features for the BFMAC vs. DMAC classification task are shown in Figure 8a-e, respectively. From these plots, the median values of each entropy feature were different for the BFMAC and DMAC classes. Similarly, we show the mean and standard deviation values of different entropy features for the Fp1, F7, C4, and O1-channel EEG signals in Table 3.
For channel Fp1, the p-values of the approximation entropy, dispersion entropy, sample entropy, and slope entropy features were less than 0.05. Significant differences in the mean values of the Fp1 channel EEG signal entropy features in between the BFMAC and DMAC classes were also observed. Moreover, for the F7-channel EEG signal, the p-values of all entropy features were found to be less than 0.05.
For the C4-channel EEG signal, except for the dispersion entropy, other entropy features showed p-values of less than 0.05, and these selected features were clinically significant for the classification of BFMAC vs. DMAC tasks. For the O1-channel EEG signal, higher differences in the mean values of the sample entropy and permutation entropy were observed between the BFMAC and DMAC classes. Similarly, statistical variations were also observed for the entropy features of other EEG channels. Similarly, the box-plots of the Fp1-channel approximation entropy, Fp1-channel sample entropy, F7-channel dispersion entropy, C4-channel slope entropy, and O1-channel permutation entropy features for the BMAC and GMAC classes are depicted in Figure 9a-e, respectively. The statistical analysis of all entropy features of the Fp1, F7, C4, and O1channel EEG signals for the BMAC and GMAC classes are shown in Table 4. From the table, the Fp1-channel approximation entropy, dispersion entropy, sample entropy, and permutation entropy features had higher mean value differences between the BMAC and GMAC classes when compared to the slope entropy features.
Similarly, the F7-channel dispersion entropy, permutation entropy, and slope entropy features had p-values less than 0.05. For the C4-channel EEG signal, higher mean value differences in the slope entropy, approximation entropy, permutation entropy, and sample entropy features were observed. The p-values of these entropy features were found to be less than 0.05 when compared to the dispersion entropy features. Similarly, for the O1-channel EEG signal, the dispersion entropy, permutation entropy, and slope entropy features had p-values that were found to be less than 0.05 for the BMAC vs. GMAC classification.
The p-values of the approximation entropy and slope entropy features of the O1channel EEG signals were more than 0.05 for the BMAC vs. GMAC classification task. From the statistical analysis results, various entropy features computed from different channel EEG signals effectively captured the diagnostic information for the automatic classification of mental arithmetic cognitive tasks. The classification performance of the GRU, LSTM, and BLSTM models for the BMAC vs. GMAC classification tasks using hold-out CV is shown in Table 5. In this work, five trial-based random hold-out CVs were used, and the mean and standard deviation values of each performance measure were evaluated [45]. We observed from this table that all RNN models obtained accuracy, sensitivity, precision, specificity, and F-score values of more than 98% for the BMAC vs. GMAC classification task.
In Table 6, we show the performance of RNN classifiers using 10-fold CV-based multichannel EEG instance selection for the BMAC vs. GMAC classification task. For the GRU model, the accuracy, sensitivity, specificity, precision, and F-score values were more than 98% for each fold. The average values of all performance measures were more than 98% for each fold. The average values of all performance measures were more than 99% for the GRU model. Similarly, for the BLSTM model, the accuracy, sensitivity, and F-score values were higher than 98% in each fold. However, the specificity values for fold 3 and fold 4 were 96.77% and 97.00%, respectively. For the GRU model, the specificity value at fold 1 was 96.43% and 100% for rest of the folds using all entropy features. The classification performance obtained using the LSTM, BLSTM, and GRU models with each entropy feature and all EEG channels is shown in Table 7. The slope entropy features combined with each type of RNN model obtained an average accuracy value of more than 93% when compared to other entropy features. The dispersion entropy features coupled with each RNN model also demonstrated average accuracy values of more than 99% for the sensitivity, precision, and specificity, and the F-score values of each type of RNN model were high using slope entropy features when compared to other entropy features extracted from multi-channel EEG signals.
The classification performance of the LSTM, BLSTM, and GRU models was evaluated using the entropy features of each EEG channel, and this is shown in Table 8. The entropy features evaluated using the O2, Fz, Cz, and Pz channel EEG signals obtained average accuracy values of more than 84% using each type of RNN model when compared to the entropy features of other EEG channels. Therefore, the O2, Fz, Cz, and Pz EEG channels were found to be significant for the classification of mental arithmetic tasks.    Table 6. The classification performance obtained using RNN classifiers with the 10-fold CV approach for the automated classification of the BMAC vs. GMAC task.    Table 7. The classification performance obtained for RNN classifiers for selected entropy features using the hold out CV approach for the BMAC vs. GMAC classification task.

Model
Feature Selection Accuracy (%) Specificity (%) Precision (%) Sensitivity (%) F-Score (%) The plots of accuracy versus the number of iterations obtained using the LSTM, BLSTM, and GRU classifiers for the automated classification of the BFMAC vs. DMAC tasks are shown in Figures 10-12, respectively. Both the training and validation accuracy values were 100% for the LSTM classifier. Similarly, for the BLSTM classifier, the training and validation accuracy values obtained were 100% and 98.54% after 200 epochs. The training and validation accuracy values were more than 100% using the GRU classifier for the automated classification of the BFMAC vs. DMAC tasks.
Similar variations were also observed in plots of accuracy versus the number of iterations obtained using the LSTM, BLSTM, and GRU classifiers for the automated classification of the BMAC-vs. GMAC-based cognitive workload tasks. We also demonstrated the confusion matrices obtained using the LSTM, BLSTM, and GRU classifiers with one trial of hold-out CV for the automated classification of the BFMAC vs. DMAC and BMAC vs. GMAC tasks. In Figure 13a-c, we show the confusion matrices obtained using the LSTM, BLSTM, and GRU classifiers for the automated classification of the BFMAC vs. DMAC tasks.
The confusion matrix plots obtained using the LSTM, BLSTM, and GRU classifiers for the automated classification of both the BMAC vs. GMAC tasks are depicted in Figure 13d-f, respectively. The true positive and true negative values were high for all three RNN classifiers for the BFMAC vs. DMAC classification tasks. The LSTM and BLSTM classifiers yielded higher false-positive values compared with the GRU classifier for the automated classification of the BMAC vs. GMAC tasks. We also evaluated the performance of the LSTM, BLSTM, and GRU classifiers using subject-independent CV cases for the automated classification of the BMAC vs. GMAC tasks.
The subject-wise accuracy values using the LSTM, BLSTM, and GRU classifiers from multichannel EEG instances are shown in Figure 14a-c. From these figures, the average accuracy values obtained using subject-independent CV for the LSTM, BLSTM, and GRU classifiers were 58.70%, 55.74%, and 60.55%, respectively. The GRU classifier obtained the highest classification accuracy with subject-independent CV over other types of RNN classifiers for the classification of the BMAC vs. GMAC tasks.    The classification results evaluated using each type of RNN model with all entropy features of multi-channel EEG signals for the BFMAC vs. DMAC classification task with hold-out CV are shown in Table 9. The LSTM, BLSTM, and GRU models obtained accuracy values of more than 99% using all entropy features of the multi-channel EEG signals for the BFMAC vs. DMAC classification task. The results obtained using various types of RNN-based models with all entropy features of multi-channel EEG with 10-fold CV are shown in Table 10.
From these results, the LSTM and BLSTM models obtained accuracy, sensitivity, and specificity values of more than 97% for each fold. However, the specificity of the GRU -based classifier was less than 97% for fold 3 and fold 6. The average accuracy of each type of RNN classifier was more than 99% for the BFMAC-vs. DMAC-based cognitive workload classification task. Therefore, the entropy features computed using multi-channel EEG demonstrated higher classification performance for the classification of cognitive workload tasks.  We compared the classification results obtained using our approach with existing methods to classify mental arithmetic calculation-based cognitive tasks using multi-channel EEG signals obtained from the same database. The comparison results are shown in Table 11. The existing methods used features, such as the mean amplitude, variance, Shannon entropy, energy, other statistical features, and various supervised learning-based classifiers, such as SVM, LSTM, and decision trees, for the BFMAC vs. DMAC based cognitive task classification scheme.
The L2-norm, mean amplitude, energy, and Shannon entropy features combined with the decision tree classifier obtained a classification accuracy of 95.80%. The statistical features of multi-channel EEG signals coupled with the decision tree model achieved an accuracy of 91.67%, which is less than the classification accuracy reported in [13]. Similarly, the SVM classifier combined with the variance, energy, and Shannon entropy features obtained an accuracy of 98.60% for the automated classification of the BFMAC vs. DMAC task. The performance of the SVM classifier depended on the proper selection of the kernel functions, kernel parameters, and number of iterations [46].
Similarly, the training parameters of the decision tree classifier were the depth of the tree, the number of times the split occurred in the decision tree, and the split criteria or information gain, respectively. The optimal training parameters of both the SVM and decision tree classifiers were selected using a grid-search in the nested cross-validation domain [4,47].
Each RNN-based classifier in the proposed approach yielded superior classification performance compared with the existing methods for the automated classification of BFMAC-vs. DMAC-based cognitive classification tasks using multi-channel EEG signals. The LSTM, BLSTM, and GRU models successfully quantified the dependencies in the entropy features of multi-channel EEG signals, which further helped to create a boundary for the classification of BFMAC-and DMAC-based cognitive tasks. The advantages of this mental-arithmetic-based cognitive classification task are given as follows: • Various entropy features computed from various EEG channels were used for the classification of mental arithmetic tasks. • The entropy features from the O2, Fz, Cz, and Pz channel EEG signals demonstrated higher classification accuracy using RNN-based classifiers. • The slope entropy features combined with each type of RNN-based classifier obtained higher classification accuracy over the other entropy features. • The proposed approach obtained the highest classification accuracy at a 99% classification accuracy for the BFMAC vs. DMAC and BMAC vs. GMAC cognitive workload classification tasks. In this work, we used the multi-channel EEG signals from 36 subjects to evaluate the proposed approach of automated classification of BMAC vs. GMAC and BFMAC vs. DMAC arithmetic tasks. Multi-lead EEG signals from more subjects are needed to develop accurate and robust automated classifications of mental-arithmetic-based cognitive workload tasks. Other entropy features, such as the distribution entropy [48], and bubble entropy [49] features obtained from multi-channel EEG signals, can be used for the automated classification of mental-arithmetic-based cognitive tasks.
In this work, the RNN-based models were used for the classification. In the future, we intend to use convolutional autoencoder [50], LSTM-autoencoder [51], convolutional neural network (CNN) [35,52] and CNN-RNN [53]-based deep learning models for feature extraction and classification tasks using a large database with multi-channel EEG signals.

Conclusions
An automated approach for the classification of mental arithmetic calculation-based cognitive tasks using various entropy features obtained from multi-channel EEG signals is proposed in this paper. The state-space domain entropy measures, such as the sample entropy, approximation entropy, dispersion entropy, permutation entropy, and slope entropy features, were computed from multi-channel EEG signals. We used recurrent neural network (RNN)-based models, such as long short-term memory (LSTM), bidirectional LSTM (BLSTM), and gated recurrent unit (GRU), as classifiers to perform cognitive task classification schemes, such as bad mental arithmetic calculation (BMAC) vs. good mental arithmetic calculation (GMAC), and before mental arithmetic calculation (BFMAC) vs. during mental arithmetic calculation (DMAC), respectively.
We obtained classification accuracies of 99.88%, 99.43%, and 99.81% using LSTM, BLSTM, and GRU-based RNN models for the automated classification of the BMAC vs. GMAC classification task. Our proposed approach demonstrated a classification accuracy of more than 99% using all RNN-based models for the automated classification of BFMAC vs. DMAC tasks. The slope entropy features coupled with each type of RNN model obtained the highest classification accuracy for both BMAC vs. GMAC and BFMAC vs. DMAC automated cognitive classification tasks. In the future, our proposed approach can be tested with multi-channel EEG signals to classify more types of mental-arithmetic-based cognitive tasks for brain-computer interface (BCI) applications.