Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals

Ortega-Rodríguez, Jordan; Gómez-González, José Francisco; Pereda, Ernesto

doi:10.3390/s23094239

Open AccessArticle

Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals

by

Jordan Ortega-Rodríguez

^1,2

,

José Francisco Gómez-González

^1,*

and

Ernesto Pereda

¹

Department of Industrial Engineering, University of La Laguna, 38200 San Cristóbal de La Laguna, Spain

²

IACTEC Medical Technology Group, Instituto de Astrofísica de Canarias (IAC), 38320 San Cristóbal de La Laguna, Spain

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(9), 4239; https://doi.org/10.3390/s23094239

Submission received: 24 February 2023 / Revised: 12 April 2023 / Accepted: 21 April 2023 / Published: 24 April 2023

(This article belongs to the Special Issue Sensor Technologies for Human Health Monitoring)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Biometric identification uses person recognition techniques based on the extraction of some of their physical or biological properties, which make it possible to characterize and differentiate one person from another and provide irreplaceable and critical information that is suitable for application in security systems. The extraction of information from the electrical biosignal of the human brain has received a great deal of attention in recent years. Analysis of EEG signals has been widely used over the last century in medicine and as a basis for brain–machine interfaces (BMIs). In addition, the application of EEG signals for biometric recognition has recently been demonstrated. In this context, EEG-based biometric systems are often considered in two different applications: identification (one-to-many classification) and authentication (one-to-one or true/false classification). In this article, we establish a methodology for selecting and reducing the minimum number of EEG sensors necessary to carry out effective biometric identification of individuals. Two methodologies were applied, one based on principal component analysis and the other on the Wilcoxon signed-rank test in order to reduce the number of electrodes. This allowed us to identify, according to the methodology used, the areas of the cerebral cortex that would allow selection of the minimum number of electrodes necessary for the identification of individuals. The methodologies were applied to two databases, one with 13 people with self-collected recordings using low-cost EEG equipment, EMOTIV EPOC+, and another publicly available database with recordings from 109 people provided by the PhysioNet BCI.

Keywords:

EEG; biometrics; brain–computer interface (BCI); support vector machine (SVM); phase locking value (PLV); asymmetry index (AI)

1. Introduction

The concept of biometrics, which comes from the words bio (life) and metrics (measurement), consists of those techniques for individual identification of people based on their physical or biological traits [1,2], which result in unique and irreplicable information. It is therefore currently the object of study for security systems. There are some commonly used biometric traits, such as fingerprints, DNA, or facial recognition, among others; however, over the last decade, it has been discovered that the analysis of electrical brain signals such as EMG or MEG (magnetoencephalography) as an alternative, is an extremely useful and unforgeable individual identification and recognition tool.

An interesting concept in this field is vitality detection, whose objective is an actual measurement of the biometric sample taken from the legitimate and living individual at the time and place of authentication. It improves the reliability of a biometric system by allowing the system to resist artefacts and ensure that no non-living or fake samples are accepted. Although current biometric systems use people’s physiological information for authentication, these measurements hardly detect their vitality. However, although they appear secure, it has been shown that a biometric system can be counterfeited with artefact samples; for example, the fingerprint system can be counterfeited with an artificial finger prepared from gelatin, silicon, latex, or Play-Doh [3].

There are currently numerous and diverse personal identification techniques through the analysis of biometric features of the neurophysiology of the human body, such as the eye retina, fingerprints, or facial recognition, among others [4]. Although today the use of common analysis processes for these traits is widespread, many of them require relatively high-cost (both economically and computationally) hardware and software systems. One of the most widespread biometric identifiers, used above all in forensic science, is the fingerprint, although there are currently techniques that allow its falsification, which is why the possibility of an individual’s information being stolen by falsifying their biometric features in some way that manages to deceive the security system in charge of protecting the said information is a fact that poses challenges to fields of research in this area [5,6].

In recent years, it has been proposed to measure the electrical activity of the superficial human brain for biometric recognition uses such as identification (one-to-many classification) and authentication (one-to-one classification) of individuals [7,8,9,10]. One way to measure the activity of the cerebral cortex noninvasively is the electroencephalogram (EEG). The EEG has been widely used in medicine [11,12] and non-medical applications, such as in the development of brain–machine interfaces (BMIs) [13,14]. Unlike other biometric measurements, EEG-based biometric measurements of an individual are difficult to falsify, and it can also be guaranteed that the identified person is alive. However, it should be noted that EEG signals exhibit the complexity that can be influenced by the passage of time due to various factors such as noise, drowsiness, changes in electrode conditions and the current mental state of each individual at any given moment [15]. This fact can make it difficult to obtain the same EEG signal twice for the same person, leading to inconsistencies in biometric identification. In this respect, biometric identification may be affected by the non-stationarity of the EEG recordings [16]. This inherent nature of the signal, caused by shifts in the covariance of power features, led to a decrease in the accuracy of the BCI model. To address this issue, proposals such as spatial filtering and stationarity subspace analysis (SSA) have been put forward to reduce non-stationarity [17]. Proper electrode placement is also critical in EEG recordings to eliminate difficulties related to feature covariance shifts. Changes in the covariance of EEG features due to different electrode positions can result in differences in the recorded signals, making it difficult to identify individuals accurately.

Concerning the selection of features that improve the performance of biometric identification systems, studies have recently been published suggesting that the subtraction of information related to functional connectivity between different regions of the brain is a feature that can potentially improve pattern classification tasks in EEG signals. Several studies have demonstrated that features derived from functional connectivity can significantly enhance the performance of EEG signal classification for biometric recognition systems [18,19]. Incorporating such features can improve the robustness of biometric recognition systems by considering the interdependence of EEG channels [20]. One of the most effective methods for achieving this objective includes the study of phase synchronization [21,22].

In this context, using an eyes-closed resting paradigm, Campisi et al. examined in 2011 the contribution to subject discrimination of different areas of the brain and frequency bands [23]. The results showed a decrease in classification accuracy when making use of information extracted from acquisition channels located over frontal brain areas compared to those located in occipital and temporal brain areas. This decrease was more pronounced when high-frequency EEG rhythms were filtered out. In particular, the best result was obtained with the T7–Cz–T8 electrode arrangement and a low-pass filter with a cut-off frequency of 33 Hz when an autoregression model was employed, and an accuracy rate of 96.08% was obtained with a database of 48 subjects.

There are a variety of analytical tools in the literature available for measuring the statistical interdependence between brain electrical signals. These are based on various mathematical principles that are implemented in time and frequency domains, capturing linear or nonlinear changes, such as correlation, phase coherence, and Granger causality [24,25,26,27,28]. These tools allow the estimation of information extracted from a phenomenon known as functional brain connectivity, which is related to the measurement of temporal communication values of neuronal activity between different areas of the brain that are anatomically separated [29,30]. In this regard, Daria la Rocca et al. proposed in 2014 a novel approach involving the extraction of information from functional connectivity on a database of 108 subjects in the resting state with eyes closed (REC) and eyes open (REO) recording paradigms [31,32].

More recently, in 2016, Douglas Rodrígues et al. presented a paper addressing the problem of reducing the number of acquisition sensors required while still being able to maintain competent performance [33]. In this case, a binary version of the Flower Pollination Algorithm with different transfer functions was evaluated to select the best subset of channels that maximizes accuracy. This optimization problem was carried out using the classifier. The experimental results obtained indicated that the proposed model was able to make use of less than half the number of sensors while maintaining recognition rates of up to 87%. In the same year, Toshiaki Koike-Akino et al. also analysed brain waves acquired from a commercial EEG device to investigate its user identification and authentication capabilities [34]. First, they showed the statistical significance of the P300 component in event-related potential (ERP) data acquired through 14 EEG channels on a sample of 25 subjects. They then analysed the application of a variety of machine learning techniques, making comparisons in the use of several of them in terms of subject identification performance, using dimensional reduction techniques on the signal samples before the classification stage. The experimental results of this study showed an identification accuracy of 72% when using a single 800 ms ERP. Furthermore, they showed that the biometric identification accuracy of individuals can be significantly improved to 96.7% accuracy by jointly classifying multiple epochs.

This work aims to achieve a ranking of those electrodes located on the surface of the scalp, and therefore of the corresponding brain regions, that provide the most relevant information for an EEG-based biometric recognition system. For this purpose, it is necessary to determine the order of relevance of the features extracted from the EEG recordings by applying principal component analysis and the Wilcoxon signed-rank test. Features extracted are the power spectrum, asymmetry index and information related to the functional connectivity of the brain by calculating the phase-lock value.

2. Materials and Methods

The steps followed for the identification of individuals using EEG-based biometric measurements are shown in Figure 1.

2.1. Experimental Procedure

The evaluation of the proposed method was conducted on two distinct datasets. Dataset I was the primary dataset and consisted of a self-collected dataset using an inexpensive EEG acquisition device. Moreover, the largest sized available EEG dataset in the related literature, which we will refer to as dataset II, was utilized to contribute a greater degree of robustness to the validation of the obtained results. Dataset II was a publicly available collection of EEG recordings provided by the PhysioNet BCI [35]. The signal acquisition protocol was similar in both cases.

In the case of dataset I, EEG signals from thirteen volunteer healthy right-handed subjects (aged 18–51 years) with no motor pathology were recorded in a typical office environment. The experimental procedure has been described by Ortega et al. [7].

Seated in front of a computer screen with a black background on which instructions to be followed are displayed, each volunteer performed a specific mental task. This task consisted of performing a motor imagery action of squeezing a flexible object with the right hand [36]. For each subject, EEG signals were recorded in the basal state for 20 s and a 10 s transition period; finally, the main action (motor imagery action) was performed for another 20 s. EEG recordings were obtained from each participant in a single session of four repetitions, resulting in a total recording time of 80 s per participant. The study was approved by the ethical committee of the University of La Laguna (registration number: CEIBA2020-0405).

Dataset II consisted of EEG recordings from 109 healthy individuals obtained from the publicly available PhysioNet BCI database. In that case, each participant performed four different mental tasks involving eye, fist, and foot movements, with each task being repeated three times for a duration of two minutes per recording. During the tasks, a target was presented on the monitor’s right or left side to cue the participants to perform the corresponding action until the target disappeared. The EEG recordings of the motor imagery tasks and the sections involving right-hand movements were selected from the dataset. Three two-minute EEG recordings per participant were chosen for a total of 327 EEG records.

2.2. Data Acquisition

EEG recordings in dataset I were performed with the Emotiv Epoc+ (Emotiv Inc., San Francisco, CA, USA) portable commercial electroencephalograph. This device was suggested in [26] as the best low-cost EEG device in terms of versatility. This has 14 sensor electrodes (AF3, F7, F3, FC5, T7, P7, O1, O2, P8, T8, FC6, F4, F8 and AF4) with saline-soaked felt pads. The electrical reference point CMS (Common Mode Sense) is located at P3 or right mastoid (active electrode), and the noise cancellation electrode DRL (driven right leg) is located at P4 or left mastoid (passive electrode). The electrodes were placed on the scalp following the international 10–20 system. The EEG signals were transmitted to the computer with a 128 Hz sampling frequency using a wireless Bluetooth dongle and stored in the European Data Format (EDF).

Dataset II was built with EEG recordings using the BCI2000 (Laboratory of Nervous System Disorders, Wadsworth Center, New York State Department of Health, Albany, NY, USA) as the brain–computer interface system [37] with 64 electrodes. In that case, the electrodes were placed in agreement with the international 10–10 system. The EEG signals were collected with a 160 Hz sampling frequency and stored in EDF+ format.

2.3. Data Preprocessing

Preprocessing of the EEG signals was performed using FieldTrip, version 20230118 [38], a freely available toolbox for MATLAB^®, The MathWorks, Inc. The chosen environment for this study was Matlab R2022b. The first step consisted of applying a baseline correction based on the mean average voltage to the EEG signals. Next, the signals were filtered using a bandpass filter (FIR) from 5 to 40 Hz to reduce noise. Finally, FieldTrip functions specifically designed to remove other artefacts, such as eye movements and muscle activation, were used [7,39].

Each EEG recording was segmented into epochs of 2 s duration and filtered into the beta frequency band (13–30 Hz). According to the related literature, the beta frequency band is considered to provide particularly valuable information in mental tasks involving motor imagery and motor action. Therefore, this frequency band should be carefully considered in any biometric identification application that involves such mental tasks [40].

2.4. Feature Extraction

For the proposed biometric identification model, three sets of features were extracted independently from each epoch of EEG signals in the beta frequency band: power spectrum, asymmetry index, and phase-locking value.

The power spectrum (PS) was calculated with the fast Fourier transform using multiple tappers from discrete spheroidal sequences [41].

The asymmetry index (AI) [42] was calculated as the Napierian logarithm of the fraction between the spectral power values of the signal acquired by the corresponding pairs of electrodes of each cerebral hemisphere (PS_chleft, PS_chright).

A s y m m e t r y I d e x = l n \frac{{P S}_{c h l e f t}}{{P S}_{c h r i g h t}}

(1)

Information related to the functional connectivity of the brain was extracted by calculating the phase lock value (PLV) using the implementation described in [43].

For a given number of epochs (N) and a difference between the instantaneous phase of the two EEG signals, θ (t, n), at a specific time (t) and epoch (n), the PLV is defined as

P L V (t) = \frac{1}{N} |\sum_{n = 1}^{N} e^{i θ (t, n)}|

(2)

This calculation describes the average of the absolute values of the phase difference between the two signals for each epoch and can identify transient phase lock values independently of the signal amplitude [44].

Hence, in the case of dataset I, every subject was represented by 112 features for every epoch in the beta frequency band. These features included 14 PS features, 7 AI features, and 91 PLV features. Similarly, for dataset II, each subject was characterized by 2107 features from the beta frequency band, comprising 64 PS features, 27 AI features, and 2016 PLV features. The dimension of the feature tables, which includes the subject’s label column, was 520 × 112 and 2289 × 2108 for dataset I and II, respectively, when using all available electrodes.

2.5. Feature Selection

In this study, two different techniques were independently used and compared as feature selection methods: principal component analysis (PCA) and the Wilcoxon signed-rank test.

Principal component analysis (PCA) is a statistical technique that allows the complexity of sample spaces with high spatial dimensionality to be reduced while preserving their principal information [45,46,47,48]. PCA can then be used to identify the most important features in the dataset and reduce their dimensionality while preserving the most important information. It is an unsupervised learning algorithm that studies the relationship between the variables that make up a data set to identify subgroups in which the data variation is maximum. To do this, it performs the calculation of geometric projections of the source data on lower-dimensional directing predictors called principal components (PCs), which are linear combinations of the original variables. The basic idea behind PCA is therefore to find a new set of variables (PC) that capture the most important information in the data. These new variables are linear combinations of the original variables and are chosen such that they are uncorrelated and ordered by the amount of variance they explain in the data.

The Wilcoxon signed-rank test is defined as a non-parametric statistical test that allows determining the correlation between variables of a pair of independent samples that do not follow a normal distribution; that is, between two distinct sets of items where the values of one sample do not reveal information about the values of the other [49]. Through this test, it is possible to determine the p value between the measurable characteristics on different data sets that allow the computation of the degree of correlation between them. Thus, it is determined which characteristics stand out for providing more decisive information.

In this context, each method was independently used and evaluated for the discovery and ordering of those variable characteristics of a data set that provide more significant information, given that they present a lower degree of correlation than the rest.

These two techniques are widely used in problems of dimensionality reduction in multivariate models through the selection of characteristics to simplify their complexity. Restrictive dimensionality reduction methods are of great interest in problems such as the selection of features with the highest extractable significance from an EEG for various applications, including the biometric identification of individuals. However, beyond this traditional use, they not only allow the simplification of the model but also facilitate the selection of those data acquisition channels that appear more frequently in these selected features and, consequently, make it possible to study the selection of the most relevant brain regions for each EEG application.

2.6. Channel Selection

After applying the proposed feature selection techniques, the EEG channels most commonly used by these selected features were determined. The goal was then to reduce the number of EEG channels required to achieve the desired level of accuracy. Accordingly, the most relevant channels were analysed in the self-collected dataset I. Therefore, the superficial regions of the cerebral cortex corresponding to the location of these favourite electrodes located on the scalp were sorted according to the relevance contained in the information provided by each one.

From the proposed feature selection methods, the corresponding reduced arrays were computed in which the features of each group—PS, AI or PLV—were ordered from highest to lowest relevance and, consequently, the most important channels to characterize the different EEG signals of each individual. For this purpose, the order of each one was established by assigning a corresponding score calculated as follows:

{S c o r e}_{c h} = \sum_{i = 1}^{m_{i}} W_{{c h}_{i j}},

(3)

where

W_{{c h}_{i j}} = k_{j} \frac{P_{i}}{f_{j}}

(4)

Considering

W_{{c h}_{i j}}

as the weight value of each EEG channel (

c h

) as a function of a dimensionless constant k_j, whose value in each case was established with the contribution of its corresponding feature group (PS, AI or PLV) by itself to the classification accuracy, P_i is the number of ordered positions occupied by each feature in the feature array by PCA or the Wilcoxon test, f_j is the total number of features in each corresponding group and m_i is the total number of weights for each channel ch_i of the EEG. The selected values of k_j for the different feature groups were k_PS = 2 for spectral powers, k_AI = 1 for asymmetry indices and k_PLV = 1.2 for phase-locked values.

The PCA algorithm assigned a certain weight value to each feature in the different principal components. For each different feature, the value of its corresponding weight in all the main components extracted has been extracted to establish an order of relevance. After applying Equations (3) and (4) considering the sorted features that make up the first principal component of each group, the EEG channels were sorted from the highest to the lowest obtained score.

When using the Wilcoxon signed-rank test to sort the channels by the p-value of their characteristics involved, a reduced array of selected features was extracted. Unlike the PCA method, the Wilcoxon test can be performed only on paired data sets; that is, between the data of only two people at a time. For this reason, it was necessary to perform the test on 77 combinations of pairs of subjects for 13 subjects. Considering all the tests that sorted the features according to their p-value and the corresponding group of features, they were reordered by their statistical mode or the number of times they were repeated in the features array ordered by the Wilcoxon test among all the tests.

A plot of the method comparison between the PCA and the Wilcoxon test is depicted in Figure 2. The evaluation of the use of PCA or Wilcoxon signed-rank test for EEG channel selection was made by determining which of the two techniques provided a channel ranking that gave the best classification accuracy for biometric identification while using the least number of channels in dataset I (13 subjects). Finally, the corresponding extracted surface regions of the cerebral cortex were also evaluated on dataset II (109 subjects).

2.7. Classification. Support Vector Machine

In the present study, Support Vector Machines (SVM), from the MATLAB Classification Learner toolbox (MATLAB^®, MathWorks, Inc., Portola Valley, CA, USA) with a Gaussian Radial Basis Function (RBF) kernel, were used as the classification algorithm. SVM is one of the most widely used algorithms owing to its simplicity and the excellent results it has provided [50,51,52].

An RBF kernel SVM is a type of SVM that uses a nonlinear kernel function to map the input data into a higher-dimensional space where it becomes linearly separable by a hyperplane. The solution of the classification problem is

f (x) = C (\sum_{i}^{N} {α_{i} y_{i} k (x}_{i}, y_{i}) + b)

(5)

where x_i is the input vector (

x \in R^{N}

), y_i is the class label (y ∈ {−1, +1}), α_i is a set of Lagrange multipliers needed to solve the constrained optimization problem (

0 \leq α_{i} \leq C

, C is the box constraint),

b

stands for the bias and k is the RBF kernel defined as the exponential of the squared distance between two points in the feature space, which can be given by

k (x_{i}, y_{i}) = e^{(- γ {‖x_{i} {- y}_{i}‖}^{2})}

(6)

where γ is the kernel scale. The parameters C and γ were fit to have the best classification precision.

The feature tables extracted from EEG dataset I and II, which contained the PS, AI, and PLV features measured in the beta frequency band, were split into training and validation sets. This split was performed using a ten-fold cross-validation technique, which is commonly used in machine learning to evaluate the performance of a model while preventing overfitting. In cross-validation, the data is divided into k equal parts, and the model is trained k times, with each part serving as the validation set once. The average performance across all the k-folds is then reported as the overall performance of the model. This approach helps to ensure that the model generalizes well to unseen data and is not overly influenced by noise or outliers in the training set. In this way, the inputs of the classifier are the feature data and the output is the label of the corresponding subject.

2.8. Computation Setup

The calculations involved in the present study were performed on a computer with an AMD Ryzen 7 3800X (Advanced Micro Devices, Inc., Santa Clara, CA, USA) processor with 8 cores and 16 threads at 4.5 GHz, an Nvidia RTX 2060 (NVIDIA Corp., Santa Clara, CA, USA) graphics card with 6 Gb of memory at 1.7 GHz, and four 4 × 16 Gb (64 Gb) RAM modules with a CAS latency of 16 at 3.2 GHz.

3. Results

In this section, we provide the classification performance obtained from the two proposed feature selection methods (PCA and Wilcoxon signed-rank test) applied to EEG datasets I and II. The effectiveness of the selected methods was evaluated in terms of reducing the number of EEG channels to a minimum while maintaining a desirable level of precision in the biometric identification system. The cerebral cortex regions corresponding to the selected electrodes were determined based on their locations that guaranteed accurate biometric identification in dataset I, using each of the proposed methods. These regions were then evaluated in dataset II using the corresponding electrodes that are approximately located within those regions. The results obtained using the proposed methods are presented in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8. As an initial benchmark, Table 3 shows that by using all 14 available electrodes in dataset I, 100% of classification accuracy was achieved to identify the 13 participants. Similarly, in the case of dataset II, Table 4 shows that 99.9% accuracy was achieved by utilizing all the 64 available electrodes to identify 109 participants.

3.1. Channel Selection Using PCA

After applying the PCA method to the feature table extracted from dataset I, the first component of each group of features was selected to carry out the ordering of the acquisition channels based on the score obtained. In Figure 3, the ratio of conservation of original data is represented according to the number of main components extracted for the groups of features PS, AI and PLV, correspondingly. The results showed that to keep 90% of the original information, it was necessary to extract three principal components for the PS set, eight for the AI set, and 48 for the PLV set. In other words, when using only the first principal component, the percentages of conservation of the original information on the different groups of characteristics were 71% for PS, 54% for AI and 39% for PLV.

Table 1 shows an example of calculating the weights

W_{{c h}_{i j}}

and the score for the case of the T8 channel after using the PCA method for feature ordering. Likewise, Table 2 shows the order obtained from all the channels based on the score calculated after having used the PCA on each group of features.

Once the EEG acquisition channels are sorted by relevance, the performance of the Gaussian kernel SVM classifier is shown in Table 3. The metric results of precision, sensitivity (recall), F1-score, and MCC (Mathew’s correlation coefficient), as well as the number of features used as a function of the number of acquisition channels chosen by the PCA method, are shown. The precision results for biometric identification were maintained with values above 82% using a minimum of three channels and above 90% using a minimum of four channels. The computation time for the PCA method was 36.2 ms. From these results, it stands out that when features were selected in the training of the classifiers that use three channels or less, their performance decreased drastically.

The representation of the evolution of the degree of precision as a function of the number of channels shown in Figure 4a reveals a clear and expected trend towards better performance, in terms of higher accuracy values and lower standard deviations, as the number of EEG channels used increases.

Table 1. Example of calculation of weights and scores of an EEG channel for its ordering after using the PCA method.

$P_{i}$	Features
$P_{i}$	$PS (k_{P S}$ = 2)	$AI (k_{A I}$ = 1)	$PLV (k_{P L V}$ = 1.2)
91			O1–T8
…			…
14	AF3		F7–T7
13	F8		P8–AF4
12	FC6		F3–O2
11	T8		T7–P7
10	T7		F3–P8
9	AF4		AF3–AF4
8	FC5		AF3–F3
7	F7	AF3–AF4	F7–AF4
6	F4	F3–F4	F3–P7
5	P7	T7–T8	F3–F4
4	F3	FC5–FC6	F3–T7
3	O2	O1–O2	F4–AF4
2	O1	F7–F8	F3–AF4
1	P8	P7–P8	F3–FC5
Weight for T8	${W_{{T 8}_{P S 1}} = 2 \frac{11}{14}}$	${W_{{T 8}_{A I 1}} = 1 \frac{5}{7}}$	{ $W_{{T 8}_{P L V 1}} = 1.2 \frac{91}{91}$ , $W_{{T 8}_{P L V 2}} = 1.2 \frac{86}{91}$ , …, $W_{{T 8}_{P L V 67}} = 1.2 \frac{24}{91}$ }
Score for T8	$W_{{T 8}_{P S 1}}$ $W_{{T 8}_{A I 1}} +$ $W_{{T 8}_{P L V 1}} +$ $W_{{T 8}_{P L V 2}} +$ $\dots + W_{{T 8}_{P L V 67}} = 13.61$

Table 2. EEG channels ordered by relevance using the principal component analysis method.

Order	Channel	Score
1	T8	13.61
2	F8	13.52
3	FC6	12.39
4	FC5	11.68
5	O1	11.40
6	O2	10.67
7	F4	10.2
8	F7	8.99
9	AF3	8.53
10	P7	8.51
11	T7	8.38
12	P8	7.18
13	AF4	5.35
14	F3	3.68

Table 3. Results of classification metrics based on the number of EEG channels used through the application of PCA.

Set of Channels	Set of Features	Precision (%)	Recall (%)	F1-Score (%)	MCC (%)
14	112	100	100	100	100
13	98	$99.62 \pm$ 1.39	$99.62 \pm$ 0.92	$99.61 \pm$ 0.80	$99.58 \pm$ 0.86
12	83	$99.42 \pm$ 1.50	$99.43 \pm$ 1.08	$99.42 \pm$ 0.98	$99.38 \pm$ 1.05
11	70	$99.04 \pm$ 1.63	$99.09 \pm$ 2.04	$99.04 \pm$ 1.14	$98.96 \pm$ 1.22
10	58	$98.65 \pm$ 2.42	$98.66 \pm$ 1.92	$98.65 \pm$ 1.96	$98.54 \pm$ 2.11
9	48	$97.88 \pm$ 2.00	$97.93 \pm$ 2.20	$97.89 \pm$ 1.56	$97.71 \pm$ 1.68
8	39	$97.12 \pm$ 2.47	$97.23 \pm$ 3.27	$97.13 \pm$ 2.02	$96.88 \pm$ 2.17
7	30	$95.77 \pm$ 2.77	$95.90 \pm$ 3.45	$95.78 \pm$ 2.13	$95.43 \pm$ 2.28
6	23	$95.00 \pm$ 3.82	$95.07 \pm$ 3.94	$95.00 \pm$ 3.41	$94.59 \pm$ 3.69
5	16	$91.73 \pm$ 8.13	$91.88 \pm$ 6.44	$91.67 \pm$ 6.46	$91.07 \pm$ 6.93
4	11	$90.00 \pm$ 11.68	$89.93 \pm$ 8.10	$89.76 \pm$ 9.49	$89.20 \pm$ 10.00
3	6	$82.88 \pm$ 13.02	$82.86 \pm$ 10.42	$82.71 \pm$ 11.18	$81.49 \pm$ 12.06
2	3	$66.73 \pm$ 21.52	$68.35 \pm$ 20.14	$66.55 \pm$ 19.85	$64.11 \pm$ 21.12
1	1	$40.38 \pm$ 28.95	$46.91 \pm$ 29.53	$36.00 \pm$ 22.64	$35.91 \pm$ 20.57

Figure 5 shows the topographical maps estimating the relevance of different cerebral cortex areas underlying the surface location of the channels selected by PCA, considering their corresponding classification precision result. As the number of electrodes used decreases, there is a trend for their concentration in the locations close to the parietal and right temporal lobes.

To further study the relevance of these specific cerebral cortex areas, the locations of the four electrodes (T8, F8, FC6, and FC5) selected by PCA that guaranteed an accuracy rate above 90% in dataset I were compared to corresponding channels in dataset II. These channels included AF8, F8, F6, F4, FT8, FC6, FC4, T8, CP6, C4, C6, TP8, FC5, FC3, C5, and C3. Table 4 complements these findings by reporting the classification metrics results of biometric identification achieved with dataset I and II using the corresponding set of channels located in the above mentioned cerebral cortex regions of interest in Figure 5. In addition, the results obtained when using all available channels in dataset II (64 channels) are also presented for comparison.

Table 4. Classification results for dataset I and II by using the channels located on the cerebral cortex regions of interest extracted by PCA.

Dataset	Set of Channels	Precision (%)	Recall (%)	F1-Score (%)	MCC (%)
I (13 subj.)	4	$90.00 \pm$ 11.68	$89.93 \pm$ 8.10	$89.76 \pm$ 9.49	$89.20 \pm$ 10.00
II (109 subj.)	16	$99.22 \pm$ 1.84	$99.20 \pm$ 2.02	$99.19 \pm$ 1.51	$99.18 \pm$ 1.51
II (109 subj.)	64	$99.91 \pm$ 0.64	$99.92 \pm$ 0.61	$99.91 \pm$ 0.45	$99.91 \pm$ 0.52

3.2. Channel Selection by Wilcoxon Signed-Rank Test

In this section, the results obtained by using the Wilcoxon signed-rank test as a method for ordering features from the table of features of the database I and the corresponding order of electrodes are shown. Table 5 illustrates an example of calculating the weights and score for the case of the AF4 channel, and Table 6 shows the order of the channels according to their calculated score. The computation time for the Wilcoxon signed-rank test method was 7.37 ms.

Once the EEG channels have been sorted, Table 7 displays the corresponding results of the classification metrics. In this case, as with the PCA method, high classification precision values are obtained using more than three channels with 80% and greater than 90% using four channels, although with a higher standard deviation and worse performance when the number of channels used is between 1 and 2 than in the case of the PCA method.

In the case of using the Wilcoxon test method, the representation of the evolution of the degree of accuracy as a function of the number of channels shown in Figure 4b also reveals, as occurred with the application of the PCA method, a clear and expected trend towards better performance, in terms of higher accuracy value and lower standard deviation, as the number of EEG channels used increases.

Figure 6 shows a graphical representation of the topographic evolution of precision, showing a tendency to concentrate them in the areas closest to the left frontal lobe as the number of electrodes used decreases.

The locations of the four electrodes (AF3, F3, F7, and FC5) selected by the Wilcoxon signed-rank test, which ensured an accuracy rate of over 90% in dataset I, were compared to the corresponding channels in dataset II that approximately match the corresponding location of those regions of the cerebral cortex. The considered channels for that case were F7, F5, F3, F1, Fz, AF7, AF3, AFz, FP1, FPz, FT7, FC5, FC3, FC1, C5, and C3. Table 8 displays the classification metrics results of biometric identification achieved with dataset I and II using the corresponding set of channels located in the left frontal lobe, as shown in Figure 6.

Table 5. Example of the calculation of weights and scores of an EEG channel for its ordering after using the Wilcoxon signed-rank test method.

$P_{i}$	Features
$P_{i}$	$PS (k_{P S}$ = 2)	$AI (k_{A I}$ = 1)	$PLV (k_{P L V}$ = 1.2)
91			AF–3F3
…			…
14			F3–P8
13			T7–F8
12	AF3		AF3–O1
11	F3		F7–AF4
10	FC6		F7–FC6
9	AF4		F3–FC5
8	AF4	AF3–AF4	P7–AF4
7	FC6	F7–F8	T7–T8
6	F8	F3–F4	P7–F8
5	AF4	FC5–FC6	P8–AF4
4	O2	T7–T8	F7–FC6
3	O2	T7–T8	F7–O2
2	T7	O1–O2	F7–O1
1	O1	O1–O2	P7–T8
Weight for AF4	{ $W_{{A F 4}_{P S 1}} = 2 \frac{9}{12}$ , $W_{{A F 4}_{P S 2}} = 2 \frac{8}{12}$ , …, $W_{{A F 4}_{P S 12}} = 2 \frac{5}{12}$ };	${W_{{A F 4}_{A I 1}} = 1 \frac{8}{8}}$ ;	{ $W_{{A F 4}_{P L V 1}} = 1.2 \frac{81}{91}$ , $W_{{A F 4}_{P L V 2}} = 1.2 \frac{86}{91}$ , …, $W_{{A F 4}_{P L V 95}} = 1.2 \frac{5}{91}$ }
Score for AF4	$W_{{A F 4}_{P S 1}} +$ $\dots + W_{{A F 4}_{P S 12}} + W_{{A F 4}_{A I 1}} +$ $W_{{A F 4}_{P L V 1}} +$ $W_{{A F 4}_{P L V 2}} +$ $\dots + W_{{A F 4}_{P L V 95}} = 9.74$

Table 6. EEG channels ordered by relevance using the Wilcoxon signed-rank test method.

Order	Channel	Score
1	AF3	14.58
2	F3	13.79
3	F7	11.50
4	FC5	11.07
5	T7	10.20
6	F4	9.74
7	AF4	9.64
8	P7	9.55
9	T8	8.24
10	FC6	8.10
11	F8	8.03
12	O2	6.90
13	O1	6.01
14	P8	4.58

Table 7. Results of classification metrics based on the number of EEG channels used through the application of the Wilcoxon signed-rank test method.

Set of Channels	Set of Features	Precision (%)	Recall (%)	F1-Score (%)	MCC (%)
14	112	100	100	100	100
13	98	$99.51 \pm$ 1.08	99.44 ± 1.07	99.42 ± 0.65	99.38 ± 0.70
12	83	$99.40 \pm$ 1.43	99.44 ± 1.07	99.42 ± 0.84	99.38 ± 0.89
11	70	$99.23 \pm$ 1.58	99.26 ± 1.51	99.23 ± 0.96	99.17 ± 1.03
10	58	$99.23 \pm$ 1.20	99.24 ± 1.18	99.23 ± 0.81	99.17 ± 0.88
9	48	$99.23 \pm$ 1.22	99.25 ± 1.52	99.23 ± 1.08	99.17 ± 1.17
8	39	$97.69 \pm$ 2.47	97.73 ± 2.31	97.69 ± 1.75	97.50 ± 1.89
7	30	$96.92 \pm$ $4$ .10	97.01 ± 3.67	96.92 ± 3.22	96.67 ± 3.47
6	23	$95.19 \pm$ $4$ .73	95.31 ± 4.78	95.20 ± 4.20	94.80 ± 4.52
5	16	$92.31 \pm$ 5.54	92.56 ± 7.04	92.36 ± 5.66	91.68 ± 6.14
4	11	90.58 $\pm$ 6.05	90.81 ± 6.71	90.60 ± 5.69	89.81 ± 6.15
3	6	$80.58 \pm$ 11.02	80.88 ± 11.10	80.57 ± 10.48	78.98 ± 11.33
2	3	$66.38 \pm$ 18.51	59.63 ± 15.88	59.55 ± 16.93	57.17 ± 17.71
1	1	$42.31 \pm$ 26.11	38.83 ± 24.88	38.63 ± 23.66	37.98 ± 23.25

Table 8. Classification results for dataset I and II by using the channels located on the cerebral cortex regions of interest extracted by Wilcoxon signed-rank test.

Dataset	Set of Channels	Precision (%)	Recall (%)	F1-Score (%)	MCC (%)
I (13 subj.)	4	$90.58 \pm$ 6.05	$90.81 \pm$ 6.71	$90.60 \pm$ 5.69	$89.81 \pm$ 6.15
II (109 subj.)	16	$99.52 \pm$ 1.41	$99.51 \pm$ 1.45	$99.51 \pm$ 1.12	$99.50 \pm$ 1.02
II (109 subj.)	64	$99.91 \pm$ 0.64	$99.92 \pm$ 0.61	$99.91 \pm$ 0.45	$99.91 \pm$ 0.52

4. Discussion

In the present study, we proposed a method for the calculation and establishment of an ordering of EEG channels based on the degree of relevance provided by the features in which they are involved. The applied feature ordering techniques, namely, PCA and the Wilcoxon signed-rank test, were independently compared for this purpose. Based on the use of both techniques, the obtained results indicated that a minimum of four EEG electrodes is recommended to achieve sufficient biometric identification accuracy on the self-collected dataset of 13 individuals using a non-expensive EEG device. This resulted in a 75% reduction with respect to the initial used quantity of electrodes, from 16 to 4, by selecting the electrodes located in the corresponding cerebral cortex regions of interest.

Regarding the location of the selected channels, as the number of electrodes used in both feature selection cases (PCA or Wilcoxon test) was reduced, the locations of the electrodes were concentrated in opposite lobe hemisphere areas. In this context, these extracted regions of interest were different depending on the employed feature selection method. We have found that when using PCA, the electrode placement concentrated on the right parietal and temporal lobes, whereas in the case of using Wilcoxon signed-rank test, the regions of interest were found near the left frontal lobe. Furthermore, the evaluation of biometric identification performance on a dataset of 109 individuals, considering the influence of these regions, revealed that the classification accuracy remained desirable and consistent, even with a significant increase in the number of subjects. The reduction in electrode usage was also 75%, as in the previous case, where the 64 initial electrodes were reduced to 16, which were localized in the aforementioned cerebral cortex areas of interest.

By contrast to alternative approaches, some of the most recent studies in the related literature have employed various electrode selection techniques for the problem of biometric identification of individuals based on EEG. Among these techniques, those related to the use of genetic algorithms (GA) stand out as prominent, as they can search a large search space to find the best possible solution to the problem.

In this context, a genetic algorithm-based method for reducing the number of EEG electrodes to those providing maximum information for identification and eliminating redundant electrodes with maximum accuracy is presented in [53]. The study utilized the Physiontet BCI dataset, which was also used in the present study, consisting of recordings from 109 subjects and 64 EEG acquisition channels, as well as a self-collected dataset comprising recordings from 30 individuals and 14 channels. The study focused on the acquisition protocols of eyes open (EC), eyes closed (EO), and relaxation and concentration. The application of the genetic algorithm resulted in a reduction in the number of electrodes to approximately 9 to 12 channels. Using the beta frequency band to train a Fine Gaussian kernel SVM classifier, the accuracy rates ranged from 94% to 98.9%. The selected electrodes were located in the frontal, central, and parietal lobes. These regions of the superficial cerebral cortex, which have been demonstrated to be crucial in biometric identification of individuals, coincide with those identified by the proposed method. However, our method yields slightly better identification results with the use of 9 to 12 channels than the proposed in [53], achieving an accuracy rate of up to 99.4%.

Meanwhile, in [54], the non-dominated sorting genetic algorithm (NSGA) was utilized to address the multi-objective optimization problem of decreasing the number of EEG channels while maximizing the accuracy of multi-class classification, increasing the number of accepted subjects with access, and maximizing the number of intruders rejected. The authors tested their method on a dataset composed of event-related potentials (ERPs) recorded from 26 subjects using 56 EEG channels. The authors extracted features related to signal energy and fractal dimension using empirical mode decomposition (EMD) for each channel. By employing the NSGA algorithm, they were able to select seven channels that resulted in a subject identification accuracy of up to 98% when using an RBF-SVM classifier. Compared to our proposed method in the present study, as shown in the Section 3, the method was able to achieve better accuracy results and a higher proportion of reduction in the number of electrodes used when using the same classifier (RBF-SVM). It should be noted, however, that the EEG signal acquisition strategies studied in relation to mental tasks were different. Regarding feature extraction, one of the most crucial stages in classification procedures, our method not only used information from each electrode separately on the EEG signal, but also extracted information about the interrelation between them through functional connectivity, demonstrating that it is a feature that enhances biometric identification based on EEG.

Recently, in [19], the authors applied the GA to two separate datasets. The first dataset comprised EEG recordings from 21 volunteers using 19 EEG channels with audio-evoked responses as EEG recordings. In the second dataset, the authors initially selected 19 channels of interest with the motor action and motor imagery acquisition protocols. This dataset was the Physionet BCI dataset, consisting of 109 subjects. In both cases, coherence (COH) was used as the functional connectivity metric feature that yielded the best results, and a convolutional neural network (CNN) was employed as the classifier. In the 109 subjects’ dataset, the number of electrodes was reduced from 19 to 15, and the proposed method’s performance was only slightly affected, achieving 97.74% accuracy in the motor imagery protocol. In the dataset collected by the authors themselves, the number of electrodes was reduced from 19 to 11, achieving an accuracy rate of 99.56%. Compared to the results obtained using the database of 109 subjects and motor imagery mental tasks, it should be noted that similar identification accuracy and degree of reduction in the number of electrodes were achieved with the proposed method. However, the regions of the cerebral cortex where the selected electrodes are located differ. The results of the proposed method highlight the influence of the right parietal and temporal and left frontal lobes, while those achieved in [19] also include the occipital lobes.

One alternative approach to the previous literature on optimizing the number of EEG channels for biometric identification was proposed in [55]. The authors used a frequently occurring maximum power algorithm to achieve this goal on two databases, using resting-state acquisition strategy: the previously mentioned Physionet database and a self-collected dataset of 16 subjects. In both datasets, they achieved a reduction from 64 to 20 channels with an equal error rate (ERR) value of 0.0039. The selected 20 channels were predominantly located in the left hemisphere’s frontal, fronto-temporal, and fronto-central lobes. Although the acquisition strategy used in [55] (resting state) differs from the one used in the present study for the same database (motor imagery), the brain regions identified are similar to those obtained with the proposed method in this study. Moreover, the achieved trade-off between the minimum number of electrodes and the maximum possible identification accuracy was comparable between both methods.

In recent years, several studies have employed metaheuristic optimization algorithms, including the approach presented in [56]. In this work, the authors proposed a methodology based on the Flower Pollination Algorithm (FPA) and β-Hill Climbing optimizer, named FPA β-hc, to select the EEG channels that provide the most relevant information for biometric identification. Two techniques have been utilized to extract features from each individual EEG channel: Wavelet Transform and Auto-regressive (AR) models. The method was tested on the above mentioned Physionet dataset using EEG motor imagery as the acquisition protocol strategy. The results showed that the proposed method achieved 100% accuracy in identifying the 109 subjects by reducing the number of electrodes to 35. Regarding the degree of reduction in the number of electrodes and the achieved identification accuracy, it should be noted that both this method and the one proposed in the present study achieved a sufficient level of accuracy. However, the number of electrodes required in the proposed method was significantly lower (16 electrodes). Furthermore, the proposed method was successfully tested on a database with signals acquired from a low-cost EEG device, where it was highlighted that only four electrodes were needed to achieve acceptable identification results.

5. Conclusions

A comparative analysis of two dimensionality reduction techniques (PCA and Wilcoxon signed-rank test) using an automatic classification algorithm for the biometric identification of individuals based on their particular patterns of brain nerve activity has been shown. In this context, it has been demonstrated that the techniques applied are feasible for the study of the optimal number of EEG signal acquisition electrodes needed to obtain a sufficient degree of accuracy in classification. Furthermore, based on this information, it has been shown which electrode location areas provide the most relevant information to the system for the mental task that has been carried out to enable their identification. In this way, it has been determined that these areas are close to the frontotemporal areas when the subjects were performing the motor imagery mental task, although a greater concentration has been seen on the left or right side depending on the dimensional reduction technique applied.

Using the results obtained in both methods, it has been demonstrated that it is possible to maintain sufficient accuracy ratios by using at least four acquisition sensors with a low-cost EEG device.

Author Contributions

Conceptualization, J.O.-R., J.F.G.-G. and E.P.; methodology, J.O.-R.; software, J.O.-R.; validation, J.O.-R.; formal analysis, J.O.-R., J.F.G.-G. and E.P.; resources, J.F.G.-G. and E.P.; writing—original draft preparation, J.O.-R., J.F.G.-G. and E.P.; writing—review and editing, J.O.-R., J.F.G.-G. and E.P.; funding acquisition, J.F.G.-G. and E.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by Consejería de Economía, Industria, Comercio y Conocimiento of the Canary Islands Government (Spain) and European Regional Development Fund (ERDF), grant number ProID2017010100, MINECO grant number TEC2016-80063-C3-2-R, and the project MACBIOIDI2 MAC2/1.1b/352, within the INTERREG Program, funded by the European Regional Development Fund (ERDF).

Institutional Review Board Statement

The study was approved by the ethical committee of the University of La Laguna (registration number: CEIBA2020-0405).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Acknowledgments

Jordan Ortega-Rodríguez received a fellowship from Agencia Canaria de Investigación, Innovación y Sociedad de la Información (ACIISI) from the Canary Islands Government (Spain).

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Thorpe, J.; van Oorschot, P.C.; Somayaji, A. Pass-Thoughts: Authenticating with Our Minds. In Proceedings of the New Security Paradigms Workshop, Schloss Dagstuhl, Germany, 19–22 September 2006. [Google Scholar]
Nandakumar, K.; Jain, A.K.; Nagar, A. Biometric Template Security. EURASIP J. Adv. Signal Process. 2008, 2008, 113. [Google Scholar] [CrossRef]
Putte, T.; Keuning, J. Biometrical Fingerprint Recognition: Don’t Get Your Fingers Burned. In Smart Card Research and Advanced Applications; Springer International Publishing: Berlin/Heidelberg, Germany, 2000. [Google Scholar]
Singh, Y.N.; Singh, S.K. Vitality Detection from Biometrics: State-of-the-Art. In Proceedings of the 2011 World Congress on Information and Communication Technologies (WICT), Dijon, France, 21–23 June 2011. [Google Scholar]
Galbally, J.; Marcel, S.; Fierrez, J. Biometric Antispoofing Methods: A Survey in Face Recognition. IEEE Access 2014, 2, 1530–1552. [Google Scholar] [CrossRef]
Gupta, P.; Behera, S.; Vatsa, M.; Singh, R. On Iris Spoofing Using Print Attack. In Proceedings of the International Conference on Pattern Recognition, Stockholm, Sweden, 24–28 August 2014. [Google Scholar]
Ortega-Rodríguez, J.; Martín-Chinea, K.; Gómez-González, J.F.; Pereda, E. Brainprint Based on Functional Connectivity and Asymmetry Indices of Brain Regions: A Case Study of Biometric Person Identification with Non-expensive Electroencephalogram Headsets. IET Biom. 2023. [Google Scholar] [CrossRef]
Paranjape, R.B.; Mahovsky, J.; Benedicenti, L.; Koles, Z. The Electroencephalogram as a Biometric. Can. Conf. Electr. Comput. Eng. 2001, 2, 1363–1366. [Google Scholar] [CrossRef]
Palaniappan, R.; Mandic, D.P. EEG Based Biometric Framework for Automatic Identity Verification. J. VLSI Signal Process. Syst. Signal Image Video Technol. 2007, 49, 243–250. [Google Scholar] [CrossRef]
Del Pozo-Banos, M.; Alonso, J.B.; Ticay-Rivas, J.R.; Travieso, C.M. Electroencephalogram Subject Identification: A Review. Expert Syst. Appl. 2014, 41, 6537–6554. [Google Scholar] [CrossRef]
Sirvent Blasco, J.L.; Iáñez, E.; Úbeda, A.; Azorín, J.M. Visual Evoked Potential-Based Brain-Machine Interface Applications to Assist Disabled People. Expert Syst. Appl. 2012, 39, 7908–7918. [Google Scholar] [CrossRef]
McFarland, D.J.; Wolpaw, J.R. Brain-Computer Interfaces for Communication and Control. Commun. ACM 2011, 54, 60–66. [Google Scholar] [CrossRef]
Vaid, S.; Singh, P.; Kaur, C. EEG Signal Analysis for BCI Interface: A Review. In Proceedings of the International Conference on Advanced Computing and Communication Technologies (ACCT), Haryana, India, 21–22 February 2015. [Google Scholar]
Collura, T.F. History and Evolution of Electroencephalographic Instruments and Techniques. J. Clin. Neurophysiol. 1993, 10, 476–504. [Google Scholar] [CrossRef]
Roohi-Azizi, M.; Azimi, L.; Heysieattalab, S.; Aamidfar, M. Changes of the Brain’s Bioelectrical Activity in Cognition, Consciousness, and Some Mental Disorders. Med. J. Islam. Repub. Iran 2017, 31, 307–312. [Google Scholar] [CrossRef] [PubMed]
Klonowski, W. Everything You Wanted to Ask about EEG but Were Afraid to Get the Right Answer. Nonlinear Biomed. Phys. 2009, 3, 2. [Google Scholar] [CrossRef]
Miladinović, A.; Ajčević, M.; Jarmolowska, J.; Marusic, U.; Colussi, M.; Silveri, G.; Battaglini, P.P.; Accardo, A. Effect of Power Feature Covariance Shift on BCI Spatial-Filtering Techniques: A Comparative Study. Comput. Methods Programs Biomed. 2021, 198, 105808. [Google Scholar] [CrossRef]
Kong, W.; Wang, L.; Xu, S.; Babiloni, F.; Chen, H. EEG Fingerprints: Phase Synchronization of EEG Signals as Biomarker for Subject Identification. IEEE Access 2019, 7, 121165–121173. [Google Scholar] [CrossRef]
Ashenaei, R.; Asghar Beheshti, A.; Yousefi Rezaii, T. Stable EEG-Based Biometric System Using Functional Connectivity Based on Time-Frequency Features with Optimal Channels. Biomed. Signal Process. Control. 2022, 77, 103790. [Google Scholar] [CrossRef]
Wang, M.; El-Fiqi, H.; Hu, J.; Abbass, H.A. Convolutional Neural Networks Using Dynamic Functional Connectivity for EEG-Based Person Identification in Diverse Human States. IEEE Trans. Inf. Forensics Secur. 2019, 14, 3359–3372. [Google Scholar] [CrossRef]
Hu, J.; Mu, Z.; Wang, J. Phase Locking Analysis of Motor Imagery in Brain-Computer Interface. In Proceedings of the 2008 International Conference on BioMedical Engineering and Informatics, Sanya, China, 28–30 May 2008; IEEE: New York, NY, USA, 2008; Volume 2, pp. 478–481. [Google Scholar]
Caramia, N.; Lotte, F.; Ramat, S. Optimizing Spatial Filter Pairs for EEG Classification Based on Phase-Synchronization. In Proceedings of the ICASSP, IEEE International Conference on Acoustics, Speech and Signal Proceedings, Florence, Italy, 4–9 May 2014; pp. 2049–2053. [Google Scholar] [CrossRef]
Campisi, P.; Scarano, G.; Babiloni, F.; DeVico Fallani, F.; Colonnese, S.; Maiorana, E.; Forastiere, L. Brain Waves Based User Recognition Using the “Eyes Closed Resting Conditions” Protocol. In Proceedings of the 2011 IEEE International Workshop on Information Forensics and Security (WIFS), Iguacu Falls, Brazil, 29 November–2 December 2011. [Google Scholar]
David, O.; Cosmelli, D.; Friston, K.J. Evaluation of Different Measures of Functional Connectivity Using a Neural Mass Model. Neuroimage 2004, 21, 659–673. [Google Scholar] [CrossRef]
Billinger, M.; Brunner, C.; Müller-Putz, G.R. Single-Trial Connectivity Estimation for Classification of Motor Imagery Data. J. Neural Eng. 2013, 10, 046006. [Google Scholar] [CrossRef] [PubMed]
Song, Y.; Zhang, J. Automatic Recognition of Epileptic EEG Patterns via Extreme Learning Machine and Multiresolution Feature Extraction. Expert Syst. Appl. 2013, 40, 5477–5489. [Google Scholar] [CrossRef]
Sabeti, M.; Katebi, S.D.; Boostani, R.; Price, G.W. A New Approach for EEG Signal Classification of Schizophrenic and Control Participants. Expert Syst. Appl. 2011, 38, 2063–2071. [Google Scholar] [CrossRef]
Mao, C.; Hu, B.; Wang, M.; Moore, P. EEG-Based Biometric Identification Using Local Probability Centers. In Proceedings of the International Joint Conference on Neural Networks, Killarney, Ireland, 12–17 July 2015. [Google Scholar]
Friston, K.J. Functional and Effective Connectivity: A Review. Brain Connect. 2011, 1, 13–36. [Google Scholar] [CrossRef]
Blinowska, K.J.; Rakowski, F.; Kaminski, M.; de Vico Fallani, F.; del Percio, C.; Lizio, R.; Babiloni, C. Functional and Effective Brain Connectivity for Discrimination between Alzheimer’s Patients and Healthy Individuals: A Study on Resting State EEG Rhythms. Clin. Neurophysiol. 2017, 128, 667–680. [Google Scholar] [CrossRef]
Campisi, P.; La Rocca, D. Brain Waves for Automatic Biometric-Based User Recognition. IEEE Trans. Inf. Forensics Secur. 2014, 9, 782–800. [Google Scholar] [CrossRef]
La Rocca, D.; Campisi, P.; Vegso, B.; Cserti, P.; Kozmann, G.; Babiloni, F.; de Vico Fallani, F. Human Brain Distinctiveness Based on EEG Spectral Coherence Connectivity. IEEE Trans. Biomed. Eng. 2014, 61, 2406–2412. [Google Scholar] [CrossRef] [PubMed]
Rodrigues, D.; Silva, G.F.A.; Papa, J.P.; Marana, A.N.; Yang, X.S. EEG-Based Person Identification through Binary Flower Pollination Algorithm. Expert Syst. Appl. 2016, 62, 81–90. [Google Scholar] [CrossRef]
Koike-Akino, T.; Mahajan, R.; Marks, T.K.; Wang, Y.; Watanabe, S.; Tuzel, O.; Orlik, P. High-Accuracy User Identification Using EEG Biometrics. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS), Orlando, FL, USA, 16–20 August 2016. [Google Scholar]
Goldberger, A.L.; Amaral, L.A.N.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet. Circulation 2000, 101, E215–E220. [Google Scholar] [CrossRef] [PubMed]
Kong, W.; Zhou, Z.; Jiang, B.; Babiloni, F.; Borghini, G. Assessment of Driving Fatigue Based on Intra/Inter-Region Phase Synchronization. Neurocomputing 2017, 219, 474–482. [Google Scholar] [CrossRef]
Schalk, G.; McFarland, D.J.; Hinterberger, T.; Birbaumer, N.; Wolpaw, J.R. BCI2000: A General-Purpose Brain-Computer Interface (BCI) System. IEEE Trans. Biomed. Eng. 2004, 51, 1034–1043. [Google Scholar] [CrossRef] [PubMed]
Oostenveld, R.; Fries, P.; Maris, E.; Schoffelen, J.M. FieldTrip: Open Source Software for Advanced Analysis of MEG, EEG, and Invasive Electrophysiological Data. Comput. Intell. Neurosci. 2011, 2011, 156869. [Google Scholar] [CrossRef] [PubMed]
Martín-Chinea, K.; Ortega, J.; Gómez-González, J.F.; Pereda, E.; Toledo, J.; Acosta, L. Effect of Time Windows in LSTM Networks for EEG-Based BCIs. Cogn. Neurodyn. 2022, 17, 385–398. [Google Scholar] [CrossRef]
McFarland, D.J.; Miner, L.A.; Vaughan, T.M.; Wolpaw, J.R. Mu and Beta Rhythm Topographies during Motor Imagery and Actual Movements. Brain Topogr. 2000, 12, 177–186. [Google Scholar] [CrossRef] [PubMed]
Thomson, D.J. Spectrum Estimation and Harmonic Analysis. Proc. IEEE 1982, 70, 1055–1096. [Google Scholar] [CrossRef]
Bai, O.; Mari, Z.; Vorbach, S.; Hallett, M. Asymmetric Spatiotemporal Patterns of Event-Related Desynchronization Preceding Voluntary Sequential Finger Movements: A High-Resolution EEG Study. Clin. Neurophysiol. 2005, 116, 1213–1221. [Google Scholar] [CrossRef]
García-Prieto, J.; Bajo, R.; Pereda, E. Efficient Computation of Functional Brain Networks: Toward Real-Time Functional Connectivity. Front. Neuroinformatics 2017, 11, 8. [Google Scholar] [CrossRef]
Fraschini, M.; Pani, S.M.; Didaci, L.; Marcialis, G.L. Robustness of Functional Connectivity Metrics for EEG-Based Personal Identification over Task-Induced Intra-Class and Inter-Class Variations. Pattern Recognit. Lett. 2019, 125, 49–54. [Google Scholar] [CrossRef]
Tǎutan, A.M.; Rossi, A.C.; de Francisco, R.; Ionescu, B. Dimensionality Reduction for EEG-Based Sleep Stage Detection: Comparison of Autoencoders, Principal Component Analysis and Factor Analysis. Biomed. Tech. 2021, 66, 125–136. [Google Scholar] [CrossRef]
Artoni, F.; Delorme, A.; Makeig, S. Applying Dimension Reduction to EEG Data by Principal Component Analysis Reduces the Quality of Its Subsequent Independent Component Decomposition. Neuroimage 2018, 175, 176–187. [Google Scholar] [CrossRef] [PubMed]
Shi, L.C.; Duan, R.N.; Lu, B.L. A Robust Principal Component Analysis Algorithm for EEG-Based Vigilance Estimation. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS), Osaka, Japan, 3–7 July 2013. [Google Scholar]
Abdi, H.; Williams, L.J. Principal Component Analysis. Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 433–459. [Google Scholar] [CrossRef]
Wilcoxon, F. Individual Comparisons by Ranking Methods. Biom. Bull. 1945, 1, 80–83. [Google Scholar] [CrossRef]
Wang, X.W.; Nie, D.; Lu, B.L. EEG-Based Emotion Recognition Using Frequency Domain Features and Support Vector Machines. In Neural Information Processing; Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Berlin/Heidelberg, Germany, 2011; Volume 7062. [Google Scholar]
Luo, J.; Gao, X.; Zhu, X.; Wang, B.; Lu, N.; Wang, J. Motor Imagery EEG Classification Based on Ensemble Support Vector Learning. Comput. Methods Programs Biomed. 2020, 193, 105464. [Google Scholar] [CrossRef] [PubMed]
Sercan Bayram, K.; Kızrak, M.A. Classification of EEG Signals by Using Support Vector Machines. In Proceedings of the 2013 IEEE INISTA, Albena, Bulgaria, 19–21 June 2013. [Google Scholar] [CrossRef]
Albasri, A.; Abdali-Mohammadi, F.; Fathi, A. EEG Electrode Selection for Person Identification Thru a Genetic-Algorithm Method. J. Med. Syst. 2019, 43, 297. [Google Scholar] [CrossRef]
Moctezuma, L.A.; Molinas, M. Multi-Objective Optimization for EEG Channel Selection and Accurate Intruder Detection in an EEG-Based Subject Identification System. Sci. Rep. 2020, 10, 5850. [Google Scholar] [CrossRef] [PubMed]
Monsy, J.C.; Vinod, A.P. EEG-based Biometric Identification Using Frequency-weighted Power Feature. IET Biom. 2020, 9, 251–258. [Google Scholar] [CrossRef]
Alyasseri, Z.A.A.; Alomari, O.A.; Papa, J.P.; Al-Betar, M.A.; Abdulkareem, K.H.; Mohammed, M.A.; Kadry, S.; Thinnukool, O.; Khuwuthyakorn, P. EEG Channel Selection Based User Identification via Improved Flower Pollination Algorithm. Sensors 2022, 22, 2092. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The steps followed for the identification of individuals using EEG-based biometric measurements.

Figure 2. Flowchart of the EEG channel selection procedure using the Wilcoxon signed-rank test or PCA method.

Figure 3. Ratio of conservation of the original data as a function of the number of principal components extracted for each group of characteristics by means of PCA.

Figure 4. Precision obtained by applying PCA (a) and Wilcoxon signed-rank test (b) as a function of the number of acquisition channels.

Figure 5. Topographic representation of the precision obtained as a function of the number of channels used and their corresponding location from the PCA application.

Figure 6. Topographic representation of the precision obtained as a function of the number of channels used and their corresponding location from the application of the Wilcoxon signed-rank test.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ortega-Rodríguez, J.; Gómez-González, J.F.; Pereda, E. Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals. Sensors 2023, 23, 4239. https://doi.org/10.3390/s23094239

AMA Style

Ortega-Rodríguez J, Gómez-González JF, Pereda E. Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals. Sensors. 2023; 23(9):4239. https://doi.org/10.3390/s23094239

Chicago/Turabian Style

Ortega-Rodríguez, Jordan, José Francisco Gómez-González, and Ernesto Pereda. 2023. "Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals" Sensors 23, no. 9: 4239. https://doi.org/10.3390/s23094239

APA Style

Ortega-Rodríguez, J., Gómez-González, J. F., & Pereda, E. (2023). Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals. Sensors, 23(9), 4239. https://doi.org/10.3390/s23094239

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Selection of the Minimum Number of EEG Sensors to Guarantee Biometric Identification of Individuals

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Procedure

2.2. Data Acquisition

2.3. Data Preprocessing

2.4. Feature Extraction

2.5. Feature Selection

2.6. Channel Selection

2.7. Classification. Support Vector Machine

2.8. Computation Setup

3. Results

3.1. Channel Selection Using PCA

3.2. Channel Selection by Wilcoxon Signed-Rank Test

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI