From Acquisition to Validation: Methodological Dependencies and Reproducibility in EEG-Based Alzheimer’s Disease Detection
Abstract
1. Introduction
- Cohort: Sample size, demographics, diagnostic criteria, inclusion/exclusion criteria, and study setting (single-center or multi-center).
- EEG acquisition and quality control: Device, channel number, montage/reference, sampling frequency, recording paradigm, duration, and quality control procedures including quantitative quality metrics where available.
- Preprocessing: Filtering parameters (type, cutoffs, order), re-referencing scheme, segmentation (epoch length and overlap), artifact handling method and parameters, downsampling procedure, and whether data-dependent parameters were determined within training folds or globally.
- Feature representation: Feature domains, key parameter settings, and feature-selection strategy, including whether selection was nested within cross-validation; distinguish spectral, connectivity, and complexity features as appropriate.
- Modeling and validation: Prediction unit (segment- or subject-level), segment aggregation strategy if applicable, model type, hyperparameter tuning procedure, cross-validation design (nested or non-nested), and leakage prevention measures adopted.
- Evaluation and reproducibility: Class balance, performance metrics beyond accuracy (area under the curve, balanced accuracy, F1 score), uncertainty estimates (confidence intervals or fold-wise variability), external validation status, and code/data availability.
2. The Methodological Chain Framework
3. Data Acquisition
3.1. Paradigm-Dependent Signal Variability
3.2. Recording Configuration
3.3. Sampling Frequency
3.4. Quality Control
3.5. Summary
4. EEG Preprocessing in AD Studies
4.1. Referencing
4.2. Filtering
4.3. Artifact Attenuation
4.4. Segmentation and Epoch Length
4.5. Pipeline Integrity
4.6. Summary
5. Feature Representation and Modeling: An Interdependent Design Space
5.1. Low-Dimensional Representations and Conventional Machine Learning
5.2. High-Dimensional Structured Representations and Deep Learning
5.3. Connectivity-Based Representations
5.4. Graph-Based Representations
5.5. End-to-End Representation Learning Under Data Constraints
5.6. Synthesis: Representation, Validation, and the Methodological Chain
6. Validation and Reliability in EEG-Based AD Studies
6.1. Subject-Level Versus Segment-Level Evaluation
6.2. Cross-Validation and Nested Design
6.3. Data Leakage and Pipeline Integrity
6.4. External Validation and Generalizability
6.5. Dataset Size and Statistical Reliability
6.6. Methodological Quality Map of Representative Studies
7. Limitations and Future Directions
8. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Alzheimer’s Association. 2025 Alzheimer’s disease facts and figures. Alzheimers Dement. 2025, 21, e70235. [Google Scholar]
- Blennow, K.; de Leon, M.J.; Zetterberg, H. Alzheimer’s disease. Lancet 2006, 368, 387–403. [Google Scholar] [CrossRef]
- Jack, C.R., Jr.; Andrews, J.S.; Beach, T.G.; Buracchio, T.; Dunn, B.; Graf, A.; Hansson, O.; Ho, C.; Jagust, W.; McDade, E.; et al. Revised criteria for diagnosis and staging of Alzheimer’s disease: Alzheimer’s Association Workgroup. Alzheimers Dement. 2024, 20, 5143–5169. [Google Scholar] [CrossRef]
- Siuly, S.; Alçin, Ö.F.; Kabir, E.; Şengür, A.; Wang, H.; Zhang, Y.; Whittaker, F. A new framework for automatic detection of patients with mild cognitive impairment using resting-state EEG signals. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 1966–1976. [Google Scholar] [CrossRef]
- Trinh, T.T.; Liu, Y.H.; Wu, C.T.; Peng, W.H.; Hou, C.L.; Weng, C.H.; Lee, C.Y. PLI-based connectivity in resting-EEG is a robust and generalizable feature for detecting MCI and AD: A validation on a diverse multisite clinical dataset. In Proceedings of the 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Sydney, Australia, 24–27 July 2023; IEEE: New York, NY, USA, 2023; pp. 1–6. [Google Scholar]
- Lin, Y.; Shi, X.; Mu, J.; Ren, H.; Jiang, X.; Zhu, L.; Cai, X.; Lian, C.; Pei, Z.; Zhang, Y.; et al. Uncovering stage-specific neural and molecular progression in Alzheimer’s disease: Implications for early screening. Alzheimers Dement. 2025, 21, e70182. [Google Scholar] [CrossRef]
- Young, V.M.; Gates, S.; Garcia, L.Y.; Salardini, A. Data Leakage in Deep Learning for Alzheimer’s Disease Diagnosis: A Scoping Review of Methodological Rigor and Performance Inflation. Diagnostics 2025, 15, 2348. [Google Scholar] [CrossRef]
- Brookshire, G.; Kasper, J.; Blauch, N.M.; Wu, Y.C.; Glatt, R.; Merrill, D.A.; Gerrol, S.; Yoder, K.J.; Quirk, C.; Lucero, C. Data leakage in deep learning studies of translational EEG. Front. Neurosci. 2024, 18, 1373515. [Google Scholar] [CrossRef] [PubMed]
- Vecchio, F.; Babiloni, C.; Lizio, R.; Fallani, F.D.V.; Blinowska, K.; Verrienti, G.; Frisoni, G.; Rossini, P.M. Resting state cortical EEG rhythms in Alzheimer’s disease: Toward EEG markers for clinical applications: A review. Suppl. Clin. Neurophysiol. 2013, 62, 223–236. [Google Scholar] [PubMed]
- Adebisi, A.T.; Veluvolu, K.C. Brain network analysis for the discrimination of dementia disorders using electrophysiology signals: A systematic review. Front. Aging Neurosci. 2023, 15, 1039496. [Google Scholar] [CrossRef] [PubMed]
- Cassani, R.; Estarellas, M.; San-Martin, R.; Fraga, F.J.; Falk, T.H. Systematic review on resting-state EEG for Alzheimer’s disease diagnosis and progression assessment. Dis. Markers 2018, 2018, 5174815. [Google Scholar] [CrossRef]
- Paitel, E.R.; Otteman, C.B.; Polking, M.C.; Licht, H.J.; Nielson, K.A. Functional and effective EEG connectivity patterns in Alzheimer’s disease and mild cognitive impairment: A systematic review. Front. Aging Neurosci. 2025, 17, 1496235. [Google Scholar] [CrossRef]
- Ouchani, M.; Gharibzadeh, S.; Jamshidi, M.; Amini, M. A review of methods of diagnosis and complexity analysis of Alzheimer’s disease using EEG signals. BioMed Res. Int. 2021, 2021, 5425569. [Google Scholar] [CrossRef] [PubMed]
- Perez-Valero, E.; Lopez-Gordo, M.A.; Morillas, C.; Pelayo, F.; Vaquero-Blasco, M.A. A review of automated techniques for assisting the early detection of Alzheimer’s disease with a focus on EEG. J. Alzheimer’s Dis. 2021, 80, 1363–1376. [Google Scholar] [CrossRef]
- Costanzo, M.; Cutrona, C.; Leodori, G.; Malimpensa, L.; D’antonio, F.; Conte, A.; Belvisi, D. Exploring easily accessible neurophysiological biomarkers for predicting Alzheimer’s disease progression: A systematic review. Alzheimers Res. Ther. 2024, 16, 244. [Google Scholar] [CrossRef] [PubMed]
- Yuan, Y.; Zhao, Y. The role of quantitative EEG biomarkers in Alzheimer’s disease and mild cognitive impairment: Applications and insights. Front. Aging Neurosci. 2025, 17, 1522552. [Google Scholar] [CrossRef]
- Dauwels, J.; Vialatte, F.; Cichocki, A. Diagnosis of Alzheimer’s disease from EEG signals: Where are we standing? Curr. Alzheimer Res. 2010, 7, 487–505. [Google Scholar] [CrossRef]
- Babiloni, C.; Arakaki, X.; Azami, H.; Bennys, K.; Blinowska, K.; Bonanni, L.; Bujan, A.; Carrillo, M.C.; Cichocki, A.; de Frutos-Lucas, J.; et al. Measures of resting state EEG rhythms for clinical trials in Alzheimer’s disease: Recommendations of an expert panel. Alzheimers Dement. 2021, 17, 1528–1553. [Google Scholar] [CrossRef]
- Jeong, J. EEG dynamics in patients with Alzheimer’s disease. Clin. Neurophysiol. 2004, 115, 1490–1505. [Google Scholar] [CrossRef] [PubMed]
- Yamasaki, T.; Horie, S.; Ohyagi, Y.; Tanaka, E.; Nakamura, N.; Goto, Y.; Kanba, S.; Kira, J.i.; Tobimatsu, S. A potential VEP biomarker for mild cognitive impairment: Evidence from selective visual deficit of higher-level dorsal pathway. J. Alzheimer’s Dis. 2016, 53, 661–676. [Google Scholar] [CrossRef]
- Wu, S.Z.; Masurkar, A.V.; Balcer, L.J. Afferent and efferent visual markers of Alzheimer’s disease: A review and update in early stage disease. Front. Aging Neurosci. 2020, 12, 572337. [Google Scholar] [CrossRef]
- Ruiz-Gómez, S.J.; Hornero, R.; Poza, J.; Maturana-Candelas, A.; Pinto, N.; Gomez, C. Computational modeling of the effects of EEG volume conduction on functional connectivity metrics. Application to Alzheimer’s disease continuum. J. Neural Eng. 2019, 16, 066019. [Google Scholar] [CrossRef]
- Mehra, C.; Beyh, A.; Laiou, P.; Garces, P.; Jones, E.J.; Mason, L.; Buitelaar, J.; Johnson, M.H.; Murphy, D.; Loth, E.; et al. Zero-phase-delay synchrony between interacting neural populations: Implications for functional connectivity-derived biomarkers. Imaging Neurosci. 2025, 3, IMAG.a.985. [Google Scholar] [CrossRef]
- Briels, C.T.; Schoonhoven, D.N.; Stam, C.J.; de Waal, H.; Scheltens, P.; Gouw, A.A. Reproducibility of EEG functional connectivity in Alzheimer’s disease. Alzheimers Res. Ther. 2020, 12, 68. [Google Scholar] [CrossRef]
- Radüntz, T. Signal quality evaluation of emerging EEG devices. Front. Physiol. 2018, 9, 98. [Google Scholar] [CrossRef]
- Griesmaier, E.; Neubauer, V.; Ralser, E.; Trawöger, R.; Kiechl-Kohlendorfer, U.; Keller, M. Need for quality control for aEEG monitoring of the preterm infant: A 2-year experience. Acta Paediatr. 2011, 100, 1079–1083. [Google Scholar] [CrossRef] [PubMed]
- Webb, S.J.; Shic, F.; Murias, M.; Sugar, C.A.; Naples, A.J.; Barney, E.; Borland, H.; Hellemann, G.; Johnson, S.; Kim, M.; et al. Biomarker acquisition and quality control for multi-site studies: The autism biomarkers consortium for clinical trials. Front. Integr. Neurosci. 2020, 13, 71. [Google Scholar] [CrossRef]
- Esteban, O.; Birman, D.; Schaer, M.; Koyejo, O.O.; Poldrack, R.A.; Gorgolewski, K.J. MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLoS ONE 2017, 12, e0184661. [Google Scholar] [CrossRef] [PubMed]
- Hu, B.; Peng, H.; Zhao, Q.; Hu, B.; Majoe, D.; Zheng, F.; Moore, P. Signal quality assessment model for wearable EEG sensor on prediction of mental stress. IEEE Trans. Nanobiosci. 2015, 14, 553–561. [Google Scholar]
- Fickling, S.D.; Liu, C.C.; D’Arcy, R.C.; Hajra, S.G.; Song, X. Good data? The EEG quality index for automated assessment of signal quality. In Proceedings of the 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), Vancouver, BC, Canada, 17–19 October 2019; IEEE: New York, NY, USA, 2019; pp. 219–229. [Google Scholar]
- Ferree, T.C.; Luu, P.; Russell, G.S.; Tucker, D.M. Scalp electrode impedance, infection risk, and EEG data quality. Clin. Neurophysiol. 2001, 112, 536–544. [Google Scholar] [CrossRef] [PubMed]
- Al-Qazzaz, N.K.; Ali, S.H.B.M.; Ahmad, S.A.; Chellappan, K.; Islam, M.S.; Escudero, J. Role of EEG as biomarker in the early detection and classification of dementia. Sci. World J. 2014, 2014, 906038. [Google Scholar] [CrossRef]
- Zheng, X.; Wang, B.; Liu, H.; Wu, W.; Sun, J.; Fang, W.; Jiang, R.; Hu, Y.; Jin, C.; Wei, X.; et al. Diagnosis of Alzheimer’s disease via resting-state EEG: Integration of spectrum, complexity, and synchronization signal features. Front. Aging Neurosci. 2023, 15, 1288295. [Google Scholar] [CrossRef] [PubMed]
- Kessler, R.; Enge, A.; Skeide, M.A. How EEG preprocessing shapes decoding performance. Commun. Biol. 2025, 8, 1039. [Google Scholar] [CrossRef]
- Fraga, F.J.; Falk, T.H.; Kanda, P.A.; Anghinah, R. Characterizing Alzheimer’s disease severity via resting-awake EEG amplitude modulation analysis. PLoS ONE 2013, 8, e72240. [Google Scholar] [CrossRef] [PubMed]
- Yang, X.; Fan, Z.; Li, Z.; Zhou, J. Resting-state EEG microstate features for Alzheimer’s disease classification. PLoS ONE 2024, 19, e0311958. [Google Scholar] [CrossRef]
- De Haan, W.; Pijnenburg, Y.A.; Strijers, R.L.; van der Made, Y.; van der Flier, W.M.; Scheltens, P.; Stam, C.J. Functional neural network analysis in frontotemporal dementia and Alzheimer’s disease using EEG and graph theory. BMC Neurosci. 2009, 10, 101. [Google Scholar] [CrossRef]
- Rostamikia, M.; Sarbaz, Y.; Makouei, S. EEG-based classification of Alzheimer’s disease and frontotemporal dementia: A comprehensive analysis of discriminative features. Cogn. Neurodyn. 2024, 18, 3447–3462. [Google Scholar] [CrossRef]
- Pion-Tonachini, L.; Kreutz-Delgado, K.; Makeig, S. ICLabel: An automated electroencephalographic independent component classifier, dataset, and website. NeuroImage 2019, 198, 181–197. [Google Scholar] [CrossRef]
- Mognon, A.; Jovicich, J.; Bruzzone, L.; Buiatti, M. ADJUST: An automatic EEG artifact detector based on the joint use of spatial and temporal features. Psychophysiology 2011, 48, 229–240. [Google Scholar] [CrossRef]
- Chang, C.Y.; Hsu, S.H.; Pion-Tonachini, L.; Jung, T.P. Evaluation of artifact subspace reconstruction for automatic artifact components removal in multi-channel EEG recordings. IEEE Trans. Biomed. Eng. 2019, 67, 1114–1121. [Google Scholar] [CrossRef]
- Grobbelaar, M.; Phadikar, S.; Ghaderpour, E.; Struck, A.F.; Sinha, N.; Ghosh, R.; Ahmed, M.Z.I. A survey on denoising techniques of electroencephalogram signals using wavelet transform. Signals 2022, 3, 577–586. [Google Scholar] [CrossRef]
- Delorme, A. EEG is better left alone. Sci. Rep. 2023, 13, 2372. [Google Scholar] [CrossRef] [PubMed]
- Ronca, V.; Flumeri, G.D.; Giorgi, A.; Vozzi, A.; Capotorto, R.; Germano, D.; Sciaraffa, N.; Borghini, G.; Babiloni, F.; Aricò, P. o-CLEAN: A novel multi-stage algorithm for the ocular artifacts’ correction from EEG data in out-of-the-lab applications. J. Neural Eng. 2024, 21, 056023. [Google Scholar] [CrossRef] [PubMed]
- Pellegrini, E.; Anschuetz, M.; Sander, C.; Wirths, J.; Hegerl, U.; Couvy-Duchesne, B. How preprocessing shapes EEG-based clinical classification: A systematic evaluation framework. J. Neural Eng. 2024, 21, 056007. [Google Scholar]
- Shamsi, H. Alzheimer’s diagnosis from EEG with reliable probabilities: Subject-wise, leakage-free evaluation and isotonic calibration. J. Eng. Appl. Sci. 2025, 72, 226. [Google Scholar] [CrossRef]
- Chen, Y.; Wang, H.; Zhang, D.; Zhang, L.; Tao, L. Multi-feature fusion learning for Alzheimer’s disease prediction using EEG signals in resting state. Front. Neurosci. 2023, 17, 1272834. [Google Scholar] [CrossRef]
- Lal, U.; Chikkankod, A.V.; Longo, L. A comparative study on feature extraction techniques for the discrimination of frontotemporal dementia and Alzheimer’s disease with electroencephalography in resting-state adults. Brain Sci. 2024, 14, 335. [Google Scholar] [CrossRef]
- Huggins, C.J.; Escudero, J.; Parra, M.A.; Scally, B.; Anghinah, R.; Vitória Lacerda De Araújo, A.; Basile, L.F.; Abasolo, D. Deep learning of resting-state electroencephalogram signals for three-class classification of Alzheimer’s disease, mild cognitive impairment and healthy ageing. J. Neural Eng. 2021, 18, 046087. [Google Scholar] [CrossRef]
- Jiang, R.; Zheng, X.; Sun, J.; Chen, L.; Xu, G.; Zhang, R. Classification for Alzheimer’s disease and frontotemporal dementia via resting-state electroencephalography-based coherence and convolutional neural network. Cogn. Neurodyn. 2025, 19, 46. [Google Scholar] [CrossRef] [PubMed]
- Hallquist, M.N.; Hillary, F.G. Graph theory approaches to functional network organization in brain disorders: A critique for a brave new small-world. Netw. Neurosci. 2018, 3, 1–26. [Google Scholar] [CrossRef]
- Duan, F.; Huang, Z.; Sun, Z.; Zhang, Y.; Zhao, Q.; Cichocki, A.; Yang, Z.; Sole-Casals, J. Topological network analysis of early Alzheimer’s disease based on resting-state EEG. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 2164–2172. [Google Scholar] [CrossRef]
- Klepl, D.; He, F.; Wu, M.; Blackburn, D.J.; Sarrigiannis, P. EEG-based graph neural network classification of Alzheimer’s disease: An empirical evaluation of functional connectivity methods. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 2651–2660. [Google Scholar] [CrossRef]
- Sharma, R.; Meena, H.K. Graph based novel features for detection of Alzheimer’s disease using EEG signals. Biomed. Signal Process. Control 2025, 103, 107380. [Google Scholar] [CrossRef]
- Zheng, H.; Xiao, H.; Zhang, Y.; Jia, H.; Ma, X.; Gan, Y. Time-Frequency functional connectivity alterations in Alzheimer’s disease and frontotemporal dementia: An EEG analysis using machine learning. Clin. Neurophysiol. 2025, 170, 110–119. [Google Scholar] [CrossRef] [PubMed]
- Shan, X.; Cao, J.; Huo, S.; Chen, L.; Sarrigiannis, P.G.; Zhao, Y. Spatial–temporal graph convolutional network for Alzheimer classification based on brain functional connectivity imaging of electroencephalogram. Hum. Brain Mapp. 2022, 43, 5194–5209. [Google Scholar] [CrossRef]
- Parvandeh, S.; Yeh, H.W.; Paulus, M.P.; McKinney, B.A. Consensus features nested cross-validation. Bioinformatics 2020, 36, 3093–3098. [Google Scholar] [CrossRef]
- Miltiadous, A.; Tzimourta, K.D.; Afrantou, T.; Ioannidis, P.; Grigoriadis, N.; Tsalikakis, D.G.; Angelidis, P.; Tsipouras, M.G.; Glavas, E.; Giannakeas, N.; et al. A dataset of scalp EEG recordings of Alzheimer’s disease, frontotemporal dementia and healthy subjects from routine EEG. Data 2023, 8, 95. [Google Scholar] [CrossRef]
- Nour, M.; Senturk, U.; Polat, K. A novel hybrid model in the diagnosis and classification of Alzheimer’s disease using EEG signals: Deep ensemble learning (DEL) approach. Biomed. Signal Process. Control 2024, 89, 105751. [Google Scholar] [CrossRef]
- Devnath, L.; Summons, P.; Luo, S.; Wang, D.; Shaukat, K.; Hameed, I.A.; Aljuaid, H. Computer-aided diagnosis of coal workers’ pneumoconiosis in chest x-ray radiographs using machine learning: A systematic literature review. Int. J. Environ. Res. Public Health 2022, 19, 6439. [Google Scholar] [CrossRef]

| Typical Range | Common Failure | Implication for Reproducibility | Recommendation |
|---|---|---|---|
| Paradigm: shapes observable signal content | |||
| REST 5–20 min; ERPs [11,17] | Recording duration, eyes condition, and vigilance state inconsistently reported | Spectral estimates may differ across studies due to uncontrolled conditions, even under identical paradigm labels | Report paradigm, eyes condition, and vigilance monitoring; justify pooling across paradigms |
| Channel density: shapes spatial resolution of network estimation | |||
| 19–25 ch (clinical); 32–64 ch or ≥128 ch (research) [11,18] | Low-density recordings used for connectivity analysis without justification | Sparse layouts may introduce spatial aliasing and volume conduction bias; construct validity of connectivity biomarkers may be weakened [22] | Justify channel density relative to intended analysis; 19 ch adequate for spectral features [4]; ≥32 ch preferable for connectivity [5] |
| Sampling rate: shapes frequency range and phase precision | |||
| 128–2048 Hz [11,17] | Downsampling procedure unreported | Inconsistent rates reduce comparability, especially for phase-sensitive measures | Report acquisition and analysis rates; apply anti-aliasing filter before downsampling |
| Recording duration: shapes the amount of artifact-free data available | |||
| 2–30 min (REST) [11,14] | Retained artifact-free duration omitted | Limited artifact-free data may reduce the stability of spectral and connectivity estimates | Report retained artifact-free duration; justify adequacy for the intended analysis |
| Quality control: important prerequisite for valid inference | |||
| Rarely formalized; impedance criteria vary [31] | QC criteria absent; rejection rate unreported | Device noise and poor signal quality may mimic or obscure disease-related differences | Report device, impedance criteria, QC procedures, and proportion of data excluded |
| Common Practice | Implication for Reproducibility | Recommendation |
|---|---|---|
| Re-referencing: choice of reference scheme affects connectivity estimates more than spectral measures | ||
| Linked mastoid or average reference [35,36] | PLV, PLI, and coherence are strongly sensitive to reference choice; mastoid reference introduces volume conduction that can alter inter-electrode coupling [22]; spectral power is generally less affected | Use a consistent reference scheme; report as a primary preprocessing parameter |
| High-pass filtering: cutoff choice affects -band biomarkers and phase-sensitive measures | ||
| Variable high-pass cutoffs, typically 0.5–2 Hz | Cutoffs above 1 Hz progressively attenuate low- activity [11,19]; filter type and order introduce phase distortion that can affect PLV and PLI estimates [23] | Use HP ≤ 0.5 Hz when -band is a primary outcome; report filter type, order, and exact cutoffs |
| Artifact attenuation: rejection criteria influence the balance between biomarker preservation and artifact removal | ||
| ICA + ICLabel [39]/ADJUST [40]; ASR [41]; o-CLEAN [44] | Overly aggressive rejection may misclassify disease-related components as artifacts, reducing statistical sensitivity [43]; pipelines with different rejection thresholds are difficult to compare | Apply predefined, conservative rejection criteria; report components or segments removed per subject |
| Epoch length and segmentation: determines frequency resolution and statistical independence of samples | ||
| Fixed-length epochs, commonly 1–30 s | Frequency resolution scales as ; short epochs reduce -band resolution and yield unstable connectivity matrices [24]; epochs from the same subject are statistically dependent and should not be treated as independent samples | Report epoch length, overlap, and count; avoid treating epoch count as subject-level sample size |
| Global parameter selection: fitting parameters before splitting allows test set information to influence training | ||
| Normalization, scaling, thresholds, or feature selection applied before data splitting | Data-dependent parameters estimated from the full dataset incorporate test set information into preprocessing or feature selection, inflating performance estimates [7,46] | Fit all data-dependent parameters within training folds; document which steps were global or within-fold |
| Preprocessing Sensitivity | Sample-Size Sensitivity Indicative | Typical Models | Primary Leakage Risk |
|---|---|---|---|
| Low-dimensional spectral/complexity [11,19,33] EEG slowing (, ); band power, PAF, entropy (ApEn, SampEn), Hjorth | |||
| Lower: robust to moderate reference and filter variation | Lower (∼30–50); compact feature space relatively stable in small cohorts | SVM, RF, kNN, LR | Global normalization before split |
| High-dimensional structured [33,50] Multi-scale abnormalities; spectrograms, wavelet coefficients, multichannel tensors | |||
| Moderate: filter cutoff shapes spectrogram structure | High; segment inflation does not substitute for subject diversity | CNN, CNN-LSTM | Segment-level split; global standardization |
| Connectivity matrices [10,12,55] Disrupted long-range cortico-cortical communication; coherence, PLV, PLI, wPLI, AEC | |||
| High: reference, filter phase, and epoch length alter estimates [22] | Moderate–high (∼50–100); depends on epoch number and metric | SVM, RF, CNN (matrix) | Reference/filter inconsistency; global FC thresholding |
| Graph-based topology [37,52,54] Altered network integration; clustering, efficiency, modularity, small-worldness | |||
| High: inherits connectivity uncertainty; sensitive to graph construction choices [51] | High (∼100 or more); depends on thresholding strategy | Graph kernels, GNN | Post hoc threshold selection; variable graph density across studies |
| End-to-end learned [49,53,56] Implicit hierarchical features; raw or minimally processed EEG | |||
| Moderate–high: upstream filtering conditions learnable content | Very high; rarely satisfied by current EEG–AD datasets | CNN, RNN, transformers | Segment-level split; subject-identity leakage via oversampling |
| Leakage Type (Stage) | Mechanism | Remediation |
|---|---|---|
| Label-informed preprocessing (Preproc) | Artifact rejection or cleaning criteria differ by diagnostic group; preprocessing implicitly encodes class labels | Apply identical, predefined preprocessing blinded to diagnosis |
| Shared preprocessing parameters (Preproc) | Artifact thresholds, cleaning rules, or preprocessing parameters tuned using the full dataset before splitting | Tune data-dependent parameters within training folds; apply fixed rules to test data |
| Global normalization (Preproc) | Mean/SD computed from full dataset, including test subjects; test set distribution informs training normalization | Fit normalization parameters within each training fold |
| Pre-split oversampling (Preproc) | SMOTE applied to full dataset; synthetic samples constructed using test-subject feature distributions | Apply oversampling exclusively within training folds |
| Global feature selection (Feature) | Feature ranking based on full dataset statistics; test set labels implicitly inform feature selection | Nest feature selection inside CV folds |
| Overlapping/adjacent epochs (Segmentation) | Temporally adjacent epochs from the same session split across folds; temporal autocorrelation exploited | Use non-overlapping epochs or temporal blocking in CV |
| Segment-level splitting (Validation) | Epochs from same subject distributed across train and test sets; intra-subject similarity exploited instead of disease signal | Split at subject level; report N subjects as effective sample size |
| Study | Dataset/N | Acquisition | Preprocessing | Segmentation | Feature | Model | Validation | External Val. |
|---|---|---|---|---|---|---|---|---|
| Trinh et al. 2023 [5] | Proprietary; 6-site; 150 (50 AD/50 MCI/50 HC) | 32-ch; 6 sites; 500 Hz; eyes-open REST | Ref: right mastoid; filtering NR; ICA + ADJUST; within-fold norm NR | Non-overlapping 3-s epochs | PLI (5 bands) | LDA + SFS; Acc = 82.50% (train)/75.00% (test) | Subject LOPO-CV; independent test N = 30 | Multi-site test set |
| Siuly et al. 2020 [4] | Proprietary; 27 (11 MCI/16 HC) | 19-ch; 256 Hz; eyes-closed REST; >30 min | SWT denoising (0.5–32 Hz); ref NR; within-fold norm NR | Non-overlapping 2-s epochs | AR (4th order) + PE + histogram (PAA) | ELM/SVM/KNN; best: ELM Acc = 98.78%, AUC = 0.98 | Subject-wise 10-fold CV + LOPO-CV | No external set |
| Nour et al. 2024 [59] | Public; Florida State Univ. datasets; 140 (104 AD/36 HC) | 19-ch; 128 Hz; 8-s segments; eyes condition partly inherited/not fully harmonized | Dataset-level artifact cleaning; Butterworth 0.5–45 Hz; band filtering into delta–gamma; ref and within-fold norm NR | 1-s epochs; 128 × 19 arrays; total 1120 epochs | Raw/filtered 2D EEG arrays; no handcrafted feature extraction; band-specific inputs | DEL ensemble of five 2D-CNNs; weighted averaging; Acc = 97.9% | epoch-level 5-fold CV; subject-wise split NR | No external set |
| Huggins et al. 2021 [49] | Proprietary; 141 age-matched (52 AD/37 MCI/52 HA) | 21-ch; 200 Hz; eyes-closed REST; ∼10 min (Braintech 3.0) | 1–60 Hz band-pass FIR; notch 21 & 42 Hz; ICA + MARA; within-fold norm NR | Non-overlapping 5-s epochs | CWT scalograms (Morse wavelet) → tiled topographic RGB images | AlexNet-based CNN; Acc = 98.9% | epoch-level 10-fold CV; subject-wise split NR | No external set |
| The following two studies share the same acquisition (OpenNeuro ds004504 [58]): 19-ch, 500 Hz, eyes-closed REST, ∼12–14 min. | ||||||||
| Shamsi 2025 [46] | 59 post-QC (31 AD/28 HC) | — | Resample 128 Hz; 0.5–45 Hz band-pass; notch 50 Hz; avg ref; within-fold z-norm | 8-s epochs, 4-s overlap | WST + ROI pooling | Regularized logistic ensemble; AUC = 0.930 | Subject-wise 5-fold GroupKFold | No external set |
| Zheng et al. 2023 [33] | 65 (36 AD/29 HC) | — | A1–A2 ref; 0.5–45 Hz Butterworth; ICA; ASR; within-fold norm NR | 4-s epochs, 50% overlap | Time-domain stats + RBP (5 bands) + entropy + graph sync metrics | Decision Tree/RF/SVM; best: RF Acc = 95.86% | Subject LOPO-CV | No external set |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wang, R.; Sugi, T.; Yamasaki, T. From Acquisition to Validation: Methodological Dependencies and Reproducibility in EEG-Based Alzheimer’s Disease Detection. Technologies 2026, 14, 301. https://doi.org/10.3390/technologies14050301
Wang R, Sugi T, Yamasaki T. From Acquisition to Validation: Methodological Dependencies and Reproducibility in EEG-Based Alzheimer’s Disease Detection. Technologies. 2026; 14(5):301. https://doi.org/10.3390/technologies14050301
Chicago/Turabian StyleWang, Ruimin, Takenao Sugi, and Takao Yamasaki. 2026. "From Acquisition to Validation: Methodological Dependencies and Reproducibility in EEG-Based Alzheimer’s Disease Detection" Technologies 14, no. 5: 301. https://doi.org/10.3390/technologies14050301
APA StyleWang, R., Sugi, T., & Yamasaki, T. (2026). From Acquisition to Validation: Methodological Dependencies and Reproducibility in EEG-Based Alzheimer’s Disease Detection. Technologies, 14(5), 301. https://doi.org/10.3390/technologies14050301

