Author Contributions
Conceptualization, K.A.; methodology, K.A.; software, K.A.; validation, K.A.; formal analysis, K.A.; investigation, K.A.; resources, K.A.; data curation, K.A.; writing—original draft preparation, K.A.; writing—review and editing, M.E.B. and C.R.; visualization, K.A.; supervision, M.E.B. and C.R.; project administration, M.E.B. and C.R. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Overall system architecture of the proposed EEG-based seizure prediction framework, illustrating data acquisition, preprocessing, feature extraction, model training, personalization strategies, and evaluation.
Figure 1.
Overall system architecture of the proposed EEG-based seizure prediction framework, illustrating data acquisition, preprocessing, feature extraction, model training, personalization strategies, and evaluation.
Figure 2.
Distribution of seizure frequency (left) and seizure duration (right) across the 161-patient cohort. The left panel shows seizure counts per patient via boxplot and strip plot; the right panel presents event duration on a logarithmic scale using a combined violin and boxplot visualization.
Figure 2.
Distribution of seizure frequency (left) and seizure duration (right) across the 161-patient cohort. The left panel shows seizure counts per patient via boxplot and strip plot; the right panel presents event duration on a logarithmic scale using a combined violin and boxplot visualization.
Figure 3.
Definition of temporal segments for classification. The diagram illustrates the distinction between the ictal phase and the pre-ictal phase.
Figure 3.
Definition of temporal segments for classification. The diagram illustrates the distinction between the ictal phase and the pre-ictal phase.
Figure 4.
Pre-ictal Multi-Horizon Data Extraction Strategy chart showing 5 min segments extracted from a recording leading to a seizure onset.
Figure 4.
Pre-ictal Multi-Horizon Data Extraction Strategy chart showing 5 min segments extracted from a recording leading to a seizure onset.
Figure 5.
Schematic diagram of the baseline CNN-LSTM architecture showing the input layer, three convolutional blocks with batch normalization and max pooling, two bidirectional LSTM layers with dropout, and the fully connected classification module.
Figure 5.
Schematic diagram of the baseline CNN-LSTM architecture showing the input layer, three convolutional blocks with batch normalization and max pooling, two bidirectional LSTM layers with dropout, and the fully connected classification module.
Figure 6.
Multi-stage optimization pipeline for EEG seizure prediction, including baseline benchmarking, architecture simplification, input and feature reduction, and deployment-oriented optimization through pruning and quantization for edge devices.
Figure 6.
Multi-stage optimization pipeline for EEG seizure prediction, including baseline benchmarking, architecture simplification, input and feature reduction, and deployment-oriented optimization through pruning and quantization for edge devices.
Figure 7.
Dual-panel performance analysis for the top five models at the 5 min prediction horizon. (a) ROC curves showing discriminative capability, with the CNN-LSTM Hybrid achieving the highest AUC (0.973). (b) Sensitivity, specificity, and F1-score comparison highlighting architecture-level trade-offs between seizure detection and false alarm control.
Figure 7.
Dual-panel performance analysis for the top five models at the 5 min prediction horizon. (a) ROC curves showing discriminative capability, with the CNN-LSTM Hybrid achieving the highest AUC (0.973). (b) Sensitivity, specificity, and F1-score comparison highlighting architecture-level trade-offs between seizure detection and false alarm control.
Figure 9.
ROC curves and metric comparison for top models at the 60 min prediction horizon. DeepConvNet (all-feature) achieves the highest AUC (0.975), with sensitivity, specificity, and F1-score remaining well-balanced across architectures.
Figure 9.
ROC curves and metric comparison for top models at the 60 min prediction horizon. DeepConvNet (all-feature) achieves the highest AUC (0.975), with sensitivity, specificity, and F1-score remaining well-balanced across architectures.
Figure 10.
Patient-wise accuracy and sensitivity of the generalized CNN–LSTM model evaluated on the 30 blind patients at the 5 min prediction horizon.
Figure 10.
Patient-wise accuracy and sensitivity of the generalized CNN–LSTM model evaluated on the 30 blind patients at the 5 min prediction horizon.
Figure 11.
Comparison of generalized model performance on the test set and blind patient cohort, showing accuracy, sensitivity, and number of undetected seizures at the 5 min prediction horizon.
Figure 11.
Comparison of generalized model performance on the test set and blind patient cohort, showing accuracy, sensitivity, and number of undetected seizures at the 5 min prediction horizon.
Figure 12.
Patient-wise evolution of accuracy, sensitivity, and average undetected seizures across the five stages of personalization for all 30 patients. The results demonstrate progressive performance stabilization, improved patient-specific adaptation, and enhanced seizure coverage through successive personalization stages.
Figure 12.
Patient-wise evolution of accuracy, sensitivity, and average undetected seizures across the five stages of personalization for all 30 patients. The results demonstrate progressive performance stabilization, improved patient-specific adaptation, and enhanced seizure coverage through successive personalization stages.
Figure 13.
Distribution of Seizure Morphology Similarity. Dynamic Time Warping (DTW) distances were calculated for 130 patients against a gold-standard reference. Vertical markers at n = 30 and n = 70.
Figure 13.
Distribution of Seizure Morphology Similarity. Dynamic Time Warping (DTW) distances were calculated for 130 patients against a gold-standard reference. Vertical markers at n = 30 and n = 70.
Figure 14.
Performance comparison between similarity-based personalization strategies using Top-30 and Top-70 patient subsets, illustrating the impact of similarity pool size on accuracy and sensitivity.
Figure 14.
Performance comparison between similarity-based personalization strategies using Top-30 and Top-70 patient subsets, illustrating the impact of similarity pool size on accuracy and sensitivity.
Figure 15.
Spatial distribution of correlation with the labeled seizure. The topographic map displays the intensity of correlation across the scalp, with peak correlations highlighted by electrode labels. Interpolated using cubic spline methods to ensure a smooth anatomical gradient, the color scale represents the strength of association.
Figure 15.
Spatial distribution of correlation with the labeled seizure. The topographic map displays the intensity of correlation across the scalp, with peak correlations highlighted by electrode labels. Interpolated using cubic spline methods to ensure a smooth anatomical gradient, the color scale represents the strength of association.
Figure 16.
Relationship between the number of retained EEG channels and model performance, showing changes in accuracy and sensitivity as electrode count is progressively reduced.
Figure 16.
Relationship between the number of retained EEG channels and model performance, showing changes in accuracy and sensitivity as electrode count is progressively reduced.
Table 1.
Recent EEG-Based Seizure Prediction Studies.
Table 1.
Recent EEG-Based Seizure Prediction Studies.
| Study (Year) | Dataset | Model Type | Prediction Horizon | Key Limitation |
|---|
| Liu et al. (2024) [14] | CHB-MIT | Pseudo-3D CNN + BiConvLSTM3D | Short/unspecified | Focus on spatial features, limited long-horizon analysis |
| Esmaeilpour et al. (2024) [21] | CHB-MIT | CNN + Ensemble Classifier | Minutes (5 min implied) | No cross-subject adaptation reported |
| Yuan et al. (2024) [24] | CHB-MIT | Hybrid DenseNet-ViT with Attention | Minutes (preictal windows typical) | No personalization; global model |
| Sadeghi Khansari et al. (2025) [25] | CHB-MIT | DWT + Deep Learning (FNN) | Not explicitly long horizon | High performance but no personalization focus |
| Upadhyay et al. (2025) [26] | Public EEG | Explainable Hybrid DNN | Unspecified | Seizure vs. non-seizure classification not long horizon |
| Li et al. (2025) [27] | CHB-MIT & MSSM | Spatio-Temporal Attention Network | 15–45 min | Still relatively short horizons |
Table 2.
Handcrafted-features extracted from EEG.
Table 2.
Handcrafted-features extracted from EEG.
| Feature Category | Feature | Description |
|---|
| Statistical | Mean | Average signal amplitude |
| Standard Deviation | Signal variability |
| Variance | Power dispersion |
| Skewness | Signal asymmetry |
| Kurtosis | Peakness and tail behavior |
| Zero Crossing Rate | Temporal oscillation rate |
| Hjorth Parameters | Activity | Signal power |
| Mobility | Mean frequency estimate |
| Complexity | Signal shape variation |
| Spectral | Band Power (δ, θ, α, β, γ) | Energy in standard EEG bands |
| Relative Band Power | Normalized spectral contribution |
| Spectral Entropy | Frequency distribution randomness |
| Time–Frequency | Wavelet Coefficients | Multi-resolution signal representation |
| Wavelet Energy | Energy at different scales |
| Spatial (when applicable) | Common Spatial Pattern (CSP) | Discriminative spatial filtering |
Table 3.
Incremental personalization stages for a single patient.
Table 3.
Incremental personalization stages for a single patient.
| Personalization Stage | Patient-Specific Input Data | Test Data |
|---|
| Minimal | 10% | 90% |
| Low-resource | 25% | 75% |
| Moderate | 50% | 50% |
| Maximized | 75% | 25% |
Table 4.
Generalized model performance at 5 min prediction horizon.
Table 4.
Generalized model performance at 5 min prediction horizon.
| Metric | Value |
|---|
| Accuracy | 96.30% ± 0.41% |
| Sensitivity | 91.62% ± 0.27% |
| Specificity | 97.47% ± 0.35% |
| F1-Score | 93.9% ± 0.38% |
| False Positive/Hour | 0.25 ± 0.04 |
| Undetected Seizures | 4 |
Table 5.
Performance comparison across reduced EEG channel configurations. Channel reduction from 19 to 10 electrodes maintains accuracy above 90%, while aggressive reduction below 8 channels results in significant performance degradation.
Table 5.
Performance comparison across reduced EEG channel configurations. Channel reduction from 19 to 10 electrodes maintains accuracy above 90%, while aggressive reduction below 8 channels results in significant performance degradation.
| Inputs | Accuracy | Sensitivity | Undetected Seizures | Pre-Process Time | Inference Time (s) |
|---|
| 19 EEG + feature | 0.96 ± 0.83 | 0.91 ± 0.94 | 4 | 0.002 | 0.00005 |
| 15 EEG + feature | 0.93 ± 1.01 | 0.90 ± 1.03 | 5 | 0.002 | 0.00005 |
| 10 EEG + feature | 0.89 ± 0.99 | 0.87 ± 2.05 | 7 | 0.002 | 0.00005 |
| 9 EEG + feature | 0.90 ± 1.04 | 0.88 ± 1.84 | 15 | 0.002 | 0.00005 |
| 8 EEG + feature | 0.91 ± 0.98 | 0.89 ± 1.37 | 16 | 0.002 | 0.00005 |
| 7 EEG + feature | 0.88 ± 1.73 | 0.82 ± 3.10 | 11 | 0.002 | 0.00005 |
| 6 EEG + feature | 0.86 ± 0.91 | 0.86 ± 1.41 | 14 | 0.002 | 0.00005 |
| 5 EEG + feature | 0.88 ± 1.22 | 0.81 ± 2.64 | 24 | 0.002 | 0.00005 |
| 4 EEG + feature | 0.83 ± 3.01 | 0.68 ± 3.81 | 61 | 0.002 | 0.00005 |
| 3 EEG + feature | 0.70 ± 3.48 | 0.66 ± 4.24 | 73 | 0.002 | 0.00005 |
Table 6.
Model Compression Performance Comparison.
Table 6.
Model Compression Performance Comparison.
| Model | Accuracy | Sensitivity | Undetected Seizures | Total Parameters | Training Time (Min) | Network Size (MB) |
|---|
| AlexNet | 96% ± 0.83% | 91% ± 0.94% | 4 | 63,403,512 | 268 | 227.64 |
| Compressed AlexNet | 95% ± 0.87% | 92% ± 1.03% | 5 | 57,157,209 | 261 | 212.35 |
| Mobilenet v2 | 94% ± 1.22% | 90% ± 1.37% | 5 | 3,521,928 | 175 | 14.24 |
| Compressed MobileNet v2 | 94% ± 1.66% | 91% ± 1.08% | 7 | 3,105,422 | 170 | 13.5 |
| Squeeze net | 95% ± 1.94% | 90% ± 2.75% | 8 | 1,235,496 | 138 | 5.02 |
| Compressed squeezenet | 93% | 91% | 17 | 1,210,843 | 136 | 4.7 |
Table 7.
Performance Comparison with Recent State-of-the-Art EEG Seizure Prediction Methods.
Table 7.
Performance Comparison with Recent State-of-the-Art EEG Seizure Prediction Methods.
| Study | Year | Dataset | Model | Horizon (Min) | Acc % | Sens % |
|---|
| This Work | 2026 | EPILEPSIAE (161 patients) | CNN-LSTM | 5 | 96.3 ± 2.8 | 91.6 ± 2.1 |
| This Work (Optimized) | 2026 | EPILEPSIAE (161 patients) | CNN-LSTM | 5 | 89.7 ± 3.7 | 86.7 ± 2.6 |
| This Work (generalized) | 2026 | EPILEPSIAE (161 patients) | CNN-LSTM | 60 | 91.3 ± 1.9 | 83.0 ± 4.2 |
| Pontes et al. [41] | 2024 | EPILEPSIAE (37 patients) | SVM | 50 | NA | 75.0 ± 33 |
| Batista et al. [28] | 2024 | EPILEPSIAE (37 patients) | SVM | 55 | NA | 49.0 |
| Jiang et al. [29] | 2023 | Siena Scalp EEG | PAC feature extraction + RandomForest | 5–15 | 85.71% | NA |
| Lopes et al. [42] | 2024 | EPILEPSIAE (41 patients) | DCAE + BiLSTM | 40 | NA | 0.16 ± 0.23 |