1. Introduction
According to the World Health Organization (WHO), depression is one of the most common mental disorders worldwide [
1]. It significantly influences both personal well-being and social functioning, highlighting the urgent need for effective detection and treatment strategies. Depression must be treated in a timely manner because it can cause self-harm or even suicide. The WHO reports that suicide ranks as the fourth most prevalent cause of mortality among teenagers and young adults globally [
1]. Although depression is a major risk factor for suicide, effective treatment is essential not only to reduce suicidal ideation but also to improve overall quality of life and social functioning.
Treatment of depression is mitigated via selective serotonin reuptake inhibitors (SSRI) antidepressants or magnetic pulses such as repetitive transcranial magnetic stimulation (rTMS), depending on the level of depression. However, the success rates of these depression therapies are not guaranteed. The reason is that the response of a patient’s brain changes from one patient to another. This variability in brain response across individuals highlights the inherently nonlinear dynamics of the brain, which operates as a complex and unpredictable system. If a depression therapy fails for a depressed patient, there might be an increase in the suicide risk for that patient. Time is a significant factor in patients with depression. Proposing a robust and reliable method to predict the outcome of therapies for patients with depression before starting the treatment helps to enhance the quality of life of patients with depression.
One of the difficulties facing medical teams is deciding on the optimum level of a depression treatment plan. Psychiatrists use their experiences and a trial and error strategy to select the best depression therapy for every individual case. In medical terminology, a patient who gets well after therapy is called a responder (R) to therapy and, likewise, a non-responder (NR) is a patient who does not get well after the therapy.
Predicting depression in R and NR patients has been the subject of several studies. Blood tests [
2], medical imaging [
3], DNA [
4], and socioeconomic background [
5] are some of the modalities that form the basis of these studies. These days, one of the most widely used neuroimaging method in healthcare is EEG. Compared with other modalities, recording a patient’s EEG offers several advantages including real-time, cost-effectiveness, and widespread availability in hospitals and local clinics.
According to some studies, EEG signals may be utilized as biomarker to forecast how well patients respond to treatments. This inspired us to create a CAD system that would help mental health professionals decide on the optimal treatment plan for individuals with depression. In the literature, the results of depression treatment have been predicted using the study of EEG data. According to the methodology employed, earlier research may be divided into four groups: time-domain, frequency-domain, time–frequency-domain, and convolutional neural network (CNN)-based approaches.
In time-domain-based methods, the information is extracted from the EEG signals in the time domain, whereas in frequency-domain-based methods, the EEG signals are transferred into the frequency domain and then the frequency spectrum of the EEG signals is used to extract the information. Similarly, in time–frequency-domain-based methods, the information is extracted from the EEG signals after decomposing the EEG signals into subbands. On the other hand, we have CNN-based methods in which the information is extracted from the EEG signals via successive filtering and max pooling operators. In prior studies, most CNN-driven architectures designed to forecast treatment response in depressive disorders via image-based representations of EEG signals—generated through methods such as time-frequency analysis and connectivity estimation.
In the following, we review some remarkable studies in the literature that were conducted to predict the outcomes of depression therapies for patients via EEG signals.
Several studies have explored EEG-based prediction of treatment responses in depression by employing diverse signal processing and machine learning techniques. In the time-frequency domain, transformations such as STFT, EMD, and wavelet-based decomposition have been applied to extract relevant EEG subband features for classification, with some approaches have reported over 90% prediction accuracy for SSRI therapy responders [
6]. Similarly, rTMS-related studies have analyzed EEG signals via EMD to derive intrinsic mode functions, from which entropy-based features such as permutation entropy were extracted. Notably, increased entropy values in specific IMFs (especially from the frontal region) were associated with positive treatment outcomes [
7].
In other efforts focusing on nonlinear dynamics, researchers have computed measures such as spectral coherence, mutual information, and hemispheric power ratios to build predictive models for SSRI efficacy [
8]. Feature selection methods such as the Fisher discriminant ratios were paired with probabilistic classifiers to achieve high classification performance. Additionally, studies using time-domain and entropy-based EEG features, often in combination with clinical variables, have shown promise. For instance, microstate analysis, multiscale entropy, and spectral patterns derived from both electrodes and sources domains were utilized alongside machine learning models such as SVMs to predict treatment outcomes in small clinical cohorts [
9].
In a study examining EEG channels individually [
10], the F8 channel was identified as particularly useful, achieving an accuracy of 80% in classifying patient responses to rTMS therapy. Another research effort utilized a time-domain method, which extracts both linear and nonlinear EEG signal features from individual channels. These features were then filtered through the minimal-redundancy–maximal-relevance approach for feature selection before classification with a K-nearest neighbors (KNN) algorithm. Using the F8 channel data, this method also achieves a classification accuracy of 80% in predicting rTMS therapy outcomes, with classifier validation carried out through leave-one-out cross-validation.
In this time-domain-based study [
11], EEG connectivity features were analyzed in patients receiving rTMS, and machine learning models such as SVMs were applied to classify the responders. The initial results showed promising levels of sensitivity and specificity in a small cohort, a subsequent validation using a substantially larger sample size failed to confirm these outcomes. Specifically, theta-band connectivity, once considered to be a strong predictor of therapeutic response within the early stages of treatment, did not generalize across the expanded dataset.
In a study using a time-domain methodology [
12], researchers applied a machine learning framework to predict responses to SSRI therapy among 51 patients diagnosed with depression. The study considered three distinct sets of data: demographic information, resting-state EEG signals, and source-localized current densities. Initially, irrelevant features were eliminated via a random forest based feature selection process, followed by dimensionality reduction with principal component analysis (PCA). The selected features were then classified using several machine learning classifiers. Among these, the random forest classifier achieved the highest accuracy, with an 88% success rate in predicting SSRI therapy outcomes.
Another study using frequency-domain EEG analysis [
13] extracted both total and normalized spectral energy measures from different brain areas, particularly the posterior and anterior regions. Additionally, ratios between frequency bands, including those involving beta and alpha rhythms, were computed. These spectral features were used as inputs to a gradient boosting decision tree classifier to estimate SSRIs treatment response. Among all the extracted features, the low-frequency power observed at the occipital electrodes (O1 and O2) were found to be the most informative.
In one study applying a time-domain approach [
14], researchers explored EEG functional connectivity by analyzing correlations within the alpha frequency band. These alpha spectral correlation measures demonstrated promising predictive power for patient responses to rTMS therapy. When these features were used as inputs for an ElasticNet machine learning algorithm, the model achieved an area under the curve (AUC) of 0.91, effectively predicting treatment outcomes. Another investigation [
15] employed a different time-domain strategy by assessing the stability of EEG signals through frontal alpha asymmetry (FAA). This study indicated that FAA measures remained stable across repeated evaluations and were unaffected by the specific type of antidepressant therapy administered.
In a referenced study [
16], a hybrid approach combining time-domain and frequency-domain analysis was used to extract diverse EEG signal characteristics. These included complexity measures such as Lempel-Ziv and Katz fractal dimensions, statistical and spectral features such as correlation dimension, power spectral density, and bispectrum-derived metrics, along with activity-based indicators from the frontal and prefrontal regions and their aggregated forms. To refine the input space, the minimal-redundancy–maximal-relevance (mRMR) algorithm was employed for feature selection. The resulting feature set was then evaluated via a K-nearest neighbors (KNN) classifier within a leave-one-out cross-validation framework. This methodology achieved an accuracy of 93.5%, sensitivity of 91.3%, and specificity of 91.3% in predicting therapeutic response to rTMS.
In one study [
17], researchers introduced a polygenic EEG approach that utilized functional connectivity features derived through fast independent component analysis (ICA) to forecast patient responses to rTMS therapy. These connectivity metrics were subsequently classified using discriminant analysis (DA), resulting in an area under the curve (AUC) of 0.735.
A study conducted by Ivan et al. [
18] utilized a frequency-domain framework to analyze EEG-derived indicators—namely paroxysmal activity, alpha peak frequency, and frontal alpha asymmetry (FAA)—to assess their predictive value in determining patient responsiveness to antidepressant drugs. The researchers inferred that leveraging EEG biomarkers previously correlated with depressive symptoms could aid in personalizing treatment strategies effectively.
Additionally, in a study using a time-domain-based approach [
19], researchers developed a sparse EEG latent space regression (SELSER) model. This model employed EEG signatures as input features to predict therapeutic outcomes for depression. The performance of the SELSER model was assessed by calculating the relative mean square error (RMSE), achieving a value of 5.68 in predicting treatment outcomes.
An international trial known as iSPOT-D [
20], which focused on optimizing therapeutic approaches for individuals with depression, indicated that the use of sertraline hydrochloride could help stabilize abnormalities in the EEG alpha-peak activity. This study reported a clinical improvement rate of around 74Another investigation [
17] proposed a polygenic EEG-based approach utilizing fast independent component analysis (ICA) to derive functional connectivity measures. These connectivity measures were subsequently analyzed through discriminant analysis (DA), achieving an AUC value of 0.735 for predicting rTMS treatment outcomes.
Another investigation [
21] utilized low-resolution brain electromagnetic tomography (LORETA) to assess functional brain connectivity and compute effective coherence across different cortical areas. The study examined how coherence metrics in the alpha and theta frequency ranges could predict responses to antidepressant treatment. Functional connectivity in the alpha band was examined between two key brain areas: the rostral portion of the anterior cingulate cortex (rACC) and the anterior region of the insular cortex (aINS). This analysis yielded a sensitivity of 82% and a specificity of 86%, highlights their predictive relevance for therapeutic outcomes.
In a CNN-based approach [
22], connectivity-based images were created from EEG channels for distinct frequency bands, including delta, theta, alpha, and beta. These images were subsequently analyzed via four sequentially arranged pre-trained transfer learning (TL) models integrated with a bi-directional long short-term memory (Bi-LSTM) network. This methodology resulted in a high classification accuracy (ACC) of 98.33%.
Another CNN-related investigation [
23] similarly generated EEG connectivity images, which were then input into a hybrid network combining the pre-trained Xception model with a Bio-LSTM architecture. A subject-wise validation approach, specifically a leave-one-out strategy, was employed. This method resulted in an impressive predictive accuracy of 98.86% for classifying responses to rTMS treatment.
In another convolutional neural network (CNN)-based investigation, Yerguzel et al. [
24] evaluated the use of neural models by applying Cordance features extracted from EEG recordings captured during a resting-state protocol. The team implemented a relatively simple neural network consisting of 10 computational layers and achieved a classification accuracy of 89.09% for identifying responses to rTMS treatment.
In another CNN-focused research [
25], EEG signals were transformed into images through continuous wavelet transform (CWT), which were then classified using an ensemble model built upon five pre-trained TL architectures. By employing a voting approach across these models, a classification accuracy of 96.55% was reported in predicting SSRI therapy outcomes.
Moreover, a different CNN approach [
26] employed time-frequency representations of EEG data generated via CWT. These representations were analyzed by the EfficientNetB0-Bio-LSTM architecture, further enhanced with an attention mechanism. The method achieved a high accuracy rate of 97.1% for predicting rTMS therapy outcomes.
Another CNN-based model [
27] took EEG signals as direct inputs without initial transformation. These raw signals were processed through a hybrid TL approach combined with a Bio-LSTM framework, reaching impressive classification accuracy of 98.84% in predicting patient responses to SSRI treatment.
In a deep learning-based approach [
28], multiple transfer learning models—such as InceptionResNetV2-Bio-LSTM and its variants—were used to derive EEG features. These models were aggregated via a majority voting ensemble, where the influence of each model was fine-tuned via heuristic optimization techniques like differential evolution, resulting in classification accuracies exceeding 98% in some cases. A related study [
28] transformed brainwave interactions into connectivity images, which were fed into a sequence of pre-trained convolutional networks including the EfficientNet, DenseNet, and VGG architectures. DenseNet and VGG models demonstrated strong baseline performance, and ensemble-based optimization further increased the predictive accuracy to over 92%. Summary of the studies in the literature is presented in
Table 1.
EEG signals exhibit complex dynamic behaviors in high-dimensional space, making them effective in capturing the brain’s nonlinear characteristics. In this study, we avoid labeling the brain’s dynamics as strictly “chaotic” or “orderly.” Instead, we characterize variations in EEG activity using nonlinear dynamic features that reflect signal complexity over time.
However, only a limited number of studies have explored computer-aided systems specifically designed to predict treatment response in patients with depressive disorders. This is largely because of the computational demands involved in mapping brain signals into higher-dimensional feature representations. Consequently, identifying informative EEG patterns and reducing dimensionality becomes crucial. Sophisticated techniques for signal decomposition and selection are typically employed to isolate the most informative features for accurate classification [
29,
30].
Table 1.
Summary of prior studies compared with the proposed method.
Table 1.
Summary of prior studies compared with the proposed method.
Study | Analytical Techniques and Models | Therapy | Sample Size (R vs. NR) | Accuracy | Sensitivity | Specificity |
---|
Shahabi et al., 2023 [28] | Transfer learning ensemble using Bio-LSTM | rTMS | 23 vs. 23 | 98.51% | 98.64% | 98.36% |
Khodayari-Rostamabad et al., 2013 [8] | Nonlinear EEG features + Fisher ratio + MFA classifier | SSRI | 11 vs. 11 | 87.40% | 94.90% | 80.90% |
Minami et al., 2022 [21] | Brain source connectivity and coherence + ROC analysis | SSRI | 12 vs. 18 | - | 82.00% | 86.00% |
Jaworska et al., 2019 [12] | Demographic & EEG + PCA + RF classifier | SSRI | 27 vs. 24 | 88.00% | 77.00% | 99.00% |
Mumtaz et al., 2017 [6] | EEG time-frequency analysis + ROC curve + logistic regression | SSRI | 16 vs. 18 | 91.60% | 90.00% | 90.00% |
Shahabi et al., 2023 [26] | Time-frequency domain (CWT) + transfer learning + Bio-LSTM | rTMS | 23 vs. 23 | 97.1% | 97.3% | 97.0% |
Zhdanov et al., 2020 [9] | EEG + clinical variables + statistical filtering + SVM | SSRI | 155 vs. 67 | 82.40% | 72.90% | 85.50% |
Shahabi et al., 2022 [27] | Deep TL models fused with Bio-LSTM | SSRI | 12 vs. 18 | 98.84% | 97.80% | 99.60% |
Shahabi et al., 2023 [23] | Channel connectivity image + ensemble TL + LSTM | rTMS | 23 vs. 23 | 99.32% | - | 98.34% |
Mirjebreili et al., 2024 [22] | EEG rhythms + channel connectivity + sequential TL + Bio-LSTM | SSRI | 12 vs. 18 | 98.33% | - | - |
Shahabi et al., 2021 [25] | Time-frequency plots using CWT + TL ensemble voting | SSRI | 12 vs. 18 | 96.55% | 96.01% | 96.95% |
Shahabi et al., 2023 [31] | Channel-wise connectivity features + ensemble TL models | rTMS | 34 depressed | 92.28% | - | - |
Proposed (Ours) | APM-based 2D signal mapping + BPFSL feature encoding + NCA + FFNN | SSRI | 12 vs. 18 | 98.06% | 98.22% | 97.97% |
rTMS | 9 vs. 6 | 100% | 100% | 100% |
rTMS | 23 vs. 23 | 97.19% | 97.61% | 96.80% |
To avoid overinterpreting the electrophysiological basis of EEG dynamics, we have refrained from characterizing brain activity as “chaotic”. Instead, our framework treats EEG complexity as a reflection of underlying neurodynamic variability rather than assigning metaphysical descriptors.
However, there is not much studies in designing a CAD system to predict the outcomes of depression therapies. The reason for this limitation is posed by the computationally intensive steps of traditional techniques required to project signals onto higher-dimensional space.
In addition, extracting meaningful information from data is fundamental for identifying unique patterns and minimizing dimensionality for clearer data representation [
29]. To enhance classification performance, both feature extraction methods and selection algorithms are employed to isolate the most informative attributes from the dataset [
30]. Prior research on EEG-based prediction techniques can be broadly grouped into three categories: frequency-domain, time-domain, and time–frequency domain approaches.
Frequency-based methods often rely on spectral analysis tools such as the Fourier transform to quantify brain signal rhythms [
8,
13]. In contrast, time-frequency approaches analyze sub-band components via techniques such as wavelet transforms or empirical decomposition [
6,
25]. While these methods offer rich temporal and spectral insights, they tend to suffer from limitations such as high computational cost, difficulties with non-stationary signals, and sensitivity to noise or improper parameter selection—such as wavelet type and window size.
On the other hand, time-domain approaches directly extract statistical descriptors and structural patterns from the raw EEG signal, often using binary-based and determinant measures [
8,
9,
12]. While models developed using such features typically achieve approximately 80% accuracy, these methods are simpler and more interpretable [
8,
9,
12,
13,
17,
19,
21]. However, there remains a need to design time-domain strategies that can extract more refined and discriminative signal characteristics to enhance the predictive performance of machine learning models.
A critical analysis of existing research reveals several noteworthy shortcomings in prior studies:
Traditional EEG features often fall short of 90% accuracy, mainly because they struggle to represent the brain’s inherently nonlinear characteristics.
Clinicians lack intuitive visualization tools to interpret disordered neural patterns observed in depression, making real-time monitoring difficult.
The majority of the previous deep learning studies are complex and demand significant computational resources.
All previous studies proposed a single therapy rather than a model for multiple depression therapies.
Previous studies have shown weaknesses in preprocessing EEG signals to enhance the performance of the proposed model.
For chaotic time series EEG signals, sine, and cosine functions are plotted against each other can reveal a strange attractor in a two-dimensional (2D) space, which is a key feature of chaotic systems. These attractors are usually fractal in nature, and their structure provides insights into the system’s dynamics. This fact inspired us to propose a method called the amplitude-polar map (APM) technique to plot the EEG signal in 2D space.
Furthermore, we introduce a new feature engineering method—Binary Pattern of Five Successive Lines (BPFSL)—which is derived from our APM approach. This technique is designed to capture the system’s nonlinear behavior and complexity while retaining its immediate dynamic characteristics. We then fuse resulting features of BPFSL of all EEG channels. This fusion process results in a generalized pattern that captures the nonlinear relationships between the EEG signal channels and the instantaneous dynamics of multi-channel EEG data.
This work presents a CAD system to predict the best course of depression therapy. The multi-channel EEG signals are de-noised via a multi-scale principal component analysis (MSPCA), and the EEG signals are plotted in 2D space via APM technique. The nonlinearity and complex behavior of the EEG shapes in 2D space are utilized to extract distinctive features via our proposed BPFSL technique. The Neighborhood Component Analysis (NCA) is implemented to select the best group of features. These features are used to propose a classifier via a feedforward neural network (FFNN) with a 10-fold CV strategy. Two binary classification tasks are defined: R to SSRI vs. NR to SSRI and R to rTMS vs. NR to rTMS. Our contributions to predict the best course for depression therapy are represented as follows:
We propose a novel technique to represent the complexity of EEG signals in a 2D space.
We propose a novel feature engineering technique in the time domain on the basis of the APM shape on 2D space, called BPFSL, to extract distinctive features of EEG signals.
Design and validation of a reliable CAD system for recommending the best course of depression therapy.
Compared with the previous model, we proposed the high-performance model in terms of classification ACC.
Deliver a computationally efficient CAD system.
We integrate the proposed CAD system into software to support psychiatrists’ decisions.
The present work is structured as follows:
Section 2 describes the materials.
Section 3 describes the pre-prossessing of databases. The proposed CAD system is explained in
Section 4. Results and discussion are reported in
Section 5. The paper is concluded in
Section 6.
5. Results and Discussion
In this work, a new CAD system is developed to predict the outcomes of SSRI and rTMS therapies for depression disorder. Two different databases are used to evaluate the performance of the proposed CAD system. The first database is the Mumtaz database, which contains the EEG signals of 30 depression patients who are candidates for SSRI therapy. The second database is the Atieh Hospital database, which includes the EEG signals of 46 and 15 depression patients who are candidates for rTMS therapy. In this work, two binary classification tasks are defined to evaluate the performance of the proposed CAD system: R versus NR for SSRI therapy and R versus NR for rTMS therapy.
The EEG signals are split into 15-s segments. Since the sampling frequency in the Mumtaz database is 256 Hz, the EEG signals, after being split into 15-s intervals, have dimensions of (i.e., number of channels × number of samples). On the other hand, the sampling frequency of the Atieh Hospital database is 500 Hz, and as a result, the EEG signals in the Atieh Hospital database have dimensions of .
Figure 2 shows the block diagram of the proposed CAD system for predicting the results of depression therapies. In the first step of the proposed CAD system, the EEG signals are de-noised using the MSPCA technique. Each channel of the EEG signal is subsequently plotted in 2D space using the APM technique. After that, the BPFSL technique is applied to the two-dimensional shapes and 32 features are extracted for each channel. Data fusion is then performed by concatenating the features of the 19 channels. This means that a total of 608 features are extracted from a 19-channel EEG signal (19 channels × 32 features = 608 features). In fact, the proposed CAD system reduces the dimension of the EEG data from
in the Mumtaz database and
in the Atieh Hospital database to
. This represents a dimensionality reduction of approximately 99.17% in the Mumtaz database and 95.57% in the Atieh Hospital database. In other words, the proposed CAD system, by using the APM plot aligned with the BPFSL technique, extracts the pattern of the data while simultaneously reducing the data dimension.
The features are sorted via the NCA algorithm on the basis of their weights. The FFNN architecture is used for feature classification. The features are iteratively increased from 1 to 608 based on their weights and are fed into the FFNN architecture via a 10-fold CV strategy for classification.
Figure 7 shows the classification accuracies for the sorted features as the feature vector size increases from 1 to 608. The detailed results for the proposed CAD system are reported in
Table 5. For SSRI therapy, the highest classification accuracy of 98.06% is achieved with 309 features (see
Figure 7a). For rTMS therapy, a smaller Atieh Hospital database, the best classification accuracy of 100% is achieved with 110 and 131 features (see
Figure 7b), while when the larger Atieh Hospital database is used, the best classification accuracy of 97.19% is achieved with 339 features (see
Figure 7c).
The contribution of the channels based on the selected features by NCA is illustrated in
Figure 8. From
Figure 8, the most informative EEG channels for predicting SSRI therapy outcomes are identified as F7, Fz, Fp2, P4, and Pz, whereas for rTMS therapy, the most contributive channels are O2, F4, T5, T3, Cz, and T6. These spatial patterns reflect distinct neurophysiological mechanisms involved in each treatment modality.
For SSRI therapy, the key channels are located primarily in the prefrontal and parietal regions. F7 (left ventrolateral prefrontal cortex) and Fz (midline prefrontal/cingulate) are involved in emotional regulation, decision-making, and cognitive control—functions often impaired in depression [
41]. Fp2, located over the right orbitofrontal cortex, plays a central role in reward processing and evaluating emotional salience, both of which are modulated by serotonergic systems [
42]. P4 and Pz are associated parietal electrodes with the default mode network and attentional processes [
43]. Their involvement suggests altered fronto-parietal dynamics in SSRI responders, potentially reflecting reorganization in emotion-regulation and self-referential networks, rather than direct causal modulation.
In contrast, the observed EEG patterns in rTMS-treated patients indicate broader network engagement across frontal, temporal, occipital, and central regions. F4 (right dorsolateral prefrontal cortex), the stimulation target, is critical for executive function and mood regulation [
44,
45]. O2 (right occipital lobe) is associated with visual processing and posterior alpha rhythms, possibly reflecting treatment-related shifts in arousal or sensory integration [
46]. Temporal electrodes T5 (left posterior), T3 (left mid-temporal), and T6 (right posterior temporal) correspond to areas involved in emotional memory and limbic activity, frequently associated with depressive symptomatology [
47]. Cz, located at the vertex over the sensorimotor cortex, may suggest engagement of motor-related circuits and cortico-striatal-thalamic loops, which have recently gained recognition in depression neurobiology.
Together, these results indicate that SSRI therapy primarily modulates prefrontal–parietal circuits associated with emotional regulation and self-referential thought, whereas rTMS therapy exerts its effects through broader cortical networks, including frontal, temporal, and occipital regions, reflecting its direct neuromodulatory influence and downstream connectivity changes.
To evaluate the contribution of each individual step of the proposed CAD system, the system is re-evaluated by dropping one specific step at a time, and the performance is checked. This means that the CAD system is rerun while excluding one specific step each time.
By dropping the MSPCA section of the CAD system, the performance decreases to 83.33%, 89.11%, and 76.16% in the Mumtaz, small Atieh Hospital, and big Atieh Hospital databases, respectively, resulting in decreases in performance of 14.73%, 10.89%, and 21.03%. It should be noted that by dropping the NCA section, as the feature selection technique, all 608 extracted features are fed into the FFNN architecture for classification. In this case, the classification ACC decreased from 98.06% to 95.59% for SSRI therapy. Thus, by using the NCA algorithm, the classification ACC is increased by 2.47%, while the number of features is reduced from 608 to 309 in SSRI therapy. Similarly, by dropping the NCA section in rTMS therapy, the classification ACC decreases from 100% to 98.97% for the small Atieh Hospital database and from 97.19% to 90.18% in the big Atieh Hospital database. Therefore, when the NCA algorithm is used, the classification ACC increases by 1.03% and 7.01%, while the number of features is decreased from 608 to 110 and from 608 to 339 in the small and big Atieh Hospital databases, respectively. These results highlight the impact of MSPCA and NCA algorithms as noise cancellation and feature selection techniques, respectively.
We plot the EEG signals in 2D space via the PSR [
37], SODP [
38], and CP techniques [
39], instead of the APM technique, to demonstrate their ability to decode the nonlinear and complex behavior of EEG data.
Figure 9 illustrates the 2D plot of a channel of an EEG signals using the APM technique, SODP, PSR, and CP. The classification ACC of 88.04% is obtained using PSR in SSRI therapy, which is approximately 10% lower than that of the APM technique. Similarly, when SODP is used, a classification ACC of 91.86% is achieved in SSRI therapy, which is approximately 6% lower than that of the APM technique. Using the CP technique, a classification ACC of 86.37% is obtained, which is again approximately 12% lower than that of the APM technique. This demonstrates the significant performance of the APM technique in decoding EEG data compared to traditional techniques.
SODP uses the differential operator to generate two time series,
and
, from the signal [
38]. On the other hand, CP uses sine and cosine functions to generate these two time series from the signal [
39]. The SODP shows the variation in the EEG signal without capturing the chaotic nature of the EEG signals [
38], whereas CP focuses on the nonlinear and chaotic behavior of the EEG signal by mimicking the patterns of sine and cosine [
39]. The main weakness of CP is that it does not account for amplitude variations, which contain significant information about the signal’s behavior. In contrast, the APM technique uses the variation in amplitude aligned with sine and cosine to decode the nonlinearity of the EEG signals.
The performance of the proposed CAD system is re-evaluated via the KNN classifier as well. The classification ACC decreased from 98.06% to 93.48% with SSRI therapy. Likewise, it decreased from 100% and 97.19% to 95.29% and 92.73% in rTMS therapy using the small and big Atieh Hospital databases, respectively. This demonstrates the importance of a well-tuned FFNN architecture.
Providing easy-to-use software for a CAD system has a direct impact on its utilization by clinicians in healthcare environments [
48]. Therefore, we developed software to visualize the proposed CAD system for the medical team. Since the project was conducted at the University of North Texas to predict the outcomes of depression therapy, we named the software UNT-DT. The UNT-DT utilizes multi-channel EEG signals to predict the outcomes of SSRI and rTMS therapy and recommends the best course of treatment for the patient.
The UNT-DT system categorizes patient outcomes into four distinct scenarios, depending on the combined results of rTMS and SSRI treatments. Two of these scenarios involve consistent responses to both therapies (either both positive or both negative), whereas the other two reflect differing outcomes—where one therapy is effective and the other is not. A visual representation of the UNT-DT application can be found in
Figure 10.
Table 6 offers a comparison of the proposed CAD system’s effectiveness with that of the previous research that utilized the Mumtaz database for forecasting SSRI therapy results, and with recently released research on rTMS therapy employing both the small and big Atieh Hospital datasets. The proposed CAD system achieves perfect classification accuracies of 98.06%, 100%, and 97.19% for the Mumtaz, small Atieh Hospital, and large Atieh Hospital databases, respectively.
In [
6], the EEG signals are split into several subbands which requires a complex time-frequency signal decomposition techniques. However, our proposed CAD system does not rely on any time-frequency technique before feature extraction and achieved a classification ACC of 98.06%, which is higher than the reported ACC of 91.60% in [
6] for predicting the outcomes of SSRI therapy.
In [
25], the continuous wavelet transform (CWT) was utilized to transform EEG signals into the time-frequency domain, which was subsequently expressed as images and incorporated into a transfer learning (TL) framework. This strategy obtained an accurate classification rate of 96.55%, which is 1.51% less than the accuracy obtained by our approach for predicting SSRI therapy results. The creation of image representations through continuous wavelet transform (CWT) is a far more complicated process than the direct projection of EEG signals into two-dimensional space utilising the amplitude polar map (APM) technique. Additionally, the proposed CAD framework uses a bandpass filter signal localisation (BPFSL) for feature extraction and implements a single-layer feedforward neural network (FFNN) for classification. In contrast, the methodology presented in [
25] utilised a hybrid framework that integrates five transfer learning models: DenseNet121, Xception, VGG16, InceptionResNetV2, and MobileNetV2, which consist of 121, 36, 16, 164, and 53 layers, respectively, leading to a total of 390 layers. Thus, their methodology incorporates numerous convolutional and pooling operations, resulting in significant computational requirements. The proposed CAD system exhibits enhanced efficiency, attaining high accuracy while requiring considerably less computational power.
In [
27], the feature map was extracted by applying Xception, VGG16, and DenseNet121 to the EEG signal as three pretrained TL architectures. The extracted feature map was then fed into a Bio-LSTM classifier. The proposed model in [
27] requires significant computational resources for both feature extraction and classification, whereas our proposed CAD system extracts features via the APM technique aligned with BPFSL and classifies them using a single-layer FFNN architecture. As a result, our proposed CAD system is faster due to its simplicity, compared to the model in [
27].
In a related study [
22], a bidirectional long short-term memory (Bi-LSTM) classifier was coupled with four pretrained transfer learning architectures to analyse connectivity images. With 121, 66, 48, and 18 levels, the models used DenseNet121, EfficientNet-B0, Inception-v3, and ResNet18 to produce a deep learning framework with 253 layers. Although the model proposed in [
22] reported a classification ACC of 98.33%, which is 0.25% higher than our proposed CAD system in detection of SSRI therapy results, generating images by measuring connectivity and aligning them with four successive TL architectures is more complex than our proposed CAD system. Our CAD system offers faster, simpler, and more straightforward computational steps.
In a related study [
23], connection maps were constructed from the EEG channel relationship and went through three pretrained deep learning architectures: VGG16, Xception, and EfficientNetB0. These attributes were then supplied into a bidirectional long short-term memory (Bi-LSTM) network enriched with an attention procedure for classification. In addition, an optimal majority voting mechanism was applied to combine the results of the three designs. Using a leave-one-out cross-validation method, the model attained a classification accuracy of 99.32%. The classification ACC of our proposed CAD system is 97.19%, which is 2.15% lower than that of the proposed model in [
23]. However, our CAD system has easier and fewer computational steps. Additionally, our proposed CAD system was evaluated via a 10-fold CV strategy, which provides stricter conditions than the leave-one-out CV strategy, making the reported results from our model more reliable and significant for real-world applications. In [
26], CWT is employed to convert time series EEG signals into time-frequency images. These images were input into EfficientNetB0 and VGG16. Then, the output of these two pretrained TL architectures were integrated with a Bio-LSTM network aided with an attention mechanism for classification. They noted a 97.10% classification ACC using the EfficientNetB0-Bio-LSTM for predicting the results of rTMS therapy, which is 0.09% less than the accuracy attained by our suggested approach.
To predict rTMS treatment results from EEG data, three pretrained transfer learning models—VGG16, EfficientNetB0, and InceptionResNetV2—were integrated with a Bi-LSTM network in [
28]. A weighted majority voting technique was used to build the ensemble model, and the differential evolution (DE) algorithm was used to optimise the weights. However, their technique obtained a classification accuracy of 98.51%, which is approximately 1.32% higher than our suggested system for rTMS outcome prediction, which was based on three transfer learning models, which increased architectural complexity when compared to our CAD framework.
In a different study [
31], the connection matrices were constructed across four EEG rhythms and analyzed via five pretrained transfer learning methods architectures. The outputs of these models were pooled via a weighted majority voting system. This technique had a classification accuracy of 92.28% for predicting rTMS therapeutic results; however, our suggested CAD system surpassed it with a perfect accuracy of 100% on the same dataset.
The proposed CAD system extracts the complex and chaotic patterns of EEG data by plotting the EEG signals in a 2D space and computing the BPFSL for the shape. The best features are selected via NCA and fed into a single-layer FFNN architecture. In contrast, available methods used a variety of time-frequency and connectivity-based methodologies to first create feature matrices, which were subsequently combined with transfer learning models. Therefore, the proposed CAD system is faster and involves fewer computational steps compared with previous models, which require significant computational resources due to their complexity [
6,
22,
23,
25,
26]. Additionally, for the first time in the literature, a CAD system has been developed to predict the outcomes of both SSRI and rTMS therapies for depression patients. The contributions and advantages of the proposed CAD system in the current work are as follows:
Introduction of the amplitude polar map (APM) via a cutting-edge method for two-dimensionally representing the complex dynamics of EEG data.
Proposal of the BPFSL feature to efficiently capture and define the nonlinear behaviour present in EEG recordings.
Validation of the proposed CAD framework on two different EEG datasets with varied sampling rates, proving its flexibility and insensitivity to fluctuations in sampling frequency.
While previous research has examined only one therapy, this work is the first to proposes a CAD system that can predict treatment results for both the SSRI and rTMS treatments in depression.
Design of a computationally lightweight CAD framework that incorporates an efficient method for deriving features, leveraging the structural simplicity of APM and BPFSL, alongside a tuned feedforward neural network (FFNN) architecture comprising a single intermediate layer with ten neurons. This design avoids resource-intensive approaches such as connectivity-based [
17,
21,
22] and time-frequency domain analyses [
6,
25], resulting in a fast, lightweight, and low-complexity implementation for EEG classification.
In the proposed CAD system, the features are extracted by fusing the information from all EEG channels, meaning that it uses all the information in the EEG data without losing any information (i.e., no channels are dropped), while simultaneously reducing data redundancy.
A dependable CAD framework was developed, and its performance was assessed through a 10-fold cross-validation approach to minimize potential evaluation bias.
Development of a novel user-friendly, economical, and useful tool for forecasting the results of SSRI and rTMS therapy. Its MATLAB implementation facilitates portability and user-friendliness in clinical settings, aiding mental health professionals in making decisions about patient treatment.
Notably, clinical and demographic traits have shown some promise in predicting the results of the treatment. Patients with depression between the ages of 18 and 65 years were included in this study; the mean ages of the responder (R) and non-responder (NR) groups receiving SSRI therapy were 40.7 and 41.1 years, respectively (see
Table 2). There was a modest difference in mean age between the NR and R groups, but it was not statistically significant (
p > 0.05) [
6]. Both the R and NR groups demonstrated equal gender representation, with an identical number of male and female participants (see
Table 2) [
6]. Remarkably, the R and NR groups’ respective pre-treatment BDI-II scores were 18.4 (
) and 22.8 (
) [
6]. These results imply that NR patients had somewhat more severe symptoms than their R counterparts before treatment.
Demographic information and clinical characteristics of the Atieh Hospital rTMS database are provided in
Table 4. The study involved 46 depression patients, equally divided into R and NR groups for rTMS therapy, with each group containing 23 patients. The mean age for the R group undergoing rTMS therapy is 30.87 years (
), while for the NR group, it is 39.00 years (
). While individuals in the NR group exhibited a higher average age compared to those in the R group, this age-related variation did not reach statistical significance (
p = 0.052). In terms of gender distribution, both groups have a similar composition, with 8 males and 15 females in each group, and no significant gender difference between the groups (
p-value = 0.90).
The pre-treatment BDI-II scores show a slight difference, with the R group scoring 32.5 () and the NR group scoring 28.1 (), though this difference is not statistically significant (p-value = 0.08). However, the post-treatment BDI-II scores reveal a significant difference, with the R group showing a marked improvement () compared to the NR group (), with a p-value of <0.001. Finally, the duration of depression shows no significant difference between the two groups, with the R group having a mean duration of 6.5 years () and the NR group having a mean duration of 7.9 years (), with a p-value of 0.27.
In summary, the analysis revealed no meaningful statistical variation in demographic or clinical attributes between the Mumtaz and Atieh Hospital datasets. This suggests that the CAD model developed in this study is specifically tailored to distinguishing between responder (R) and non-responder (NR) groups. Variables such as age, BDI-II scores, gender, and other clinical indicators were found to have minimal impact on the classification performance.
While the proposed method demonstrated promising results in predicting therapy outcomes using EEG data, several limitations should be acknowledged. First, the dataset used in this study, although clinically relevant, was relatively limited in size, which may affect the generalizability of the findings. Second, the use of a private database may hinder reproducibility by other researchers. Third, the feature extraction process relies on predefined signal properties, which may not capture the full complexity of brain dynamics across all patients. To address these issues, future work will involve validating the model on larger and publicly available EEG datasets to ensure broader applicability. Additionally, we plan to explore deep learning-based end-to-end frameworks that learn feature representations automatically, potentially improving classification accuracy and interpretability. Cross-site validation and real-time deployment strategies will also be investigated to evaluate the model’s clinical utility.
Although gender information was available, the dataset was not balanced across male and female participants, and the sample size per gender group was insufficient for reliable subgroup analysis. This limits the ability to draw gender-specific conclusions from the current findings. Future studies should explore gender-stratified neural response patterns to improve personalized treatment prediction.