Exploring Effects of Mental Stress with Data Augmentation and Classification Using fNIRS

M. N. Afzal Khan; Nada Zahour; Usman Tariq; Ghinwa Masri; Ismat F. Almadani; Hasan Al-Nashah

doi:10.3390/s25020428

,

and

¹

Department of Electrical Engineering, American University of Sharjah, Sharjah 26666, United Arab Emirates

²

Biosciences and Bioengineering Graduate Program, American University of Sharjah, Sharjah 26666, United Arab Emirates

^*

Author to whom correspondence should be addressed.

Sensors2025, 25(2), 428;https://doi.org/10.3390/s25020428

This article belongs to the Section Biomedical Sensors

Version Notes

Order Reprints

Abstract

Accurately identifying and discriminating between different brain states is a major emphasis of functional brain imaging research. Various machine learning techniques play an important role in this regard. However, when working with a small number of study participants, the lack of sufficient data and achieving meaningful classification results remain a challenge. In this study, we employ a classification strategy to explore stress and its impact on spatial activation patterns and brain connectivity caused by the Stroop color–word task (SCWT). To improve our results and increase our dataset, we use data augmentation with a deep convolutional generative adversarial network (DCGAN). The study is carried out at two separate times of day (morning and evening) and involves 21 healthy participants. Additionally, we introduce binaural beats (BBs) stimulation to investigate its potential for stress reduction. The morning session includes a control phase with 10 SCWT trials, whereas the afternoon session is divided into three phases: stress, mitigation (with 16 Hz BB stimulation), and post-mitigation, each with 10 SCWT trials. For a comprehensive evaluation, the acquired fNIRS data are classified using a variety of machine-learning approaches. Linear discriminant analysis (LDA) showed a maximum accuracy of 60%, whereas non-augmented data classified by a convolutional neural network (CNN) provided the highest classification accuracy of 73%. Notably, after augmenting the data with DCGAN, the classification accuracy increases dramatically to 96%. In the time series data, statistically significant differences were noticed in the data before and after BB stimulation, which showed an improvement in the brain state, in line with the classification results. These findings illustrate the ability to detect changes in brain states with high accuracy using fNIRS, underline the need for larger datasets, and demonstrate that data augmentation can significantly help when data are scarce in the case of brain signals.

Keywords:

functional near-infrared spectroscopy (fNIRS); hemodynamic response; deep convolutional generative adversarial network (DCGAN); feed-forward neural network; linear support vector machines; decision tree; restricted Boltzmann machine; convolutional neural networks; classification; binaural beats

1. Introduction

Originally developed for clinical tissue oxygenation monitoring [1], functional near-infrared spectroscopy (fNIRS) has evolved into a useful tool for functional neuroimaging studies [2,3,4]. To date, various fNIRS devices that track changes in local cerebral oxygenation by measuring variations in the concentration of deoxygenated hemoglobin (ΔHbR) and oxygenated hemoglobin (ΔHbO) have been developed. fNIRS has been used to investigate a variety of brain activity, such as cognitive and motor processes [5,6,7,8].

fNIRS is being used more often in functional neuroimaging studies due to its affordability and portability [9]. As opposed to functional magnetic resonance imaging (fMRI), fNIRS measures alterations in the brain’s hemodynamics. It shares similarities with fMRI but stands out for being quiet (with no operating sound). Higher temporal resolution, freedom from space restriction, and no need for participants to lie down are all very appealing features. These characteristics make fNIRS the preferred option for tracking hemodynamic changes associated with brain activity, not only in lab settings but also in more ecologically valid and real-world workspace environments.

As a useful neuroimaging tool, fNIRS has impressive potential and notable applications across multiple domains. Compared with electroencephalography, it has the advantage of having better spatial resolution and being less susceptible to noise [10]. Some example applications of fNIRS include assessing early neurodevelopment [11], cognitive discernment and perception [12], psychiatric conditions [13], language experiment research [14], as well as addressing issues related to stroke and brain damage [15]. Additionally, fNIRS plays a crucial role in clinical and network imaging [16], Brain-Computer Interfaces (BCIs) [17], and mental stress analysis [18,19].

Among the main sources of stress in an individual’s life is their workplace. Stress-related problems can arise when workers are overworked and are unable to fulfill unreasonable deadlines (. Many factors, such as disturbed sleep patterns, excruciating headaches, decreased concentration, and elevated absenteeism, are caused by stress and influence workers. Firms suffer large financial losses because of these ramifications. Anxiety, insomnia, recurrent injuries, and a rise in absenteeism—especially at work—are all associated with stress [20].

A multitude of studies have employed fNIRS as a primary modality for studying cortical activity with different stressors and cerebral regions. A 2018 study that examined the impact of mental stress on occupational performance used multiple virtual training instances and found significant prefrontal cortex activation (PFC) [21]. The increased activity of the PFC was then used to detect stress. The next year, the effects of psychosocial stress on cognition were examined using the Trier Social Stress Test (TSST) [22]. The results showed that, in male teenagers exposed to psychosocial stress, maintaining an increased level of physical activity did not always translate into improved inhibitory control. Additionally, mental arithmetic exercises were employed as stressors in other research projects by different investigators, confirming the efficacy of fNIRS as a tool for early identification and measurement of mental stress [23,24]. The various application of fNIRS across multiple stress paradigms highlights its efficacy as a neuroimaging tool for assessing and quantifying mental stress.

The need for advanced methodologies that can identify and track particular brain states has grown as a result of recent advancements in fNIRS technology, particularly in the field of brain state monitoring for biomedical applications. These applications address issues like stress and chronic diseases like dementia and mild cognitive impairment. One notable issue that arises when incorporating brain images—such as activation maps and connectivity maps—into the training and assessment of modern machine learning algorithms is that there are not enough data, which makes the models more prone to overfitting. Adopting data augmentation techniques becomes essential to lessening this challenge. Conventional augmentation techniques, such as cropping, rotating, and zooming, are frequently utilized; however, they have drawbacks, such as requiring manual intervention and being impractical for complex images such as brain activation maps [25,26].

Modern machine learning techniques, such as Generative Adversarial Networks (GAN), provide a promising alternative to traditional augmentation methods [27]. In the context of brain imaging and machine learning applications, GANs offer a workable solution to the data scarcity dilemma by enabling the creation of realistic augmented data. Typically, conventional GAN models need a significant amount of data to be trained and used for image augmentation. However, recent noteworthy works by Toutouh et al. [28] and Zhao et al. [29] have paved the way for training GAN models even with a limited dataset size, such as a thousand images. These discoveries have greatly helped expand the use of GANs in situations where data availability is limited. Hence, the idea behind enhancing fNIRS data for reliable machine learning model development. This combination of old and new methods, along with improvements in GAN training on smaller datasets, adds up to a complete plan for reliable and efficient use of fNIRS data in the creation and assessment of advanced machine learning models.

The main contribution of the current work is to classify different brain states with higher classification accuracy with the help of GAN. In doing so, this work aims to investigate the effects of stress on the patterns of spatial activation elicited by the challenging Stroop color–word task (SCWT). In addition, the current study aims to improve the brain state using binaural beat stimulation. We conducted morning and evening sessions with working individuals in anticipation of the participants experiencing stress due to the demands of the workday. Our main objective is to examine how participants’ hemodynamic responses to workplace stress change over time, determine whether these changes can be reliably detected using classification techniques, and determine how stress levels are reduced when exposed to binaural beats.

2. Materials and Methods

2.1. Participants

Twenty-one healthy volunteers—thirteen men and eight women—working at the American University of Sharjah participated in this study (mean age: 29 ± 5 years). The subjects’ vision was either normal or corrected to normal. None of the participants reported having issues with hearing or seeing color. There was no history of neurological or visual impairment in any of the participants, and there was no proof of drug addiction or ongoing medication use. On the day of the experiment, the participants were instructed not to consume any alcohol, caffeine, or other drinks that would increase their energy levels. Prior to starting the experiment, every participant was given a comprehensive explanation about the study and every participant had the choice to withdraw from the experiment at any moment. Written consent forms were signed prior to the experiment. The American University of Sharjah’s Institutional Review Board granted permission for this study, which was carried out in compliance with the most recent Helsinki Declaration [30].

2.2. Task Design

This study used the SCWT as a stress-inducing paradigm. As part of the assignment, participants had to pay attention to six distinct color words that were presented in random order: “Green”, “Yellow”, “Red”, “Cyan”, “Magenta”, and “Blue”. Notably, the word displayed on the computer screen was printed in a color that was not consistent with its semantic meaning. A cognitive challenge was introduced by asking the participants to select the color of the typed word as a response, not the word itself. Participants had to select the ink color from six options that were displayed as push buttons underneath the displayed word. The background of the buttons featured a third color, and the text on the push buttons appeared in a different color to increase the complexity of the task. Rather than identifying the color of the button itself, participants were asked to identify the color written inside it.

Every question had a time limit, and if the participant did not answer in the given amount of time, a “Time is out” notification was displayed on the screen. Participants also received feedback about how correct their selected option was on the screen. MATLAB^® 2023 was used to implement the SCWT protocol, guaranteeing accuracy and consistency in task execution. Figure 1 illustrates an example of a question that participants saw on their screens.

Figure 1. An example of Stroop color–word task used as a stressor in the current study: (a) Welcome screen (b) Rest phase (c) The SCWT task in which the participant had to choose the word yellow (font color of the displayed word, i.e., BLUE) written in the cyan color box. (d) Feedback on choosing the right option.

2.3. Experimental Paradigm

The experiments were carried out at two different times of the day, i.e., morning and afternoon. There were 10 trials in a session of the experiment, each lasting 50 s (30 s task period). One experimental session, known as the control phase, was conducted in the morning, whereas three sessions were conducted in the afternoon, namely stress, mitigation, and post-mitigation. During the afternoon experiments, participants carried out the experiments in the stress phase (first phase) in a manner similar to that of the control phase. Afterward, during the mitigation phase (second phase), participants performed the SCWT tasks while simultaneously being stimulated by binaural beats. Ultimately, the third session (post-mitigation) replicated the stress or control phase by having participants complete multiple SCWT trials. A summary of the experimental paradigm is given in Figure 2.

Figure 2. Experimental paradigm.

Participants sat in a comfortable chair and were instructed to move their bodies as little as possible during the experiment. For all stimulation durations, a 20-s inter-stimulation interval was kept, along with 20-s pre- and post-rest intervals. During the rest period, a black screen was presented. Visual stimuli were displayed to the subjects on a computer screen, and they were instructed to keep their eyes open throughout the experiment.

2.4. Optode Placement

In order to record brain signals, the prefrontal cortex region was covered with seven detectors and eight emitters. The optode configuration on the region of interest is displayed in Figure 3. The Fpz region of the brain was chosen as the reference point to guarantee exact placement on the prefrontal cortex. This reference point was chosen using the International 10–20 System for electrode placement to guarantee correct electrode placement.

Figure 3. fNIRS optode configuration with red dots showing sources and blue showing detectors.

2.5. Data Acquisition

Brain signals was sampled at a frequency of 10.17 Hz for this study. A single-phase continuous wave fNIRS system, NIRSport2 from NIRx Medical Technologies, Orlando, FL, USA, was used to acquire the fNIRS signals. Two different wavelengths were used by the system: 760 nm and 850 nm. The process of converting raw intensities into changes in ΔHbO and ΔHbR was carried out by utilizing the Modified Beer-Lambert Law [31]. NIRSlab was used to convert the data from the light intensity to the changes in hemoglobin.

2.6. fNIRS Preprocessing

Several pipelines have been employed by researchers to preprocess the fNIRS data [32]. After the acquisition, the data (ΔHbO & ΔHbR) for the current study were preprocessed to remove any noise contamination that could have impacted the signal quality. In order to correct for artifacts linked with subjects’ movement, the converted ΔHbO and ΔHbR intensities were first subjected to principal component analysis followed by temporal derivative distribution repair [33,34]. Following motion-artifact correction, a Butterworth bandpass filter with a low-pass cutoff frequency of 0.15 Hz and a high-pass cutoff frequency of 0.01 Hz was applied to remove cardiac, respiratory, and low-frequency drift signals. Lastly, the desired hemodynamic response function (dHRF) is utilized in this work to identify neuronal activation. Two gamma functions were applied to produce the dHRF, as explained by [35].

2.7. Statistical Analysis

The mean of ΔHbO, t-values, and p-values were utilized in the study for statistical analysis and the identification of active channels. The degree of freedom of the trial period was used to choose the t_crt, and a significance level for the one-tailed t-test was set at 0.05. In order to calculate the t-values, MATLAB^®’s built-in robustfit function was utilized. A channel was considered active when the t-value was greater than t_crt, and the p-value was less than 0.05.

2.8. Data Augmentation

For data convolution in this study, we used a deep convolutional generative adversarial network (DCGAN) [36]. GANs were initially introduced for image generation. We first explain their basic idea with reference to image generation. We follow this up with the adaptation to our use case. A GAN consists of a discriminator and a generator network. The generator learns the distribution of the data and creates images from random noise inputs. Concurrently, the discriminator assesses whether an image is deemed “real” and gives losses for generator and discriminator networks [37]. Figure 4 shows the general architecture of GAN.

Figure 4. General architecture of generative adversarial network.

On the other hand, the DCGAN algorithm combines GAN with convolutional neural networks (CNN). With a discriminator network and a generator network, the DCGAN basic architecture is similar to that of GANs. The discriminator network uses convolutional and normalization layers, capped by a dense layer, to evaluate the authenticity of images, whereas the generator network uses transposed convolutional and normalization layers to convert a random noise vector into images. In the competitive training dynamic, the generator wants to produce images that are realistic, while the discriminator wants to correctly identify generated images as fraudulent [38].

The discriminator in DCGANs consists of a sequence of convolution layers with stride convolutions, each of which down-samples the input image. With each layer, the network learns the complex representation of input images for classification (fake or real). The discriminator loss is given by

D_{r} = log (D (x))

D_{f} = log (1 - D (G (z)))

where x is the input data, z is the noise vector, D(x) is the output of the discriminator for the real image, and D(G(z)) is the output for the fake image. The main goal of the discriminator is to minimize D(G(z)) and maximize D(x). The overall loss of discriminator is given by

D_{l} = \frac{1}{m} \sum_{i = 1}^{m} ({log (D (x}^{i})) + log (1 - {D (G (z}^{i}))))

In contrast, the generator comprises convolution layers that use fractional-strided convolutions or transpose convolutions, resulting in up-sampling of the input picture at each convolutional layer. As the noise moves through the layers, the network gradually increases the image size to match that of a real image. The loss function for the generator is defined as

G = log (1 - D (G (z)))

G_{l} = \frac{1}{m} \sum_{i = 1}^{m} log (1 - {D (G (z}^{i})))

where the aim is to maximize

D (G (z))

. Moreover, the discriminator and the generator are continuously optimizing themselves, which can be represented as follows:

\min_{G} \max_{D} V (D, G) = E_{x ~ p_{d a t a} (x)} [l o g D (x)] + E_{z ~ p_{z} (z)} [\log (1 - D (G (z)))]

where

p_{z} (z)

is the input noise,

E

represents the expectation. Table 1 summarizes the main parameters of the DCGAN. The activation function used was the LeakyReLU function.

Table 1. Main Parameters of DCGAN.

2.9. Classification

After processing the data, the next task was to classify the data using two different methods: classification based on temporal features and classification using images. For the image-based classification, five different classifier types—feed-forward NN (FFNN), linear support vector machines (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and CNN—were employed. On the other hand, linear discriminant analysis (LDA) was used for the classification based on temporal features. The mean, maximum (max), slope, skewness (skew), and kurtosis (kurt) were among the temporal features that were extracted for four distinct window sizes of 0 to 5 s, 5 to 30 s, 30 to 50 s, and 0 to 50 s. The classification accuracy was computed using fivefold cross-validation.

The data from all trials were first converted into visual representations for image-based analysis. In the current study, activation maps and connectivity maps were used as image features. Activation maps were produced using a t-test, and MATLAB’s robustfit function was used to calculate the t-values for each channel. Each trial was further divided into a 5-s window (resulting in the generation of 10 images from each trial) in order to increase the size of the data. Functional connectivity (FC) in the brain refers to the interactions between different brain regions, characterized by the temporal correlations observed between neurophysiological activities in spatially separated areas. These interactions are represented on the connectome, which maps individual differences in brain organization and highlights the potential of connectivity-based approaches for biometric applications. In this study, Pearson’s correlation coefficients (r) were calculated using temporal data from all channels to create connectivity matrices. These matrices detail both intra- and inter-hemispheric connectivity. The matrix elements are the correlation coefficients between paired channels, with the rows and columns corresponding to the channel numbers. Once the activation maps and FC maps were acquired, CNN, FFNN, LSVM, DT, and RBM were the classification methods used for these images. Again, fivefold cross-validation was used to obtain the average classification accuracy.

3. Results

3.1. Comparison of Hemodynamic Responses

The hemodynamic response function (HRF) is a common pattern for any type of activation seen in the human brain. The derived hemodynamic response function (dHRF) was utilized to identify the trials that exhibited activation to characterize the shape of the HRF. After extraction, the active trials were averaged for each subject. The control phase, stress phase, mitigation phase, and post-mitigation phase were all processed using the same procedure. In each of the four cases, visual inspection verified credible activation. However, there was a discernible difference between the hemodynamic response seen prior to and following the binaural beats sessions (p-value < 0.05). The averaged hemodynamic response activation patterns for each of the four cases are shown in Figure 5. The task period is indicated by the green shaded area, and the rest interval is indicated by the non-shaded area.

Figure 5. Averaged hemodynamic response achieved for all four phases.

T-tests were used to determine the statistical significance of the activation level in each of the four cases. Paired t-tests were used to compare activation in stress, mitigation, and post-mitigation sessions, and independent sample t-tests were used to compare the average activation in the control phase with the other three cases. A difference met both of the following requirements to be deemed statistically significant: (i) p-value < 0.05 and (ii) t-value > critical t-value (t_crt).

A statistically significant difference was noted in the mean activation levels between the control and stress phases (p-value < 0.001). Interestingly, during the binaural beats session, there was no statistically significant difference in the responses between the mitigation phase and control phase (p-value = 0.377). However, a significant difference was found when stress phase data were compared with post-mitigation phase data (p-value < 0.005). The activation of the brain after the binaural beats session (post-mitigation) was significantly higher (p-value < 0.005) than it was during the control phase. Overall, a significant improvement was noticed in the hemodynamic response because of the binaural beat stimulation.

3.2. Enhancement of Classification Accuracies

Figure 6 and Figure 7 show the averaged activation maps and connectivity maps, respectively. As mentioned earlier, there were two different methods used in the classification process: the first was the use of temporal features, and the second was the use of brain maps as features. In the case of classifying using time series data, all temporal features were extracted and subsequently utilized in pairs. The LDA-based classification was performed for each of the four feature extraction windows. Table 2 shows the classification accuracy obtained with each of the feature sets.

Figure 6. Averaged activation maps for each of the four phases: (a) control, (b) stress, (c) mitigation, and (d) post-mitigation.

Figure 7. Averaged connectivity maps for each of the four phases: (a) control, (b) stress, (c) mitigation, and (d) post-mitigation.

Table 2. LDA-based classification accuracies (%) obtained for four different window sizes.

Given the two-class classification, the highest achieved classification accuracy in all comparisons was 60.69%, which could be regarded as a moderate result. Furthermore, no statistically significant difference was found in the classification accuracies when comparing the three different cases, i.e., control vs. stress, control vs. mitigation, and control vs. post-mitigation. Brain activation maps and connectivity maps are widely utilized in both fMRI and fNIRS to find the activated areas. These maps served as features in the classification process for the current study. For a single 50-s trial, ten activation maps and ten connectivity maps were produced having five seconds of data each. This procedure produced a total of 100 brain activation map images and 100 connectivity map images (10 trials each, resulting in 10 maps) for a single subject. A total of 2100 images were collected from 21 subjects for each of the four scenarios (control, stress, mitigation, and post-mitigation). From this dataset of 2100 images per class, five different classifiers—FFNN, LSVM, DT, RBM, and CNN—were used to perform classification similar to the classification based on temporal features. Table 3 shows the resulting classification accuracies obtained using the brain activation maps. Similarly, Table 4 shows the obtained classification accuracies using the connectivity maps.

Table 3. Classification accuracies obtained using activation (not augmented). Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

Table 4. Classification accuracies obtained using connectivity maps (not augmented). Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

In this case the maximum classification accuracy achieved was 73% for Control vs. post-mitigation. Lastly, the acquired 2100 images per class were used as an input data set to augment the dataset using DCGAN. After almost doubling the size of the data, the augmented dataset was used to perform classification using the four classifiers mentioned earlier. The classification accuracy obtained with the augmented brain activation maps is shown in Table 5. Similarly, Table 6 shows the obtained classification accuracies obtained using augmented connectivity maps.

Table 5. Classification accuracies obtained using augmented brain activation maps. Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

Table 6. Classification accuracies obtained using augmented connectivity maps. Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

4. Discussion

The purpose of this work was to investigate whether DCGAN can be used to augment the data based on hemodynamic responses (i.e., ΔHbO) in the brain’s frontal cortex that could improve classification/detection accuracy of brain state. SCWT was utilized to activate specific brain activity during the acquisition of fNIRS signals across four distinct brain states: control, stress, mitigation, and post-mitigation. Initially, temporal features were utilized in pairs to facilitate the initial implementation of LDA-based classification.

Next, using brain activation maps as a feature, five classifiers were used for the classification process: CNN, FFNN, LSVM, DT, and RBM. Lastly, DCGAN was used for data augmentation, which aided in improving classification accuracy in stress detection and confirmed the efficacy of the suggested technique.

The primary objective of this study is to contribute to the advancement of classification accuracy, particularly in detecting specific brain states such as stress. One of the main objectives of fNIRS-based brain imaging is to identify abnormal brain conditions, such as stress, anxiety, dementia, Alzheimer’s, mild cognitive impairment, etc. The requirement for automated procedures in accurately detecting these mental states is of importance, and achieving higher classification accuracies leads to robust and reliable outcomes.

During the experiment, the performance of SCWT under time constraints, in addition to the feedback they received, significantly increased the stress levels in participants. This was reflected by noticeable changes in the participants’ behavior, as they showed signs of increased anxiety and nervousness during the SCWT task. As the study participants were all employed individuals, the hypothesis was that the brain states in the morning (control) and afternoon (stress, mitigation, and post-mitigation) would differ significantly from one another, as shown in Figure 5. Statistical analysis of the hemodynamic response during the stress and mitigation phases showed no significant difference. Nonetheless, a statistically significant difference (p-value < 0.001) was found in the hemodynamic response for the control and post-mitigation phases. Furthermore, there was a statistically significant difference in hemodynamic response between the post-mitigation and stress phases, indicating that the brain state was enhanced by binaural beat stimulation. In Figure 5, it can be noticed that the activation levels during the stress and post-mitigation phases are higher as compared with the control phase. This can be due to adaptation of the task.

For the classification based on temporal features, a feature set comprising five features was extracted using the hemodynamic responses in each of the four groups. Four different window sizes were used to extract these features: 0–5 s, 5–30 s, 30–50 s, and 0–50 s. The objective of using the extracted features in pairs was to distinguish the control state from the other three brain states—stress, mitigation, and post-mitigation. LDA was employed for this classification, given its widespread use as a classifier for temporal fNIRS data [39]. There was no significant difference obtained between the classification accuracies of the three cases that were examined. This finding showed the limitation of relying solely on temporal features for the detection of specific brain states, highlighting its infeasibility as a viable option.

The prefrontal cortex plays an important role in various active-memory tasks, including mental arithmetic, mental counting, and working-memory tasks, and is predominantly responsive to mental training. As people age, the functional capacity of this brain area weakens, potentially leading to devastating conditions like Alzheimer’s. The variations in mental health associated with such changes can be effectively followed through brain activation maps. Therefore, the impact of binaural beats was also verified by examining the changes in activation maps.

Each trial of SCWT was divided into 5 s blocks, which were then used to make the activation maps and connectivity maps of the prefrontal cortex area of the brain. Since every trial lasted for 50 s, 10 images were generated from each trial. These maps were used to perform classification using five different classifiers: CNN, FFNN, LSVM, DT, and RBM. Out of the five classifiers, FFNN provided the lowest accuracy (52%). The statistical findings derived from examining ΔHbO signals were supported by the classification accuracy of each of the five classifiers. All the classifiers demonstrated the highest classification accuracy in the control vs. post-mitigation scenario, corresponding to a statistically significant difference in hemodynamic responses between the two groups. Additionally, CNN produced the highest classification accuracy, outperforming all other classifiers. Moreover, the accuracy that was obtained was about 70%.

Lastly, DCGAN was used to augment the images of brain activations maps and connectivity maps, and the five classifiers mentioned earlier were used to classify the images. A significant increase in the classification accuracies was noticed when the augmented data were used for classification (shown in Table 3). By using augmented data, the previously achieved classification accuracy of 73% using CNN increased to 94%. CNN gave the best classification accuracies, which were 91%, 86%, and 94% for control vs. stress, control vs. mitigation, and control vs. post-mitigation, respectively. Similarly, by utilizing the augmented connectivity maps, the classification accuracy went up to 96%. DCGAN significantly improved the classification accuracy for fNIRS-based imaging. Through utilizing the proposed method, the detection of any abnormalities in brain states can be detected efficiently.

The scope of the study was limited to prefrontal cortex only due to the small number of fNIRS channels that were available. By adding more channels to explore different brain regions, future studies can expand their scope of investigation. Moreover, the combination of hybrid EEG-fNIRS neuroimaging modalities presents a promising way to obtain brain signals, enabling whole-brain examination and verifying the study’s hypotheses [40,41]. It is necessary to acknowledge a primary limitation of fNIRS—inter-subject and intra-subject variability in the hemodynamic response signal [42]. This research overcame this constraint by combining information from several participants, which enabled a thorough evaluation of the overall trend. Subsequent studies could examine optode placements at high densities or in bundles, with short separation channels included, to enhance spatial resolution and boost activation map accuracy [9]. Future research may examine neuroplasticity in the hours or days following binaural beats therapy to determine the long-lasting effects. Future studies will focus on systematically exploring and comparing different data augmentation techniques, as well as investigating the impact of varying DCGAN parameters to further enhance the robustness and generalizability of fNIRS-based classification models. A further constraint concerns the study’s sample size, which, although comparable to earlier research, implies the possibility of growing to a larger cohort. Furthermore, gender variability was not investigated in this study; one possible direction is to perform independent research on male and female participants for comparative analysis. In addition, future studies might concentrate on applying electrical or magnetic stimulation only to particular brain regions in order to evaluate their varying effects on different brain regions [43,44].

5. Conclusions

In conclusion, this study tackles the challenge of detecting distinct brain states with higher accuracy, having an emphasis on the impact of stress during the Stroop color–word task through functional near-infrared spectroscopy (fNIRS). Using a variety of classification approaches, we encountered the frequently occurring constraint of limited participant numbers which leads to lower accuracy. The introduction of data augmentation through a deep convolutional generative adversarial network proved critical in increasing classification accuracy to 96%. Notably, the same classification approaches fell short with the original data set (without augmentation), highlighting the importance of data augmentation in improving interpretability. The investigation of binaural beat stimulation adds a stimulating element to our study, implying possible stress reduction benefits. The results demonstrate the potential of fNIRS in detecting subtle changes in brain states, especially when combined with bigger datasets obtained through data augmentation. This study adds vital insights to the advancement of brain state identification and opens up a prospective route for advanced interpretations in the field of functional brain imaging.

Author Contributions

M.N.A.K. performed the data analysis and wrote the initial draft. N.Z. assisted in data analysis. U.T. participated in designing the study. G.M. and I.F.A. conducted the experiments. H.A.-N. supervised the entire project and finalized the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the American University of Sharjah, United Arab Emirates, under Grant FRG-23-C-E05. Publishing this paper was supported, in part, by the Open Access Program from the American University of Sharjah. This paper represents the opinions of the author(s) and does not mean to represent the position or opinions of the American University of Sharjah.

Institutional Review Board Statement

The study was conducted in accordance with the latest Helsinki Declaration after obtaining authorization from the Institutional Review Board (IRB) of the American University of Sharjah (IRB# 19-513).

Informed Consent Statement

Written Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Acknowledgments

We extend our gratitude to all the participants who generously volunteered their time and efforts for this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Noninvasive, F.F.J. Infrared Monitoring of Cerebral and Myocardial Oxygen Sufficiency and Circulatory Parameters. Science 1977, 198, 1264–1267. [Google Scholar] [CrossRef]
Villringer, A.; Planck, J.; Hock, C.; Schleinkofer, L.; Dirnagl, U. Near infrared spectroscopy (NIRS): A new tool to study hemodynamic changes during activation of brain function in human adults. Neurosci. Lett. 1993, 154, 101–104. [Google Scholar] [CrossRef] [PubMed]
Hoshi, Y.; Tamura, M. Near-Infrared Optical Detection of Sequential Brain Activation in the Prefrontal Cortex during Mental Tasks. Neuroimage 1997, 5, 292–297. [Google Scholar] [CrossRef] [PubMed]
Chance, B.; Zhuang, Z.; UnAh, C.; Alter, C.; Lipton, L. Cognition-activated low-frequency modulation of light absorption in human brain. Proc. Natl. Acad. Sci. USA 1993, 90, 3770–3774. [Google Scholar] [CrossRef] [PubMed]
Ferrari, M.; Quaresima, V. A brief review on the history of human functional near-infrared spectroscopy (fNIRS) development and fields of application. Neuroimage 2012, 63, 921–935. [Google Scholar] [CrossRef] [PubMed]
Ayaz, H.; Onaral, B.; Izzetoglu, K.; Shewokis, P.A.; McKendrick, R.; Parasuraman, R. Continuous monitoring of brain dynamics with functional near infrared spectroscopy as a tool for neuroergonomic research: Empirical examples and a technological development. Front. Hum. Neurosci. 2013, 871, 7. [Google Scholar] [CrossRef] [PubMed]
Khan, M.N.A.; Hong, K.-S. Most favorable stimulation duration in the sensorimotor cortex for fNIRS-based BCI. Biomed. Opt. Express 2021, 12, 5939–5954. [Google Scholar] [CrossRef] [PubMed]
Khan, M.; Ghafoor, U.; Yoo, H.-R.; Hong, K.-S. Acupuncture enhances brain function in patients with mild cognitive impairment: Evidence from a functional-near infrared spectroscopy study. Neural Regen. Res. 2022, 17, 1850. [Google Scholar] [CrossRef] [PubMed]
Yaqub, M.A.; Woo, S.-W.; Compact, K.-S.H. Portable, High-Density Functional Near-Infrared Spectroscopy System for Brain Imaging. IEEE Access 2020, 8, 128224–128238. [Google Scholar] [CrossRef]
Naseer, N.; Hong, K.-S. fNIRS-based brain-computer interfaces: A review. Front. Hum. Neurosci. 2015, 9, 3. [Google Scholar] [CrossRef] [PubMed]
Watanabe, H.; Shitara, Y.; Aoki, Y.; Inoue, T.; Tsuchida, S.; Takahashi, N.; Taga, G. Hemoglobin phase of oxygenation and deoxygenation in early brain development measured using fNIRS. Proc. Natl. Acad. Sci. USA 2017, 114, E1737–E1744. [Google Scholar] [CrossRef] [PubMed]
Cutini, S.; Moro, S.B.; Bisconti, S. Functional near Infrared Optical Imaging in Cognitive Neuroscience: An Introductory Review. J. Near Infrared Spectrosc. 2012, 20, 75–92. [Google Scholar] [CrossRef]
Ghafoor, U.; Lee, J.-H.; Hong, K.-S.; Park, S.-S.; Kim, J.; Yoo, H.-R. Effects of Acupuncture Therapy on MCI Patients Using Functional Near-Infrared Spectroscopy. Front. Aging Neurosci. 2019, 11, 237. [Google Scholar] [CrossRef] [PubMed]
Wang, Z.; Zhou, Y.; Chen, L.; Gu, B.; Yi, W.; Liu, S.; Xu, M.; Qi, H.; He, F.; Ming, D. BCI Monitor Enhances Electroencephalographic and Cerebral Hemodynamic Activations During Motor Training. IEEE Trans. Neural Syst. Rehabil. Eng. 2019, 27, 780–787. [Google Scholar] [CrossRef] [PubMed]
Petrantonakis, P.C.; Kompatsiaris, I. Single-Trial NIRS Data Classification for Brain–Computer Interfaces Using Graph Signal Processing. IEEE Trans. Neural Syst. Rehabil. Eng. 2018, 26, 1700–1709. [Google Scholar] [CrossRef]
Zheng, Y.; Zhang, D.; Wang, L.; Wang, Y.; Deng, H.; Zhang, S.; Li, D.; Wang, D. Resting-State-Based Spatial Filtering for an fNIRS-Based Motor Imagery Brain-Computer Interface. IEEE Access 2019, 7, 120603–120615. [Google Scholar] [CrossRef]
Tanveer, M.A.; Khan, M.J.; Qureshi, M.J.; Naseer, N.; Hong, K.-S. Enhanced Drowsiness Detection Using Deep Learning: An fNIRS Study. IEEE Access 2019, 7, 137920–137929. [Google Scholar] [CrossRef]
Katmah, R.; Al-Shargie, F.; Tariq, U.; Babiloni, F.; Al-Mughairbi, F.; Al-Nashash, H. Mental Stress Management Using fNIRS Directed Connectivity and Audio Stimulation. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 1086–1096. [Google Scholar] [CrossRef] [PubMed]
Al-Shargie, F.; Katmah, R.; Tariq, U.; Babiloni, F.; Al-Mughairbi, F.; Al-Nashash, H. Stress management using fNIRS and binaural beats stimulation. Biomed. Opt. Express 2022, 13, 3552. [Google Scholar] [CrossRef]
Mark, G.; Smith, A. Effects of occupational stress, job characteristics, coping, and attributional style on the mental health and job satisfaction of university employees. Anxiety Stress Coping 2012, 25, 63–78. [Google Scholar] [CrossRef] [PubMed]
Shi, Y.; Zhu, Y.; Mehta, R.K.; Du, J. A neurophysiological approach to assess training outcome under stress: A virtual reality experiment of industrial shutdown maintenance using Functional Near-Infrared Spectroscopy (fNIRS). Adv. Engin Infors 2020, 46, 101153. [Google Scholar] [CrossRef]
Mücke, M.; Ludyga, S.; Colledge, F.; Pühse, U.; Gerber, M. Association of Exercise with Inhibitory Control and Prefrontal Brain Activity Under Acute Psychosocial Stress. Brain Sci. 2020, 10, 439. [Google Scholar] [CrossRef]
Gurel, N.Z.; Jung, H.; Hersek, S.; Inan, O.T. Fusing Near-Infrared Spectroscopy With Wearable Hemodynamic Measurements Improves Classification of Mental Stress. IEEE Sens. J. 2019, 19, 8522–8531. [Google Scholar] [CrossRef] [PubMed]
Shirvan, R.A.; Setarehdan, S.K.; Nasrabadi, A.M. Classification of Mental Stress Levels by Analyzing fNIRS Signal Using Linear and Non-linear Features. Int. Clin. Neuro J. 2018, 5, 55–61. [Google Scholar] [CrossRef]
Wickramaratne, S.D.; Mahmud, M.S. Conditional-GAN Based Data Augmentation for Deep Learning Task Classifier Improvement Using fNIRS Data. Front. Big Data 2021, 4, 659146. [Google Scholar] [CrossRef]
Nagasawa, T.; Sato, T.; Nambu, I.; Wada, Y. fNIRS-GANs: Data augmentation using generative adversarial networks for classifying motor tasks from functional near-infrared spectroscopy. J. Neural Eng. 2020, 17, 016068. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Liu, D.; Li, T.; Zhang, P.; Li, Z.; Gao, F. CGAN-rIRN: A data-augmented deep learning approach to accurate classification of mental tasks for a fNIRS-based brain-computer interface. Biomed. Opt. Express 2023, 14, 2934. [Google Scholar] [CrossRef] [PubMed]
Toutouh, J.; O’Reilly, U.-M.; Hemberg, E. Data Dieting in GAN Training. arXiv 2020, arXiv:2004.04642. [Google Scholar] [CrossRef]
Zhao, S.; Liu, Z.; Lin, J.; Zhu, J.-Y.; Han, S. Differentiable Augmentation for Data-Efficient GAN Training. arXiv 2020, arXiv:2006.10738. [Google Scholar] [CrossRef]
Christie, B. Doctors revise Declaration of Helsinki. BMJ 2000, 321, 913. [Google Scholar] [CrossRef]
Kocsis, L.; Herman, P.; Eke, A. The modified Beer–Lambert law revisited. Phys. Med. Biol. 2006, 51, N91–N98. [Google Scholar] [CrossRef] [PubMed]
Gemignani, J.; Gervain, J. Comparing different pre-processing routines for infant fNIRS data. Dev. Cogn. Neurosci. 2021, 48, 100943. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Noah, J.A.; Hirsch, J. Separation of the global and local components in functional near-infrared spectroscopy signals using principal component spatial filtering. Neurophotonics 2016, 3, 015004. [Google Scholar] [CrossRef] [PubMed]
Fishburn, F.A.; Ludlum, R.S.; Vaidya, C.J.; Medvedev, A.V. Temporal Derivative Distribution Repair (TDDR): A motion correction method for fNIRS. Neuroimage 2019, 184, 171–179. [Google Scholar] [CrossRef] [PubMed]
Zafar, A.; Hong, K.-S. Neuronal Activation Detection Using Vector Phase Analysis with Dual Threshold Circles: A Functional Near-Infrared Spectroscopy Study. Int. J. Neural Syst. 2018, 28, 1850031. [Google Scholar] [CrossRef] [PubMed]
Radford, A.; Metz, L.; Chintala, S. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. arXiv 2015, arXiv:1511.06434. [Google Scholar]
Wu, Q.; Chen, Y.; Meng, J. DCGAN-Based Data Augmentation for Tomato Leaf Disease Identification. IEEE Access 2020, 8, 98716–98728. [Google Scholar] [CrossRef]
Yahaya, M.S.M.; Teo, J. Data augmentation using generative adversarial networks for images and biomarkers in medicine and neuroscience. Front. Appl. Math Stat. 2023, 9, 1162760. [Google Scholar] [CrossRef]
Benerradi, J.; Clos, J.; Landowska, A.; Valstar, M.F.; Wilson, M.L. Benchmarking framework for machine learning classification from fNIRS data. Front. Neuroergo 2023, 4, 99496. [Google Scholar] [CrossRef]
Gao, Y.; Jia, B.; Houston, M.; Zhang, Y. Hybrid EEG-fNIRS Brain Computer Interface Based on Common Spatial Pattern by Using EEG-Informed General Linear Model. IEEE Trans. Instrum. Meas. 2023, 72, 4006110. [Google Scholar] [CrossRef]
Wang, Z.; Yang, L.; Zhou, Y.; Chen, L.; Gu, B.; Liu, S.; Xu, M.; He, F.; Ming, D. Incorporating EEG and fNIRS Patterns to Evaluate Cortical Excitability and MI-BCI Performance During Motor Training. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 2872–2882. [Google Scholar] [CrossRef] [PubMed]
Sato, H.; Fuchino, Y.; Kiguchi, M.; Katura, T.; Maki, A.; Yoro, T.; Koizumi, H. Intersubject variability of near-infrared spectroscopy signals during sensorimotor cortex activation. J. Biomed. Opt. 2005, 10, 044001. [Google Scholar] [CrossRef] [PubMed]
Lu, H.; Zhang, Y.; Qiu, H.; Zhang, Z.; Tan, X.; Huang, P.; Zhang, M.; Miao, D.; Zhu, X. A new perspective for evaluating the efficacy of and tDCS in improving executive functions: A combined tES and fNIRS study. Hum. Brain Mapp. 2023, 45, e26559. [Google Scholar] [CrossRef] [PubMed]
Huo, C.; Xu, G.; Xie, H.; Zhao, H.; Zhang, X.; Li, W.; Zhang, S.; Huo, J.; Li, H.; Sun, A.; et al. Effect of High-Frequency rTMS combined with bilateral arm training on brain functional network in patients with chronic stroke: An fNIRS study. Brain Res. 2023, 1809, 148357. [Google Scholar] [CrossRef] [PubMed]

Figure 1. An example of Stroop color–word task used as a stressor in the current study: (a) Welcome screen (b) Rest phase (c) The SCWT task in which the participant had to choose the word yellow (font color of the displayed word, i.e., BLUE) written in the cyan color box. (d) Feedback on choosing the right option.

Figure 2. Experimental paradigm.

Figure 3. fNIRS optode configuration with red dots showing sources and blue showing detectors.

Figure 4. General architecture of generative adversarial network.

Figure 5. Averaged hemodynamic response achieved for all four phases.

Figure 6. Averaged activation maps for each of the four phases: (a) control, (b) stress, (c) mitigation, and (d) post-mitigation.

Figure 7. Averaged connectivity maps for each of the four phases: (a) control, (b) stress, (c) mitigation, and (d) post-mitigation.

Table 1. Main Parameters of DCGAN.

Parameters	Values
Batch size	32
Epoch	30,000
Learning rate	0.0001
Image Channels	3
Batch Normalization	0.9
Drop-Out	0.3
Strides	2

Table 2. LDA-based classification accuracies (%) obtained for four different window sizes.

Window Size	Control vs. Stress	Control vs. Mitigation	Control vs. Post-Mitigation
0 to 5 s	58.84	59.06	59.04
5 to 30 s	59.53	59.60	60.04
30 to 50 s	58.58	59.50	59.29
0 to 50 s	59.56	59.81	59.96

Table 3. Classification accuracies obtained using activation (not augmented). Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

Accuracy (%)
Classifiers	Control vs. Stress	Control vs. Mitigation	Control vs. Post-Mitigation
CNN	72	68	73
FFNN	57	52	59
DT	64	60	63
LSVM	66	63	67
RBM	62	59	64

Table 4. Classification accuracies obtained using connectivity maps (not augmented). Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

Accuracy (%)
Classifiers	Control vs. Stress	Control vs. Mitigation	Control vs. Post-Mitigation
CNN	76	64	73
FFNN	61	52	65
DT	61	56	64
LSVM	68	64	69
RBM	63	56	66

Table 5. Classification accuracies obtained using augmented brain activation maps. Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

Accuracy (%)
Classifiers	Control vs. Stress	Control vs. Mitigation	Control vs. Post-Mitigation
CNN	91	86	94
FFNN	62	57	64
DT	88	85	89
LSVM	84	80	86
RBM	76	70	75

Table 6. Classification accuracies obtained using augmented connectivity maps. Classifiers used: feed-forward neural network (FFNN), linear support vector machine (LSVM), decision tree (DT), restricted Boltzmann machine (RBM), and convolutional neural network (CNN).

Accuracy (%)
Classifiers	Control vs. Stress	Control vs. Mitigation	Control vs. Post-Mitigation
CNN	93	88	96
FFNN	66	61	70
DT	91	87	94
LSVM	85	82	86
RBM	74	71	77

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Exploring Effects of Mental Stress with Data Augmentation and Classification Using fNIRS

Abstract

1. Introduction

2. Materials and Methods

2.1. Participants

2.2. Task Design

2.3. Experimental Paradigm

2.4. Optode Placement

2.5. Data Acquisition

2.6. fNIRS Preprocessing

2.7. Statistical Analysis

2.8. Data Augmentation

2.9. Classification

3. Results

3.1. Comparison of Hemodynamic Responses

3.2. Enhancement of Classification Accuracies

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics