Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks

Penmetsa, Harshith; Abbasi, Rahma; Yellamilli, Nagasree; Winkelman, Kimberly; Chan, Jeff; Hwang, Jaejin; Cho, Kyu Taek

doi:10.3390/info16100880

Open AccessArticle

Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks

by

Harshith Penmetsa

¹,

Rahma Abbasi

²,

Nagasree Yellamilli

³,

Kimberly Winkelman

⁴,

Jeff Chan

^2,*

,

Jaejin Hwang

^3,*

and

Kyu Taek Cho

^5,*

¹

Operations Management and Information Systems, Northern Illinois University, DeKalb, IL 60115, USA

²

Department of Special and Early Education, Northern Illinois University, DeKalb, IL 60115, USA

³

Department of Industrial and Systems Engineering, Northern Illinois University, DeKalb, IL 60115, USA

⁴

Westside Children’s Therapy, DeKalb, IL 60115, USA

⁵

Department of Mechanical Engineering, Northern Illinois University, DeKalb, IL 60115, USA

^*

Authors to whom correspondence should be addressed.

Information 2025, 16(10), 880; https://doi.org/10.3390/info16100880

Submission received: 9 August 2025 / Revised: 23 September 2025 / Accepted: 8 October 2025 / Published: 10 October 2025

(This article belongs to the Special Issue AI Technology-Enhanced Learning and Teaching)

Download

Browse Figures

Versions Notes

Abstract

Children with Autism Spectrum Disorder (ASD) often experience cognitive difficulties that impact learning. This study explores the use of electroencephalogram data collected with the MUSE 2 headband during task-based cognitive sessions to understand how cognitive states in children with ASD change across three structured tasks: Shape Matching, Shape Sorting, and Number Matching. Following signal preprocessing using Independent Component Analysis (ICA), power across various frequency bands was extracted using the Welch method. These features were used to analyze cognitive states in children with ASD in comparison to typically developing (TD) peers. To capture dynamic changes in attention over time, Morlet wavelet transform was applied, revealing distinct brain signal patterns. Machine learning classifiers were then developed to accurately distinguish between ASD and TD groups using the EEG data. Models included Support Vector Machine, K-Nearest Neighbors, Random Forest, an Ensemble method, and a Neural Network. Among these, the Ensemble method achieved the highest accuracy at 0.90. Feature importance analysis was conducted to identify the most influential EEG features contributing to classification performance. Based on these findings, an ASD map was generated to visually highlight the key EEG regions associated with ASD-related cognitive patterns. These findings highlight the potential of EEG-based models to capture ASD-specific neural and attentional patterns during learning, supporting their application in developing more personalized educational approaches. However, due to the limited sample size and participant heterogeneity, these findings should be considered exploratory. Future studies with larger samples are needed to validate and generalize the results.

Keywords:

autism spectrum disorder; classification; cognitive tasks; EEG; machine learning; neural networks

1. Introduction

Autism Spectrum Disorder (ASD) affects how individuals communicate, process information, and interact with their environment. Many children with ASD face challenges maintaining attention in classroom settings and often struggle with tasks that require multi-step reasoning, interpretation of social cues, or problem-solving [1,2,3]. Even with structured support, educators and caregivers frequently find it difficult to determine the most effective learning environment for each child [2,3,4]. Executive functioning encompasses a range of cognitive abilities, including working memory, the capacity to shift attention in response to environmental changes, the ability to sustain focus on the task at hand, and the capacity to inhibit responses to irrelevant stimuli. While a meta-analysis of research shows the benefits of the direct instruction of math skills to students with ASD [5], there is a need for research on executive functioning to improve math performance.

Cognitive challenges in ASD vary significantly from one student to another, making personalized education both critical and complex. In recent years, electroencephalography (EEG) has emerged as a valuable tool for investigating brain function in individuals with ASD [6,7,8]. EEG captures real-time electrical activity from the brain and is particularly well-suited for children due to its non-invasive and portable nature. Several studies have utilized resting-state EEG to differentiate children with ASD from their typically developing (TD) peers [9,10], recording brain activity while the child remains still and unengaged in a specific task.

In contrast, task-based EEG provides insight into how brain activity shifts during specific types of engagement. For instance, Ghee et al. [4] investigated EEG synchrostates in children viewing emotional facial expressions, achieving over 94% classification accuracy between ASD and TD groups using machine learning. Peck et al. [11] studied infants at high risk for autism by recording EEG responses to different speech sounds, identifying patterns that predicted later ASD diagnoses. Bosl et al. [12] asked children with autism to watch videos eliciting happy, boring, or sad emotions, then used machine learning to infer the emotional responses.

Studies on EEG markers of attention in autism show that children with ASD often display atypical frontal alpha and beta asymmetry, both during rest and task situations, which has been linked to differences in attentional control [13,14]. In task-based measures, autistic children also show a higher theta/beta ratio, pointing to reduced sustained attention compared to typically developing peers [15]. At the same time, research in high-risk infants has identified early asymmetry differences, but these patterns do not fully match those found in diagnosed ASD, emphasizing the need to separate developmental risk indicators from findings in confirmed cases [10].

These studies suggest that the task-based EEG may provide insights into functional brain differences between ASD and TD children during active engagement. While task-evoked EEG may offer neural markers associated with the cognitive processes relevant to learning, its use in personalized educational strategies would require integration with intermediate behavioral and educational outcomes, supported by evidence that links neural activity to observable learning gains.

However, there remains a significant gap in research analyzing dynamic brain activity in children with ASD during real-world classroom tasks. In everyday learning settings, children engage in a continuous stream of visual, motor, and cognitive processes solving problems, responding to stimuli, making decisions, and interpreting instructions. Capturing brain activity during these active learning moments can yield more accurate insights into how learning unfolds in children with ASD, beyond what resting-state studies reveal.

Our study aims to fill this gap by investigating how children with ASD respond to structured learning tasks that closely resemble classroom activities. We recorded EEG data as children completed three cognitive tasks, Shape Matching, Shape Sorting, and Number Matching, each designed to engage specific cognitive domains such as visuospatial reasoning, categorization, and numerical recognition. Using the MUSE 2 (InteraXon Inc., Toronto, ON, Canada) EEG headband and analyzing the data through MNE-Python [16], we will examine patterns of brain engagement across these tasks based on the methods in our prior work [17].

The hypothesis of this study is that EEG signals can provide a reliable index for monitoring behavior in children with ASD, with responses that differ distinctly from typically developing (TD) peers, allowing for the development of highly accurate machine learning models. Specifically, children with ASD are expected to exhibit dominant delta waves, pronounced frontal asymmetry, and unusually high Theta/Alpha Ratio (TAR) and Theta/Beta Ratio (TBR). These features are predicted to become more prominent as the learning tasks increase in difficulty. Additionally, inconsistent attention is anticipated to appear in the time–frequency dynamics of the EEG signals.

Studies highlight the potential of low-cost consumer EEG devices like MUSE 2 for neuroscience research. A study showed that MUSE 2 could reliably capture broad brainwave activity (alpha, beta, theta) and frontal asymmetry patterns despite having only four electrodes [18], while another demonstrated that the MUSE 2 successfully detected event-related potentials such as the N400, with timing and amplitude comparable to research-grade systems [19].

To interpret the EEG signals, we employed both traditional machine learning models and deep neural networks. This dual approach enabled us to identify which tasks elicited greater cognitive load and which brain regions were most involved. Unlike diagnostic studies, our goal is to generate practical insights that can inform personalized educational strategies tailored to each child’s cognitive strengths and difficulties.

Understanding how the ASD brain responds to real learning activities can empower educators, therapists, and curriculum designers to create inclusive classroom environments that support effective learning, not just clinical assessment. We expect that our findings contribute to bridging the gap between neuroscience research and everyday educational practice.

Recent studies show how educational technology can be applied in real learning settings. A study used EEG-based attention classification to track students’ focus and adapt learning materials in real time, improving the learning experience [20]. Another study applied machine learning to classroom EEG data to identify cognitive patterns, helping to tailor teaching strategies to students’ needs [21]. These examples support the applied aspect of our work and show practical ways to enhance learning with technology.

2. Methodology

2.1. Participants and Data Collection

EEG data were collected from 10 children in total, including 3 diagnosed with ASD and 7 typically developing (TD) children. For balanced model training, only EEG data from 3 ASD and 3 TD participants were used in the classification tasks. The selected TD data were confirmed to have statistical properties consistent with the full TD dataset, ensuring they were representative of TD participants’ brain signals. The ASD group consisted of three male participants between 4 and 7 years of age, while the TD group included one male (4.5 years) and two females (11 and 13 years). Given the limited sample size, heterogeneity remained, and the findings should be considered exploratory. All ASD participants demonstrated minimal verbal communication and significant educational delays.

Each child completed three STEM-related cognitive tasks: Shape Matching (SM), Shape Sorting (SS), and Number Matching (NM) as shown in Figure 1. Matching and sorting shapes are an invaluable tool for fostering STEM learning in children. It helps them understand fundamental concepts such as shapes, sizes, and spatial relationships. Through this process, they learn to analyze problems and adapt their approaches, which are essential skills in STEM fields. Number matching tasks are integral to assessing young children’s foundational numerical comprehension, crucial for fostering their STEM development. In the Shape Matching (SM) task, they were asked to drag each object to the corresponding object. The Shape Sorting (SS) task required them to look at a pattern made of colored Shapes and recreate it into the dotted box by choosing matching Shapes. For the Number Matching (NM) task, the children had to find and select numbers that matched each other from a group of mixed numbers. Each task consisted of five distinct images depicting different problems, presented to each participant in a random sequence to minimize residual order effects.

As shown in Figure 2, EEG data were recorded using the MUSE 2 headband, a portable device equipped with four electrodes positioned over frontal and temporal regions (AF7, AF8, TP9, TP10). These electrodes provide scalp-level measurements of band-specific EEG activity and allow for the assessment of frontal and temporal activity differences between groups. However, they cannot resolve precise cortical sources or detailed topographic patterns; thus, inferences about specific brain regions should be made with caution. The MUSE 2 system has been validated as a low-cost, portable tool suitable for ERP and band-power research, but it remains limited to scalp-level recordings and does not support high-resolution source localization [22]. Data were sampled at 256 Hz, stored in CSV format, and included raw EEG values from all channels.

2.2. Data Preprocessing

Raw EEG signals were preprocessed to improve quality and ensure consistency across participants. First, a high-pass filter (0.4 Hz cutoff) was applied to remove low-frequency drifts, followed by notch filtering at 50/60 Hz and PSD-based checks to reduce line noise. Independent Component Analysis (ICA) was then performed on the four MUSE channels (AF7, AF8, TP9, TP10) to isolate and remove artifacts. ICA, a widely used EEG preprocessing technique, operates under the assumption that recorded signals contain both neural activity and non-neural noise. It decomposes EEG recordings into statistically independent components, which are then classified as either brain-related or artifact-related. Artifact-related components were removed, and the remaining components were recombined to reconstruct a cleaner EEG signal for analysis.

This approach is particularly effective with the MUSE system, as frontal channels capture most eye-blink activity and temporal channels detect muscle artifacts. Components were rejected if they exhibited stereotypical blink patterns, high-frequency EMG activity (>30 Hz, >3 SD), or abnormal signal segments (±100 µV excursions or flat lines > 1 s). To maintain data reliability, only participants with less than 20% rejected data were included in the final analysis.

This process is illustrated in Figure 3, where Figure 3a presents the raw EEG data collected from participants, and Figure 3b shows the EEG data after ICA-based preprocessing.

The Fourier Transform is a widely used technique for converting time-series data, such as EEG signals, into the frequency domain. However, it has notable limitations: it assumes stationarity and lacks the ability to capture how frequency content evolves over time. Since EEG signals are inherently non-stationary, applying standard Fourier analysis can obscure meaningful time varying patterns in brain activity.

To address these limitations, this study will employ two time-resolved frequency decomposition methods: the Welch method and the Morlet wavelet transform. The Welch method with 4 s windows will be first applied to segment the EEG signals into shorter, quasi-stationary windows. A Hanning window will be used to taper each segment, with 50% overlap, reducing edge artifacts and minimizing spectral leakage. The Fast Fourier Transform (FFT), which is a computationally efficient algorithm for calculating the Discrete Fourier Transform (DFT) will be then applied to each windowed segment. The resulting spectra will be averaged to produce a more stable and representative estimate of the signal’s power distribution [23,24]. The mathematical formulation of the DFT is provided in Equation (1).

X_{f} = \sum_{k = 1}^{n} x_{k} e^{- \frac{i 2 π f (k - 1)}{n}}

(1)

where n refers to the number of data points in vector

x

, and

X_{f}

is the Fourier coefficient of time series variable

x

at frequency f.

The absolute power values defined as the squared magnitudes of the EEG signal will be computed for standard frequency bands: Delta (0.4–4 Hz), Theta (4–8 Hz), Alpha (8–12 Hz), Beta (12–30 Hz), and Gamma (30 Hz to the maximum available frequency). Each band is associated with distinct cognitive and physiological states. Delta waves are primarily linked to deep sleep, but may occasionally appear during wakefulness. Theta waves emerge as consciousness drifts toward drowsiness and are often associated with creativity, emotional processing, and meditative states. Alpha waves are indicative of relaxed wakefulness and are commonly observed when an individual is calm but not focused. Beta waves are associated with active thinking, attention, and cognitive engagement. Gamma waves, sometimes referred to as fast beta waves, are linked to higher-order cognitive functions and are considered indicators of event-related synchronization in the brain [6,9].

These absolute power values provide critical insights into cognitive states and will be used as primary input features for developing machine learning classification models in this study. In addition to absolute power, relative power ratios such as Alpha/Beta, Theta/Alpha, and Theta/Beta will also be computed. These ratios offer deeper insights into cognitive processes.

The Morlet Wavelet Transform will be employed as another method to overcome the limitations of the traditional Fourier Transform for time-frequency analysis. Unlike the Fourier Transform, which uses infinite-duration sine wave kernels and therefore lacks temporal resolution, the Morlet wavelet uses a sine wave modulated by a Gaussian window, allowing for precise localization of frequency content in time. This makes it especially suitable for analyzing non-stationary EEG signals. In this study, complex Morlet wavelets will be used to extract both power and phase information from the EEG data.

c m w = A e^{- t^{2} / 2 s^{2}} e^{i 2 π f t}

(2)

A = \frac{1}{{(s \sqrt{π})}^{1 / 2}}

(3)

In Equation (2), the first term represents a Gaussian function, while the second term corresponds to a complex sine wave. Here, s denotes the standard deviation. By convolving the Morlet wavelet with the measured EEG signals, one can extract dynamic power and phase information over time. In this study, dynamic decibel-normalized power will be utilized for the analysis. For a more detailed explanation, please refer to the relevant literature [6,23,24].

2.3. Machine Learning Models

2.3.1. Traditional Machine Learning Models

Support Vector Machine (SVM)

Support Vector Machine (SVM) is a supervised learning algorithm designed to find the maximum-margin hyperplane that best separates two classes in a binary classification task. The margins are defined by the closest data points from each class, known as support vectors, and the optimal separating boundary lies midway between them, as expressed in Equation (4) [25,26]:

g (x) = {\vec{W}}_{0} x_{1} + {\vec{W}}_{1} x_{2} + b

(4)

where

x_{1}

and

x_{2}

are the inputs,

{\vec{W}}_{0}

and

{\vec{W}}_{1}

are the weight vectors, and

b

is the bias. If

g (x)

greater than 1, the point is classified as Class 1, and if

g (x)

is less than −1, then the point is classified as Class 2.

During training, the objective is to minimize

‖w‖

—the norm of the weight vector, represented by the normalized weight vectors

{\vec{W}}_{0}

and

{\vec{W}}_{1}

—to maximize the margin between the classes. To improve prediction performance, several hyperparameters are tuned, including the kernel type, the regularization parameter C, and the kernel coefficient gamma.

Different kernel functions such as linear, radial basis function (RBF), and polynomial are used to handle nonlinear data distributions through the kernel trick. This technique projects the input data into a higher-dimensional space where a linear separation is more feasible. The parameter C controls the trade-off between maximizing the margin and minimizing classification errors. A high C value prioritizes correctly classifying all training points, potentially leading to a narrower margin and overfitting. Conversely, a low C value allows for a wider margin, which can improve generalization, but may reduce training accuracy. Gamma defines the influence range of a single training instance. A low gamma means each point has a broad area of influence, promoting smoother decision boundaries. In contrast, a high gamma concentrates the influence around individual points, making the model more sensitive to local variations and increasing the risk of overfitting.

K-Nearest Neighbors (KNN)

KNN is a non-parametric classifier that assigns a class label based on the majority vote of the K nearest neighbors in the feature space, with distance commonly measured using Euclidean distance as described by Hastie et al. [25,26,27], as shown in Equation (5).

d (x, x^{'}) = \sqrt{\sum_{i = 1}^{n} {(x_{i} - x_{i}^{'})}^{2}}

(5)

where d(x, x′) is the Euclidean distance between two points x and x′, and n is the total number of dimensions (features) used in the comparison. A small value of K can make the model highly sensitive to noise and outliers, often leading to overfitting. Conversely, a larger K value yields more stable predictions, but may reduce classification accuracy. Therefore, selecting an optimal K is essential to balance bias and variance and to ensure robust model performance.

Random Forest (RF)

Random Forest is an ensemble learning method built upon multiple decision trees, which serve as its fundamental components. It is typically trained using the bagging technique, where each tree is trained on a random subset of the training data sampled with replacement. During tree construction, the algorithm selects the feature and threshold that best split the data at each node. The quality of a split is evaluated using a purity metric, such as Gini impurity (

G_{i}

) or Entropy (

H_{i}

) which measure the homogeneity of the resulting subsets:

G_{i} = 1 - \sum_{k = 1}^{n} p_{i, k}^{2}

(6)

H_{i} = - \sum_{\begin{matrix} k = 1 \\ p_{i, k} \neq 0 \end{matrix}}^{n} p_{i, k} \log_{2} (p_{i, k})

(7)

where

G_{i}

is the Gini impurity of the ith node,

p_{i, k}

is the ratio of class k instances among the training instances in the ith node.

The decision tree is trained using the Classification and Regression Tree (CART) algorithm. This algorithm splits the training data into two subsets based on a selected feature k and a corresponding threshold

t_{k}

. It systematically searches for the pair (

k

,

t_{k}

) that yields the purest child nodes, measured by a weighted impurity score. The objective is to minimize the cost function defined in Equation (8).

J (k, t_{k}) = \frac{m_{l e f t}}{m} G_{l e f t} + \frac{m_{r i g h t}}{m} G_{r i g h t}

(8)

where

G_{l e f t / r i g h t}

measures the impurity of the left/right subset,

m_{l e f t / r i g h t}

is the number of instances in the left/right subset.

Once the CART algorithm successfully splits the training set into two subsets, it recursively applies the same splitting strategy to each subset, continuing this process until the maximum tree depth is reached [25,27].

Ensemble Model

Ensemble models improve prediction accuracy by combining the outputs of multiple individual classifiers. In this study, a soft voting approach was employed, aggregating the output probabilities from SVM, KNN, and Random Forest classifiers to enhance classification performance, as illustrated in Figure 4. Soft voting typically outperforms hard voting because it assigns greater weight to predictions with higher confidence levels [25,27].

Hyperparameter Optimization

The prediction accuracy of each model depends heavily on its hyperparameters parameters that are set prior to training and are not learned from the data itself. Examples include the regularization parameter C, kernel type, and gamma for SVM; the number of neighbors K for KNN; and the number of trees and maximum depth for Random Forest. A detailed summary of these hyperparameters and their value ranges is provided in Table 1.

In this study, the GridSearchCV method from the Scikit-learn library was used to identify the optimal hyperparameter settings. This method systematically evaluates all possible combinations within a predefined grid by applying cross-validation to assess each configuration’s performance. For instance, Figure 5 illustrates the relationship between different K values in KNN and the resulting classification error. The error initially decreased, reaching its minimum at K = 3, after which it increased as K grew larger.

Cross-Validation

A model trained on a dataset may yield poor predictions if the data are unevenly distributed. To address this issue, the cross-validation method will be employed, particularly during hyperparameter optimization. Instead of simply dividing the data into fixed training and testing sets, the dataset will be split into k folds. The model will be then trained and evaluated k times, each time using a different fold as the test set and the remaining folds as the training set, as illustrated in Figure 6. The final performance score will be obtained by averaging the results from all k runs. This averaged score provides a more reliable estimate of the model’s generalization ability.

Ten-fold cross-validation (CV = 10) was performed at the participant level, with all EEG epochs from a given child kept together. Accordingly, each child’s data appeared exclusively in either the training set or the test set, never in both. This design prevents data leakage and avoids artificially inflated accuracy by ensuring the model does not encounter the same participant during both training and testing. Hyperparameters were optimized using nested cross-validation across participants, maintaining independence between model selection and evaluation. As a result, the reported performance reflects the model’s ability to generalize to previously unseen children.

2.3.2. Neural Network Models

Neural networks are a class of machine learning models inspired by the functioning of the human brain. They consist of layers of interconnected nodes, called neurons, that process data sequentially, learning to recognize patterns at each step. Neural networks are particularly effective for analyzing complex, high-dimensional data such as EEG signals, where simpler models often struggle to identify meaningful features.

The simplest neural network architecture is the perceptron, which comprises a single neuron that mimics a biological neuron’s behavior. It calculates a weighted sum of its inputs to represent the overall input strength and applies an activation function typically a step function to decide whether to produce an output. However, to capture more complex patterns, neural networks must include multiple neurons organized into layers.

A multilayer perceptron (MLP), as illustrated in Figure 7, consists of an input layer (containing the feature vector), one or more hidden layers where feature extraction occurs, and an output layer that generates predictions. Each neuron in one layer connects to neurons in the next layer via weighted connections, with weights representing the importance of each connection on the final prediction. The output, denoted

\hat{y}

(Equation (9)), is compared to the true value to calculate an error E (Equation (10)). To minimize this error, gradients are computed to measure how each weight influences the error, and weights are updated accordingly (Equation (11)). Depending on the network design, the output can be a continuous value (for regression tasks) or a probability distribution over classes (for classification tasks) [25,27].

\hat{y} = σ (\sum x_{i} \cdot w_{i} + b)

(9)

E = y - \hat{y}

(10)

w_{n e x t - s t e p} = w_{c u r r e n t} - α \frac{d E}{d w_{i}}

(11)

where

\hat{y}

is the model prediction,

σ

is an activation function,

x_{i}

is input data set,

w_{i}

is weight set,

b

is a bias,

E

is an error,

y

is a ground true value, and

α

is a learning rate.

Key hyperparameters in neural networks include the number of hidden layers, the number of neurons per layer, the activation functions, and the learning rate, among others. In this study, the final model architecture was determined by experimenting with various hyperparameter combinations and selecting the configuration that achieved the highest validation accuracy. The chosen setup consists of two hidden layers with 16 neurons each, a learning rate of 0.001, and the Adam optimizer (Adaptive Moment Estimation) for efficient gradient handling (Equation (11)). The ReLU activation function will be used in the hidden layers, while the output layer will employ a single neuron with a sigmoid activation function. To prevent overfitting, early stopping will be implemented based on validation performance.

2.4. Feature Importance Computation

To determine the contribution of each EEG feature, the permutation importance method will be utilized to the trained machine learning models. This technique evaluates feature influence by measuring how much the model’s accuracy drops when the values of that feature are randomly shuffled [25]. By disrupting its relationship with the target variable, the method captures how essential each feature was for making correct predictions. For consistency, the feature importance scores from each model will be averaged, allowing us to highlight EEG features that are consistently impactful across models. These scores will be later visualized to better understand the diagnostic relevance of each frequency band and sensor location.

3. Results & Discussion

3.1. EEG Feature Observations

3.1.1. Increased Delta Activity

Signal power was calculated across five frequency bands from the EEG data collected while participants performed three cognitive tasks: Shape Matching, Shape Sorting, and Number Matching. Fractional power values were derived by normalizing each band’s power by the total power, and these results are presented in Figure 8.

Our analysis revealed that children with autism exhibited elevated power in the delta band across all tasks. Delta waves are typically more prominent in children with slower brain development, such as those with autism. This finding is consistent with prior research; for example, Wang et al. [9] analyzed resting-state EEG data and reported increased delta activity in children with autism compared to typically developing peers. Elevated delta activity is generally associated with delayed neural maturation, aligning with our observations. Similarly, Dickinson et al. [28] found stronger delta waves in autistic children, particularly during cognitively demanding tasks. So, our results line up with theirs and support the idea that higher delta activity might be a marker of delayed or different brain development in autism.

3.1.2. Frontal Alpha/Beta Ratio Asymmetry

Alpha waves are associated with a relaxed state of awareness without focused attention or concentration, whereas beta waves correspond to active thinking and focused attention. Consequently, a lower alpha-to-beta ratio generally indicates increased attention and cognitive engagement, while a higher ratio suggests relaxation or drowsiness. In this study, the alpha-to-beta ratio was compared between the left (AF7) and right (AF8) frontal regions.

As shown in Figure 9a, during the Shape Matching task, the control group exhibited a balanced ratio at AF7 and AF8, with values of 0.64 and 0.68, respectively. In contrast, the ASD group displayed a pronounced asymmetry, with ratios of 0.56 on the left and 1.3 on the right. A similar pattern was observed during the Number Matching task (Figure 9c), where the control group’s ratios were 0.22 and 0.32, compared to 0.6 and 1.27 in the ASD group. The most severe asymmetry occurred in the Shape Sorting task (Figure 9b), with the ASD group showing a ratio of 1.2 on the left versus 3.2 on the right, while the control group remained relatively balanced at 0.4 and 0.7. This suggests that frontal asymmetry becomes more pronounced in individuals with ASD as task difficulty increases.

The topographic map in Figure 10 further illustrates this effect, revealing a localized alpha-to-beta ratio in the ASD group and a more uniform distribution in controls. This asymmetry points to reduced activation of the right frontal region, which is involved in attention and executive functions.

Such left-right imbalances in frontal brain activity known as frontal asymmetry have been reported in previous research. Van der Molen et al. [29] found that adolescents with autism exhibited stronger frontal alpha asymmetry, which was correlated with social interaction difficulties. Similarly, Wang et al. [9] observed comparable patterns in infants at high risk for autism, even before formal diagnosis. Our findings align with these studies, suggesting that frontal asymmetry may serve as an early neural marker of atypical information processing in autistic individuals during cognitive and learning tasks.

3.1.3. Theta/Alpha (TAR) and Theta/Beta (TBR) Ratios

These ratios represent the relative power of brain activity across different frequency bands. Theta, Alpha, and Beta offer valuable insights into cognitive processes and potential neurological abnormalities. Theta activity is typically linked to drowsiness and relaxation, while Alpha waves are associated with a calm but alert state. A higher Theta/Alpha Ratio (TAR) may therefore indicate increased relaxation or drowsiness. In contrast, Beta waves correspond to active thinking and focused attention; thus, a higher Theta/Beta Ratio (TBR) may suggest impairments in attention and cognitive control.

Figure 10a shows these ratios during the Shape Matching task. In the left frontal region (AF7), TAR and TBR values were similar between ASD and control groups. And, in the right frontal region (AF8), the control group exhibited slightly higher TAR and TBR—by 0.5 to 1.0—compared to the ASD group, indicating minimal differences in brain activity during this task.

In contrast, clear group differences emerged in the Shape Sorting and Number Matching tasks (Figure 11b,c). During Shape Sorting, the ASD group exhibited substantially higher TAR and TBR—by 2.4 to 4.0 in AF7 and 1.2 to 9.0 in AF8—relative to controls. Similarly, during Number Matching, TAR and TBR were elevated in the ASD group by 2.0 to 2.2 in AF7 and 2.1 to 4.7 in AF8 relative to controls.

These elevated ratios may reflect challenges in maintaining attention or a propensity for mind-wandering both commonly observed in individuals with autism. Our findings are consistent with previous research; for instance, Ghanbari et al. [28] reported significantly higher theta/beta and theta/alpha ratios in children with low-functioning autism during arithmetic tasks, which were interpreted as signs of increased cognitive effort and attentional difficulties. Our results further support the interpretation that elevated ratios in children with ASD may indicate cognitive overload during more demanding tasks like Shape Sorting.

3.2. Dynamic Attention Analysis

The Morlet wavelet transform was applied to analyze the dynamic changes in the cognitive states of participants during task engagement, using decibel-normalized power.

P_{d B} (f, t) = \log_{10} (\frac{P (f, t)}{P_{b a s e l i n e} (f)})

(12)

where

P (f, t)

is the power at frequency f and time t, computed by Morlet wavelet transform, and

P_{b a s e l i n e} (f)

is the average power at frequency f during the initial 10 s. This baseline period was visually inspected, and variance across channels was computed to ensure stability. Instruction periods were excluded, so only idle or resting activity was included.

Figure 12 illustrates how the normalized power varies dynamically across different frequencies and time intervals as participants perform the three tasks. The color scale indicates relative attention compared to the baseline period (the first 10 s): warmer colors (towards red) indicate increased attention, while cooler colors (towards blue) indicate decreased attention.

In the ASD group, multiple vertical red bands appear in all figures, indicating fluctuating attention. In the Shape Matching task (Figure 12a), notable peaks in attention occur around 22 and 27 s. For the Shape Sorting task (Figure 12b), attention is concentrated between 10 and 20 s. In the Number Matching task (Figure 12c), multiple attention bands are observed throughout the test period. These patterns suggest that attention in the ASD group fluctuates dynamically rather than remaining sustained.

In contrast, the TD group displays more uniform color distributions across all tasks (Figure 12d–f), reflecting more consistent attention over time. Moreover, the dynamic attention patterns in the TD group remain relatively stable across the three tasks, while the ASD group’s patterns vary depending on the task. This indicates that task type may significantly influence attention in individuals with ASD.

These findings offer valuable insights for designing task-specific instructional materials or curricula aimed at improving attention in children with ASD. However, further research is needed to explore this potential.

3.3. ASD and TD Classification Using Machine Learning Models

Signal powers of five frequency bands were employed as features to develop classifiers distinguishing between ASD and typically developing (TD) participants through various machine learning algorithms. The preprocessed data were split into 80% for training and 20% for testing, with stratification applied to ensure the proportion of ASD and TD cases remained consistent in both sets. Optimized hyperparameters were used during model training: SVM used C = 1, gamma = 1, and kernel = RBF; KNN used K = 3; and Random Forest used 200 estimators, a max depth of 5, minimum samples split of 2, minimum samples per leaf of 1, ‘sqrt’ as the maximum features, and ‘gini’ as the split criterion. Multiple evaluation metrics were employed to assess the prediction performance of the algorithms from various aspects.

Accuracy represents the proportion of correctly predicted observations both positive and negative out of the total number of observations. However, in datasets where one class significantly outweighs the other, a model can achieve high accuracy simply by predicting the majority class, while failing to detect the minority class entirely. To address this limitation, additional metrics such as precision, recall, and the F1 score were used. Precision, the ratio of correctly predicted positive cases to all predicted positives, is especially important when false positives carry a high cost. Recall (also called sensitivity or the true positive rate) measures the proportion of actual positive cases that were correctly identified and is crucial when the cost of false negatives is high. The F1 score, which is the harmonic mean of precision and recall, offers a balanced evaluation when there is a trade-off between the two [25,27].

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N}

(13)

P r e c i s i o n = \frac{T P}{T P + F P}

(14)

R e c a l l = \frac{T P}{T P + F N}

(15)

F 1 = 2 \frac{P r e c i s i o n \cdot R e c a l l}{P r e c i s i o n + R e c a l l}

(16)

where TP represents true positive, TN is true negative, FP is false positive, and FN is false negative.

Table 2 compares the classification performance of various models—SVM, KNN, Random Forest, Ensemble, and a neural network where ASD is treated as the positive class and TD as the negative. The SVM model achieved high recall (0.9), indicating it most correctly identified ASD case with a little false negatives. However, its precision was slightly lower at 0.84, meaning it produced some false positives by misclassifying TD as ASD. The F1 score, the harmonic mean of precision and recall, was 0.86, with an overall accuracy of 0.84.

In contrast, KNN showed poor recall (0.70), indicating many ASD cases were misclassified as TD. However, its precision was almost perfect (0.97), meaning it unlikely misclassified TD as ASD. The resulting F1 score was 0.78, and accuracy was 0.84. The Random Forest model also had high precision (0.95), but a recall of 0.70, suggesting that it missed some ASD cases. It achieved an F1 score of 0.86 and an accuracy of 0.88. Among all models, the Ensemble method yielded the best performance, with an accuracy of 0.90, an F1 score of 0.87, precision of 0.97, and recall of 0.82. Finally, the neural network showed a little lower performance with 0.79 for accuracy, 0.90 for precision, 0.67 for recall, and 0.72 for F1-score.

95% confidence interval of prediction accuracy across 10-fold cross-validation was calculated for each machine learning model: 0.8 ± 0.12 for SVM, 0.84 ± 0.09 for KNN, 0.88 ± 0.06 for Random Forest, 0.90 ± 0.07 for Ensemble, and 0.79 ± 0.12 for Neural Network.

3.4. Feature Importance Analysis and ASD Map

Various features extracted from the measured EEG data were used to develop classification models based on machine learning algorithms. To better understand which features contributed most to model performance, the permutation importance method was applied to both Random Forest and Ensemble models, and the resulting importance values were averaged.

Figure 13 presents the importance scores for ten EEG features. Among these, AF8-Beta and AF7-Beta emerged as the most positively significant features. This means that when these features were randomly shuffled, the model’s performance declined, indicating their importance in improving prediction accuracy. In contrast, Theta/Alpha_AF7 and Alpha/Beta_AF7 exhibited the most negative importance values, suggesting that the model performed better when these features were shuffled. This implies that these features may have introduced noise or misleading patterns, ultimately hindering the model’s predictive performance.

The two most significant features, AF8-Beta and AF7-Beta, were used to construct a predictive visualization referred to as the ASD map, shown in Figure 14. This map reveals two distinct zones based on the values of these features. A light orange zone, defined by AF8-Beta values ranging from 0 to approximately 5000 and AF7-Beta values from 0 to around 1150, corresponds to the ASD group. In contrast, a light blue zone, characterized by higher values of both features, represents the TD group. When EEG signal measurements fall within the orange region, the subject is likely to be classified as having ASD. This map provides a clear and interpretable guideline for ASD prediction and visually distinguishes between the two groups.

The plots based on these two features showed that group differentiation became more apparent, suggesting that these specific EEG channels and frequency bands serve as effective markers for distinguishing ASD from TD individuals. However, due to the limited sample size and participant heterogeneity, these findings should be considered exploratory. Future studies with larger samples are needed to validate and generalize the results. This approach offers a generalizable framework that can be applied to other tasks involving different types of brain signals. Once such a map is generated for a particular condition or task, it can offer a powerful visual and diagnostic tool for classification and assessment.

Potential confounding factors such as age differences, muscle activity, engagement variability, medication, and sleep could have influenced the results. In our study, preprocessing with ICA and artifact removal mitigated the impact of movement-related noise; however, no separate analyses were conducted to account for age, medication, or sleep effects, and some confounds could not be fully controlled.

4. Conclusions

This study investigated task-based EEG patterns in children during three structured cognitive tasks such as Shape Matching, Shape Sorting, and Number Matching to explore neural indicators of learning engagement. The cognitive states of children with Autism Spectrum Disorder (ASD) were analyzed using power across five frequency bands, power ratios, and dynamic attentional behavior. Machine learning classifiers were developed to accurately distinguish ASD from typically developing (TD) based on EEG data. Key features used in the classification models were identified, and an ASD map was constructed to support interpretation and visualization. The key findings of this study are summarized below.

Children with ASD exhibited elevated delta activity across all tasks, suggesting delayed neural maturation and atypical developmental trajectories. In addition, they showed frontal alpha-to-beta asymmetry and significantly higher theta/beta ratios, particularly during more complex tasks, indicating difficulties in attention regulation and increased cognitive load.

Dynamic changes in cognitive states were also examined. In the ASD group, attention levels fluctuated over time rather than remaining sustained, whereas the TD group demonstrated more consistent and stable attention throughout the tasks. Furthermore, attention patterns in the TD group remained relatively consistent across the three tasks, while those in the ASD group varied depending on the task. These results suggest that task type significantly affects attention in individuals with ASD, offering important insights for designing task-specific instructional materials or curricula aimed at improving attention in this population.

To classify ASD versus TD using EEG data, models were built using traditional machine learning algorithms (Support Vector Machine, K-Nearest Neighbors, Random Forest, and an Ensemble method), as well as a neural network. The Ensemble model achieved the highest accuracy (0.90), while the neural network provided slightly lower performance, with 0.79 for accuracy and 0.72 for F1-score.

Feature importance analysis identified beta activity at electrodes AF7 and AF8 as the most discriminative EEG markers. These features were used to construct an ASD decision map, offering a clear and interpretable framework for ASD prediction and a visual means to distinguish between the two groups.

This study found that EEG signals can serve as useful indicators for monitoring the brain activity of children with ASD, whose responses differed from their TD peers, thereby supporting the study’s hypothesis. Characteristics such as dominant delta waves, pronounced frontal asymmetry, elevated theta/alpha and theta/beta ratios, and inconsistent attention patterns became more pronounced as learning tasks increased in difficulty for children with ASD. These features may hold potential as markers to guide more personalized and effective instructional approaches for individual learners with ASD; however, further research is needed before practical applications can be established.

Although the present study was carefully designed and conducted, several limitations are noted. Using only 4 EEG channels for ICA may reduce the accuracy of artifact removal. Notch filtering and EMG controls were not applied, so some residual noise may remain. The wide age range and inclusion of minimally verbal children could contribute to variability in the EEG features. Importantly, our ML pipeline was designed to avoid subject leakage by keeping each participant’s data fully within a single fold. Despite these factors, the ensemble model achieved high accuracy, and the key EEG features were consistently identified across folds and models, demonstrating robustness of the results.

Future directions for this research involve conducting studies with larger, age-matched cohorts to enhance statistical power and reduce developmental variability. Preregistration will be adopted to improve transparency and reproducibility, while participant-level nested cross-validation will be employed to strengthen the rigor of model evaluation. Longitudinal replication in classroom settings will also be pursued to evaluate ecological validity and scalability. In the present study, learning ability and attention were assessed solely through EEG data; however, incorporating additional biometric measures—such as eye movements, heart rate variability, and skin conductance—would enable a more comprehensive characterization of participants’ physiological and neurological states in children with ASD. Finally, within-subject manipulations of task difficulty will be examined to further investigate how neural and behavioral responses adapt across varying levels of cognitive challenge.

Author Contributions

Conceptualization, J.H., J.C. and K.T.C.; methodology, H.P., R.A., N.Y., J.H. and K.T.C.; software, H.P., J.H. and K.T.C.; validation, H.P. and K.T.C.; formal analysis, H.P. and K.T.C.; investigation, H.P., R.A. and N.Y.; resources, K.W., J.H., J.C. and K.T.C.; data curation, H.P., R.A., N.Y., J.H. and K.T.C.; writing—original draft preparation, H.P.; writing—review and editing, K.W., J.H., J.C. and K.T.C.; visualization, H.P., J.H. and K.T.C.; supervision, K.W., J.H., J.C. and K.T.C.; project administration, J.H., J.C. and K.T.C.; funding acquisition, J.H., J.C. and K.T.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by T-rise grant at Northern Illinois University.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of NORTHERN ILLINOIS UNIVERSITY (HS25-0128; 27 November 2024) for studies involving humans.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The code for this machine learning framework is publicly available at https://github.com/Harshith8333/EEG-Task-Classification-ASD (accessed on 7 October 2025). Additionally, some or all of the data presented in this study are available from the corresponding author upon request.

Acknowledgments

Authors, H. Penmetsa, R. Abbasi, N. Yellamilli, K. Winkelman, J. Chan, J. Hwang, and K. Cho, acknowledge the support from Westside Children’s Therapy to conduct the tests.

Conflicts of Interest

Author Kimberly Winkelman was employed by the company Westside Children’s Therapy. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Gabard-Durnam, L.J.; Wilkinson, C.; Kapur, K.; Tager-Flusberg, H.; Levin, A.R.; Nelson, C.A. Longitudinal EEG power in the first postnatal year differentiates autism outcomes. Nat. Commun. 2019, 10, 4188. [Google Scholar] [CrossRef]
Da, F.; Ma, Y.; Ma, M.; Mao, J.; Weng, Z.; Yang, C.; Wang, T. Effects of STEM learning on students with autism spectrum disorder and students with intellectual disability: A systematic review and meta-analysis. Humanit. Soc. Sci. Commun. 2025, 12, 1009. [Google Scholar] [CrossRef]
Matthews, N.L.; Honda, H.; Mitchell, M.M.; Johns, A.; Kiefer, S.L.; Mann, M.; Schimmel, K.; Boglio, A.; Hallur, S.; Koke, J.; et al. Building capacity for inclusive informal STEM learning opportunities for autistic learners. Int. J. STEM Educ. 2024, 11, 53. [Google Scholar] [CrossRef]
Ghee Hou, S.; Ahmad, M.R.b. Emotion Classification System for ASD Group by Using Wireless EEG Monitoring Device. Elektr. J. Electr. Eng. 2024, 23, 1–9. [Google Scholar] [CrossRef]
May, T.; Rinehart, N.J.; Wilding, J.; Cornish, K. Attention and basic literacy and numeracy in children with Autism Spectrum Disorder: A one-year follow-up study. Res. Autism Spectr. Disord. 2015, 9, 193–201. [Google Scholar] [CrossRef]
Sanei, S.; Chambers, J.A. EEG Signal Processing and Machine Learning, 2nd ed.; Wiley: Hoboken, NJ, USA, 2022; ISBN 978-1-119-38694-0. [Google Scholar]
Hankus, M.; Ochman-Pasierbek, P.; Brzozowska, M.; Striano, P.; Paprocka, J. Electroencephalography in Autism Spectrum Disorder. J. Clin. Med. 2025, 14, 1882. [Google Scholar] [CrossRef] [PubMed]
Ruffini, R.; Brinciotti, M.; Giovannone, F.; Pisani, F.; Tofani, M.; Sogos, C. Correlations between EEG abnormalities and clinical phenotypes in a population of children with autism spectrum disorder. Res. Autism 2025, 123, 202536. [Google Scholar] [CrossRef]
Wang, J.; Barstein, J.; Ethridge, L.E.; Mosconi, M.W.; Takarae, Y.; Sweeney, J.A. Resting state EEG abnormalities in autism spectrum disorders. J. Neurodev. Disord. 2013, 5, 24. [Google Scholar] [CrossRef]
Edmunds, S.R.; Fogler, J.; Braverman, Y.; Gilbert, R.; Faja, S. Resting frontal alpha asymmetry as a predictor of executive and affective functioning in children with neurodevelopmental differences. Front. Psychol. 2023, 13, 1065598. [Google Scholar] [CrossRef]
Peck, F.C.; Gabard-Durnam, L.J.; Wilkinson, C.L.; Bosl, W.; Tager-Flusberg, H.; Nelson, C.A. Prediction of autism spectrum disorder diagnosis using nonlinear measures of language-related EEG at 6 and 12 months. J. Neurodev. Disord. 2021, 13, 57. [Google Scholar] [CrossRef]
Bosl, W.; Tierney, A.; Tager-Flusberg, H.; Nelson, C. EEG complexity as a biomarker for autism spectrum disorder risk. BMC Med. 2011, 9, 18. [Google Scholar] [CrossRef]
Wang, T.-S.; Wang, S.-S.; Wang, C.-L.; Wong, S.-B. Theta/beta ratio in EEG correlated with attentional capacity assessed by Conners Continuous Performance Test in children with ADHD. Front. Psychiatry 2024, 14, 1305397. [Google Scholar] [CrossRef]
Neuhaus, E.; Santhosh, M.; Kresse, A.; Aylward, E.; Bernier, R.; Bookheimer, S.; Jeste, S.; Jack, A.; McPartland, J.C.; Naples, A.; et al. Frontal EEG alpha asymmetry in youth with autism: Sex differences and social–emotional correlates. Autism Res. 2023, 16, 2364–2377. [Google Scholar] [CrossRef] [PubMed]
Dastgheib, S.S.; Kaufmann, J.M.; Kowallik, A.E.; Schweinberger, S.R. Attention to Social and Non-Social Stimuli in a Continuous Performance Test in Autistic and Typically Developed Participants: An ERP Study. J. Autism Dev. Disord. 2025; ahead of print. [Google Scholar] [CrossRef]
MNE—MNE 1.10.1 Documentation. Available online: https://mne.tools/stable/index.html (accessed on 20 September 2025).
Cho, R.; Zaman, M.; Cho, K.T.; Hwang, J. Investigating brain activity patterns during learning tasks through EEG and machine learning analysis. Int. J. Inf. Technol. 2024, 16, 2737–2744. [Google Scholar] [CrossRef]
Krigolson, O.E.; Hammerstrom, M.R.; Abimbola, W.; Trska, R.; Wright, B.W.; Hecker, K.G.; Binsted, G. Using Muse: Rapid Mobile Assessment of Brain Performance. Front. Neurosci. 2021, 15, 634147. [Google Scholar] [CrossRef]
Hayes, H.B.; Magne, C. Exploring the Utility of the Muse Headset for Capturing the N400: Dependability and Single-Trial Analysis. Sensors 2024, 24, 7961. [Google Scholar] [CrossRef]
Syed, M.K.; Wang, H.; Siddiqi, A.A.; Qureshi, S.; Gouda, M.A. EEG-Based Attention Classification for Enhanced Learning Experience. Appl. Sci. 2025, 15, 8668. [Google Scholar] [CrossRef]
Yuvaraj, R.; Chadha, S.; Prince, A.A.; Murugappan, M.; Islam, M.S.B.; Sumon, M.S.I.; Chowdhury, M.E.H. A Machine Learning Framework for Classroom EEG Recording Classification: Unveiling Learning-Style Patterns. Algorithms 2024, 17, 503. [Google Scholar] [CrossRef]
Krigolson, O.E.; Williams, C.C.; Norton, A.; Hassall, C.D.; Colino, F.L. Choosing MUSE: Validation of a Low-Cost, Portable EEG System for ERP Research. Front. Neurosci. 2017, 11, 109. [Google Scholar] [CrossRef] [PubMed]
Balathay, D.; Narasimhan, U.; Belo, D.; Anandan, K. Quantitative assessment of cognitive profile and brain asymmetry in the characterization of autism spectrum in children: A task-based EEG study. Proc. Inst. Mech. Eng. 2023, 237, 653–665. [Google Scholar] [CrossRef]
Cohen, M.X. Analyzing Neural Time Series Data: Theory and Practice; Issues in clinical and cognitive neuropsychology; The MIT Press: Cambridge, MA, USA, 2014; ISBN 978-0-262-01987-3. [Google Scholar]
Bishop, C.M.; Bishop, H. Deep Learning: Foundations and Concepts; Springer: Cham, Switzerland, 2024; ISBN 978-3-031-45467-7. [Google Scholar]
Géron, A. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, 3rd ed.; O’Reilly Media: Beijing, China, 2022; ISBN 978-1-09-812597-4. [Google Scholar]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Müller, A.; Nothman, J.; Louppe, G.; et al. Scikit-learn: Machine Learning in Python. arXiv 2018, arXiv:1201.0490. [Google Scholar] [CrossRef]
Dickinson, A.; DiStefano, C.; Senturk, D.; Jeste, S.S. Peak Alpha Frequency is a Neural Marker of Cognitive Function Across the Autism Spectrum. Eur. J. Neurosci. 2017, 47, 643–651. [Google Scholar]
Van der Molen, M.J.W.; Van der Molen, M.W.; Ridderinkhof, K.R.; Hamel, B.C.J.; Curfs, L.M.G.; Ramakers, G.J.A. Auditory and visual cortical activity during selective attention in fragile X syndrome: A cascade of processing deficiencies. Clin. Neurophysiol. 2012, 123, 720–729. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Three learning tasks: (a) Shape Matching task where the test subjects drag each object in box to its matching object, (b) Shape Sorting task where the subjects recreate structure in box using the objects, and (c) Number Matching task where the subjects drag each number to its matching number.

Figure 2. Data collection: (a) Subject participating in tasks, (b) MUSE EEG sensor, and (c) Location of EEG electrode.

Figure 3. Noise removal from EEG by ICA.

Figure 4. Diagram of Ensemble Model Algorithm.

Figure 5. The effect of the number of neighbors (K) on Error in KNN model.

Figure 6. Example of cross-validation for folds 5.

Figure 7. Typical structure of multilayer perceptron neural network.

Figure 8. Comparison of fractional power of five frequency bands calculated from data collected while participants participated in all three tests such as Shape Matching, Shape Sorting, and Number Matching.

Figure 9. Signal ratio (Alpha/Beta) over AF7 (left frontal) and AF8 (right frontal) for subjects engaging in three tasks.

Figure 10. Topographic map of ASD and TD while engaging in Shape Sorting.

Figure 11. Signal ratios (Theta/Alpha, Theta/Beta) over AF7 and AF8.

Figure 12. Dynamic attention change calculated by Morlet wavelet transform.

Figure 13. Feature contribution to Random Forest and Ensemble classification models.

Figure 14. ASD map formed by the most significant features, AF7_Beta and AF8_Beta.

Table 1. Summary of hyperparameters considered for the ML models.

Model	Tuned Parameters	Grid Search Values
SVM	C, gamma, kernel	C = [0.1, 1, 10, 100]; gamma = [0.001, 0.01, 0.1, 1]; Kernel = [Linear, rbf, poly]
KNN	Number of neighbors (K)	K = [1, 3, 5, …, max_safe_k] (odd numbers only, chosen based on dataset size)
Random Forest	n_estimators, max_depth, min_samples_split, min_samples_leaf, max_features, bootstrap, criterion	n_estimators = [100, 200, 300]; max_depth = [5, 10, 15, None]; min_samples_split = [2, 4, 6]; min_samples_leaf = [1, 2, 4]; max_features = [‘sqrt’, ‘log2′]; bootstrap = [True, False]; criterion = [‘gini’, ‘entropy’]

Table 2. Summary of prediction results of machine learning models.

Model	Accuracy	Precision	Recall	F1-Score
SVM	0.84 ± 0.16	0.84 ± 0.21	0.90 ± 0.15	0.86 ± 0.14
KNN	0.84 ± 0.12	0.97 ± 0.07	0.70 ± 0.27	0.78 ± 0.18
Random Forest	0.88 ± 0.08	0.95 ± 0.10	0.82 ± 0.19	0.86 ± 0.11
Ensemble	0.90 ± 0.08	0.97 ± 0.07	0.82 ± 0.18	0.87 ± 0.11
Neural Network	0.79 ± 0.12	0.90 ± 0.13	0.67 ± 0.27	0.72 ± 0.19

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Penmetsa, H.; Abbasi, R.; Yellamilli, N.; Winkelman, K.; Chan, J.; Hwang, J.; Cho, K.T. Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks. Information 2025, 16, 880. https://doi.org/10.3390/info16100880

AMA Style

Penmetsa H, Abbasi R, Yellamilli N, Winkelman K, Chan J, Hwang J, Cho KT. Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks. Information. 2025; 16(10):880. https://doi.org/10.3390/info16100880

Chicago/Turabian Style

Penmetsa, Harshith, Rahma Abbasi, Nagasree Yellamilli, Kimberly Winkelman, Jeff Chan, Jaejin Hwang, and Kyu Taek Cho. 2025. "Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks" Information 16, no. 10: 880. https://doi.org/10.3390/info16100880

APA Style

Penmetsa, H., Abbasi, R., Yellamilli, N., Winkelman, K., Chan, J., Hwang, J., & Cho, K. T. (2025). Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks. Information, 16(10), 880. https://doi.org/10.3390/info16100880

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Investigating Brain Activity of Children with Autism Spectrum Disorder During STEM-Related Cognitive Tasks

Abstract

1. Introduction

2. Methodology

2.1. Participants and Data Collection

2.2. Data Preprocessing

2.3. Machine Learning Models

2.3.1. Traditional Machine Learning Models

Support Vector Machine (SVM)

K-Nearest Neighbors (KNN)

Random Forest (RF)

Ensemble Model

Hyperparameter Optimization

Cross-Validation

2.3.2. Neural Network Models

2.4. Feature Importance Computation

3. Results & Discussion

3.1. EEG Feature Observations

3.1.1. Increased Delta Activity

3.1.2. Frontal Alpha/Beta Ratio Asymmetry

3.1.3. Theta/Alpha (TAR) and Theta/Beta (TBR) Ratios

3.2. Dynamic Attention Analysis

3.3. ASD and TD Classification Using Machine Learning Models

3.4. Feature Importance Analysis and ASD Map

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI