A Convolutional Deep Neural Network Approach to Predict Autism Spectrum Disorder Based on Eye-Tracking Scan Paths
Abstract
:1. Introduction
2. Related Works
3. Proposed Methodology
3.1. Dataset Collection
3.2. Stage of Image Processing
3.2.1. Reducing the Size of Images
3.2.2. Grayscale Image Conversion
3.2.3. Reduction in Dimensions
3.3. Cross-Validation
3.4. T-CNN-ASD Model
- The initial layer of T-CNN-ASD is referred to as the input layer. It accepts images as input, resizes them, and subsequently forwards them to subsequent layers for the purpose of extracting features.
- The convolution layer is responsible for extracting spatial and temporal features from an input image through the process of convolution. It consists of multiple filters or kernels that perform convolution on the entire image. To configure the convolution layer, important parameters such as the number and size of the kernels need to be set. In the T-CNN-ASD model, each image is represented as an array of pixel values. These pixel values are then passed to the convolutional layer. Within this layer, the kernels move across receptive fields in the input image, searching for specific cues like edges, colors, and curves. Figure 9 provides a visual representation of this process [27].T-CNN-ASD is composed of multiple convolutional layers, generating numerous feature maps in the input layer. These maps are then utilized as the input for subsequent layers to detect a wide range of features for learning purposes.
- The Rectified Linear Unit (ReLU) is a linear function that processes the input directly if it is positive, and if it is negative, it transforms it to 0. This activation function has been widely used in various neural network models due to its improved performance and ease of training. It can be represented mathematically as f(x) = max(0, x) [28]. In the context of T-CNN-ASD, the ReLU helps maintain mathematical stability by preventing learned values from getting stuck near 0 or approaching infinity. The process of the ReLU is illustrated in Figure 10.
- Pooling Layer: The feature sets obtained from the previous layer are forwarded to the pooling layer. The purpose of this layer is to decrease the size of large images while still preserving the crucial information. In this study, both maximum and average pooling techniques were employed. These techniques involve extracting patches from the input feature maps. Max pooling selects the maximum value within each area, which represents the most prominent feature, while disregarding the remaining values. For this research, a maximum pooling method with a filter size of and a stride of 2 was utilized. The max-pooling layer is illustrated in Figure 11.The process of average pooling involves computing the average value of each patch in the feature map. In other words, a 2 × 2 square in the feature map is reduced to its average value [6]. Figure 12 illustrates the operation of average pooling.A pooling layer provides a downsampling operation, which reduces the internal dimensionality of the feature maps. It is of note that there is no learnable parameter in any of the pooling layers, whereas filter size, stride, and padding are hyperparameters in pooling operations. Unlike height and width, the depth dimension of the feature maps remains unchanged [26].
- The Fully Connected Layer is considered the last layer in a network. Its purpose is to take the high-level filtered images and convert them into categories or classes with corresponding labels. The output feature maps from the final convolution or pooling layer are typically flattened, meaning they are transformed into a one-dimensional array of numbers. These flattened feature maps are then connected to one or more fully connected layers. In these layers, each input is connected to every output through a weight that can be measured [29]. Once the features extracted by the convolution layers and downsampled by the pooling layers are generated, a subset of fully connected layers is used to map them to the final output of the network [29].
3.5. Tuning the Hyperparameters of T-CNN-ASD
- The architecture of the CNN-ASD model can be optimized by tuning its hyperparameters. One important hyperparameter to consider is the number of neurons in each hidden layer. The optimal number of neurons should be chosen based on the complexity of the problem being solved. In this study, we experiment with different numbers of neurons to identify the value that yields the highest accuracy. It is worth noting that the accuracy of the neural network can also be influenced by the number of layers. The selection of layers plays a crucial role in determining the performance of the prediction model. Using too few layers may lead to underfitting, while employing too many layers may result in overfitting. Hence, tuning the number of layers can lead to improved results.
- Optimizing primary hyperparameters in convolutional and pooling layers:
- Kernel/Filter Size: The size of the kernel refers to the width × height of the filter mask. Various kernel sizes were employed to determine the optimal kernel size that yielded the highest level of accuracy.
- Stride and Padding: The stride refers to the number of steps the filter moves in each step during convolution. It determines the number of pixels that are skipped while traversing the input horizontally and vertically after each element-wise multiplication of the input weights with those of the filter [26]. Figure 13 illustrates the stride parameter.Padding is employed to maintain the output size equal to the input size. The output size is smaller compared to the input size. Padding is a technique utilized to append zero-filled columns and rows in order to preserve the spatial dimensions after convolution. This can potentially enhance performance by preserving information at the boundaries. It is employed to ensure that the output dimension matches the input dimension [26].
- Dropout: When the dataset is small in size, it is possible for overfitting to occur. This means that while the training results may be satisfactory, the test results are not as good. The reason for this is that the model is not able to accurately generalize to data that it has not encountered during training. If the model is unable to effectively generalize to unseen data, it will struggle to perform the intended classification or prediction tasks. To address this issue, dropout is employed as a means of reducing overfitting. Dropout is a regularization technique that decreases the likelihood of overfitting by randomly dropping out neurons during each epoch (or during each batch when using a batch approach). When a unit is dropped, the corresponding neuron is temporarily removed from the network, along with all of its incoming and outgoing connections [31], as illustrated in Figure 14.
- Batch Size (BS): The batch size refers to the number of samples that the network will process simultaneously. This parameter plays a crucial role in enhancing the training process, particularly when fine-tuning is involved [32].
- Epochs refer to the number of times the learning algorithm processes the entire training dataset. It is a hyperparameter that determines how many passes the algorithm will make over the data. Each epoch allows every sample in the training dataset to update the internal model parameters. An epoch consists of one or more batches. The rationale behind using epochs is to ensure that the network has the opportunity to observe previous data and adjust the model parameters, preventing bias towards the most recent data points during training [33].
- Optimizers are responsible for modifying the learning rate and weights of neurons in a neural network in order to minimize the loss function or maximize the accuracy. During the training process, the weights of the neural network are randomly initialized and then updated in each epoch to improve the overall accuracy. The loss function is used to calculate the error by comparing the output of the training data with the actual data at each epoch, and the weights are adjusted accordingly [34]. The optimizers employed include the adaptive moment estimate (Adam) optimizer, stochastic gradient descent (SGD) optimizer, Adadelta optimizer, Root Mean Square Propagation (RMSprop) optimizer, AdaMax optimizer, and Nesterov Accelerated Adam (Nadam) optimizer.
3.6. Model Evaluation
4. Experiments and Results
4.1. Environment Setup
4.2. Sensitivity Analysis
- Results of Different Kernel/Filter Size: The training of the model involved 10 rounds of cross-validation, with each round consisting of 100 epochs and a dropout rate of 20%. Table 2 displays the accuracy achieved for various kernel sizes.The T-CNN-ASD model reached a peak accuracy of 95.59% by employing two layers with 300 and 150 neurons, respectively. The kernel size for these layers was set to 3.
- Results of Different Stride: This paper trained the model using 10 rounds of cross-validation, with 100 epochs and a 20% dropout rate. The kernel size used was 3, and the stride initially set to 1 but later changed to 2. The results can be seen in Table 3.By including two layers with 300 and 150 neurons and a stride of 1, the T-CNN-ASD model achieved an outstanding performance of 95.59%.
- Pooling Layers: The training of the model was conducted using 10 rounds of cross-validation. Each round consisted of 100 epochs with a dropout rate of 20%. The kernel size used was 3. After applying max pooling, average pooling was applied to a single depth slice with a stride of 1. The details are presented in Table 4.It is worth noting that the T-CNN-ASD model attained its peak performance at 95.59% accuracy by utilizing two layers comprising 300 and 150 neurons, respectively, in addition to employing max pooling.
- Dropout: In this study, three layers were utilized with varying neuron counts to identify the optimal architecture and performance. The accuracy results for the T-CNN-ASD model, with different layer and neuron configurations, along with 5 and 10 rounds of cross-validation, are presented in Table 5. The dropout rate was initially set at 20% but was later increased to 50%. Additionally, a kernel size of 3 was employed throughout the experiment.
4.3. Results
4.3.1. Comparison of T-CNN-ASD with ML Models
4.3.2. Comparison of T-CNN-ASD with Other Studies in the Literature
4.4. Discussion
- Various kernel sizes were employed, and it was observed that smaller kernel sizes tend to outperform larger ones. One advantage of favoring small kernel sizes over fully connected networks is the reduction in computational costs and weight sharing, resulting in fewer weights for back-propagation. Conversely, longer kernel sizes are avoided due to their excessively long training time and associated cost, which may lead to the loss of image details [23]. On the other hand, using small kernels aids in detecting small features and capturing image details. It is worth noting that odd kernel sizes are preferred over even ones, such as 2 by 2 or 4 by 4. This preference arises from the symmetry exhibited by all pixels in the previous layer around the output pixel. The absence of this symmetry necessitates additional calculations in the layers when employing even kernel sizes [6]. Nonetheless, some studies have successfully utilized even kernel sizes and achieved satisfactory results.
- The kernel was moved using a 1-then-2 stride technique because the optimal kernel was determined to be 3. Upon comparing the outcomes, it was discovered that moving the kernel by 1 step yielded superior results. This can be attributed to the fact that incrementally shifting the kernel aids in more effectively extracting the features.
- The pooling layer is employed to decrease the spatial size of the input image following convolution. It is positioned between two convolution layers. Applying a fully connected after-convolution layer without pooling would be computationally intensive. Hence, max pooling and average pooling are employed to reduce the spatial size of the input image [26]. Prior to pooling, the average pooling method smooths the image, resulting in the exclusion of sharp features. On the other hand, max pooling identifies the brighter pixels, which in turn determine the lighter pixels. This is particularly useful when the background of the image is dark, such as the images utilized in the research.
- Dropout is a technique commonly employed to mitigate overfitting in neural networks. It involves randomly deactivating neurons, allowing other neurons to compensate and make predictions based on the deactivated ones. Consequently, the neurons within the layers learn the weights independently and do not rely on collaboration with neighboring cells, thereby reducing inter-neuron dependency. This reduction in dependency helps to alleviate overfitting. In our study, we experimented with different dropout rates, specifically the lowest rate of 20% and the highest rate of 50%, to determine the optimal value. Based on the obtained results, a dropout rate of 20% yielded the best performance for the T-CNN-ASD model.
- Additionally, the most optimal outcomes were noted when the batch size was set to 256. Increasing the batch size allows our model to complete each epoch more rapidly during training. However, it is important to consider that using excessively large batches may compromise the quality of the model and hinder its ability to effectively generalize unseen data [32]. Consequently, the batch size is a hyperparameter that requires careful testing and adjustment based on the model’s performance during training.
- Furthermore, it is important to note that selecting the correct number of epochs is crucial. Using a limited number of epochs can lead to underfitting as the neural network does not have sufficient learning time. Conversely, using an excessive number of epochs may result in the model performing well on the training data but struggling to accurately predict new, unseen data. In this study, the optimal number of epochs was determined to be 100.
- In terms of optimizers, the model was tested to find the best one, and a total of six optimizers were employed. Each optimizer has its own unique approach to handling the dataset. Based on the previous findings, the Adam optimizer yielded the highest accuracy. Additionally, it was noted that the learning process was more consistent when using the Adam optimizer.
5. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Thabtah, F.; Peebles, D. A new machine learning model based on induction of rules for autism detection. Health Inform. J. 2020, 26, 264–286. [Google Scholar] [CrossRef] [PubMed]
- Bilgen, I.; Guvercin, G.; Rekik, I. Machine learning methods for brain network classification: Application to autism diagnosis using cortical morphological networks. J. Neurosci. Methods 2020, 343, 108799. [Google Scholar] [CrossRef]
- Brigido, E.; Rodrigues, A.; Santos, S. Autism spectrum disorder behavioural profiles: A cluster analysis exploration. Int. J. Disabil. Dev. Educ. 2023, 70, 515–529. [Google Scholar] [CrossRef]
- Chowdhury, T.; Romero, V.; Stent, A. Interactional coordination between conversation partners with autism using non-verbal cues in dialogues. In Proceedings of the First Workshop on Connecting Multiple Disciplines to AI Techniques in Interaction-Centric Autism Research and Diagnosis (ICARD 2023), Prague, Czechia, 12 September 2023; pp. 24–34. [Google Scholar]
- Schaeffer, J.; Abd El-Raziq, M.; Castroviejo, E.; Durrleman, S.; Ferré, S.; Grama, I.; Hendriks, P.; Kissine, M.; Manenti, M.; Marinis, T.; et al. Language in autism: Domains, profiles and co-occurring conditions. J. Neural Transm. 2023, 130, 433–457. [Google Scholar] [CrossRef]
- Wan, G.; Kong, X.; Sun, B.; Yu, S.; Tu, Y.; Park, J.; Lang, C.; Koh, M.; Wei, Z.; Feng, Z.; et al. Applying eye tracking to identify autism spectrum disorder in children. J. Autism Dev. Disord. 2019, 49, 209–215. [Google Scholar] [CrossRef]
- Kanhirakadavath, M.R.; Chandran, M.S.M. Investigation of Eye-Tracking Scan Path as a Biomarker for Autism Screening Using Machine Learning Algorithms. Diagnostics 2022, 12, 518. [Google Scholar] [CrossRef]
- Weiss, R.; Karimijafarbigloo, S.; Roggenbuck, D.; Rödiger, S. Applications of Neural Networks in Biomedical Data Analysis. Biomedicines 2022, 10, 1469. [Google Scholar] [CrossRef]
- Yaneva, V.; Eraslan, S.; Yesilada, Y.; Mitkov, R. Detecting high-functioning autism in adults using eye tracking and machine learning. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 1254–1261. [Google Scholar] [CrossRef]
- Carette, R.; Elbattah, M.; Cilia, F.; Dequen, G.; Guérin, J.L.; Bosche, J. Learning to Predict Autism Spectrum Disorder based on the Visual Patterns of Eye-tracking Scanpaths. In Proceedings of the HEALTHINF, Prague, Czech Republic, 22–24 February 2019; pp. 103–112. [Google Scholar]
- Shic, F.; Naples, A.J.; Barney, E.C.; Chang, S.A.; Li, B.; McAllister, T.; Kim, M.; Dommer, K.J.; Hasselmo, S.; Atyabi, A.; et al. The autism biomarkers consortium for clinical trials: Evaluation of a battery of candidate eye-tracking biomarkers for use in autism clinical trials. Mol. Autism 2022, 13, 15. [Google Scholar] [CrossRef] [PubMed]
- Frazier, T.W.; Klingemier, E.W.; Parikh, S.; Speer, L.; Strauss, M.S.; Eng, C.; Hardan, A.Y.; Youngstrom, E.A. Development and Validation of objective and quantitative eye tracking- based measures of autism risk and symptom levels. J. Am. Acad. Child Adolesc. Psychiatry 2018, 57, 858–866. [Google Scholar] [CrossRef] [PubMed]
- Elbattah, M. Visualization of Eye-Tracking Scanpaths in Autism Spectrum Disorder: Image Dataset. In Proceedings of the 12th International Conference on Health Informatics (HEALTHINF), Prague, Czech Republic, 22–24 February 2019. [Google Scholar] [CrossRef]
- Fabiano, D.; Canavan, S.; Agazzi, H.; Hinduja, S.; Goldgof, D. Gaze-based classification of autism spectrum disorder. Pattern Recognit. Lett. 2020, 135, 204–212. [Google Scholar] [CrossRef]
- Vabalas, A.; Gowen, E.; Poliakoff, E.; Casson, A.J. Applying Machine Learning to Kinematic and eye Movement features of a Movement imitation task to predict Autism Diagnosis. Sci. Rep. 2020, 10, 8346. [Google Scholar] [CrossRef] [PubMed]
- Duan, H.; Min, X.; Fang, Y.; Fan, L.; Yang, X.; Zhai, G. Visual attention analysis and prediction on human faces for children with autism spectrum disorder. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 2019, 15, 1–23. [Google Scholar] [CrossRef]
- Tao, Y.; Shyu, M.L. SP-ASDNet: CNN-LSTM based ASD classification model using observer scanpaths. In Proceedings of the 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), Shanghai, China, 8–12 July 2019; pp. 641–646. [Google Scholar]
- Kociołek, M.; Strzelecki, M.; Obuchowicz, R. Does image normalization and intensity resolution impact texture classification? Comput. Med. Imaging Graph. 2020, 81, 101716. [Google Scholar] [CrossRef] [PubMed]
- Gonzalez, R.C.; Woods, R.E. Digital Image Processing, 4th ed.; Pearson: London, UK, 2018. [Google Scholar]
- Zebari, R.; Abdulazeez, A.; Zeebaree, D.; Zebari, D.; Saeed, J. A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction. J. Appl. Sci. Technol. Trends 2020, 1, 56–70. [Google Scholar] [CrossRef]
- Kaminsky, A.L.; Wang, Y.; Pant, K. An Efficient Batch K-Fold Cross-Validation Voronoi Adaptive Sampling Technique for Global Surrogate Modeling. J. Mech. Des. 2021, 143, 011706. [Google Scholar] [CrossRef]
- Brownlee, J. Data Preparation for Machine Learning: Data Cleaning, Feature Selection, and Data Transforms in Python; Machine Learning Mastery: Vermont, VIC, Australia, 2020. [Google Scholar]
- Jung, Y. Multiple predicting K-fold cross-validation for model selection. J. Nonparametr. Stat. 2018, 30, 197–215. [Google Scholar] [CrossRef]
- Filchev, L.; Pashova, L.; Kolev, V.; Frye, S. Surveys, Catalogues, Databases/Archives, and State-of-the-Art Methods for Geoscience Data Processing. In Knowledge Discovery in Big Data from Astronomy and Earth Observation; Elsevier: Amsterdam, The Netherlands, 2020; pp. 103–136. [Google Scholar]
- Sharma, N.; Jain, V.; Mishra, A. An analysis of convolutional neural networks for image classification. Procedia Comput. Sci. 2018, 132, 377–384. [Google Scholar] [CrossRef]
- Yamashita, R.; Nishio, M.; Do, R.K.G.; Togashi, K. Convolutional neural networks: An overview and application in radiology. Insights Imaging 2018, 9, 611–629. [Google Scholar] [CrossRef]
- Ahlawat, S.; Choudhary, A.; Nayyar, A.; Singh, S.; Yoon, B. Improved handwritten digit recognition using convolutional neural networks (CNN). Sensors 2020, 20, 3344. [Google Scholar] [CrossRef]
- Agarap, A.F. Deep learning using rectified linear units (relu). arXiv 2018, arXiv:1803.08375. [Google Scholar]
- Dao, H. Image Classification Using Convolutional Neural Networks. Bachelor’s Thesis, Oulu University of Applied Sciences, Oulu, Finland, 2020. [Google Scholar]
- Tian, C.; Xu, Y.; Fei, L.; Wang, J.; Wen, J.; Luo, N. Enhanced CNN for image denoising. CAAI Trans. Intell. Technol. 2019, 4, 17–23. [Google Scholar] [CrossRef]
- Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
- Kandel, I.; Castelli, M. The effect of batch size on the generalizability of the convolutional neural networks on a histopathology dataset. ICT Express 2020, 6, 312–315. [Google Scholar] [CrossRef]
- Afaq, S.; Rao, S. Significance Of Epochs On Training A Neural Network. Int. J. Sci. Technol. Res. 2020, 9, 485–488. [Google Scholar]
- Boob, D.; Lan, G. Theoretical properties of the global optimizer of two layer neural network. arXiv 2017, arXiv:1710.11241. [Google Scholar]
Number of Participants | 59 |
Gender Distribution (M/F) | 38 (64%)/21 (36%) |
Number of Non-ASD | 30 |
Number of ASD-Diagnosed | 29 |
Age (Mean/Median) years | 7.88/8.1 |
CARS (Mean/Median) | 32.97/34.50 |
Number of Neurons | Hidden Layers | The Kernel Size | ||
---|---|---|---|---|
3 | 6 | 9 | ||
50 | Single layer | 92.95 | 93.69 | 93.25 |
100 | 93.87 | 93.79 | 93.05 | |
200 | 93.51 | 94.07 | 83.98 | |
300 | 93.97 | 93.44 | 87.12 | |
400 | 92.87 | 94.16 | 93.14 | |
500 | 93.61 | 94.13 | 84.58 | |
600 | 93.33 | 89.87 | 83.86 | |
80, 40 | Two layers | 94.69 | 91.93 | 93.35 |
100, 50 | 94.87 | 93.80 | 91.04 | |
200, 100 | 94.52 | 90.59 | 86.93 | |
300, 150 | 95.59 | 85.23 | 81.71 | |
400, 200 | 94.96 | 93.06 | 87.24 | |
500, 250 | 93.59 | 85.82 | 81.59 |
Number of Neurons | Hidden Layers | Stride | |
---|---|---|---|
1 | 2 | ||
50 | Single layer | 92.95 | 92.61 |
100 | 93.87 | 92.71 | |
200 | 93.51 | 92.87 | |
300 | 93.97 | 92.31 | |
400 | 92.87 | 93.69 | |
500 | 93.61 | 93.13 | |
600 | 93.33 | 93.60 | |
80, 40 | Two layers | 94.69 | 93.05 |
100, 50 | 94.87 | 94.06 | |
200, 100 | 94.52 | 93.60 | |
300, 150 | 95.59 | 94.14 | |
400, 200 | 94.96 | 93.23 | |
500, 250 | 93.59 | 93.96 |
Number of Neurons | Hidden Layers | Pooling | |
---|---|---|---|
Max Pooling | Average Pooling | ||
50 | Single layer | 92.95 | 92.60 |
100 | 93.87 | 92.70 | |
200 | 93.51 | 92.60 | |
300 | 93.97 | 91.88 | |
400 | 92.87 | 91.49 | |
500 | 93.61 | 93.14 | |
600 | 93.33 | 93.23 | |
80, 40 | Two layers | 94.69 | 91.42 |
100, 50 | 94.87 | 90.48 | |
200, 100 | 94.52 | 92.88 | |
300, 150 | 95.59 | 92.14 | |
400, 200 | 94.96 | 91.80 | |
500, 250 | 93.59 | 91.78 |
No.Neurons | Hidden Layers | Numbers of Folds Cross-Validation | |||
---|---|---|---|---|---|
5 Folds Cross-Validation | 10 Folds Cross-Validation | ||||
Dropout 20% | Dropout 50% | Dropout 20% | Dropout 50% | ||
50 | Single layer | 92.87 | 93.51 | 92.42 | 93.43 |
100 | 92.69 | 93.14 | 93.79 | 93.43 | |
200 | 94.24 | 92.69 | 94.07 | 94.16 | |
300 | 92.78 | 93.32 | 93.97 | 93.70 | |
400 | 93.87 | 93.06 | 92.87 | 93.42 | |
500 | 92.87 | 93.88 | 93.61 | 93.50 | |
600 | 93.69 | 93.23 | 93.33 | 93.98 | |
80, 40 | Two layers | 93.87 | 94.15 | 94.69 | 92.60 |
100, 50 | 94.15 | 94.61 | 94.25 | 94.78 | |
200, 100 | 93.87 | 94.69 | 93.60 | 95.07 | |
300, 150 | 94.53 | 93.60 | 95.59 | 94.16 | |
400, 200 | 94.69 | 93.60 | 94.96 | 94.32 | |
500, 250 | 94.15 | 93.60 | 90.52 | 91.37 |
Number of Neurons | Hidden Layers | The Batch Size | |||
---|---|---|---|---|---|
32 | 64 | 128 | 256 | ||
50 | Single layer | 93.68 | 93.79 | 94.52 | 92.95 |
100 | 93.96 | 92.69 | 93.77 | 93.87 | |
200 | 89.59 | 93.50 | 93.96 | 93.51 | |
300 | 93.51 | 92.95 | 94.42 | 93.97 | |
400 | 88.95 | 93.33 | 93.61 | 92.87 | |
500 | 93.51 | 93.14 | 93.06 | 93.61 | |
600 | 90.33 | 93.78 | 93.61 | 93.33 | |
80, 40 | Two layers | 93.79 | 94.24 | 94.25 | 94.69 |
100, 50 | 92.81 | 94.78 | 94.71 | 94.87 | |
200, 100 | 84.92 | 92.83 | 94.15 | 94.52 | |
300, 150 | 71.13 | 87.73 | 91.78 | 95.59 | |
400, 200 | 87.37 | 93.79 | 90.18 | 94.96 | |
500, 250 | 80.44 | 87.11 | 94.34 | 93.59 |
Number of Neurons | Hidden Layers | Epochs | ||
---|---|---|---|---|
50 | 100 | 150 | ||
50 | Single layer | 86.80 | 92.95 | 92.32 |
100 | 90.90 | 93.87 | 93.05 | |
200 | 90.50 | 93.51 | 92.60 | |
300 | 91.34 | 93.97 | 91.88 | |
400 | 91.70 | 92.87 | 93.87 | |
500 | 91.32 | 93.61 | 92.70 | |
600 | 91.29 | 93.33 | 88.87 | |
80, 40 | Two layers | 92.41 | 94.69 | 92.98 |
100, 50 | 90.05 | 94.87 | 91.40 | |
200, 100 | 91.20 | 94.52 | 90.77 | |
300, 150 | 92.40 | 95.59 | 92.60 | |
400, 200 | 92.60 | 94.96 | 88.90 | |
500, 250 | 87.81 | 93.59 | 89.50 |
Number of Neurons | Hidden Layers | Optimizers | ||||
---|---|---|---|---|---|---|
SGD | Adadelta | RMSprop | AdaMax | Nadam | ||
50 | Single layer | 59.98 | 59.80 | 88.10 | 90.23 | 92.97 |
100 | 60.13 | 57.05 | 92.50 | 91.21 | 92.86 | |
200 | 59.97 | 56.64 | 93.51 | 92.04 | 93.96 | |
300 | 62.51 | 61.97 | 93.16 | 92.42 | 94.44 | |
400 | 61.80 | 61.60 | 92.04 | 92.24 | 91.61 | |
500 | 61.77 | 60.41 | 92.97 | 93.13 | 92.95 | |
600 | 62.33 | 58.88 | 90.13 | 93.71 | 93.43 | |
80, 40 | Two layer | 59.97 | 56.50 | 92.23 | 85.94 | 90.27 |
100, 50 | 59.93 | 58.43 | 93.24 | 85.38 | 95.07 | |
200, 100 | 59.97 | 58.15 | 92.78 | 87.75 | 85.67 | |
300, 150 | 60.48 | 61.30 | 90.80 | 90.50 | 81.60 | |
400, 200 | 59.96 | 58.41 | 94.43 | 91.40 | 85.08 | |
500, 250 | 59.98 | 61.71 | 88.50 | 93.52 | 85.70 |
K-Neighbors | Decision Tree | Random Forest | MLP | T-CNN-ASD | |
---|---|---|---|---|---|
Accuracy | 61.3 | 67.7 | 69.60 | 94.69 | 95.59 |
Sensitivity | 55.10 | 64.42 | 69.34 | 73.91 | 77.60 |
Specificity | 64.24 | 64.22 | 74.31 | 73.43 | 79.91 |
F1-Score | 59.21 | 64.32 | 71.74 | 73.67 | 78.73 |
K-Neighbors | Decision Tree | Random Forest | MLP | T-CNN-ASD | |
---|---|---|---|---|---|
Accuracy | 60 | 65.3 | 70.70 | 93.69 | 94.69 |
Sensitivity | 53.31 | 65.10 | 69.82 | 73.52 | 75.70 |
Specificity | 60.23 | 65.53 | 74.11 | 72.44 | 75.73 |
F1-Score | 56.56 | 65.31 | 71.90 | 73.67 | 75.71 |
No. of Neurons | Hidden Layers | ANN | MLP | MLP | T-CNN-ASD | T-CNN-ASD |
---|---|---|---|---|---|---|
(Gray) | (Gray) | (RGB) | (Gray) | (RGB) | ||
50 | Single layer | 91.00 | 93.52 | 93.69 | 92.42 | 93.14 |
200 | 92.00 | 93.68 | 93.51 | 93.42 | 93.96 | |
500 | 92.00 | 93.23 | 94.07 | 93.61 | 93.22 | |
80, 40 | Two Layers | 91.00 | 93.42 | 93.60 | 94.69 | 92.87 |
ML Algorithms | Accuracy |
---|---|
Naïve Bayes | 61.00 |
Logistic Regression | 70.00 |
SVM | 77.00 |
Random Forest | 70.00 |
ANN | 92.00 |
MLP | 94.07 |
T-CNN-ASD | 95.59 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alsaidi, M.; Obeid, N.; Al-Madi, N.; Hiary, H.; Aljarah, I. A Convolutional Deep Neural Network Approach to Predict Autism Spectrum Disorder Based on Eye-Tracking Scan Paths. Information 2024, 15, 133. https://doi.org/10.3390/info15030133
Alsaidi M, Obeid N, Al-Madi N, Hiary H, Aljarah I. A Convolutional Deep Neural Network Approach to Predict Autism Spectrum Disorder Based on Eye-Tracking Scan Paths. Information. 2024; 15(3):133. https://doi.org/10.3390/info15030133
Chicago/Turabian StyleAlsaidi, May, Nadim Obeid, Nailah Al-Madi, Hazem Hiary, and Ibrahim Aljarah. 2024. "A Convolutional Deep Neural Network Approach to Predict Autism Spectrum Disorder Based on Eye-Tracking Scan Paths" Information 15, no. 3: 133. https://doi.org/10.3390/info15030133
APA StyleAlsaidi, M., Obeid, N., Al-Madi, N., Hiary, H., & Aljarah, I. (2024). A Convolutional Deep Neural Network Approach to Predict Autism Spectrum Disorder Based on Eye-Tracking Scan Paths. Information, 15(3), 133. https://doi.org/10.3390/info15030133