GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification

Daniel, Ebenezer; Gulati, Anjalie; Saxena, Shraya; Urgun, Deniz Akay; Bista, Biraj

doi:10.3390/diagnostics15111425

Open AccessArticle

GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification

by

Ebenezer Daniel

¹,

Anjalie Gulati

²

,

Shraya Saxena

³,

Deniz Akay Urgun

¹ and

Biraj Bista

^1,*

¹

Department of Diagnostic Radiology, City of Hope National Medic and Center, Duarte, CA 91010, USA

²

Department of Radiology, Henry Ford Hospital, Detroit, MI 48202, USA

³

Department of Health and Exercise Science, La Sierra University, Riverside, CA 92505, USA

^*

Author to whom correspondence should be addressed.

Diagnostics 2025, 15(11), 1425; https://doi.org/10.3390/diagnostics15111425

Submission received: 28 April 2025 / Revised: 1 June 2025 / Accepted: 2 June 2025 / Published: 3 June 2025

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Download

Browse Figures

Versions Notes

Abstract

Background: Around 1 in 59 individuals is diagnosed with Autism Spectrum Disorder (ASD), according to CDS statistics. Conventionally, ASD has been diagnosed using functional brain regions, regions of interest, or multi-tissue-based training in artificial intelligence models. The objective of the exhibit study is to develop an efficient deep learning network for identifying ASD using structural magnetic resonance imaging (MRI)-based brain scans. Methods: In this work, we developed a VGG-based deep learning network capable of diagnosing autism using whole brain gray matter (GM) tissues. We trained our deep network with 132 MRI T1 images from normal controls and 140 MRI T1 images from ASD patients sourced from the Autism Brain Imaging Data Exchange (ABIDE) dataset. Results: The number of participants in both ASD and normal control (CN) subject groups was not statistically different (p = 0.23). The mean age of the CN subject group was 14.62 years (standard deviation: 4.34), and the ASD group had mean age of 14.89 years (standard deviation: 4.29). Our deep learning model accomplished a training accuracy of 97% and a validation accuracy of 96% over 50 epochs without overfitting. Conclusions: To the best of our knowledge, this is the first study to use GM tissue alone for diagnosing ASD using VGG-Net.

Keywords:

deep learning; VGG Net; autism identification; ABIDE dataset; brain imaging

1. Introduction

Autism Spectrum Disorder (ASD) is a progressive condition characterized by difficulties in social interaction, communication, stereotypic behaviors, and sensory abnormalities [1,2]. In the field of medical imaging, deep learning models have made significant strides in ASD diagnosis, leveraging their unsupervised nature to identify complex patterns [3,4].

Magnetic resonance imaging (MRI)-based studies have shown various biomarkers that demonstrated altered patterns of gray matter in autism patients compared to the normal control population [5,6]. For example, increased gray matter has been reported in angular gyrus in the right hemisphere, the prefrontal cortex, superior and middle frontal gyri in the left hemisphere, the precuneus and inferior occipital gyrus in the left hemisphere and the inferior temporal gyrus in the right hemisphere regions as increased biomarkers in autism subjects. In addition to these increased gray matter tissues in the brain, they also reported diminished gray matter tissues in the left hemisphere post central gyrus and cerebellar regions [7]. Another study compared three groups, normal control (CN), participants with attention deficit hyperactivity disorder (ADHD), and ASD, and found that gray matter volume (GMV) was significantly higher in the ASD group compared to the ADHD and CN groups (p = 0.004). Total brain volume (TBV) was also significantly higher in the ASD group (p = 0.015) [8]. Another longitudinal volumetric study with 156 participants also exhibited statistically significant increases in GMV and TBV in ASD subjects [9]. Even though the sample sizes in the previous study were relatively small (33 CN, 44 ADHD, and 19 ASD), the observed group differences in GMV and TBV were statistically validated using Bonferroni correction [8]. These findings might be an indication that gray matter tissue alone could potentially serve as a useful biomarker for classifying ASD through deep learning approaches.

Another longitudinal volumetric study with 156 subjects also reported a similar pattern of statistically significant increased gray matter and volume in ASD subjects [9]. Although, the sample sizes were small for CN, ADHD, and ASD subjects, such as 33, 44, and 19, respectively. Nonetheless, the results of the GMV and TBV group differences were based on Bonferroni statistical correction [8], suggesting that these results might indicate the potential for further classification of ASD brain patterns using gray matter tissues alone over a deep learning approach. Another study consisted of 295 study cohorts that studied MRI-based brain changes associated with ASD, with respect to gender differences. The study noted that males with ASD displayed increased gray matter volumes in the insula and superior frontal gyrus, while diminished volumes were noted in the inferior frontal gyrus and thalamus. However, females with ASD exhibited increased gray matter volume in the right cuneus [10]. In addition to gray matter biomarkers, other works have reported on white matter changes, including white matter connectivity, which also provide a significant biomarker for ASD [11,12,13,14]. For example, a diffusion tensor imaging (DTI)-based work reported a 99% classification accuracy for ASD using fivefold cross validation [15].

Previous studies have used various deep learning approaches to identify ASD based on functional MRI (fMRI) and structural MRI (sMRI) data, showing a wide range of classification accuracies. A study using a 3D Residual Network (ResNet-18) and multilayer perceptron (MLP) achieved 74% accuracy using fMRI and region of interest (ROI) data [16]. Another study using a deep neural network (DNN) approach to fMRI data reported a 70% classification accuracy, with ROIs selected based on co-activation levels of brain regions [17]. Another hybrid model that combined fMRI and structural MRI data, including gray and white matter tissues, for a Deep Belief Network (DBN) approach accomplished 65% accuracy, with 116 ROIs used from both imaging modalities [18]. A connectivity-based study using 7266 gray matter ROIs from the Blood Oxygen Level Dependent (BOLD) signal tested on 964 subjects from the Autism Brain Imaging Data Exchange (ABIDE) dataset achieved a 60% classification accuracy [19].

Another study reported improved accuracy with smaller sample sizes, such as a study with 80 subjects using a leave-one-out classifier that achieved 79% accuracy, which boosted to 89% for subjects under 20 years of age [20]. A DNN classifier on fMRI data involving 866 subjects (402 ASD and 464 control subjects) showed a high classification accuracy of 88%, using ROIs based on several functional and structural atlases, including the Bootstrap Analysis of Stable Clusters (BASC) and the Craddock 200 (CC200) atlas [21]. A convolutional neural network (CNN) approach, using 126 subjects from the ABIDE database, achieved an impressive 99.39% accuracy over 50 epochs with 20% of the data reserved for validation [22]. Additionally, a multimodal fusion approach incorporating both fMRI and sMRI for 1383 male participants aged 5 to 40 years achieved an accuracy of 85%, with the structural model alone achieving 75% and the functional model achieving 83% [23]. These findings highlight the effectiveness of different deep learning models and imaging modalities in ASD classification, with multimodal approaches offering the highest accuracies.

While fMRI provides valuable physiological information about brain regions, it has lower resolution and more attenuation of structural regions. In contrast, sMRI offers higher resolution and less attenuation of structural regions, making it a promising tool for studying brain anatomy. However, its application in ASD prediction using deep learning models has been relatively underexplored. In this work, we aim to use sMRI images alone to train and predict outcomes in a deep learning model. For this purpose, we used the VGG network, introduced by Simonyan and Zisserman in 2014 for the ImageNet Challenges. The VGG network has proven effective in large image data challenges, particularly in image recognition [24]. Previously, a study used VGG16 model to identify papillary thyroid carcinoma from benign thyroid nodules using cytological images, achieving 97.66% accuracy in cancer detection [25].

In our study, we introduce a modified VGG model, leveraging the strengths of the base VGG model along with multiple weighted layers in a deep neural network, using TensorFlow and Keras, to improve ASD identification in large datasets. To the best of our knowledge, this is the first study to apply the VGG model for ASD identification based solely on sMRI. While the majority of conventional deep learning models in the literature have used multimodal or multiple tissue types for ASD classification, our approach focuses on identifying ASD using GMD maps alone, minimizing computational complexity in terms of storage and learning.

2. Materials and Methods

2.1. Dataset

The present study utilized MRI T1-weighted image data from the ABIDE database. ABIDE is a consortium that provides previously collected sMRI and rs-fMRI data from individuals with ASD and normal controls for data sharing within the scientific community [26]. We included a total of 272 subjects in our analysis and the age difference between the ASD and CN groups was assessed using an independent t-test from the SciPy Python 3.8 library, implemented within the PyCharm platform.

2.2. Preprocessing of MRI-T1 Images

For data preprocessing, we employed the statistical parametric mapping package SPM12 (Wellcome Department of Cognitive Neurology, London, UK) and MATLAB 2019.b (The MathWorks Inc., Natick, MA, USA) with custom software to preprocess our MRI T1 images. The preprocessing steps followed those described earlier [27]. The Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) toolbox was used to improve inter-subject image registration in our input images [28]. We segmented gray matter (GM), white matter (WM), cerebrospinal fluid (CSF), skull, and other brain regions using the ‘new segment’ option in the DARTEL toolbox. The gray matter probability maps computed for each scan were spatially normalized to Montreal Neurological Institute (MNI) space (unmodulated, re-sliced to 1 × 1 × 1 mm) and smoothed with a Gaussian filter (9 mm full width at half maximum) [29,30,31,32].

2.3. Conventional VGG-16 Architecture

In the conventional VGG-16 network, convolutional layers are followed by pooling layers in each hidden layer unit. The deep learning network starts with 64 filters in the first layer unit, then to 128 filters, then 256 filters, and finally obtains 512 filters in the deeper hidden layers. Furthermore, each convolutional layer utilities a rectified linear unit (ReLU) for activation. Finally, it incorporates three fully connected layers: the first two with 4096 channels, and the third with 1000 channels—one for every class.

2.4. Proposed Deep Learning GM-VGG-Net Architecture

The proposed deep learning network is implemented using the TensorFlow 1.4 and Keras platforms. Skull-stripped image data (segmented and normalized) provided a higher probability of achieving valid MNI coordinates for functional activations compared to skull-included input images [33]. Our deep learning architecture starts with 32 filters, followed by 64 and 128 filters, and ends with two final units, each containing 256 filters. We have also encompassed further batch normalization units, as demonstrated below. The proposed deep learning VGG network architecture is shown in Figure 1. Nevertheless, we have significantly changed the filter layout and layer structure in our version of this deep neural network [24,34,35,36].

2.4.1. Input Layer Unit

The input layer consists of preprocessed gray matter (GM) from MRI T1-weighted images. To reduce complexity, we selected the best 70 slices that contain the most brain regions (256 × 70 × 256), where 256 represents the image dimensions and 70 is the number of selected slices. Our network was designed with five fully connected, sequential hidden layer units. Each hidden layer unit was designed with the following layers: convolution filters (3 × 3), activation unit (ReLU), maximum pooling layer (2 × 2), and 25 percentage of dropout layers.

2.4.2. Hidden Layer Unit

The first hidden layer consists of 32 convolution filters with a kernel size of 3 × 3. The output of the convolution layer in the first hidden layer has 32 feature maps. The maximum pooling layer of the first hidden layer reduces the dimensionality of the feature map by half, i.e., 128 × 70 × 128 × 32 feature maps. The second hidden layer unit consists of 64 convolution filters with a kernel size of 3 × 3, and the output of the convolution layer in the second hidden layer unit has 64 feature maps. The maximum pooling layer of the second hidden layer reduces the dimensionality of the feature map by half, i.e., 64 × 70 × 64 × 64 feature maps. The third hidden layer unit is designed with 128 convolution filters with a kernel size of 3 × 3, and the output of the convolution layer in the third hidden layer unit has 128 feature maps. The maximum pooling layer of the third hidden layer reduces the dimensionality of the feature map by half, i.e., 32 × 70 × 32 × 128 feature maps. The fourth hidden layer unit is developed using 256 convolution filters with a kernel size of 3 × 3, and the output of the convolution layer in the fourth hidden layer unit has 256 feature maps. The maximum pooling layer of the fourth hidden layer reduces the dimensionality of the feature map by half, i.e., 16 × 70 × 16 × 256 feature maps. The fifth (final) hidden layer unit is implemented with 256 convolution filters with a kernel size of 3 × 3. The maximum pooling layer of the fifth hidden layer reduces the dimensionality of the feature map by half, i.e., 8 × 70 × 8 × 256 feature maps. In our proposed deep learning network, each hidden layer unit is designed with a rectified linear unit (ReLU)-based activation, a dropout layer with 25%, and batch normalization functions.

2.4.3. Fully Connected Layer Unit

Our proposed fully connected (FC) layer unit is designed with a flatten layer, a fully connected layer, a batch normalization layer, a ReLU-based activation, a maximum pooling layer, and a 50% dropout layer. The FC layer connects the hidden layers to the output layer unit.

2.4.4. Output Layer Unit

The output layer unit is designed with a dense layer and a sigmoid activation function. The output unit predicts our images into ASI and HC classes.

3. Results

3.1. Demographic Data

We included a total of 272 subjects, with 132 individuals diagnosed with ASD and 140 matched normal controls. The mean age of the CN group was 14.62 years (SD = 4.34), and the mean age of the ASD group was 14.89 years (SD = 4.29). The CN group consisted of 68 male subjects and 72 female subjects, while the ASD group consisted of 67 males and 65 females. The mean age of males in the CN group was 14.97 years (SD = 4.14), and in the ASD group, it was 15.75 years (SD = 3.77). The mean age of females in the CN group was 13.57 years (SD = 4.56), and in the ASD group, it was 14.02 years (SD = 4.60). No significant age differences were observed between the two groups (p = 0.23), as shown in Table 1.

3.2. Performance Evaluation of GM-VGG Net Classifier

The classification performance of our proposed GM-VGGNet was evaluated based on loss and accuracy parameters. The training and validation loss functions, along with the accuracy of our deep network, are shown in Figure 2. Our proposed deep learning network was validated over 50 epochs. The training and validation accuracy of our network were 97% and 96%, respectively, over 50 epochs. The loss function values of our network were 0.0204 for training and 0.0696 for validation over 50 epochs. In this deep learning model, we used the TensorFlow–Keras platform with the Adam optimizer, kept at the default learning rate of 0.001. We fine-tuned the structure based on the loss and accuracy performance to avoid overfitting challenges. In this work, the image dataset was split into 70% for training and 30% for validation. The total number of parameters was 5,176,705, of which 5,174,721 were trainable. The model summary is given in Table 2.

4. Discussion

In this study, we developed a deep learning network for ASD identification, utilizing only structural GM tissue images and based on the VGG16 architecture. We systematically evaluated the network’s performance and achieved the highest accuracy on sMRI data after 50 epochs of training. To the best of our knowledge, the proposed deep learning model outperformed exciting network for ASD identification using structural GM tissues alone. By using the deep learning network with GM maps exclusively, we were able to reduce the complexity of the training process.

Previous studies on ASD classification using the ABIDE dataset have reported a classification accuracy of 63.89% for gray matter (GM) tissue alone, using a tenfold cross-validation with a DBN [18]. The classification accuracy was improved to 65% when fMRI data alone, along with GM tissue, were used. Finally, by combining features from white matter (WM) tissues, GM, and fMRI, they achieved an accuracy of 65.56% for ASD classification. Our model demonstrates superior performance in classification compared to the previous model [18]. We tested our model on a dataset of 272 samples, whereas their model was evaluated on 185 data samples. In their previous work, the fMRI-based model showed lower performance than the sMRI GM tissue alone images, possibly due to the low temporal resolution from the hemodynamic response, as well as susceptibility artifacts from signal dropout [37]. Therefore, our GM-VGG16 model has less computational complexity while maintaining greater accuracy, as it relies solely on GM tissues.

Another study on male participants from the ABIDE dataset, which incorporated 1383 subjects, exhibited higher performance with an fMRI model compared to sMRI. Their accuracy reached 75% with sMRI alone, while fMRI achieved 83%, and the combined fused data reached an accuracy of 85%. The previously reported model [18] performed differently with lower accuracy, possibly due to the high spatial resolution of their method, or due to the fusion approach they employed, which used early fusion to combine sMRI and fMRI before classification. In contrast, later fusion approaches integrate features based on the classification performance during label testing. However, their feature extraction models required more manual involvement, leading to a semi-automated approach. Our method, on the other hand, does not rely on feature selection from the images; instead, our model is trained to identify ASD patterns directly from the whole GM maps.

The architecture of our deep learning network is based on the VGG network, which was developed by Karen Simonyan and Andrew Zisserman for the ImageNet Challenge in 2014 [24]. The conventional VGG addressed the challenges of training deep neural networks for large scale image recognition, reaching higher accuracy. Furthermore, VGG16 has demonstrated 97.66% accuracy on cytological images for papillary thyroid carcinomas [25]. Similarly, we employed small 3 × 3 convolution filters for feature map generation. Our network consists of five sequential hidden layers, each with 2 × 2 max pooling and a stride of 2. As with the VGG architecture, the width of the convolution filters increases sequentially across all hidden layers, starting with 32 filters in the first hidden layer and progressing to 256 filters in the final hidden layer. Unlike conventional neural networks, which typically use smaller input sizes (e.g., 32 × 32 pixels), the VGG network is designed to handle larger input sizes effectively [24,25]. Larger input sizes preserve more substantial brain regions, generating more active feature maps.

However, unlike the original VGG network, we incorporated batch normalization across all five hidden layers, which improved training accuracy. Each hidden layer in our network uses a rectified linear unit (ReLU) activation function, and we applied a uniform 25% dropout rate to prevent overfitting. This dropout rate was fixed using the trial and error, as the network showed poor learning without it, and started to memorize the training data. Our deep learning architecture showed a lower error difference between training and validation over 50 epochs, as shown in Figure 3. These results exhibit that our network overcame the overfitting limitations and enabled higher learning.

In contrast to the conventional VGG network, which includes three fully connected layers and a dropout layer with a 0.5 rate [24], our network features a fully connected (FC) unit and an output layer (OL). In our work, the FC unit was designed with a flatten layer, a dense layer (256 filters), a batch normalization layer, activation layer, and a 50 percentage of dropout layer. Moreover, in our output layer, we used a sigmoid activation, while the conventional VGG network utilizes softmax activation. Our proposed gray matter-based deep learning network has a higher performance in terms of validation accuracy and loss function over the various existing ASD identification models.

However, there are some limitations in our study. Our deep learning network was trained solely on the ABIDE dataset, and future work should involve incorporating additional datasets to further validate our approach. Furthermore, our model was tested on 272 MRI images, and a larger dataset is needed to enhance the model’s generalization. Although our model does not require feature extraction during training and validation, our preprocessing, which involved segmenting the GM tissues, was performed semi-automatically using the SPM 12 toolbox. In the future, a fully automated approach for GM tissue segmentation should be integrated into the deep learning network, alongside the classification model. Moreover, we acknowledge that automated hyperparameter optimization approaches were not applied in this pilot work. While we used standard default settings and fine-tuned network structure based on observed performance, future work will include efficient optimization techniques to increase the efficiency. Additionally, this exploratory pilot work was focused on evaluating the proposed GM-VGG-Net using validation and training accuracy and error measures obtained via TensorBoard with Keras. Although these metrics offer a useful baseline, future work will combine more broad evaluation measures such as F1-score, AUC-ROC, and confusion matrices. We also plan to benchmark the model against recent state-of-the-art architectures using the largest dataset to provide more rigorous comparative analysis. Despite these challenges, our model achieved the highest performance, minimizing classification loss over 50 epochs.

5. Conclusions

The developed deep learning network, which is a modified VGG architecture named the gray matter network (GM-VGG-Net), demonstrates an effective method for classifying ASD using sMRI brain scans based exclusively on gray matter (GM) tissues. Our modified GM-VGG-Net showed a training accuracy of 97% and a validation accuracy of 96% over 50 epochs. This methodology is significant as it based on sMRI GM maps, which streamlines the training process, reduces computational complexity, and outperforms previous models that required multimodality or whole brain data.

Author Contributions

E.D. was responsible for conceptualization, investigation, validation, and writing—original draft preparation; A.G. contributed to validation and writing—review and editing; S.S. contributed to validation and writing—review and editing; D.A.U. was responsible for conceptualization, investigation, validation, and writing—review and editing; and B.B. was responsible for conceptualization, investigation, validation, and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

The research presented in this manuscript received no external funding. The data used were obtained from the publicly available ABIDE database, which may have been supported by separate funding sources not related to this study.

Institutional Review Board Statement

Not applicable for this study, as it involved the use of publicly available, de-identified data from the ABIDE database.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data for this study were obtained from the ABIDE database, which operates under defined accessibility protocols as outlined by the source.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hodges, H.; Fealko, C.; Soares, N. Autism spectrum disorder: Definition, epidemiology, causes, and clinical evaluation. Transl. Pediatr. 2020, 9 (Suppl. S1), S55–S65. [Google Scholar] [CrossRef] [PubMed]
Lord, C.; Brugha, T.S.; Charman, T.; Cusack, J.; Dumas, G.; Frazier, T.; Jones, E.J.H.; Jones, R.M.; Pickles, A.; State, M.W.; et al. Autism spectrum disorder. Nat. Rev. Dis. Prim. 2020, 6, 5. [Google Scholar] [CrossRef] [PubMed]
Ding, Y.; Zhang, H.; Qiu, T. Deep learning approach to predict autism spectrum disorder: A systematic review and meta-analysis. BMC Psychiatry 2024, 24, 739. [Google Scholar] [CrossRef] [PubMed]
Uddin, M.Z.; Shahriar, M.A.; Mahamood, M.N.; Alnajjar, F.; Pramanik, M.I.; Ahad, M.A.R. Deep learning with image-based autism spectrum disorder analysis: A systematic review. Eng. Appl. Artif. Intell. 2024, 127, 107185. [Google Scholar] [CrossRef]
Rafiee, F.; Rezvani Habibabadi, R.; Motaghi, M.; Yousem, D.M.; Yousem, I.J. Brain MRI in Autism Spectrum Disorder: Narrative Review and Recent Advances. J. Magn. Reson. Imaging 2022, 55, 1613–1624. [Google Scholar] [CrossRef]
Wang, M.; Xu, D.; Zhang, L.; Jiang, H. Application of Multimodal MRI in the Early Diagnosis of Autism Spectrum Disorders: A Review. Diagnostics 2023, 13, 3027. [Google Scholar] [CrossRef]
Liu, J.; Yao, L.; Zhang, W.; Xiao, Y.; Liu, L.; Gao, X.; Shah, C.; Li, S.; Tao, B.; Gong, Q.; et al. Gray matter abnormalities in pediatric autism spectrum disorder: A meta-analysis with signed differential mapping. Eur. Child. Adolesc. Psychiatry 2017, 26, 933–945. [Google Scholar] [CrossRef]
Lim, L.; Chantiluke, K.; Cubillo, A.I.; Smith, A.B.; Simmons, A.; Mehta, M.A.; Rubia, K. Disorder-specific grey matter deficits in attention deficit hyperactivity disorder relative to autism spectrum disorder. Psychol. Med. 2015, 45, 965–976. [Google Scholar] [CrossRef]
Lange, N.; Travers, B.G.; Bigler, E.D.; Prigge, M.B.; Froehlich, A.L.; Nielsen, J.A.; Cariello, A.N.; Zielinski, B.A.; Anderson, J.S.; Fletcher, P.T.; et al. Longitudinal volumetric brain changes in autism spectrum disorder ages 6–35 years. Autism Res. 2015, 8, 82–93. [Google Scholar] [CrossRef]
Zhou, D.; Hua, T.; Tang, H.; Yang, R.; Huang, L.; Gong, Y.; Zhang, L.; Tang, G. Gender and age related brain structural and functional alterations in children with autism spectrum disorder. Cereb. Cortex 2024, 34, bhae283. [Google Scholar] [CrossRef]
Gibbard, C.R.; Ren, J.; Seunarine, K.K.; Clayden, J.D.; Skuse, D.H.; Clark, C.A. White matter microstructure correlates with autism trait severity in a combined clinical–control sample of high-functioning adults. NeuroImage Clin. 2013, 3, 106–114. [Google Scholar] [CrossRef] [PubMed]
Ohta, H.; Aoki, Y.Y.; Itahashi, T.; Kanai, C.; Fujino, J.; Nakamura, M.; Kato, N.; Hashimoto, R.-I. White matter alterations in autism spectrum disorder and attention-deficit/hyperactivity disorder in relation to sensory profile. Mol. Autism 2020, 11, 77. [Google Scholar] [CrossRef] [PubMed]
Dimond, D.; Schuetze, M.; Smith, R.E.; Dhollander, T.; Cho, I.; Vinette, S.; Ten Eycke, K.; Lebel, C.; McCrimmon, A.; Dewey, D.; et al. Reduced White Matter Fiber Density in Autism Spectrum Disorder. Cereb. Cortex 2019, 29, 1778–1788. [Google Scholar] [CrossRef] [PubMed]
Zhang, M.; Hu, X.; Jiao, J.; Yuan, D.; Li, S.; Luo, T.; Wang, M.; Situ, M.; Sun, X.; Huang, Y. Brain white matter microstructure abnormalities in children with optimal outcome from autism: A four-year follow-up study. Sci. Rep. 2022, 12, 20151. [Google Scholar] [CrossRef]
ElNakieb, Y.; Ali, M.T.; Elnakib, A.; Shalaby, A.; Soliman, A.; Mahmoud, A.; Ghazal, M.; Barnes, G.N.; El-Baz, A. The Role of Diffusion Tensor MR Imaging (DTI) of the Brain in Diagnosing Autism Spectrum Disorder: Promising Results. Sensors 2021, 21, 8171. [Google Scholar] [CrossRef]
Tang, M.; Kumar, P.; Chen, H.; Shrivastava, A. Deep Multimodal Learning for the Diagnosis of Autism Spectrum Disorder. J. Imaging 2020, 6, 47. [Google Scholar] [CrossRef]
Heinsfeld, A.S.; Franco, A.R.; Craddock, R.C.; Buchweitz, A.; Meneguzzi, F. Identification of autism spectrum disorder using deep learning and the ABIDE dataset. NeuroImage Clin. 2018, 17, 16–23. [Google Scholar] [CrossRef]
Akhavan Aghdam, M.; Sharifi, A.; Pedram, M.M. Combination of rs-fMRI and sMRI Data to Discriminate Autism Spectrum Disorders in Young Children Using Deep Belief Network. J. Digit. Imaging 2018, 31, 895–903. [Google Scholar] [CrossRef]
Nielsen, J.A.; Zielinski, B.A.; Fletcher, P.T.; Alexander, A.L.; Lange, N.; Bigler, E.D.; Lainhart, J.E.; Anderson, J.S. Multisite functional connectivity MRI classification of autism: ABIDE results. Front. Hum. Neurosci. 2013, 7, 599. [Google Scholar] [CrossRef]
Anderson, J.S.; Nielsen, J.A.; Froehlich, A.L.; DuBray, M.B.; Druzgal, T.J.; Cariello, A.N.; Cooperrider, J.R.; Zielinski, B.A.; Ravichandran, C.; Fletcher, P.T.; et al. Functional connectivity magnetic resonance imaging classification of autism. Brain 2011, 134, 3742–3754. [Google Scholar] [CrossRef]
Subah, F.Z.; Deb, K.; Dhar, P.K.; Koshiba, T. A Deep Learning Approach to Predict Autism Spectrum Disorder Using Multisite Resting-State fMRI. Appl. Sci. 2021, 11, 3636. [Google Scholar] [CrossRef]
Feng, M.; Xu, J. Detection of ASD Children through Deep-Learning Application of fMRI. Children 2023, 10, 1654. [Google Scholar] [CrossRef] [PubMed]
Saponaro, S.; Lizzi, F.; Serra, G.; Mainas, F.; Oliva, P.; Giuliano, A.; Calderoni, S.; Retico, A. Deep learning based joint fusion approach to exploit anatomical and functional brain information in autism spectrum disorders. Brain Inform. 2024, 11, 2. [Google Scholar] [CrossRef] [PubMed]
Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
Guan, Q.; Wang, Y.; Ping, B.; Li, D.; Du, J.; Qin, Y.; Lu, H.; Wan, X.; Xiang, J. Deep convolutional neural network VGG-16 model for differential diagnosing of papillary thyroid carcinomas in cytological images: A pilot study. J. Cancer 2019, 10, 4876–4882. [Google Scholar] [CrossRef]
Di Martino, A.; Yan, C.G.; Li, Q.; Denio, E.; Castellanos, F.X.; Alaerts, K.; Anderson, J.S.; Assaf, M.; Bookheimer, S.Y.; Dapretto, M.; et al. The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 2014, 19, 659–667. [Google Scholar] [CrossRef]
Ashburner, J.; Friston, K.J. Computing average shaped tissue probability templates. NeuroImage 2009, 45, 333–341. [Google Scholar] [CrossRef]
Ashburner, J. A fast diffeomorphic image registration algorithm. NeuroImage 2007, 38, 95–113. [Google Scholar] [CrossRef]
Ashburner, J. Computational anatomy with the SPM software. Magn. Reson. Imaging 2009, 27, 1163–1174. [Google Scholar] [CrossRef]
Daniel, E.; Deng, F.; Patel, S.K.; Sedrak, M.S.; Kim, H.; Razavi, M.; Sun, C.L.; Root, J.C.; Ahles, T.A.; Dale, W.; et al. Cortical thinning in chemotherapy-treated older long-term breast cancer survivors. Brain Imaging Behav. 2023, 17, 66–76. [Google Scholar] [CrossRef]
Whitwell, J.L. Voxel-based morphometry: An automated technique for assessing structural changes in the brain. J. Neurosci. 2009, 29, 9661–9664. [Google Scholar] [CrossRef] [PubMed]
Ashburner, J.; Friston, K.J. Voxel Based Morphometry. In Encyclopedia of Neuroscience; Squire, L.R., Ed.; Academic Press: Oxford, UK, 2009; pp. 471–477. [Google Scholar] [CrossRef]
Fischmeister, F.P.S.; Höllinger, I.; Klinger, N.; Geissler, A.; Wurnig, M.C.; Matt, E.; Rath, J.; Robinson, S.D.; Trattnig, S.; Beisteiner, R. The benefits of skull stripping in the normalization of clinical fMRI data. NeuroImage Clin. 2013, 3, 369–380. [Google Scholar] [CrossRef] [PubMed]
Muhtasim, D.A.; Pavel, M.I.; Tan, S.Y. A Patch-Based CNN Built on the VGG-16 Architecture for Real-Time Facial Liveness Detection. Sustainability 2022, 14, 10024. [Google Scholar] [CrossRef]
Fei, X.; Wu, S.; Miao, J.; Wang, G.; Sun, L. Lightweight-VGG: A Fast Deep Learning Architecture Based on Dimensionality Reduction and Nonlinear Enhancement for Hyperspectral Image Classification. Remote Sens. 2024, 16, 259. [Google Scholar] [CrossRef]
Klangbunrueang, R.; Pookduang, P.; Chansanam, W.; Lunrasri, T. AI-Powered Lung Cancer Detection: Assessing VGG16 and CNN Architectures for CT Scan Image Classification. Informatics 2025, 12, 18. [Google Scholar] [CrossRef]
Glover, G.H. Overview of functional magnetic resonance imaging. Neurosurg. Clin. N. Am. 2011, 22, 133–139. [Google Scholar] [CrossRef]

Figure 1. Proposed deep learning VGG network for ASD and HC classification.

Figure 2. Training and validation accuracy of the proposed deep learning network for ASD identification.

Figure 3. Training and validation loss of the proposed deep learning network for ASD identification.

Table 1. Demographic data.

Variable	CN	ASD
N	140	132
Age (mean ± SD)	14.62 ± 4.34	14.89 ± 4.29 (p = 0.23)
Age (male) (mean ± SD)	14.97 ± 4.14	15.75 ± 3.77
N (male)	68	67
Age (female) (mean ± SD)	13.57 ± 4.56	14.02 ± 4.60
N (female)	72	65

Abbreviations: CN (control group), ASD (Autism Spectrum Disorder), SD (standard deviation), and N (number of subjects). The p-value was tested between the groups for age and is considered significant at a threshold of 0.05. The total number of subjects is 272.

Table 2. Model summary of proposed deep learning network.

Parameter	Value
Optimizer	Adam
Learning Rate	0.001
Epochs	50
Trainable Parameters	5,174,721
Non-Trainable Parameters	1984
Total Parameters	5,176,705

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Daniel, E.; Gulati, A.; Saxena, S.; Urgun, D.A.; Bista, B. GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification. Diagnostics 2025, 15, 1425. https://doi.org/10.3390/diagnostics15111425

AMA Style

Daniel E, Gulati A, Saxena S, Urgun DA, Bista B. GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification. Diagnostics. 2025; 15(11):1425. https://doi.org/10.3390/diagnostics15111425

Chicago/Turabian Style

Daniel, Ebenezer, Anjalie Gulati, Shraya Saxena, Deniz Akay Urgun, and Biraj Bista. 2025. "GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification" Diagnostics 15, no. 11: 1425. https://doi.org/10.3390/diagnostics15111425

APA Style

Daniel, E., Gulati, A., Saxena, S., Urgun, D. A., & Bista, B. (2025). GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification. Diagnostics, 15(11), 1425. https://doi.org/10.3390/diagnostics15111425

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

GM-VGG-Net: A Gray Matter-Based Deep Learning Network for Autism Classification

Abstract

1. Introduction

2. Materials and Methods

2.1. Dataset

2.2. Preprocessing of MRI-T1 Images

2.3. Conventional VGG-16 Architecture

2.4. Proposed Deep Learning GM-VGG-Net Architecture

2.4.1. Input Layer Unit

2.4.2. Hidden Layer Unit

2.4.3. Fully Connected Layer Unit

2.4.4. Output Layer Unit

3. Results

3.1. Demographic Data

3.2. Performance Evaluation of GM-VGG Net Classifier

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI