Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification

Chelef, Aoumria; Yuksel Dal, Demet; Ozturk, Mahmut; Yousif, Mosab A. A.; Koc, Gokce

doi:10.3390/bioengineering13010099

Open AccessArticle

Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification

by

Aoumria Chelef

¹

,

Demet Yuksel Dal

²

,

Mahmut Ozturk

³

,

Mosab A. A. Yousif

¹

and

Gokce Koc

^4,*

¹

Department of Biomedical Engineering, Institute of Graduate Studies, Istanbul University-Cerrahpasa, Istanbul 34320, Türkiye

²

Department of Electrical and Electronics Engineering, Engineering Faculty, Fatih Sultan Vakıf University, Istanbul 34015, Türkiye

³

Department of Electrical and Electronics Engineering, Engineering Faculty, Istanbul University-Cerrahpasa, Istanbul 34320, Türkiye

⁴

Department of Biomedical Engineering, Engineering and Architecture Faculty, Istanbul Yeni Yuzyil University, Istanbul 34010, Türkiye

^*

Author to whom correspondence should be addressed.

Bioengineering 2026, 13(1), 99; https://doi.org/10.3390/bioengineering13010099

Submission received: 1 December 2025 / Revised: 27 December 2025 / Accepted: 30 December 2025 / Published: 15 January 2026

(This article belongs to the Special Issue Neuroimaging Techniques and Applications in Neuroscience)

Download

Browse Figures

Versions Notes

Abstract

Autism spectrum disorder (ASD) is a neurodevelopmental condition characterized by impairments in social interaction and communication, along with atypical behavioral patterns. Affected individuals often seem isolated in their inner world and exhibit particular sensory reactions. The World Health Organization has indicated a persistent increase in the global prevalence of autism, with approximately 1 in 127 persons affected worldwide. This study contributes to the growing research effort by presenting a comprehensive analysis of functional connectivity patterns for ASD prediction using rs-fMRI datasets. A novel approach was used for ASD identification using the ABIDE II dataset, based on functional networks derived from BOLD signals. The sparse functional brain connectome (Lean-NET) model is employed to construct subject-specific connectomes, from which local graph metrics are extracted to quantify regional network properties. Statistically significant features are selected using Welch’s t-test, then subjected to False Discovery Rate (FDR) correction and classified using a Support Vector Machine (SVM). Our experimental results demonstrate that locally derived graph metrics effectively discriminate ASD from typically developing (TD) subjects and achieve accuracy ranging from 70% up to 91%, highlighting the potential of graph learning approaches for functional connectivity analysis and ASD characterization.

Keywords:

autism spectrum disorder ASD; rs-FMRI BOLD signal; graph learning; sparse functional brain connectome (Lean-NET); local graph metrics; feature selection; support vector machine (SVM)

1. Introduction

Autism spectrum disorders (ASD) are neurodevelopmental conditions that emerge early in childhood and persist into adulthood, characterized by social-communication difficulties and behavioral abnormalities [1]. Beyond behavioral definitions, neuroimaging studies suggest that ASD also involves atypical functional connectivity between brain regions; this view—often described as a functional disconnection syndrome—highlights disrupted communication across large-scale neural networks [2]. Accordingly, network-based resting-state fMRI (rs-fMRI) has become an important tool for studying ASD. Rs-fMRI is a non-invasive technique that measures blood-oxygen-level-dependent (BOLD) signals to characterize intrinsic brain activity [3]. In this context, the Autism Brain Imaging Data Exchange II (ABIDE II) provides a large multisite rs-fMRI resource for investigating connectivity alterations in ASD [4,5]. Although both single-site and multisite ABIDE data have been used, single-site cohorts are often preferred to reduce variability due to scanner and acquisition differences.

Several studies have leveraged ABIDE rs-fMRI for ASD–TD classification using diverse methodological pipelines. On single-site data (NYU), Thomas et al. [6] applied a 3D convolutional neural network (3D-CNN) to temporal BOLD representations, while Chu et al. [7] proposed a multi-scale graph representation learning framework to model hierarchical functional connectivity (FC). Other works derived time–frequency scalogram features via continuous wavelet transform (CWT) and evaluated multiple machine learning (ML) classifiers, reporting high accuracy in some settings [8].

Extending to multisite cohorts, a range of FC-based machine learning approaches has been reported, such as SVM/kSVM, logistic regression, and random forest, in addition to hybrid deep learning pipelines combining an FC matrix with CNN architectures [9,10,11,12,13]. Notably, performance has been shown to depend strongly on preprocessing and feature construction, and recent comparisons suggest that classical models—particularly (kernel) SVM—often remain competitive with or outperform several deep learning alternatives such as TabNet, XGBoost, and MLP on FC-derived features (AUC ≈ 0.75–0.77) [14,15].

To address the limitations of traditional FC and ML approaches, D.Y. Dal employed graph-theoretical metrics derived from multimodal (structural and functional) brain connectomes with the aim of identifying network signatures, achieving notable results [16]. A recent work, Yang et al. proposed a multi-view low-rank subspace graph structure learning method that integrates multi-atlas and multi-center rs-fMRI data to reduce heterogeneity and extract consistent FC features, achieving an accuracy of 83.2% on the ABIDE dataset [17].

Despite these advances, two methodological gaps remain salient. First, many pipelines still derive functional connectivity (FC) from dense pairwise Pearson/partial correlations between regional BOLD time series. While widely used, correlation-based FC can be sensitive to noise, preprocessing decisions, and indirect dependencies among regions, potentially inflating spurious edges and reducing subject-level reliability—an issue that becomes more pronounced when individual-level interpretation is required. Second, classification-driven studies often prioritize predictive performance without a rigorous, localized characterization of which regions drive group differences, including appropriate control for multiple comparisons when identifying discriminative regions. Consequently, stable and interpretable node-level signatures that generalize beyond global summary metrics remain comparatively underdeveloped in the ASD rs-fMRI literature.

Accordingly, there remains a fundamental methodological imperative to establish rigorous, data-driven graph construction methods that can infer subject-specific functional brain networks while mitigating noise, reducing spurious connections, and preserving neurobiological interpretability—especially in the context of heterogeneous multisite datasets. Addressing this need is essential for identifying stable and localized connectivity alterations associated with ASD, beyond global network summaries or purely performance-driven classification models. To improve these limitations, recent research has turned toward graph learning frameworks that infer network topology directly from data while enforcing sparsity and robustness. Among recent data-driven approaches, graph-learning frameworks aim to infer subject-specific network topology directly from the observed signals while enforcing sparsity and smoothness constraints. In this perspective, the network is learned under a smooth graph-signal assumption, which provides a principled mechanism to control sparsity and suppress spurious edges, consistent with Laplacian learning in smooth graph signal representations [18]. This approach differs conceptually from sparse connectivity estimation based on inverse covariance (precision) matrices, which relies on optimization schemes tailored to sparse inverse covariance estimation [19]. Graph-based modeling has also been increasingly adopted in neuroimaging to quantify structure–function discrepancies across cognitive decline [20].

Importantly, focusing on local (node-wise) graph properties rather than only global network summaries enables the detection of region-specific connectivity disruptions that may relate to the heterogeneous clinical manifestations of ASD. Building on this direction, the present study employs the Lean-NET framework to estimate subject-specific sparse functional networks from ABIDE data, thereby avoiding dense correlation matrices and ad hoc thresholding [21]. Local graph metrics (e.g., node degree, local efficiency, assortativity) are then computed at each node to capture localized alterations in network organization. To obtain statistically robust and interpretable region-level signatures, node-wise group differences are evaluated using Welch’s t-tests with false discovery rate (FDR) correction, explicitly addressing the multiple-comparison problem that can inflate false positives in high-dimensional connectomic analyses. The resulting top-ranked node signatures are subsequently evaluated using a linear SVM with leave-one-out cross-validation, linking classification performance to localized, statistically controlled network features. The central hypothesis is that Lean-NET–derived subject-specific networks reveal robust node-level alterations in ASD and that these localized signatures provide discriminative power for ASD–TD classification.

The main contributions of our work can be summarized as follows:

A graph learning-based approach (Lean-NET) is employed to construct subject-specific functional brain networks.
Analysis of localized graph-theoretical features to characterize region-specific connectivity alterations
Statistical feature selection using Welch’s t-test with FDR correction
Robust classification using linear SVM with leave-one-out cross-validation

Through this methodology, shown in Figure 1, this approach aims to explore and visualize the FC patterns in the brain, providing insights into neural dynamics and differences between itself.

2. Methodology

2.1. Datasets Description

In this research, rs-fMRI data were obtained from the Autism Brain Imaging Data Exchange II [4], which was assembled from multiple independent neuroimaging sites, each approved by its respective local institutional review board. The dataset comprises rs-fMRI scans, consisting of 1112 participants including 539 individuals diagnosed with ASD and 573 TD. For the present analyses, only rs-fMRI data from the NYU site of the ABIDE II cohort were included. To ensure sufficient temporal sampling and improve the stability of subsequent functional connectivity and graph-based estimates, a minimum scan-length criterion was applied after preprocessing and quality control. Specifically, participants with recordings containing fewer than 140 rs-fMRI time points (T < 140) were excluded (5 ASD and 7 TD) because such short time series were considered insufficient for stable subject-level network construction [7]. After applying this criterion, the final analytical sample comprised 74 ASD and 98 TD participants, all satisfying T ≥ 140. Multi-site rs-fMRI acquisitions frequently differ in scanner hardware, acquisition protocols and image quality, thereby introducing site-specific variance that is unrelated to the actual neural differences between the groups. Restricting the sample to a single site, NYU provides a relatively large and methodologically homogeneous cohort, which is advantageous for developing and evaluating the proposed approach while minimizing site-related variability. The diagnostic status and demographic details of the participants were obtained from the ABIDE II database and are summarized in Table 1.

2.2. Data Preprocessing

Data were obtained from the New York University (NYU) site of the Autism Brain Imaging Data Exchange II (ABIDE II) dataset [4]. Resting-state fMRI was acquired using a gradient-echo echo-planar imaging sequence (TR = 2000 ms, TE = 15 ms, flip angle = 90°, voxel size = 3 × 3 × 4 mm³), with approximately 180 volumes per scan (≈6 min), whole-brain axial coverage, 4 mm slice thickness, and no interslice gap [4]. Preprocessing was performed using the Configurable Pipeline for the Analysis of Connectomes (C-PAC) and included standard steps such as slice-timing correction, motion realignment, spatial normalization to a standard template space, and nuisance signal regression [22]. Following preprocessing, regional BOLD time series were extracted using the Craddock 200 (CC200) functional atlas (200 ROIs) by computing the mean BOLD signal within each ROI, yielding region-wise time series for subsequent functional connectivity estimation and graph-based analyses [3]. Region-wise extraction and data handling were implemented using Nilearn [23].

2.3. Graph Construction and Features Extraction on BOLD Signals Using Lean-NET

2.3.1. Connectome (Graph) Construction

The extraction of meaningful features from BOLD signals in fMRI is crucial for understanding brain function and pathology. In recent years, the graph-based network construction approaches and its extension, have emerged as powerful approaches for analyzing non-stationary and nonlinear signals such as BOLD signals. In this study, FC was not estimated using conventional pairwise correlations but via the novel approach Lean-NET framework, which yields a sparse FC representation specifically to overcome the limitations of fully connected networks for graph-theoretical analysis. In Lean-NET, sparsity is imposed through the solution of an optimization problem, rather than by standard thresholding methods as in standard fNETs. The thresholding method applies the same threshold to all subjects, whereas Lean-NET performs subject-specific sparsification.

For each subject, let

x \in R^{T \times N}

denote the preprocessed resting-state BOLD time series A pairwise squared-distance matrix

Z \in R^{N \times N}

is first computed as:

Z_{i j} = {∥ x_{i} - x_{j} ∥}_{2}^{2}

(1)

where

x_{i} \in R^{T}

is the time series of ROI

i

. The functional connectivity graph

W \in R^{N \times N}

is then learned using the Lean-NET implementation of the Kalofolias graph-learning model [24], which solves:

\underset{W \geq 0}{m i n ⟨W, Z⟩} - α \sum_{i} \log d_{i} + β ∥ W ∥_{F}^{2}

(2)

where

d_{i} = \sum_{j} W_{i j}

denotes the degree of node

i

, and

α = β = 1

. The resulting matrix is rescaled by a factor

δ = 2

(

W \leftarrow δ W

) and symmetrized. To retain only robust positive connections, negative entries and edges with very small weights (

W_{i j} < 0.03

) are set to zero.

The hyperparameters of the Lean-NET framework were selected based on prior literature rather than optimized for the present dataset. In particular, the regularization parameters were set to α = β = 1, following established practice in graph-learning formulations to balance data fidelity and sparsity constraints, as originally proposed by Kalofolias [24] and adopted in subsequent neuroimaging studies [18]. This setting provides a stable compromise between enforcing network sparsity and preserving meaningful functional relationships.

In addition, a small-weight pruning threshold (

W_{i j} < 0.03

) was applied to remove residual weak connections and improve numerical stability, consistent with previous Lean-NET–based studies [18]. As these hyperparameters were fixed based on established literature, a preliminary sensitivity analysis was conducted to assess the robustness of this choice. The results indicated that the resulting network topology and the identification of discriminative nodes were not overly sensitive to these parameter settings. Specifically, the threshold primarily eliminated near-zero edges without affecting the dominant functional structure, confirming that the resulting sparse graphs were driven mainly by the Lean-NET optimization rather than by post hoc thresholding. This procedure yields subject specific adjacency matrices directly inferred from the ROI time series via the Lean-NET approach, which enforces smoothness and sparsity constraints to capture the underlying functional connectivity structure.

2.3.2. Node Level Graph Metrics

After constructing the subject-specific functional networks using the Lean-NETmodel, we characterized each connectome through a set of node-level graph metrics to capture the local topological properties of brain regions. These measures quantify how each node contributes to information integration and segregation within the overall functional network.

For each subject, we computed the following metrics from the adjacency matrix W = [

w_{i j}

]; where

w_{i j}

represents the connection weight between nodes i and j, the following local graph measures were computed for each subject:

1.: Local (Node) Assortativity ( $r_{i}$ ): evaluates the tendency of nodes to connect with other nodes that have similar connectivity degrees, indicating the network’s local structural organization, the local assortativity of node i is defined as [25]:

$r_{i} = \frac{j (j + 1) (\bar{k} - μ_{q})}{2 M σ_{q}^{2}}$

where j is the node’s remaining degree, $\bar{k}$ represent the average remaining degree of the neighbors, and $σ_{q} \neq 0$ . The $μ_{q}$ is the global average excess degree.

2.: Betweenness centrality ( $B_{i}$ ): is a measure of a node’s importance within the network, defined as the fraction of all shortest paths between any two other nodes in the network that pass-through node i. The shortest path lengths were calculated using the inverse of the weighted adjacency matrix W, as the edge lengths, and B_i is computed using an algorithm tailored for weighted graphs as shown in Equation (3) [26].

$B_{i} = \sum_{j \neq k \neq i} \frac{σ_{i j} (i)}{σ_{j k}}$

(3)

where $σ_{j k}$ is the total number of shortest paths between node j and node k and $σ_{j k} (i)$ is the number of those paths pass through i.

3.: Clustering coefficient ( $C_{i}$ ): The local clustering coefficient measures how tightly connected a node’s neighbors are to each other [26]. A commonly used method for calculating local clustering coefficients is in Equation (4):

$C_{i =} \frac{2 E_{i}}{K_{i} (K_{i} - 1)}$

(4)

where the $E_{i}$ represents then number of edges for node i, and $K_{i}$ is the degree of node i.

4.: Local distance: is a measure of the average shortest distance from node i to all other nodes in the network [16,26]. All are based on the shortest path length matrix $D^{g}$ where the length of an edge $D_{i j}^{g}$ is defined as the inverse of the weighted connection strength:

$D_{i j}^{g} = \frac{1}{W_{i j}}$

(5)

5.: Node Degree ( $K_{i}$ ): is the most fundamental measure of nodal connectivity and is defined as the number of connections incident to node $i$ , reflecting its level of topological integration within the network [26,27]. It is calculating using the binarized version of the adjacency matrix $A_{i j}$ , where $A_{i j} = 1$ if an edge between nodes $i$ and $j$ is present and $A_{i j} = 0$ otherwise. For a graph G = (V, E) with N nodes, the degree $K_{i}$ of node i is given by

$K_{i} = \sum_{j \in V} A_{i j}$

(6)

6.: Local Efficiency ( $E_{i}$ ): quantifies the efficiency of information exchange among the immediate neighbors of node i, representing the local fault tolerance of the network [16,26]. The Efficiency is defined as the inverse of the average shortest path length between all pairs of nodes in the subgraph:

$E_{l o c, i} = \frac{1}{K_{i} (k_{i} - 1)} \sum_{j \in G_{i}} \sum_{h \in G_{i, h \neq j}} \frac{1}{d_{j h}}$

(7)

where $G_{i}$ is the set of neighbors on node i, $d_{j h}$ represents the shortest path length between the node j and h within the subgraph $G_{i}$ and $k_{i}$ is given in Equation (6). So the local $E_{i}$ is a key measure of functional segregation or modularity.

These metrics were calculated by brain connectivity toolbox (BCT) [26] for all brain regions to create feature vectors that describe each subject’s local connectivity pattern. Focusing on local rather than global properties help highlight region specific differences in how brain areas communicate, which is especially relevant since many studies report that ASD is related to localized hypo or hyper connectivity rather than broad, network-wide disruptions [28,29]. By capturing these nodal irregularities, the metrics provide a clearer picture of the connectivity patterns that may contribute to social and cognitive difficulties in ASD offering insights that global metrics, which only summarize overall network structure, cannot fully reveal [16,30].

2.3.3. Statistical Testing and Leakage-Free Feature Selection

The first stage of the analysis focuses on statistical testing and feature selection to identify brain regions (nodes) that exhibit significant differences in local graph properties between ASD and TD groups. For each subject, node-wise local graph metrics were computed from the subject-specific functional network (e.g., node degree, local efficiency, assortativity, clustering coefficient, betweenness centrality, and local distance), capturing complementary aspects of local network organization.

To assess whether group differences in each metric were statistically significant, Welch’s two-sample t-test was applied independently at each node, yielding one p-value per node. Welch’s test was selected because it is robust to unequal variances and accommodates unequal group sizes, which are common in clinical neuroimaging datasets. To control for multiple comparisons across nodes, the resulting p-values were adjusted using the Benjamini–Hochberg false discovery rate (FDR) procedure at

q = 0.05

[31,32]. Nodes surviving FDR correction were considered statistically significant and were subsequently ranked in ascending order of their p-values to prioritize regions showing the strongest ASD–TD effects.

To evaluate the influence of feature dimensionality, incremental feature sets were formed by selecting the top-

k

ranked significant nodes for each metric (

k = 1

–5). This procedure generated classifiers trained on progressively larger sets of discriminative node features, starting from the most significant node and extending to the top five nodes, thereby improving interpretability while limiting redundancy and over-parameterization [33].

2.3.4. Classification

To enhance model robustness and mitigate the curse of dimensionality associated with the 200-ROI feature space [34], classification was performed on low-dimensional feature sets defined by statistically prioritized nodes. For each local graph metric, nodes were ranked according to ASD–TD group differences based on Welch’s t-test p-values (with BH-FDR correction as described in Section 2.3.3), and separate models were trained using the top-

k

nodes. To examine the effect of feature dimensionality, this procedure was repeated for

k = 1

–5, and the corresponding accuracy trends are reported in Figure 2.

The number of selected nodes

k

controls the dimensionality of the feature space used by the SVM. Very small

k

values may not capture sufficient discriminative information, whereas larger

k

increases model complexity and the risk of overfitting in a modest sample-size setting. As illustrated in Figure 2, performance improved within the low-dimensional range

k = 1

–5, and

k = 5

was selected as a fixed operating point for Table 2 to provide a compact yet informative feature set and a consistent comparison across metrics.

ASD versus TD classification was conducted using a support vector machine (SVM) applied to node-wise local graph metrics derived from Lean-NET adjacency matrices. A linear-kernel SVM [35] was implemented in MATLAB_R2025b (fitcsvm). Predictor variables were z-score standardized, and class priors were set to uniform to mitigate class imbalance (74 ASD, 98 TD). Model performance was evaluated using leave-one-out cross-validation (LOOCV) [31], in which each subject served once as the test sample while the remaining subjects formed the training set. Classification accuracy was computed across LOOCV folds for each metric.

3. Results and Discussion

Among the local graph measures evaluated in Table 2, node degree yielded the highest overall classification performance, with accuracy, sensitivity, specificity, and F1-score approaching 0.90 under LOOCV. The local efficiency metric demonstrated intermediate yet relatively stable performance, characterized by high precision but lower sensitivity. In contrast, assortativity, betweenness centrality, clustering coefficient, and distance exhibited more limited discriminative capability, with classification accuracies ranging between 0.70 and 0.73.

As illustrated in Figure 2, the selection of nodes was limited to a range of k = 1 to 5 to avoid the curse of dimensionality, thereby focusing on the most informative and robust regions [36]. Among the local graph measures, node degree consistently yielded the highest classification performance across all values of

k

, followed by local efficiency, whereas metrics such as local assortativity, betweenness centrality, clustering coefficient, and distance exhibited comparatively lower accuracies. This pattern indicates that only a subset of local graph measures is highly sensitive to ASD-related differences and that focusing on the most discriminative nodes provides a compact yet interpretable representation of network alterations.

At the same time, the node-wise graph analysis demonstrated that ASD-related network differences were not uniformly distributed across the connectome but instead manifested a spatially selective pattern. Specifically, only a subset of regions showed statistically meaningful group differences, with local metrics like assortativity identifying approximately 30 discriminative nodes and degree identifying about 50 nodes. This pattern is consistent with previous findings that ASD-related changes are predominantly localized to key hubs within the default mode, salience, and subcortical networks, rather than spread across the whole brain (global alterations).

A more detailed inspection of the full set of statistically significant nodes further revealed that these spatially localized alterations collectively reflect a substantial global shift in functional network topology. Specifically, across the identified hub regions in the ASD group—particularly within frontal and temporal areas—a consistent increase in betweenness centrality and nodal distance was observed. This indicates a pattern of local hyperconnectivity in which these hubs assume a disproportionate role in local information flow while becoming increasingly segregated from distal network modules—a configuration consistent with the hypotheses of enhanced modular segregation and reduced long-range integration in ASD. Although concentrated in specific nodes, these changes are sufficient to modulate global efficiency by disrupting hub-mediated communication pathways. This provides a mechanistic bridge to established global theories of ASD, which emphasize a dual pattern of reduced long-range integration and increased local segregation. Our findings thus support the hypothesis that focal disruptions in hub organization can collectively manifest as widespread network-level dysfunction, a recognized hallmark of ASD neurobiology [16,37,38].

Given that the local degree metric yielded the highest classification performance, subsequent analyses focused on the node-degree distributions of key brain regions, as depicted in Figure 3. Each node is shown with its anatomical label and functional role. The thalamus, a major subcortical relay for sensory, motor, and attentional information, is frequently reported to exhibit atypical thalamo-cortical connectivity in ASD, which has been linked to sensory processing abnormalities and attentional difficulties [39,40]. The hippocampus, which is central to declarative memory, contextual integration, and social cognition, shows disrupted anterior–posterior specialization and altered coupling with cortical and subcortical regions, potentially reflecting impairments in social memory and large-scale network integration [41]. Frontal and orbitofrontal cortices, implicated in executive control, decision-making, and socio-emotional regulation, demonstrate both structural and functional connectivity alterations in ASD, in line with deficits in higher-order cognitive and social processes [42,43]. Temporal cortical regions, particularly posterior and superior divisions, are key for language, auditory processing, and social cognition; atypical connectivity between these areas and subcortical as well as other cortical nodes may contribute to core social-communication difficulties [44,45]. Nodes annotated as “not labeled” in the atlas, likely corresponding to transitional or boundary parcels, may index more diffuse network-level reorganization; however, such findings require cautious interpretation due to potential limitations of the parcellation scheme. Taken together, these observations indicate that ASD-related functional connectivity alterations are concentrated within subcortico-cortical networks supporting sensory processing, memory, executive function, and social cognition, rather than being uniformly distributed across the brain.

These findings indicate that the SVM classifier is sensitive to subtle alterations in brain network organization, and that graph-theoretical metrics indexing information integration and segregation are particularly informative for detecting ASD-related changes. Specifically, betweenness centrality is reduced in the ASD group, suggesting a diminished capacity of certain nodes to serve as critical intermediaries for information transfer across the network. Local efficiency is also lower in ASD, consistent with less effective integration of information within the immediate neighborhood of each node. In addition, reduced clustering coefficient together with increased distance in ASD reflects weaker local grouping of brain regions and less efficient communication pathways.

Several limitations should be considered when interpreting the present findings. First, the comparatively high classification performance relative to some ABIDE-based studies using correlation-derived functional connectivity is likely attributable to methodological choices—namely sparse, subject-specific network construction via Lean-NET and low-dimensional feature modeling—rather than increased model complexity; nevertheless, such gains may not generalize beyond the specific experimental setting. In particular, the exclusive reliance on single-site rs-fMRI data from the NYU cohort of ABIDE II constitutes a major limitation. Although single-site analysis reduces confounds related to scanner hardware, acquisition parameters, and site-specific variability, it substantially limits external validity. Inter-site heterogeneity is a well-recognized challenge in ABIDE datasets and can alter functional connectivity estimates, node-wise graph metrics, and downstream classification performance; consequently, discriminative node patterns and models learned under single-site conditions may not transfer reliably to other sites or clinical scenarios. Second, node-level graph measures are inherently parcellation-dependent, and the regional findings reported here should therefore be interpreted with respect to the CC200 atlas used in this study; alternative atlases or resolutions may yield different node rankings and effect local metrics. Importantly, while the proposed graph-learning and feature-selection pipeline is not intrinsically tied to a single atlas and can be extended to other parcellations, systematic evaluation across multiple atlases and explicit cross-site validation (e.g., training on one site and testing on another, or harmonized multisite protocols) are necessary to establish robustness and generalizability.

4. Conclusions

This study presents an exploratory, proof-of-concept framework for ASD–TD discrimination from rs-fMRI by combining subject-specific sparse functional network estimation (Lean-NET) with node-wise local graph metrics and an interpretable linear SVM. Using the ABIDE II NYU cohort, the proposed pipeline identified localized, statistically prioritized node-level signatures—most prominently degree-based features—that were associated with improved classification performance relative to several local metrics. These findings support the methodological premise that learning sparse, individualized graphs and focusing on local topology can yield compact and interpretable feature representations for characterizing ASD-related network differences.

Importantly, the current results should not be interpreted as evidence of near-clinical diagnostic applicability. The analyses were restricted to a single site and a single parcellation (CC200), and performance was evaluated under a limited-sample validation setting; therefore, generalizability to other acquisition sites, protocols, and populations remains unestablished. Future work should (i) validate the proposed framework in multi-site settings using appropriate harmonization and cross-site generalization tests, (ii) assess robustness across alternative atlases and resolutions, and (iii) relate discriminative node-level signatures to behavioral and clinical measures to strengthen neurobiological interpretability. Overall, the proposed approach provides a promising methodological basis for localized connectome modeling in ASD, while requiring broader external validation before any claims about diagnostic utility can be made.

Author Contributions

Conceptualization A.C., D.Y.D. and M.O., methodology A.C., D.Y.D. and M.O., software A.C., D.Y.D., M.A.A.Y. and M.O., validation, A.C., D.Y.D. and M.O.; formal analysis A.C., M.A.A.Y., G.K., investigation, A.C., D.Y.D. and M.O.; visualization, A.C., M.A.A.Y.; writing—original draft preparation, A.C., D.Y.D. and M.O., writing—review and editing A.C., D.Y.D., M.A.A.Y., G.K., and M.O.; supervision, D.Y.D. and M.O. All authors have read and agreed to the published version of the manuscript.

Funding

This publication was supported by the Scientific Research Projects Coordination Unit of Istanbul Yeni Yüzyıl University.

Institutional Review Board Statement

This study used publicly available, fully anonymized neuroimaging data from the ABIDE II database. Ethical approval and informed consent were obtained by the original data contributors, and no additional ethical approval was required for the present analysis.

Informed Consent Statement

Informed consent was waived for this study because it involved the analysis of anonymized, publicly available data obtained from an open-access repository. No new data were collected, and no participants can be identified.

Data Availability Statement

Publicly available datasets were analyzed in this study. Resting-state fMRI data were obtained from the Autism Brain Imaging Data Exchange II (ABIDE II) repository, NYU site. The dataset can be accessed upon registration from the ABIDE II website (http://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html) (accessed on 20 June 2025). No new neuroimaging data were collected for this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

IHME. Global Burden of Disease (GBD) 2021; Global Burden of Disease Results Tool (Online Database); Institute for Health Metrics and Evaluation (IHME): Seattle, WA, USA, 2024; Available online: https://ghdx.healthdata.org/gbd-2021 (accessed on 16 July 2025).
Di Martino, A.; Milham, M. Autism Spectrum Disorder Research and Clinical Program at the Child Study Center; New York University Langone Medical Center: New York, NY, USA, 2017. [Google Scholar]
Craddock, R.C.; James, G.A.; Holtzheimer, P.E., III; Hu, X.P.; Mayberg, H.S. A whole-brain fMRI atlas generated via spatially constrained spectral clustering. Hum. Brain Mapp. 2012, 33, 1914–1928. [Google Scholar] [CrossRef] [PubMed]
Di Martino, A.; O’Connor, D.; Chen, B.; Alaerts, K.; Anderson, J.S.; Assaf, M.; Balsters, J.H.; Baxter, L.; Beggiato, A.; Bernaerts, S.; et al. Enhancing studies of the connectome in autism using the Autism Brain Imaging Data Exchange II. Sci. Data 2017, 4, 170010. [Google Scholar] [CrossRef]
Liu, M.; Li, B.; Hu, D. Autism spectrum disorder studies using fMRI data and machine learning: A review. Front. Neurosci. 2021, 15, 697870. [Google Scholar] [CrossRef]
Thomas, R.M.; Gallo, S.; Cerliani, L.; Zhutovsky, P.; El-Gazzar, A.; van Wingen, G. Classifying autism spectrum disorder using the temporal statistics of resting-state functional MRI data with 3D convolutional neural networks. Front. Psychiatry 2020, 11, 440. [Google Scholar] [CrossRef]
Chu, Y.; Wang, G.; Cao, L.; Qiao, L.; Liu, M. Multi-scale graph representation learning for autism identification with functional MRI. Front. Neuroinform. 2021, 15, 802305. [Google Scholar] [CrossRef]
Tikaram, T.; Raj, U.; Ratnaik, R.; Agastinose Ronickom, J.F. Advancing ASD diagnostic classification using time–frequency spectrograms of fMRI BOLD signals and machine learning. 2024; preprint. [Google Scholar] [CrossRef]
Abraham, A.; Milham, M.P.; Di Martino, A.; Craddock, R.C.; Samaras, D.; Thirion, B.; Varoquaux, G. Deriving reproducible biomarkers from multi-site resting-state data: An autism-based example. NeuroImage 2017, 147, 736–745. [Google Scholar] [CrossRef] [PubMed]
Sherkatghanad, Z.; Akhondzadeh, M.; Salari, S.; Zomorodi-Moghadam, M.; Abdar, M.; Acharya, U.R.; Khosrowabadi, R.; Salari, V. Automated detection of autism spectrum disorder using a convolutional neural network. Front. Neurosci. 2020, 13, 1325. [Google Scholar] [CrossRef] [PubMed]
Yang, X.; Schrader, P.T.; Zhang, N. A Deep Neural Network Study of the ABIDE Repository on Autism Spectrum Classification. Int. J. Adv. Comput. Sci. Appl. 2020, 11, 401–408. [Google Scholar] [CrossRef]
Chib, A.S.; Malhotra, D.; Mengi, M. A machine learning approach for autism spectrum disorder detection using BOLD-fMRI signals and the ABIDE-II dataset. In Proceedings of the 2023 International Conference on Computer, Electronics & Electrical Engineering and Their Applications (IC2E3), Srinagar Garhwal, India, 8–9 June 2023; pp. 1–4. [Google Scholar] [CrossRef]
Bazay, F.E.; Drissi El Maliani, A. Assessing the impact of preprocessing pipelines on fMRI-based autism spectrum disorder classification: ABIDE II results. In Engineering Applications of Neural Networks; Iliadis, L., Maglogiannis, I., Papaleonidas, A., Pimenidis, E., Jayne, C., Eds.; EANN 2024; Communications in Computer and Information Science; Springer: Cham, Switzerland, 2024; Volume 2141. [Google Scholar] [CrossRef]
Mainas, F.; Golosio, B.; Retico, A.; Oliva, P. Exploring Autism Spectrum Disorder: A Comparative Study of Traditional Classifiers and Deep Learning Classifiers to Analyze Functional Connectivity Measures from a Multicenter Dataset. Appl. Sci. 2024, 14, 7632. [Google Scholar] [CrossRef]
Dong, Y.; Batalle, D.; Deprez, M. A framework for comparison and interpretation of machine learning classifiers to predict autism on the ABIDE dataset. Hum. Brain Mapp. 2025, 46, e70190. [Google Scholar] [CrossRef]
Dal, D.Y. Multimodal Statistical Analysis of Brain Connectome with Application to Alzheimer’s Disease; Department of Electrical and Electronics Engineering, Boğaziçi University: Istanbul, Turkey, 2024. [Google Scholar]
Yang, S.; Yin, Z.; Ma, Y.; Wang, M.; Huang, S.; Zhang, L. M3ASD: Integrating Multi-Atlas and Multi-Center Data via Multi-View Low-Rank Graph Structure Learning for Autism Spectrum Disorder Diagnosis. Brain Sci. 2025, 15, 1136. [Google Scholar] [CrossRef]
Dong, X.; Thanou, D.; Frossard, P.; Vandergheynst, P. Learning Laplacian Matrix in Smooth Graph Signal Representations. IEEE Trans. Signal Process. 2016, 64, 6160–6173. [Google Scholar] [CrossRef]
Oztoprak, F.; Nocedal, J.; Rennie, S.; Olsen, P.A. Newton-like methods for sparse inverse covariance estimation. Adv. Neural Inf. Process. Syst. 2012, 7. [Google Scholar]
Gamgam, G.; Yıldırım, Z.; Kabakçıoğlu, A.; Gürvit, H.; Demiralp, T.; Acar, B. Siamese graph convolutional network quantifies increasing structure–function discrepancy over the cognitive decline continuum. Comput. Methods Programs Biomed. 2024, 254, 108290. [Google Scholar] [CrossRef]
Smith, S.M.; Miller, K.L.; Salimi-Khorshidi, G.; Webster, M.; Beckmann, C.F.; Nichols, T.E.; Ramsey, J.D.; Woolrich, M.W. Network modelling methods for fMRI. NeuroImage 2011, 54, 875–891. [Google Scholar] [CrossRef]
Craddock, C.; Sikka, S.; Cheung, B.; Khanuja, R.; Ghosh, S.S.; Yan, C.; Li, Q.; Lurie, D.; Vogelstein, J.; Burns, R.; et al. Towards Automated Analysis of Connectomes: The Configurable Pipeline for the Analysis of Connectomes (C-PAC). Front Neuroinform 2013, 42. [Google Scholar] [CrossRef]
Abraham, A.; Pedregosa, F.; Eickenberg, M.; Gervais, P.; Mueller, A.; Kossaifi, J.; Gramfort, A.; Thirion, B.; Varoquaux, G. Machine learningmfor neuroimaging with scikit-learn. Front. Neuroinform. 2014, 8, 14. [Google Scholar] [CrossRef] [PubMed]
Kalofolias, V. How to learn a graph from smooth signals. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS), Cadiz, Spain, 9–11 May 2016; Volume 51, pp. 920–929. [Google Scholar]
Thedchanamoorthy, G.; Piraveenan, M.; Kasthuriratna, D.; Senanayake, U. Node assortativity in complex networks: An alternative approach. Procedia Comput. Sci. 2014, 29, 2449–2461. [Google Scholar] [CrossRef]
Rubinov, M.; Sporns, O. Complex network measures of brain connectivity: Uses and interpretations. NeuroImage 2010, 52, 1059–1069. [Google Scholar] [CrossRef]
Wu, S.; Zhan, P.; Wang, G.; Yu, X.; Liu, H.; Wang, W. Changes of brain functional network in Alzheimer’s disease and frontotemporal dementia: A graph-theoretic analysis. BMC Neurosci. 2024, 25, 30. [Google Scholar] [CrossRef]
Yang, X.; Zhang, N.; Schrader, P. A study of brain networks for autism spectrum disorder classification using resting-state functional connectivity. Mach. Learn. Appl. 2022, 8, 100290. [Google Scholar] [CrossRef]
Kolla, S.; Falakshahi, H.; Abrol, A.; Fu, Z.; Calhoun, V.D. Intra-Atlas Node Size Effects on Graph Metrics in fMRI Data: Implications for Alzheimer’s Disease and Cognitive Impairment. Sensors 2024, 24, 814. [Google Scholar] [CrossRef]
Deng, Z.; Wang, S. Sex differentiation of brain structures in autism: Findings from a gray matter asymmetry study. Autism Res. 2021, 14, 1115–1126. [Google Scholar] [CrossRef]
Curtis, D. Welch’s t test is more sensitive to real world violations of distributional assumptions than student’s t test but logistic regression is more robust than either. Stat. Pap. 2024, 65, 3981–3989. [Google Scholar] [CrossRef]
Li, A.; Barber, R.F. Multiple Testing with the Structure-Adaptive Benjamini–Hochberg Algorithm. J. R. Stat. Soc. Ser. B Stat. Methodol. 2019, 81, 45–74. [Google Scholar] [CrossRef]
Theng, D.; Bhoyar, K.K. Feature selection techniques for machine learning: A survey of more than two decades of research. Knowl. Inf. Syst. 2024, 66, 1575–1637. [Google Scholar] [CrossRef]
Lu, C.; Shi, L.; Chen, Z.; Wu, C.; Wierman, A. Overcoming the curse of dimensionality in reinforcement learning through approximate factorization. arXiv 2024, arXiv:2411.07591. [Google Scholar] [CrossRef]
Patle, A.; Chouhan, D.S. SVM kernel functions for classification. In Proceedings of the 2013 International Conference on Advances in Technology and Engineering (ICATE), Mumbai, India, 23–25 January 2013; pp. 1–9. [Google Scholar] [CrossRef]
Yüksel Dal, D.; Yıldırım, Z.; Gürvit, H.; Kabakçıoglu, A.; Acar, B. Reorganization of brain connectivity across the spectrum of clinical cognitive decline. Neurol. Sci. 2024, 45, 5719–5730. [Google Scholar] [CrossRef] [PubMed]
Chen, L.; Chen, Y.; Zheng, H.; Zhang, B.; Wang, F.; Fang, J.; Li, Y.; Chen, Q.; Zhang, S. Changes in the topological organization of the default mode network in autism spectrum disorder. Brain Imaging Behav. 2021, 15, 1058–1067. [Google Scholar] [CrossRef] [PubMed]
Gamgam, G.; Kabakcioglu, A.; Yüksel Dal, D.; Acar, B. Disentangled attention graph neural network for Alzheimer’s disease diagnosis. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), Marrakesh, Morocco, 6–10 October 2024; Springer: Cham, Switzerland, 2024; pp. 219–228. [Google Scholar]
Ayub, R.; Sun, K.L.; Flores, R.E.; Lam, V.T.; Jo, B.; Saggar, M.; Fung, L.K. Thalamocortical connectivity is associated with autism symptoms in high-functioning adults with autism and typically developing adults. Transl. Psychiatry 2021, 11, 93. [Google Scholar] [CrossRef]
Woodward, N.D.; Giraldo-Chica, M.; Rogers, B.; Cascio, C.J. Thalamocortical dysconnectivity in autism spectrum disorder: An analysis of the autism brain imaging data exchange. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 2017, 2, 76–84. [Google Scholar] [CrossRef]
Bhamidimarri, P.M.; Alhosani, K.; Cai, H.; Al-Ali, H.; Abukhaled, Y.M.; Tawamie, H.; Abdelaziz, S.; Fawaz, M.; Kashir, J.; Sajjad, Y.; et al. Review on the role of hippocampus in autism spectrum disorder: Recent insights into neuropathology, genetics, and emerging therapeutic strategies. Neurobiol. Dis. 2026, 218, 107227. [Google Scholar] [CrossRef]
Ong, L.T.; Fan, S.W.D. Morphological and functional changes of the cerebral cortex in autism spectrum disorder. Innov. Clin. Neurosci. 2023, 20, 40–47. [Google Scholar] [PubMed]
Barreto, C.; Curtin, A.; Topoglu, Y.; Day-Watkins, J.; Garvin, B.; Foster, G.; Ormanoglu, Z.; Sheridan, E.; Connell, J.; Bennett, D.; et al. Prefrontal Cortex Responses to Social Video Stimuli in Young Children with and without autism spectrum disorder. Brain Sci. 2024, 14, 503. [Google Scholar] [CrossRef] [PubMed]
Liu, J.; Chen, H.; Wang, H.; Wang, Z. Neural correlates of facial recognition deficits in autism spectrum disorder: A comprehensive review. Front. Psychiatry 2024, 15, 1464142. [Google Scholar] [CrossRef] [PubMed]
Hashimoto, R.-I.; Okada, R.; Aoki, R.; Nakamura, M.; Ohta, H.; Itahashi, T. Functional alterations of lateral temporal cortex for processing voice prosody in adults with autism spectrum disorder. Cereb. Cortex 2024, 34, bhae363. [Google Scholar] [CrossRef]

Figure 1. Proposed methodology for classification ASD using rs-fMRI BOLD signals, r(i) refers to the node ID with importance ranking i.

Figure 2. SVM classification accuracy using the top-ranked nodes (

k = 1, \dots,

5) from each local graph metric.

Figure 2. SVM classification accuracy using the top-ranked nodes (

k = 1, \dots,

5) from each local graph metric.

Figure 3. Violin plots showing the distribution of node degree values for the top five discriminative ROIs (selected from the degree metric by the smallest Welch’s t-test p-values after BH-FDR correction) in the ABIDE II–NYU cohort. TD participants are shown in yellow and ASD participants in red. For each ROI, the violin depicts the full distribution across subjects; the central marker indicates the mean, and the thick bar indicates the interquartile range. ROI labels are reported on the x-axis (LH/RH denote the left/right hemisphere; Crtx denotes cortex).

Table 1. Overview of analyzed subject’s demographics.

	ASD (n = 74)	TD (n = 98)	p_Value
Age	14.7 ± 7.0	15.1 ± 6.0	0.678
Gender (Male/Female)	64/10	72/26	0.572
Full scale IQ	107.9 ± 16.6	113.2 ± 13.1	0.045

Table 2. Classification accuracies of linear SVM classifiers obtained via leave-one-out cross-validation using local graph metric separately across the 5 most statistically significant nodes for ASD vs. TD discrimination.

Metric	Accuracy	Sensitivity	Specificity	Precision	F1-Score
Assortivity	0.72	0.61	0.79	0.65	0.62
Betweenness centrality	0.70	0.68	0.80	0.70	0.64
Degree	0.91	0.91	0.91	0.90	0.89
Clustering Coefficient	0.71	0.67	0.74	0.66	0.67
Distance	0.73	0.74	0.72	0.67	0.70
Efficiency	0.80	0.68	0.80	0.83	0.75

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chelef, A.; Yuksel Dal, D.; Ozturk, M.; Yousif, M.A.A.; Koc, G. Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification. Bioengineering 2026, 13, 99. https://doi.org/10.3390/bioengineering13010099

AMA Style

Chelef A, Yuksel Dal D, Ozturk M, Yousif MAA, Koc G. Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification. Bioengineering. 2026; 13(1):99. https://doi.org/10.3390/bioengineering13010099

Chicago/Turabian Style

Chelef, Aoumria, Demet Yuksel Dal, Mahmut Ozturk, Mosab A. A. Yousif, and Gokce Koc. 2026. "Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification" Bioengineering 13, no. 1: 99. https://doi.org/10.3390/bioengineering13010099

APA Style

Chelef, A., Yuksel Dal, D., Ozturk, M., Yousif, M. A. A., & Koc, G. (2026). Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification. Bioengineering, 13(1), 99. https://doi.org/10.3390/bioengineering13010099

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Lean-NET-Based Local Brain Connectome Analysis for Autism Spectrum Disorder Classification

Abstract

1. Introduction

2. Methodology

2.1. Datasets Description

2.2. Data Preprocessing

2.3. Graph Construction and Features Extraction on BOLD Signals Using Lean-NET

2.3.1. Connectome (Graph) Construction

2.3.2. Node Level Graph Metrics

2.3.3. Statistical Testing and Leakage-Free Feature Selection

2.3.4. Classification

3. Results and Discussion

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI