1. Introduction
Alzheimer’s disease (AD) is a progressive neurodegenerative disorder characterized by irreversible cognitive decline, memory impairment, and behavioral dysregulation [
1]. Its neuropathological hallmarks—amyloid-
plaque accumulation, neurofibrillary tau tangles, synaptic loss, and cortical atrophy—manifest insidiously over years before clinical symptoms appear, making early detection essential yet difficult [
2]. The global prevalence of dementia, predominantly driven by AD, has exceeded 55 million individuals and is projected to reach 152 million by 2050 [
3], imposing substantial healthcare and socioeconomic burdens worldwide [
4].
The AD continuum spans from cognitively normal (CN) individuals to early mild cognitive impairment (EMCI), late mild cognitive impairment (LMCI), and clinical AD. Approximately 10–15% of MCI patients convert to AD annually, and early intervention at the MCI stage can meaningfully slow disease progression [
5]. However, the subtle and overlapping symptomatology between adjacent stages—particularly EMCI versus LMCI—severely limits the sensitivity of conventional neuropsychological assessments [
6,
7], motivating neuroimaging-based approaches for early pathological detection.
Resting-state functional magnetic resonance imaging (rs-fMRI), which measures the blood oxygen level-dependent (BOLD) signal during rest, has emerged as a particularly informative modality for probing brain-wide functional organization in AD [
8]. By characterizing synchronous neural activity patterns across spatially distributed regions, rs-fMRI reveals functional connectivity (FC) disruptions that precede structural atrophy and cognitive symptom onset [
9]. Notably, the default mode network (DMN), a set of regions preferentially active during rest, exhibits characteristic hypo-connectivity in AD patients compared to healthy controls, providing a sensitive and non-invasive imaging biomarker [
10].
Traditional FC analyses assume stationarity—that connectivity patterns remain constant throughout the scanning session. However, converging evidence from neuroimaging studies indicates that brain functional connectivity fluctuates meaningfully over time, and that these dynamic fluctuations carry disease-relevant information beyond static summaries [
11,
12]. Dynamic functional connectivity (dFC), typically constructed via sliding-window Pearson correlation, captures this temporal variability by generating a sequence of connectivity matrices, each representing brain-network interactions within a short temporal window [
13]. Studies have demonstrated that dFC analysis is more sensitive than static FC for detecting early-stage AD pathology, particularly in distinguishing EMCI from CN subjects where pathological changes are subtle and transient [
14].
Despite its appeal, dFC-based AD classification faces three key technical challenges. First, each dFC window of a 116-region atlas yields a 6670-dimensional upper-triangular vector, creating a severe curse of dimensionality that demands effective nonlinear dimensionality reduction [
14]. Second, the temporal ordering of dFC windows encodes trajectory information about brain state transitions, yet most methods discard this structure via simple pooling, or use sequential models (e.g., LSTM) that struggle with long-range dependencies [
15,
16]. Third, clinically acceptable models must be interpretable—a diagnosis without an explanation is difficult to validate or trust [
17].
To address the dimensionality challenge, Variational Autoencoders (VAEs) [
18] have been proposed as an unsupervised dimensionality reduction technique for dFC features. By learning a smooth, low-dimensional latent space constrained by KL divergence regularization, VAEs extract robust, noise-resistant representations that capture nonlinear connectivity patterns inaccessible to linear methods such as PCA [
15].
For temporal sequence modeling, the recently proposed Mamba architecture [
19]—a Selective State Space Model (S6)—offers a compelling alternative to recurrent neural networks and Transformer-based [
20] attention mechanisms. Mamba introduces input-dependent selectivity: the transition matrices
,
,
, and the time-step
are dynamically computed from the input, enabling the model to selectively retain or discard information based on content relevance. Crucially, Mamba achieves linear computational complexity with respect to sequence length—in contrast to Transformer’s quadratic self-attention—making it particularly suitable for long sequences of dFC windows. While Mamba has demonstrated remarkable performance in natural language processing and genomics, its application to fMRI temporal dynamics for disease classification represents a significant and underexplored opportunity.
For interpretable classification, Kolmogorov–Arnold Networks (KAN) [
21]—inspired by the Kolmogorov–Arnold representation theorem—replace the fixed node activations of MLPs with learnable univariate B-spline functions on edges. Each edge function
is independently parameterized and directly visualizable, encoding the contribution of each latent feature to each class logit as an explicit, inspectable curve. This is fundamentally different from post hoc methods such as SHAP, which approximate feature contributions externally and incur additional computational cost. KAN’s intrinsic transparency is particularly valuable in clinical settings where the mechanism of a prediction matters as much as its accuracy.
In this paper, we propose MambaKAN, a unified, end-to-end interpretable framework that integrates VAE, Mamba, and KAN to address the aforementioned limitations synergistically. Specifically, the following are proposed:
A VAE-based dynamic window encoder maps each dFC snapshot independently into a compact 128-dimensional latent vector, effectively reducing the per-window feature dimension from 6670 to 128 while preserving nonlinear connectivity structure and suppressing noise via KL regularization.
A Mamba temporal encoder processes the sequence of 54 latent vectors (1 per dFC window) using selective state space dynamics, learning which temporal windows are most diagnostically relevant and capturing long-range dependencies across the entire scanning session with linear computational cost.
A KAN classifier maps the temporal context vector to diagnostic class probabilities through learnable B-spline activations, providing fully transparent, intrinsic interpretability without any post hoc approximation.
An end-to-end joint training strategy with differential learning rates ensures that VAE features are refined toward classification objectives while the Mamba and KAN modules learn effectively from a rich, pre-trained latent space.
The main contributions of this work are as follows:
This work represents one of the pioneering efforts to apply a Selective State Space Model (Mamba) to the modeling of temporal dynamics in dFC for AD classification, demonstrating superior performance over LSTM and Transformer-based alternatives in capturing clinically relevant brain state trajectories.
We integrate KAN as the classification backbone, providing intrinsic interpretability through visualizable activation functions. This enables direct inspection of how each latent dimension influences classification decisions, without relying on post hoc approximation methods.
We design a principled two-phase joint training strategy with a composite loss function that tightly couples reconstruction fidelity and classification accuracy, ensuring that feature learning is task-aligned and yielding representations that outperform unsupervised pre-training followed by fixed feature extraction.
We provide a multi-layered interpretability analysis that combines Mamba’s temporal selectivity scores with KAN activation curve visualization and gradient-based brain region attribution, offering complementary neuroscientific insights at the temporal, functional, and anatomical levels.
We conduct comprehensive experiments across five clinically relevant classification tasks on the ADNI dataset, including ablation studies and sensitivity analyses, demonstrating consistent improvements over seven competitive baselines.
The remainder of this paper is organized as follows.
Section 2 describes the ADNI dataset and preprocessing pipeline.
Section 3 presents the complete MambaKAN architecture and training procedure.
Section 4 reports experimental results, baseline comparisons, and ablation studies.
Section 5 presents the multi-layered interpretability analysis.
Section 6 analyzes the computational complexity and inference latency of all compared models.
Section 7 discusses findings, limitations, and future directions.
Section 8 concludes the paper.
5. Interpretability Analysis
A central contribution of MambaKAN is its multi-layer interpretability framework, operating simultaneously at three complementary levels: temporal (Mamba), functional (KAN), and anatomical (gradient attribution). This hierarchical analysis provides insights inaccessible to single-level post hoc methods.
5.1. Layer 1: Mamba Temporal Importance
Using the temporal importance scores defined in Equation (
27), we compute the mean
profile for each diagnostic class by averaging over all subjects in the test set.
Figure 2 visualizes these profiles across the 54 dFC windows.
The analysis reveals class-specific temporal patterns consistent with known AD pathophysiology. The AD class (brown curve) exhibits a distinctive biphasic profile: significantly elevated selectivity during early time windows (0–25), with peak z-scores approaching +1.0, followed by a sharp decline to negative values (minimum ) in late windows (30–55). This pattern suggests that early-scan brain states contain the most discriminative AD signatures, while late-scan states actively contradict AD classification—potentially reflecting fatigue-related or attention-related signal degradation that disproportionately affects AD patients. In contrast, the CN, EMCI, and LMCI classes exhibit relatively flat selectivity profiles centered near zero, indicating temporally uniform feature distributions without pronounced critical windows. The LMCI class shows a subtle progressive increase across the scan, intermediate between the stable CN/EMCI profiles and the extreme AD biphasic pattern, consistent with LMCI’s position on the disease continuum. This temporal specificity—particularly the early-window AD peak—provides data-driven evidence for optimizing clinical scanning protocols: shorter, early-focused acquisitions may suffice for AD detection, reducing patient burden and scan costs.
5.2. Layer 2: KAN Activation Curve Analysis
Figure 3 presents the learned B-spline activation curves
for the top-
most influential latent dimensions (ranked by mean absolute activation magnitude) in the first KAN layer for the CN vs. AD binary classification task.
The activation curves reveal three interpretable patterns. Certain latent dimensions exhibit monotonically increasing activation curves for the AD logit, indicating that their elevated values consistently promote AD classification; these likely encode persistent connectivity reduction in the DMN and hippocampal networks. Conversely, latent dimensions with monotonically decreasing curves correspond to features whose suppression is associated with AD pathology, potentially encoding preserved connectivity in frontal networks observed in early-stage patients. Non-monotonic curves (particularly sigmoidal or u-shaped) indicate a nonlinear contribution to classification—such dimensions may correspond to connectivity features that are pathological only outside a normative range. Critically, across all 10 latent dimensions, the AD class (yellow curves) consistently exhibits higher logit values than the CN class (blue curves) without any rank reversals, demonstrating that KAN has learned stable, monotonic class-discriminative features rather than overfitting to noise.
The EMCI vs. LMCI activation curves (
Figure 4) present a striking contrast to the CN vs. AD case. While the EMCI class (blue) consistently maintains higher
logit values than LMCI (yellow) across all dimensions—preserving monotonic class ordering—the inter-class separation is markedly reduced, with curves frequently overlapping or running in close parallel. This compressed separation quantitatively reflects the clinical challenge of distinguishing adjacent MCI stages: EMCI and LMCI share substantial pathological overlap, differing primarily in severity rather than qualitative feature profiles. The smooth, continuous nature of all curves confirms that KAN captures biologically plausible nonlinear relationships rather than spurious discontinuities, even in this difficult discrimination regime.
5.3. Layer 3: Gradient-Based Brain Region Attribution
To map the influence of the final classification decision back onto the anatomical brain, we compute Jacobian-based attribution scores from the class logit to the original dFC matrix via backpropagation through the full pipeline:
where
is the pairwise connectivity feature between ROI
i and ROI
j in window
k, and
is the predicted logit for the target class
c. The attribution matrix
is then thresholded to retain the top 1% most influential connections for visualization.
Figure 5 presents these attribution maps for each diagnostic class.
The chord diagram analysis (
Figure 5) reveals class-specific connectivity patterns that align with known AD neuropathology. Across all four diagnostic classes, the cerebellum—particularly Vermis_10 (cerebellar vermis lobule X)—emerges as the dominant hub, exhibiting the strongest attribution scores and the highest degree of inter-regional connectivity. This cerebellar centrality is consistent with recent evidence implicating cerebellar dysfunction in cognitive decline and AD progression [
9]. Beyond this shared cerebellar foundation, each diagnostic class exhibits distinctive secondary connectivity profiles. The CN class (Class 0) shows the prominent involvement of the caudate nucleus and striatal regions, reflecting intact cognitive control networks. The EMCI class (Class 1) demonstrates elevated attribution in the insula and parahippocampal gyrus, regions associated with emotional processing and episodic memory encoding, potentially indicating early compensatory recruitment. The LMCI class (Class 2) exhibits heightened connectivity in primary sensory cortices—including Heschl’s gyrus (auditory) and olfactory cortex—alongside hippocampal structures, suggesting progressive sensory integration deficits. Most strikingly, the AD class (Class 3) displays a complex high-order cognitive network involving the orbitofrontal cortex (decision-making), superior parietal lobule (spatial attention), and amygdala (emotional regulation), indicating the widespread disruption of executive and limbic systems characteristic of advanced neurodegeneration. These anatomically coherent, class-specific attribution patterns validate that MambaKAN’s learned representations capture biologically meaningful functional connectivity signatures rather than spurious correlations.
The heatmap visualization (
Figure 6) provides a complementary global perspective on connectivity attribution. A striking gradient in attribution intensity is observed across disease stages: the CN class exhibits the weakest overall attribution (scale: 0.0000–0.0010), indicating relatively diffuse and uniform connectivity patterns; EMCI and LMCI classes show progressively stronger attribution (scales: 0.0012 and 0.0016, respectively); and the AD class displays the highest attribution intensity (scale: 0.0020), with prominent bright yellow regions concentrated along cerebellar and limbic system connections. This monotonic increase in attribution magnitude suggests that as AD pathology advances, the model increasingly relies on a narrower set of highly discriminative connectivity features—consistent with the hypothesis that advanced neurodegeneration produces more stereotyped and detectable functional network disruptions. The consistent localization of high-attribution regions to cerebellar–cortical and limbic pathways across all classes reinforces the biological validity of the learned representations.
The ranked attribution bar charts (
Figure 7) provide quantitative confirmation of the qualitative patterns observed in the chord and heatmap visualizations. Vermis_10 dominates all four panels with the longest bars, achieving attribution scores approximately two to three times higher than the second-ranked region in each class. This universal cerebellar primacy suggests that MambaKAN has learned to anchor its classification decisions on a stable, disease-invariant feature set derived from cerebellar connectivity, while modulating class-specific predictions through secondary region recruitment. The divergence in secondary features is particularly informative: the CN class recruits bilateral cerebellum and caudate (cognitive control); EMCI adds the insula and parahippocampal gyrus (emotional and memory processing); LMCI incorporates Heschl’s gyrus and olfactory cortex (sensory integration); and AD engages the orbitofrontal cortex, superior parietal lobule, and amygdala (executive dysfunction and emotional dysregulation). This hierarchical attribution structure—universal cerebellar foundation plus class-specific cortical/limbic modulation—mirrors the known progression of AD pathology from subcortical to cortical regions and validates the neuroscientific plausibility of the learned feature hierarchy.
8. Conclusions
We have presented MambaKAN, a novel end-to-end interpretable deep learning framework for Alzheimer’s disease diagnosis from rs-fMRI dynamic functional connectivity. By integrating three complementary components—a Variational Autoencoder for nonlinear per-window feature compression, a Selective State Space Model (Mamba) for temporally selective sequence modeling, and a Kolmogorov–Arnold Network for intrinsically interpretable classification—MambaKAN addresses fundamental limitations of prior dFC-based approaches: the neglect of temporal dynamics, the opacity of deep learning decision-making, and the misalignment between feature learning and classification objectives.
Experimental evaluation on the ADNI dataset across five clinically relevant classification tasks demonstrates that MambaKAN consistently outperforms seven competitive baselines, with statistically significant improvements over the previous state of the art. The multi-layer interpretability analysis provides three complementary levels of neuroscientific insight: (1) temporal importance maps identifying diagnostically critical scan periods, (2) KAN activation curves revealing the nonlinear functional relationships between latent features and diagnostic decisions, and (3) gradient-based brain region attribution maps identifying key functional connections consistent with established AD neuropathology.
This work establishes a methodological foundation for integrating next-generation sequence modeling architectures (SSMs) and intrinsically interpretable networks into the neuroimaging analysis pipeline. We believe MambaKAN represents an important step toward the clinically deployable, trustworthy AI-assisted diagnosis of neurodegenerative diseases.