Next Article in Journal
Evaluating Middleware Performance in the Transition from Monolithic to Microservices Architecture for Banking Applications
Previous Article in Journal
Developing an AI Assistant for Knowledge Management and Workforce Training in State DOTs
Previous Article in Special Issue
BiLSTM-FuseNet: A Deep Fusion Model for Denoising High-Noise Near-Infrared Spectra
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Softmax-Derived Brain Age Mapping: An Interpretable Visualization Framework for MRI-Based Brain Age Prediction

by
Ting-An Chang
*,
Shao-Yu Yan
,
Kuan-Chih Wang
and
Chung-Wen Hung
*
Department of Electrical Engineering, National Yunlin University of Science and Technology, Yunlin 64002, Taiwan
*
Authors to whom correspondence should be addressed.
Electronics 2026, 15(1), 220; https://doi.org/10.3390/electronics15010220
Submission received: 30 November 2025 / Revised: 26 December 2025 / Accepted: 31 December 2025 / Published: 2 January 2026
(This article belongs to the Special Issue Image and Signal Processing Techniques and Applications)

Abstract

Brain age has been widely recognized as an important biomarker for monitoring adolescent brain development and assessing dementia risk. However, existing model visualization methods primarily highlight brain regions associated with aging, making it difficult to comprehensively reveal broader brain changes. In this study, we developed a VGGNet-based brain age prediction model and proposed the Softmax-Derived Brain Age Mapping algorithm to simultaneously identify brain regions associated with both youthful and aging features. The resulting saliency maps provide explicit representations of developmental and degenerative processes across different brain regions. Brain Age Map analysis revealed that aging features in the healthy group were primarily confined to the frontal cortex, aligning with findings that the frontal lobe is the earliest region to undergo natural senescence. In contrast, the dementia group exhibited widespread aging across the frontal, temporal, parietal, and occipital lobes, as well as the ventricular regions. These results suggest that the spatial distribution of brain aging can serve as a critical biomarker for distinguishing normal aging trajectories from pathological degeneration. From an application perspective, we further explored the potential of the proposed framework in neurodegenerative diseases. The analysis reveals that dementia patients generally exhibit an advanced brain age, with cortical aging being markedly more pronounced than in age-matched healthy samples. Notably, although dementia cases were not included in the training set, the model was still able to localize abnormalities in relevant brain regions, underscoring its potential value as an assistive tool for early dementia diagnosis.

1. Introduction

With the intensification of declining birth rates and population aging, the demand for precision medicine has become increasingly urgent. Predicted Age Difference (PAD) has been shown to be highly associated with cognitive impairment and dementia, making brain age prediction an important tool for assessing neurodevelopment and identifying early cognitive decline. Ref. [1] demonstrated that deep learning models based on Magnetic Resonance Imaging (MRI) can accurately predict brain age, providing reliable and quantifiable indicators for adolescent brain development. Similarly, refs. [2,3,4] reported that PAD is strongly linked to cognitive decline, mild cognitive impairment (MCI), and Alzheimer’s disease (AD) in middle-aged and elderly populations. Consequently, brain age has increasingly been adopted as a formal clinical indicator in recent years.
Most existing models output brain age as a single scalar value, which limits their diagnostic applicability. If visualization techniques are integrated to elucidate how predictions are generated and to map the key brain regions involved, brain age prediction models would no longer serve merely as numerical indicators but rather as comprehensive frameworks that enable region-specific analyses of brain development and aging. Particularly for the early detection of dementia, a model capable of accurately tracking abnormal regional aging would provide substantial clinical value by enabling timely intervention, thereby helping to slow disease progression and improve patient quality of life. Motivated by these considerations, this study proposes a deep learning-based brain age prediction framework that integrates brain MRI with a key region identification algorithm. The framework is capable of assessing brain development in adolescents and assisting in the identification of cognitive decline risk in older adults, holding significant implications for monitoring child development and preventing dementia.
The visualization technique Grad-CAM (Gradient-weighted Class Activation Mapping), proposed by [5], has been widely applied for highlighting salient regions in medical imaging and has inspired multiple variants. In the context of brain age prediction, ref. [6] employed the Guided Backpropagation [7] algorithm and found that the model primarily focused on cerebrospinal fluid, particularly the lateral ventricles, underscoring their importance in brain age estimation. Similarly, ref. [8] utilized SmoothGrad, another gradient-based technique, and identified periventricular, temporal, and insular regions as being highly influential in brain age prediction. However, these approaches face inherent limitations when applied to brain age prediction. Since the output of brain age prediction is a single scalar value, gradient-based visualization techniques such as Grad-CAM and its variants typically highlight only regions strongly associated with aging features. To overcome this limitation, we propose a refined algorithm that can simultaneously identify regions associated with both youthful and aging features, thereby providing more comprehensive information in brain saliency maps.

2. Related Works

We reviewed recent literature on brain age prediction and summarized the best-performing models (Table 1 and Table 2). The results indicate that preprocessing, data augmentation, model architecture, and training strategies all play critical roles in determining model accuracy and generalizability. Common preprocessing techniques include the segmentation of gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF), skull stripping, and image registration, which allow models to focus on brain structures most associated with age-related changes. Bias correction and intensity normalization further help to reduce imaging heterogeneity and smooth the loss surface, thereby facilitating model convergence. Data augmentation methods, such as rotation and translation, are frequently employed to enhance data diversity and reduce overfitting. Model architecture is a critical factor influencing predictive performance. Although conventional convolutional neural networks (CNNs) have achieved certain success in brain age prediction, their performance has not been optimal. As shown in Table 2, CNN-based models such as those reported by [4,9] yielded higher MAE values, indicating that traditional CNNs may not be the best choice for this task. Recent studies suggest that lightweight models exhibit stronger generalizability and improved accuracy, while ensemble approaches can further enhance reliability and mitigate overfitting [10].
However, evidence from the experimental results of [18,19] suggests that data augmentation does not necessarily provide positive benefits. Ref. [18] reported that, under standardized VBM preprocessing, data augmentation offered no significant advantages, and excessive augmentation could even disrupt the model’s ability to learn brain age-related imaging features. Similarly, ref. [19] argued that the introduction of additional spatial variations during MRI acquisition (e.g., subject head motion) substantially reduces model accuracy, indicating that when imaging quality and spatial alignment are already highly standardized, further augmentations such as rotations or translations may act as sources of noise rather than improvements. The effectiveness of data augmentation in brain age prediction remains uncertain under standardized preprocessing.

3. Experiments

This study proposes an interpretable visualization-driven brain age prediction model inspired by the VGGNet architecture, enabling fast and accurate brain age estimation from T1-weighted MRI scans. The proposed Softmax-Derived Brain Age Mapping (SD-BAM) is further applied to generate Brain Age Maps, providing visual clinical support (Figure 1).

3.1. Data Sets

This experiment utilized T1-weighted 3D MRI scans from four publicly available neuroimaging datasets: ABIDE, ADNI, CamCAN, and IXI (Table 3). The CamCAN dataset was obtained from the CamCAN repository [20,21]. These datasets cover an age range of 6 to 90 years (Figure 2), enabling the model to learn structural brain changes across the entire human lifespan. In total, 2339 MRI scans were integrated, with a broad age span and balanced sample distributions across datasets, each containing more than 500 subjects, thereby ensuring sufficient training resources.

3.2. Data Preprocessing

Prior to model training, all T1-weighted 3D brain MRI scans were subjected to preprocessing. Since the proposed model performs six successive downsampling operations by a factor of two, the input images must have edge lengths divisible by 64 (i.e., 2 6 ) to avoid boundary loss. Therefore, we first preprocessed the MNI template (189 × 197 × 233) by resampling it to 128 × 192 × 128 and subsequently used this template for image registration to ensure that all input images conformed to the required dimensions. The preprocessing pipeline for whole-brain (WB) features is as follows. First, non-brain tissues were removed (skull stripping) using the U-net-based brain extraction model provided by ANTsPyNet [22]. Next, each image was rigidly registered to the preprocessed MNI152 template using SimpleITK 2.5.0. Finally, image intensity values were scaled using the 5th and 95th percentiles and normalized to the range [0, 1] (Figure 3).

3.3. Model Architecture Design

Our model design is inspired by VGGNet [23] and further guided by the architectures proposed by [10,24], where fully connected layers are replaced with convolutional layers to achieve a lightweight structure. The model consists of three main stages: feature extraction, nonlinear enhancement, and output adjustment. In the feature extraction stage, five consecutive convolutional blocks (Conv Blocks) are employed. Within each Conv Block, the input passes sequentially through a convolutional layer (Conv Layer), a batch normalization layer (BN), a max pooling layer, and a ReLU activation function. The nonlinear enhancement stage operates similarly to the Conv Blocks, except that the convolutional kernel size is modified from 3 × 3 × 3 to 1 × 1 × 1. Finally, a global average pooling layer and a 1 × 1 × 1 convolutional layer are used in the output adjustment stage to generate continuous brain age predictions. Two different output adjustment strategies were adopted for the brain age prediction task, namely the regression strategy and the softmax strategy. Both strategies share identical feature extraction and nonlinear enhancement modules, with the key difference lying only in the design of the output adjustment block. Table 4 presents the model architecture using the regression strategy. In this configuration, the output adjustment block consists of a global average pooling layer followed by a 1 × 1 × 1 convolutional layer, which transforms the extracted features into a single-channel continuous brain age prediction value. Compared with conventional deep networks such as ResNet [25] and DenseNet [26] (implemented via the MONAI framework “https://github.com/Project-MONAI/MONAI (accessed on 18 December 2025)”as ResNet-18 and DenseNet-18, see Table 5), our architecture is significantly more lightweight and streamlined. By avoiding excessively deep layer stacking, the substantial reduction in parameters effectively mitigates the risk of overfitting.
Table 6 presents the model architecture using the softmax strategy, which transforms the task from regression to classification. In this configuration, the output adjustment block also consists of a global average pooling layer followed by a 1 × 1 × 1 convolutional layer. Unlike the single-channel output of the regression strategy, the convolutional layer here produces a 50-dimensional vector. This vector is then passed through a softmax function to generate the probability distribution over age classes (Table 7), and the final predicted brain age is computed using a weighted-sum approach.
The predicted brain age in the softmax strategy is computed using a weighted-sum approach as follows:
y ^   = c   =   1 50 a c × p c
where y ^ denotes the predicted brain age, a c represents the age corresponding to class c , and p c is the probability of class c being the predicted age.

3.4. Softmax-Derived Brain Age Mapping

To identify the brain regions most influential in the model’s brain age estimation, we designed a visualization algorithm based on Grad-CAM++ (Gradient-weighted Class Activation Mapping++) [27] and Guided Backpropagation [7] tailored to the softmax strategy. This algorithm generates heatmaps of key regions for individual samples, referred to as Brain Age Maps (BAM). We denote this method as Softmax-Derived Brain Age Mapping (SD-BAM), and its architecture is illustrated in Figure 4.
Recalling the output structure of the softmax layer (Table 6), the proposed algorithm computes the activation maps for each age category’s probability by taking the element-wise product of Grad-CAM++ and Guided Backpropagation. We define this resultant map as L c . The underlying principle is as follows: Grad-CAM++ localizes regions that contribute positively to brain age prediction, while Guided Backpropagation captures the fine-grained edge features within those regions. The formula for calculating L c is defined as follows:
L c = p c I ijk guided   ReLU   UpSample   d w d c A ijk d
where p c is the model-predicted probability for class c , I ijk represents the input image, and A ijk d represents the d -th channel’s feature map. The term w d c is the weight coefficient for the d -th channel regarding class c (derived by Grad-CAM++), which is calculated following the Grad-CAM++ formulation [27] as follows:
w d c = i j k 2 p c A ijk d 2 2 2 p c A ijk d 2 + a b t A abt d 3 p c A ijk d 3 ReLU ( p c A ijk d )
The Softmax-Derived Fusion Map (SD-Fusion Map, B) is defined as in (4). Through softmax-weighted fusion, this map reveals both aging and youthful features. The negative regions (blue) indicate features that are strongly associated with brains under the age of 50, whereas the positive regions (red) correspond to features that are strongly associated with brains over the age of 50.
B   = c = 1 50 age c   -   50 p c L c
where age c denotes the age corresponding to class c and L c represents the Guided Grad-CAM++ activation map of class c . Subsequently, the final Brain Age Map (BAM) is obtained through tanh normalization and threshold processing, thereby accentuating the most significant brain regions associated with youthful and aging features:
BAM   =   Threshold tanh B B λ λ percentile   =   0.995
where B denotes the SD-Fusion Map derived from the input image, and λ represents the scaling factor used to clip the pixel intensities within the interval ± λ .

3.5. Training Method

The experiments were conducted using the widely adopted 5-fold cross-validation method, as in previous studies [10,28,29], to train the brain age prediction models. During inference, the final brain age estimate was obtained by averaging the outputs of the five fold-specific models (one trained on each fold). Following the approaches of [2,10,15,16], data augmentation was applied to 70% of the training samples. The augmentation procedures included gamma correction with a correction factor ranging from 0.5 to 2.0 [30], translations of 3–5 voxels, and random rotations of 3–5°. To further examine its effect, additional experiments were conducted without data augmentation for comparison. The training hyperparameters were set as follows: batch size = 4, learning rate = 0.001, and 300 epochs.

4. Experimental Results and Discussion

4.1. Models Evaluation Methods

To quantitatively assess the performance of the proposed brain age prediction models, two primary evaluation metrics were employed: mean absolute error (MAE) and predicted age difference (PAD). The MAE is defined as follows:
MAE   =   1 n   i = 1 n   y ^ i - y i
where y ^ i represents the predicted brain age for the i -th subject, y i denotes the corresponding chronological age, and n is the total number of samples.
In addition, PAD is calculated as the difference between the predicted and actual chronological age:
PAD   =   y ^ - y
This metric is particularly useful for evaluating whether the model exhibits age-dependent bias, such as consistently overestimating or underestimating brain age in specific age groups.

4.2. SD-BAM Evaluation Method

To quantitatively evaluate the regional characteristics of the proposed SD-BAM, a region-of-interest (ROI) analysis was conducted using the Automated Anatomical Labeling (AAL) atlas. The AAL template provides a standardized anatomical parcellation for anatomically consistent evaluation across subjects. For each subject, voxel-wise SD-BAM attention maps were aggregated within each atlas-defined region. The ROI attribution value was computed as the mean attention over all voxels in the region:
ROI r   =   1 V r   v V r BAM ( v )
where BAM ( v ) denotes the SD-BAM attention value at voxel v , and V r represents the set of voxels within ROI r . Positive and negative attributions were analyzed separately to characterize regions that contribute toward or against the predicted brain age. Since rigid registration was employed in the preprocessing stage, inter-subject anatomical variability could not be fully compensated at a fine-grained regional level. As a result, ROI analysis was restricted to coarse anatomical subdivisions to ensure robustness and anatomical consistency.
Specifically, the brain was partitioned into four broad regions: the anterior region (including the frontal lobe), the posterior region (including the occipital lobe and posterior parietal areas), the lateral region (including the temporal lobe), and the central region (including the thalamus, hippocampus, basal ganglia, and periventricular areas). This coarse-level regional grouping mitigates potential misalignment effects introduced by rigid registration while preserving interpretable anatomical trends for SD-BAM evaluation.

4.3. Comparisons of Regression and Softmax Performance

To investigate the effect of dropout rate [31] on different output layer designs, we conducted comparative experiments on models with both regression and softmax strategies. Considering that most studies have employed the Adam optimizer [24,32], while a few have adopted SGD [10], both optimizers were included as variables in this experiment. Apart from these factors, all models shared the same network architecture and dataset settings, and their performance was evaluated under the 5-fold ensemble framework.
As shown in Table 8, both strategies achieved better training performance when using the Adam optimizer compared with SGD. We attribute this to the fact that the learning rate of standard SGD does not decay automatically during the training iterations. For the regression strategy, the best results were obtained with a dropout rate of 0. This can be explained by its single-class output design, in which the absence of a softmax layer after dropout causes the perturbation to directly and more significantly affect the final regression output, thereby degrading performance. In contrast, the softmax strategy performed best with a dropout rate of 0.2, whereas its performance began to decline at a dropout rate of 0.5. These findings suggest that the softmax layer is more tolerant to moderate dropout rates and, compared with the regression strategy, is better able to exploit dropout for alleviating overfitting. Overall, the softmax strategy achieved performance comparable to that of the regression strategy under optimal hyperparameter settings. Given that the regression-by-classification approach of the softmax strategy enhances model interpretability through weighted averaging, we adopted the softmax strategy for subsequent experiments.

4.4. Evaluation of Data Augmentation

This section presents evaluation plots of the model performance on the test set, with a focus on examining the impact of data augmentation on brain age prediction. Figure 5 shows scatter plots of predicted versus chronological ages, where the gray line represents the reference line (x = y) and the red line indicates the regression line of the model predictions. Figure 5a,b correspond to the results with and without data augmentation, respectively. It can be clearly observed that the predictions of the model without augmentation align more closely with the reference line, indicating higher overall accuracy. In contrast, the predictions from the augmented model display greater divergence, suggesting that data augmentation may reduce predictive accuracy under the conditions of this study.
Figure 6 presents the MAE distribution plots, where Figure 6a,b correspond to the prediction results with and without data augmentation, respectively. It can be observed that in the model without augmentation, outliers with MAE greater than 15 occurred less frequently, indicating reduced instances of extreme prediction errors. Moreover, the denser distribution of points within the MAE < 5 range suggests higher prediction accuracy for the majority of samples.
Table 9 compares the performance of models trained with and without data augmentation on the test sets of ABIDE, ADNI, CamCAN, and IXI, as well as on the overall test set. In all cases, the models without augmentation demonstrated superior predictive performance.
For the overall test set, the model with data augmentation yielded an MAE of 4.33, whereas the model without augmentation achieved a reduced MAE of 4.08. The most notable improvement was observed on the CamCAN test set, where the MAE decreased from 5.53 to 4.90. Improvements were also seen in the ADNI and ABIDE datasets, where the MAE decreased from 4.33 to 4.14 and from 2.85 to 2.77, respectively, indicating that data augmentation failed to provide performance gains under the current parameter settings. Table 10 presents a comparison between this study and related works. On the ABIDE test set, our model achieved an MAE of 2.77 years, which is comparable to large-scale studies such as ref. [2] (3.63 years), ref. [15] (3.08 years), and ref. [6] (2.97 years). Overall, this study demonstrates strong data efficiency: using only 1871 MRI scans, our model achieved accuracy close to that of large-scale studies. It should be noted that performance comparisons must be conducted with caution due to variations in datasets and preprocessing methodologies. Furthermore, this study opted to analyze the ABIDE dataset independently to highlight the model’s superior predictive performance within that specific age range.
We summarized studies that exclusively used CamCAN as the test set in Table 11. Because CamCAN covers a wide age range, brain age prediction on this dataset is particularly challenging. In this study, by incorporating multiple datasets to expand the training set, we achieved an MAE of 4.90 on the CamCAN test set, outperforming most existing studies. Notably, ref. [16] combined brain MRI with MEG to reduce the MAE to 4.88 years; however, this approach required additional MEG acquisition, which increases both participant burden and cost. When relying on MRI alone, their model yielded an MAE of 5.33 years. By contrast, our model achieved a comparable level of accuracy using MRI only. These results demonstrate that augmenting training data with multiple datasets can overcome the limitations of insufficient samples and enhance predictive performance on specific datasets.

4.5. Clinical Applications and Efficacy Assessment

We further explored the application of the model in neurodegenerative diseases. Figure 7 compares the prediction results between Alzheimer’s disease (AD) samples and cognitively normal (CN) controls, where orange and gray denote AD and CN samples, respectively. For subjects aged 50–70, many AD samples exhibited “advanced brain age,” with predicted ages substantially exceeding chronological ages and falling outside the variation range of the CN group. In contrast, for subjects aged 80–90, a trend of “delayed brain age” was observed. According to the Limited Lifespan Hypothesis [33], the average human lifespan is approximately 85 years in the absence of biological aging interventions. Therefore, we speculate that individuals surviving to advanced ages tend to have healthier brain conditions, which may explain the manifestation of delayed brain age in this group.
To explain why some AD samples still exhibited delayed brain age, we hypothesize that these individuals were already in a delayed state prior to disease onset. This was particularly evident among subjects over 80 years old, who likely had relatively healthier brain conditions before onset. Even after the accelerated aging associated with AD, it would take time before their brain age predictions shifted to an advanced state. Consequently, as shown in Figure 7, a small number of samples still appeared delayed; however, their predicted brain ages never fell below the minimum values observed in age-matched CN subjects, indicating that they nevertheless exhibited an abnormal aging trajectory. To compare brain age differences between AD and CN groups, we plotted PAD values as shown in Figure 8, where orange and gray represent AD and CN samples, respectively. The results show that, for subjects aged 50–80, most AD samples exhibited “advanced brain age.” Notably, this phenomenon was observed even though the model was not trained on AD data, thereby confirming the strong association between PAD and dementia. For subjects aged 80–90, AD samples did not show clear advancement; however, their trend of “delayed brain age” was weaker compared to CN subjects. We attribute this to the accelerated brain aging caused by AD [34,35,36].

4.6. Softmax-Derived Brain Age Mapping Analysis

We employed the SD-BAM algorithm to analyze regional aging patterns across various ages and physiological conditions. Figure 9 illustrates the BAMs of healthy subjects aged 20, 55, and 70 years. Observations reveal that the model primarily focuses on the ventricular regions. In the 20-year-old sample, the markers appear in deep blue, signifying prominent youthful features. In contrast, the 55-year-old brain exhibits partial orange-yellow markers, indicating early signs of senescence. By age 70, the markers have almost entirely transitioned to red. This progression from blue to red precisely reflects the normal developmental trajectory of the human brain with increasing age. We attribute these findings to the trend of ventricular enlargement as described in [14,37].
Next, we analyzed the Alzheimer’s Disease (AD) samples aged 55 and 70, as illustrated in Figure 10. In contrast to the healthy controls, the model not only identifies ventricular enlargement but also demonstrates significant attention toward cortical contours and the hippocampal regions. While normal aging is accompanied by gradual cortical and hippocampal atrophy, AD induces abnormally accelerated brain aging, leading to more severe atrophy of the cortex and hippocampus, as well as pronounced ventricular expansion. Consequently, the AD samples exhibit almost entirely red aging markers within the BAM [38].
Notably, the training dataset used in this study did not include any AD samples. These findings confirm that AD indeed leads to accelerated brain aging and demonstrate that brain age can serve as a reliable quantitative biomarker. Furthermore, through BAM, researchers can examine neurodegenerative progression across different brain regions in detail, providing an objective spatial basis for diagnosis and clinical assessment.
Figure 11, Figure 12 and Figure 13 present the ROI significance analysis histograms across different age groups (15–20, 55–60, and 70–75 years) and health states (CN, AD), evaluated under three pivot ages (40, 50, and 60) and various threshold settings (None, P50, and P99.5). The left subfigures (a,c,e) display positive weights, representing pixels that significantly contribute to aging features. Conversely, the right subfigures (b,d,f) show negative weights, corresponding to pixels associated with youthful features. For this analysis, the original brain regions were integrated into four major spatial zones: Anterior, Posterior, Lateral, and Central.
All CN subjects included in the ROI analysis were drawn exclusively from the test set. The training set was not used for ROI analysis to avoid potential data leakage and to ensure a fair and unbiased comparison across groups. In contrast, AD subjects were not included in the model training process and were therefore all incorporated into the ROI analysis. The corresponding sample sizes for each group are reported in Table 12.
In this study, P99.5 was selected as the fusion threshold primarily because the responses in the final saliency maps tend to be obscured by noise generated during the fusion process described in Equation (4). Observation of subfigures (a) through (d) reveals that synthesizing multiple saliency maps can lead to a loss of features; consequently, aging-related (positive) pixels are preserved only in the AD group, while youthful (negative) pixels show almost no response. In contrast, by applying a high-intensity mask at the 99.5th percentile, subfigure (e) successfully captures the physiological aging features of the CN group, and subfigure (f) clearly delineates brain regions associated with youthful characteristics. Without this thresholding, it would be impossible to effectively define the underlying physiological significance.
The setting of the pivot age can be adjusted according to the specific analytical goals. Given the wide age span of the dataset (6–90 years, as detailed in Table 3), we established 50 years as the primary pivot point to ensure a balanced distribution. To validate the impact of this parameter, we further tested the ROI performance under three different pivot ages: 40, 50, and 60. Experimental results demonstrate that the responses across all brain regions remained stable across these different pivot age settings, showing no significant fluctuations.
Figure 12e,f present the ROI analysis results using the parameters proposed in this study. From the analysis of aging-related pixels (Positive) in Figure 12e, it is evident that the AD group exhibits extremely high significance across all four regions (Anterior, Posterior, Lateral, and Central). In contrast, the responses in the CN group are primarily concentrated in the Anterior region. This phenomenon reveals the natural pattern of human brain senescence, where evolutionarily later-developing regions, such as the frontal lobe, tend to manifest aging characteristics first [39]. Conversely, in the youthful features (Negative) shown in Figure 12f, the significant response in the Anterior region serves as a metric for assessing brain youthfulness. This quantitative analysis confirms that SD-BAM can effectively differentiate between physiological aging and pathological degeneration, demonstrating its potential as an index for early dementia screening.

4.7. Limitations and Discussions

This study proposed a lightweight model based on VGGNet [23] and evaluated the effects of preprocessing procedures, model design, and data augmentation on predictive performance. Finally, we integrated our newly developed algorithm, Softmax-Derived Brain Age Mapping (SD-BAM), to enable the visualization and identification of key brain regions. The experimental results on the output adjustment layer showed that, compared with the regression strategy, the softmax strategy achieved better MAE performance. However, since the two strategies differ only in a small number of neurons, the overall difference in predictive accuracy was not substantial. From the perspective of model interpretability, the softmax strategy provides a key advantage, as its fixed weighting parameters elegantly assign physical meaning to the feature vectors. Regarding the output structure, this study employed 50 age classes with 2-year intervals. While this configuration yielded competitive results, we recognize that the granularity of age partitioning may influence model sensitivity and error distribution. Future investigations could explore different granularities, such as 1-year or 5-year intervals, to systematically evaluate their impact on predictive performance and clinical utility.
This study observed a clear improvement in predictive performance after data augmentation was removed. We speculate that this may be related to our preprocessing pipeline: MNI registration and skull stripping had already substantially reduced inter-image variations in position and intensity, such that additional augmentation instead introduced unwanted noise. While these results were consistent across our tests, we acknowledge that this conclusion is contingent upon our current methodology. Specifically, the augmentation parameters (3–5 voxel shifts and 3–5° rotations) might be too conservative to offer positive regularization. Moreover, the detrimental effect of Gamma correction may be attributed to its interaction with the intensity normalization process rather than the augmentation itself. PAD (Predicted Age Difference) analysis revealed a tendency of overestimation in younger subjects and underestimation in older subjects, indicating that sample size and distribution influence prediction accuracy. In particular, for individuals over 80 years of age, most samples exhibited delayed brain age. We attribute this to the fact that this age range approaches the human lifespan limit [33], such that those who survive beyond 80 years are more likely to maintain relatively healthy brain conditions. However, this observation should be interpreted with caution, as it may also reflect the uneven sample distribution of the elderly population and the regression-toward-the-mean effect commonly observed in brain age estimation. Consequently, the extent to which this collective delayed brain age phenomenon arises from insufficient training samples versus genuinely healthier brain states remains to be further investigated. Importantly, our analysis shows that even when AD samples appear to exhibit delayed brain age, they still demonstrate a trend of accelerated aging. These results suggest that PAD remains an effective indicator of abnormal brain aging and holds potential as an early screening tool for dementia risk.
The BAM analysis revealed an important finding: when estimating brain age, the model focused particularly on ventricle, cortical and hippocampal regions, which are closely associated with cognitive function. This not only aligns with conclusions from existing neuroscience literature [1,38,40] but also demonstrates the model’s ability to automatically capture core brain regions related to aging and cognitive decline. The SD-BAM algorithm provides visualization of both youthful and aging feature dimensions in brain age prediction models, thereby laying a solid foundation for clinical applications. Despite its effectiveness, the current implementation of SD-BAM is inherently coupled with a classification-based regression framework. This architectural dependency arises from the need to leverage discrete age-class probabilities to weight the feature maps. While this approach excels at capturing stage-specific anatomical markers, it may constrain the direct integration of SD-BAM into pure regression models or emerging backbone architectures that lack a softmax-based output structure. Recognizing this limitation, future research should focus on decoupling the saliency generation mechanism from the classification layer. Developing more generalized SD-BAM variants that can be seamlessly applied to continuous regression tasks and diverse neural network designs remains a priority for enhancing the framework’s versatility. However, the algorithm remains constrained by the classification-weighted output structure, limiting its applicability across all model architectures. Beyond architectural dependencies, as SD-BAM is built upon the Grad-CAM++ framework, it inherits a degree of sensitivity to gradients. This dependency implies that the stability of the generated saliency maps may be susceptible to issues such as gradient vanishing and noise amplification. Therefore, the interpretation of these results should be approached with caution, taking into account these inherent algorithmic limitations. Regarding spatial precision, the use of rigid-body registration in our preprocessing may limit the precision required for investigating subtle brain regions. This study also faces the challenge of high MRI heterogeneity, which could affect the robustness of the saliency regions. In light of these challenges, recent studies [41,42] have demonstrated that strategies for robust feature representation and geometric perturbation modeling can significantly enhance cross-scene consistency. Specifically, the domain transform strategies (e.g., PHFM/QPHFMs) proposed in these works provide stable mappings for complex structural features, which is particularly enlightening for enhancing the stability of SD-BAM under multi-center MRI conditions. Integrating such deformation-resistant feature preservation could mitigate the impact of data heterogeneity and ensure more reliable brain-age representations. Therefore, we believe that future research could incorporate non-linear registration techniques and advanced feature-stabilizing frameworks to enable more granular ROI analysis and mitigate the impact of data heterogeneity. An additional point for consideration is the quantitative evaluation of spatial stability. While our results show consistent neuroanatomical patterns, the reproducibility of these saliency maps across different training folds and random initializations was not formally quantified. Future research should incorporate metrics such as the Dice coefficient to rigorously assess the consistency of these identified biomarkers. Consequently, we believe that future investigations could benefit from non-linear registration techniques and advanced feature-stabilizing frameworks to enable more granular ROI analysis.
Fortunately, our comparison of regression and softmax output strategies showed that, under appropriate dropout rates, both approaches achieved comparable performance. This suggests that models originally designed with regression outputs can be safely converted to the softmax framework without compromising accuracy. In the CamCAN dataset, achieving significant improvements in brain age prediction accuracy has long been challenging. Ref. [16] demonstrated that combining MRI and MEG modalities could enhance model performance, achieving an MAE of 4.88 years. However, our experiments showed that by incorporating external datasets into training, comparable performance (MAE = 4.90 years) could be achieved using MRI alone. This finding indicates that expanding the training dataset is an effective strategy for improving prediction accuracy on specific datasets. Moreover, this approach eliminates the need for additional MEG acquisition, thereby reducing clinical costs while maintaining competitive predictive performance.
We believe that SD-BAM holds significant potential across various advanced applications. It can serve as a critical reference for generative AI in brain synthesis tasks; specifically, these saliency maps can guide generative models to accurately reproduce aging or rejuvenating features in key regions when synthesizing brain images of specific ages or pathological states. Furthermore, SD-BAM has the potential to function as an early biomarker for neurodegenerative diseases, providing essential markers for early lesion detection and dementia risk assessment.

5. Conclusions

This study conducted a comprehensive investigation of brain age prediction models. Experimental results regarding the output adjustment layer indicate that under a regression-based strategy, the absence of Softmax modulation caused Dropout to degrade performance, with the optimal Mean Absolute Error (MAE) achieved at a Dropout rate of 0. Conversely, the Softmax-based strategy benefited positively from Dropout, reaching peak performance at a rate of 0.2. Under their respective optimal hyperparameter configurations, both strategies demonstrated comparable performance, yielding MAEs of 4.34 and 4.33 years, respectively, with the Softmax strategy showing a marginal advantage.
Regarding data augmentation, we observed that omitting augmentation operations actually improved the performance of most models. Notably, the performance of the whole-brain model advanced from an MAE of 4.33 to 4.08 years. This suggests that spatial augmentations, such as translation and rotation, may have compromised the spatial alignment achieved through MNI registration, leading to a degradation of the preprocessing effects. After removing these augmentations, our model achieved an MAE of 4.90 years on the CamCAN test set, outperforming most previous studies that relied solely on CamCAN MRI data.
Clinical application analysis reveals a prevalent ‘advanced brain age’ phenomenon in patients with dementia aged 50 to 80, validating the potential of the Predicted Age Difference (PAD) as a quantitative screening biomarker. Our proposed visualization technique, SD-BAM, effectively identifies both youthful and aging features simultaneously. Experimental evidence shows that when evaluating older brains, the model exhibits high attention toward the ventricles and cortex, accurately reflecting the trends of ventricular enlargement and cortical atrophy associated with senescence. In healthy subjects, the model precisely captures physiological transitions: younger brains are characterized by features in anterior regions (such as the frontal lobe), which are also the earliest areas to manifest signs of aging.
In conclusion, ROI quantitative analysis validates the fundamental patterns of human brain senescence, with ventricular changes persisting throughout the entire aging process. In patients with Alzheimer’s Disease, pathological degeneration causes cortical aging features to be significantly more pronounced than in age-matched healthy controls, effectively overshadowing localized youthful characteristics. SD-BAM not only enhances the depth of brain age assessment but also provides an objective tool for diagnosing neurodegenerative diseases based on spatial distribution evidence.

Author Contributions

Conceptualization, T.-A.C.; Methodology, S.-Y.Y.; Software, S.-Y.Y.; Validation, K.-C.W.; Formal analysis, C.-W.H.; Data curation, K.-C.W.; Writing—original draft, T.-A.C.; Writing—review & editing, C.-W.H.; Visualization, K.-C.W.; Supervision, T.-A.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

Data collection and sharing for this project was provided by the Cambridge Centre for Ageing and Neuroscience (CamCAN). CamCAN funding was provided by the UK Biotechnology and Biological Sciences Research Council (grant number BB/H008217/1), together with support from the UK Medical Research Council and University of Cambridge, UK. This study was partly supported by the National Science and Technology Council, Taiwan, under Contract NSTC 113-2221-E-224-042-, 113-2221-E-224-037, 113-2622-E-224-017, 114-2221-E-224-050, 114-2622-E-224-012 and IRIS “Intelligent Recognition Industry Service Research Center” from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Franke, K.; Luders, E.; May, A.; Wilke, M.; Gaser, C. Brain maturation: Predicting individual BrainAGE in children and adolescents using structural MRI. NeuroImage 2012, 63, 1305–1312. [Google Scholar] [CrossRef] [PubMed]
  2. Jonsson, B.A.; Bjornsdottir, G.; Thorgeirsson, T.E.; Ellingsen, L.M.; Walters, G.B.; Gudbjartsson, D.F.; Stefansson, H.; Stefansson, K.; Ulfarsson, M.O. Brain age prediction using deep learning uncovers associated sequence variants. Nat. Commun. 2019, 10, 5409. [Google Scholar] [CrossRef] [PubMed]
  3. Liem, F.; Varoquaux, G.; Kynast, J.; Beyer, F.; Masouleh, S.K.; Huntenburg, J.M.; Lampe, L.; Rahim, M.; Abraham, A.; Craddock, R.C.; et al. Predicting brain-age from multimodal imaging data captures cognitive impairment. NeuroImage 2017, 148, 179–188. [Google Scholar] [CrossRef] [PubMed]
  4. Cole, J.H.; Poudel, R.P.K.; Tsagkrasoulis, D.; Caan, M.W.A.; Steves, C.; Spector, T.D.; Montana, G. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. NeuroImage 2017, 163, 115–124. [Google Scholar] [CrossRef]
  5. Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 2020, 128, 336–359. [Google Scholar] [CrossRef]
  6. Wood, D.A.; Kafiabadi, S.; Busaidi, A.A.; Guilhem, E.; Montvila, A.; Lynch, J.; Townend, M.; Agarwal, S.; Mazumder, A.; Barker, G.J.; et al. Accurate brain-age models for routine clinical MRI examinations. NeuroImage 2022, 249, 118871. [Google Scholar] [CrossRef]
  7. Springenberg, J.T.; Dosovitskiy, A.; Brox, T.; Riedmiller, M. Striving for Simplicity: The All Convolutional. arXiv 2015, arXiv:1412.6806. [Google Scholar] [CrossRef]
  8. Dartora, C.; Marseglia, A.; Mårtensson, G.; Rukh, G.; Dang, J.; Muehlboeck, J.S.; Wahlund, L.O.; Moreno, R.; Barroso, J.; Ferreira, D.; et al. A deep learning model for brain age prediction using minimally preprocessed T1w images as input. Front. Aging Neurosci. 2024, 15, 1303036. [Google Scholar] [CrossRef]
  9. Wang, J.; Knol, M.J.; Tiulpin, A.; Dubost, F.; de Bruijne, M.; Vernooij, M.W.; Adams, H.H.H.; Ikram, M.A.; Niessen, W.J.; Roshchupkin, G.V. Gray matter age prediction as a biomarker for risk of dementia. Proc. Natl. Acad. Sci. USA 2019, 116, 21213–21218. [Google Scholar] [CrossRef]
  10. Peng, H.; Gong, W.; Beckmann, C.F.; Vedaldi, A.; Smith, S.M. Accurate brain age prediction with lightweight deep neural networks. Med. Image Anal. 2021, 68, 101871. [Google Scholar] [CrossRef]
  11. Chang, X.; Jia, X.; Eickhoff, S.B.; Dong, D.; Zeng, W. Multi-center brain age prediction via dual-modality fusion convolutional network. Med. Image Anal. 2025, 101, 103455. [Google Scholar] [CrossRef]
  12. Eltashani, F.; Parreno-Centeno, M.; Cole, J.H.; Papa, J.P.; Costen, F. Brain age prediction using a lightweight convolutional neural network. IEEE Access 2025, 13, 6750–6763. [Google Scholar] [CrossRef]
  13. Zheng, J.; Wang, J.; Zhang, Z.; Li, K.; Zhao, H.; Liang, P. Brain age prediction based on brain region volume modeling under broad network field of view. Comput. Methods Programs Biomed. 2025, 265, 108739. [Google Scholar] [CrossRef] [PubMed]
  14. Pang, Y.; Cai, Y.; Xia, Z.; Gao, X. Predicting brain age using Tri-UNet and various MRI scale features. Sci. Rep. 2024, 14, 13742. [Google Scholar] [CrossRef] [PubMed]
  15. Zhang, X.; Pan, Y.; Wu, T.; Zhao, W.; Zhang, H.; Ding, J.; Ji, Q.; Jia, X.; Li, X.; Lee, Z.; et al. Brain age prediction using interpretable multi-feature-based convolutional neural network in mild traumatic brain injury. NeuroImage 2024, 297, 120751. [Google Scholar] [CrossRef] [PubMed]
  16. Xifra-Porxas, A.; Ghosh, A.; Mitsis, G.D.; Boudrias, M.-H. Estimating brain age from structural MRI and MEG data: Insights from dimensionality reduction techniques. NeuroImage 2021, 231, 117822. [Google Scholar] [CrossRef]
  17. Lancaster, J.L.; Lorenz, R.; Leech, R.; Cole, J.H. Bayesian optimization for neuroimaging pre-processing in brain age classification and prediction. Front. Aging Neurosci. 2018, 10, 28. [Google Scholar] [CrossRef]
  18. Dufumier, B.; Gori, P.; Battaglia, I.; Victor, J.; Grigis, A.; Duchesnay, E. Benchmarking CNN on 3D anatomical brain MRI: Architectures, data augmentation and deep ensemble learning. arXiv 2021, arXiv:2106.01132. [Google Scholar]
  19. Hanson, J.L.; Adkins, D.J.; Bacas, E.; Zhou, P. Examining the reliability of brain age algorithms under varying degrees of participant motion. Brain Inform. 2024, 11, 9. [Google Scholar] [CrossRef]
  20. Taylor, J.R.; Williams, N.; Cusack, R.; Auer, T.; Shafto, M.A.; Dixon, M.; Tyler, L.K.; Henson, R.N.; Consortium, C.-C.A. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. NeuroImage 2017, 144, 262–269. [Google Scholar] [CrossRef]
  21. Shafto, M.A.; Tyler, L.K.; Dixon, M.; Taylor, J.R.; Rowe, J.B.; Cusack, R.; Calder, A.J.; Marslen-Wilson, W.D.; Duncan, J.; Dalgleish, T.; et al. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: A cross-sectional, lifespan, multidisciplinary examination of healthy cognitive ageing. BMC Neurol. 2014, 14, 204. [Google Scholar] [CrossRef]
  22. Tustison, N.J.; Cook, P.A.; Holbrook, A.J.; Johnson, H.J.; Muschelli, J.; Devenyi, G.A.; Duda, J.T.; Das, S.R.; Cullen, N.C.; Gillen, D.L.; et al. The ANTsX ecosystem for quantitative biological and medical imaging. Sci. Rep. 2021, 11, 9068. [Google Scholar] [CrossRef] [PubMed]
  23. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2015, arXiv:1409.1556. [Google Scholar] [CrossRef]
  24. Ding, W.; Shen, X.; Huang, J.; Ju, H.; Chen, Y.; Yin, T. Brain age prediction based on resting-state functional MRI using similarity metric convolutional neural network. IEEE Access 2023, 11, 57071–57082. [Google Scholar] [CrossRef]
  25. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. arXiv 2015, arXiv:1512.03385. [Google Scholar] [CrossRef]
  26. Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks. arXiv 2018, arXiv:1608.06993. [Google Scholar]
  27. Chattopadhay, A.; Sarkar, A.; Howlader, P.; Balasubramanian, V.N. Grad-CAM++: Generalized gradient-based visual explanations for deep convolutional networks. In Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA, 12–15 March 2018; pp. 839–847. [Google Scholar]
  28. Gong, W.; Beckmann, C.F.; Vedaldi, A.; Smith, S.M.; Peng, H. Optimising a simple fully convolutional network for accurate brain age prediction in the PAC 2019 challenge. Front. Psychiatry 2021, 12, 627996. [Google Scholar] [CrossRef]
  29. Yang, L.; He, S.; Liu, M.; Li, J.; Zhang, Y.; Liu, T. Regularizing brain age prediction via gated knowledge distillation. Proc. Mach. Learn. Res. (PMLR) 2022, 172, 375–389. [Google Scholar]
  30. Rahman, S.; Rahman, M.M.; Abdullah-Al-Wadud, M.; Al-Quaderi, G.D.; Shoyaib, M. An adaptive gamma correction for image enhancement. EURASIP J. Image Video Process. 2016, 2016, 35. [Google Scholar] [CrossRef]
  31. Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
  32. Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  33. Olshansky, S.J.; Willcox, B.J.; Demetrius, L.; Beltrán-Sánchez, H. Implausibility of radical life extension in humans in the twenty-first century. Nat. Aging 2024, 4, 1635–1642. [Google Scholar] [CrossRef] [PubMed]
  34. Gautherot, M.; Kuchcinski, G.; Bordier, C.; Sillaire, A.R.; Delbeuck, X.; Leroy, M.; Leclerc, X.; Pruvo, J.-P.; Pasquier, F.; Lopes, R. Longitudinal analysis of brain-predicted age in amnestic and non-amnestic sporadic early-onset Alzheimer’s disease. Front. Aging Neurosci. 2021, 13, 729635. [Google Scholar] [CrossRef] [PubMed]
  35. Papouli, A.; Cole, J.H. Brain age prediction from MRI scans in neurodegenerative diseases. Curr. Opin. Neurol. 2025, 38, 316–321. [Google Scholar] [CrossRef]
  36. Huang, W.; Li, X.; Li, H.; Wang, W.; Chen, K.; Xu, K.; Zhang, J.; Chen, Y.; Wei, D.; Shu, N.; et al. Accelerated brain aging in amnestic mild cognitive impairment: Relationships with individual cognitive decline, risk factors for Alzheimer disease, and clinical progression. Radiol. Artif. Intell. 2021, 3, e200171. [Google Scholar] [CrossRef]
  37. Nestor, S.M.; Rupsingh, R.; Borrie, M.; Smith, M.; Accomazzi, V.; Wells, J.L.; Fogarty, J.; Bartha, R.; The Alzheimer’s Disease Neuroimaging Initiative. Ventricular enlargement as a possible measure of Alzheimer’s disease progression validated using the Alzheimer’s disease neuroimaging initiative database. Brain 2008, 131, 2443–2454. [Google Scholar] [CrossRef]
  38. Lemaitre, H.; Goldman, A.L.; Sambataro, F.; Verchinski, B.A.; Meyer-Lindenberg, A.; Weinberger, D.R.; Mattay, V.S. Normal age-related brain morphometric changes: Nonuniformity across cortical thickness, surface area and grey matter volume? Neurobiol. Aging 2012, 33, 617.e1–617.e9. [Google Scholar] [CrossRef]
  39. Vickery, S.; Patil, K.R.; Dahnke, R.; Hopkins, W.D.; Sherwood, C.C.; Caspers, S.; Eickhoff, S.B.; Hoffstaedter, F. The uniqueness of human vulnerability to brain aging in great ape evolution. Sci. Adv. 2024, 10, eado2733. [Google Scholar] [CrossRef]
  40. Walhovd, K.B.; Fjell, A.M.; Reinvang, I.; Lundervold, A.; Dale, A.; Eilertsen, D.E.; Quinn, B.T.; Salat, D.; Makris, N.; Fischl, B. Age-related changes in brain volumes across the adult lifespan and their relations to declarative memory: A cross-sectional and longitudinal study. Neurobiol. Aging 2005, 26, 1261–1270. [Google Scholar] [CrossRef]
  41. Liu, Y.; Wang, C.; Lu, M.; Yang, J.; Gui, J.; Zhang, S. From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 5449–5462. [Google Scholar] [CrossRef]
  42. Wang, C.; Zhang, Q.; Wang, X.; Zhou, L.; Li, Q.; Xia, Z.; Ma, B.; Shi, Y.-Q. Light-Field Image Multiple Reversible Robust Watermarking Against Geometric Attacks. IEEE Trans. Dependable Secur. Comput. 2025, 22, 5861–5875. [Google Scholar] [CrossRef]
Figure 1. System flow chart.
Figure 1. System flow chart.
Electronics 15 00220 g001
Figure 2. Age distribution of healthy samples across datasets.
Figure 2. Age distribution of healthy samples across datasets.
Electronics 15 00220 g002
Figure 3. Preprocessed Image. Blue vertical and horizontal dashed lines: reference lines for the slice positions.
Figure 3. Preprocessed Image. Blue vertical and horizontal dashed lines: reference lines for the slice positions.
Electronics 15 00220 g003
Figure 4. Architecture of Softmax-Derived Brain Age Mapping.
Figure 4. Architecture of Softmax-Derived Brain Age Mapping.
Electronics 15 00220 g004
Figure 5. Scatter chart of actual age and predicted age: (a) With data augmentation; and (b) Without data augmentation. The gray line represents the reference line (x = y) and the red line indicates the regression line of the model predictions.
Figure 5. Scatter chart of actual age and predicted age: (a) With data augmentation; and (b) Without data augmentation. The gray line represents the reference line (x = y) and the red line indicates the regression line of the model predictions.
Electronics 15 00220 g005
Figure 6. MAE scatter plot: (a) With data augmentation; and (b) Without data augmentation. The red line indicates the regression line of the model predictions.
Figure 6. MAE scatter plot: (a) With data augmentation; and (b) Without data augmentation. The red line indicates the regression line of the model predictions.
Electronics 15 00220 g006
Figure 7. Comparison of Predicted and Actual Age Across AD and CN Groups.
Figure 7. Comparison of Predicted and Actual Age Across AD and CN Groups.
Electronics 15 00220 g007
Figure 8. Bar chart comparison of predicted and actual age across AD and CN groups.
Figure 8. Bar chart comparison of predicted and actual age across AD and CN groups.
Electronics 15 00220 g008
Figure 9. BAM in CN subjects.
Figure 9. BAM in CN subjects.
Electronics 15 00220 g009
Figure 10. BAM in AD subjects.
Figure 10. BAM in AD subjects.
Electronics 15 00220 g010
Figure 11. ROI significance analysis histograms at a pivot age of 40 under various threshold masks. (a) Mask: none, Positive; (b) Mask: none, Negative; (c) Mask: P50, Positive; (d) Mask: P50, Negative; (e) Mask: P99.5, Positive; (f) Mask: P99.5, Negative.
Figure 11. ROI significance analysis histograms at a pivot age of 40 under various threshold masks. (a) Mask: none, Positive; (b) Mask: none, Negative; (c) Mask: P50, Positive; (d) Mask: P50, Negative; (e) Mask: P99.5, Positive; (f) Mask: P99.5, Negative.
Electronics 15 00220 g011
Figure 12. ROI significance analysis histograms at a pivot age of 50 under various threshold masks. (a) Mask: none, Positive; (b) Mask: none, Negative; (c) Mask: P50, Positive; (d) Mask: P50, Negative; (e) Mask: P99.5, Positive; (f) Mask: P99.5, Negative.
Figure 12. ROI significance analysis histograms at a pivot age of 50 under various threshold masks. (a) Mask: none, Positive; (b) Mask: none, Negative; (c) Mask: P50, Positive; (d) Mask: P50, Negative; (e) Mask: P99.5, Positive; (f) Mask: P99.5, Negative.
Electronics 15 00220 g012
Figure 13. ROI significance analysis histograms at a pivot age of 60 under various threshold masks. (a) Mask: none, Positive; (b) Mask: none, Negative; (c) Mask: P50, Positive; (d) Mask: P50, Negative; (e) Mask: P99.5, Positive; (f) Mask: P99.5, Negative.
Figure 13. ROI significance analysis histograms at a pivot age of 60 under various threshold masks. (a) Mask: none, Positive; (b) Mask: none, Negative; (c) Mask: P50, Positive; (d) Mask: P50, Negative; (e) Mask: P99.5, Positive; (f) Mask: P99.5, Negative.
Electronics 15 00220 g013
Table 1. Summary of Best-Performing Brain Age Prediction Models.
Table 1. Summary of Best-Performing Brain Age Prediction Models.
ArticlesDataset (n)Training/Test (n)Test Set Age Range (Years)MAE (Years)
[2]DeCODE (1469) a, IXI (440), UK biobank (12,395)1909/12,395 a45–803.63
[3]LIFE (2354)1177/117719–824.29
[4]BAHC (2001) b1601/20018–904.16
[11]CamCAN (483)-18–885.47
[12]ABIDE I (228), ABIDE II (505), BNU (328), Berlin (142), Cleveland Clinic (150), IXI (544), ADNI (219), Train-39 (135)1801/2256–903.56
[13]Healthy Group (1182) c850/33219–844.60
[14]CamCAN (651)521/13018–897.46
[15]IXI (459), CoRR (266), OASIS (264)
ABIDE I (258), ABIDE II (217), Local Centers (118)
1464/11818–943.08
[6]Local Hospital (19,807)15,146/466118–952.97
[16]CamCAN (613)-18–884.88
[9]Rotterdam Study (8768)5865/55046–964.45
[17]BAHC (2003) d, CamCAN (648)-18–885.08
a DeCODE and IXI are used as training datasets, while the UK Biobank is used as the test dataset. b BAHC includes 14 publicly available datasets: ABIDE (184), Beijing Normal University (179), Berlin School of Brain & Mind (49), CADDementia (12), Cleveland Clinic (31), ICBM (322), IXI (561), MCIC (93), MIRIAD (23), NEO2012 (39), NKI (160), OASIS (288), WUSL (24), TRAIN-39 (36). c The Capital Medical University Xuanwu Hospital, in collaboration with 15 universities and hospitals nationwide, created a self-built dataset of whole-brain sMRI images collected from healthy volunteers using 3.0T MRI scans. d BAHC includes 14 publicly available datasets: ABIDE (184), Beijing Normal University (181), Berlin School of Brain & Mind (49), CADDementia (12), Cleveland Clinic (31), ICBM (322), IXI (561), MCIC (93), MIRIAD (23), NEO2012 (39), NKI (160), OASIS (288), WUSL (24), TRAIN-39 (36).
Table 2. Image Processing Methods and Models of Top Brain Age Prediction Models.
Table 2. Image Processing Methods and Models of Top Brain Age Prediction Models.
ArticlesPreprocessing MethodsData AugmentationsModel Architecture
[2]GM/WM segmentation, bias correction, skull stripping, rigid registration, Jacobian mapsRotation of 0–40°, Translation of 10 voxels.ResNet
WB, Jacobian maps, GM and WM combination model (10-fold cross-validation).
[3]Skull stripping, linear registration, filtering, nonlinear registration, mcflirt, GM/WM segmentation, bias correction, denoising SVR + Random Forest
Multimodal stacking model (connectivity matrix 197, connectivity matrix 444, cortical thickness, cortical surface area, subcortical volumes)
[4]GM segmentation, motion artifact removal, nonlinear registration, resampling include modulation, spatial smoothingRotation of 0–40°, Translation of 10 pixels.CNN
GM normalized volume map model
[11]T1-MRI: GM/WM/CSF segmentation, skull stripping, linear registration, spatial normalization, bias correction
rs-fMRI: Removal of initial volumes, slice-timing correction, motion correction, rigid-body registration, spatial normalization, Gaussian smoothing, denoising, temporal filtering
BrainDCNw
Dual-modality (weighted rs-fMRI functional connectivity + weighted T1 brain volume) model.
(10-fold cross-validation).
[12]Skull stripping, linear registration, GM segmentation, resample and alignWeighted MAE Loss (assign higher importance to samples from underrepresented age groups).Lightweight CNN
GM model.
[13]Brain region segmentation, skull stripping, affine registration, volume normalization RFBLSO
Brain region normalized volume feature model.
[14]Bias correction, skull stripping, affine registration, spatial smoothing, spatial normalization Tri-UNet
WB model.
[15]GM/WM segmentation, skull stripping, linear registrationRotation of 0–40°, Translation of 10 voxels.VGG-13
WB, GM and WM combination model (10-fold cross-validation).
[6]Cropping to the same size, intensity normalization e DenseNet121
WB (non-skull-stripped) model.
[16]MRI: GM/WM/CSF segmentation, cortical segmentation, skull stripping, nonlinear registration, Z-score normalization
MEG: Filtering, resampling, denoising, leakage correction, source localization, brain region segmentation
CCA+GPR
MRI-MEG feature stacking model
(10-fold cross-validation)
[9]GM/WM/CSF segmentation, nonlinear registration, spatial modulationSingle-subject longitudinal model (one subject with one or multiple MRI scans from different time).CNN
GM density map model.
[17]GM/WM segmentation, smoothing, affine registration, nonlinear registration, spatial normalization, modulation, bias correction SVR
GMV and WMV combination model.
e T2-w MRI is used as the input, and for other models, if not specified, T1-w MRI is used as the input.
Table 3. Statistical Data of Healthy Samples from Each Dataset.
Table 3. Statistical Data of Healthy Samples from Each Dataset.
SamplesAverage Age ± Standard DeviationAge RangeFemale (%)
ABIDE56117.1 ± 7.76–5617.6%
ADNI56271.5 ± 7.251–9059.4%
CamCAN65354.8 ± 18.618–8950.5%
IXI56348.7 ± 16.520–8655.6%
Table 4. Model Architecture with Regression Strategy.
Table 4. Model Architecture with Regression Strategy.
Layer ConfigurationInput → Output (Channels)
Conv → BN → Pool → ReLU1 → 32
Conv → BN → Pool → ReLU32 → 64
Conv → BN → Pool → ReLU64 → 128
Conv → BN → Pool → ReLU128 → 256
Conv → BN → Pool → ReLU256 → 256
Conv (1 × 1 × 1) → BN → Pool → ReLU256 → 64
Global Average Pooling-
Dropout-
Conv (1 × 1 × 1)64 → 1
Table 5. Comparison of Parameter Counts between the Proposed Model and Other Architectures.
Table 5. Comparison of Parameter Counts between the Proposed Model and Other Architectures.
ArticleModel ArchitectureParameter Counts
OursOurs2,953,586
[25]ResNet-18 (3D)33,186,546
[26]DenseNet-121 (3D)11,293,874
Table 6. Model Architecture with Softmax Strategy.
Table 6. Model Architecture with Softmax Strategy.
Layer ConfigurationInput → Output (Channels)
Conv → BN → Pool → ReLU1 → 32
Conv → BN → Pool → ReLU32 → 64
Conv → BN → Pool → ReLU64 → 128
Conv → BN → Pool → ReLU128 → 256
Conv → BN → Pool → ReLU256 → 256
Conv (1 × 1 × 1) → BN → Pool → ReLU256 → 64
Global Average Pooling-
Dropout-
Conv (1 × 1 × 1)64 → 50
Softmax-
Weighted Sum50 → 1
Table 7. Softmax Layer Output Illustration.
Table 7. Softmax Layer Output Illustration.
Output of the Softmax Layer
Channel ( c ) 123484950
Age Class ( a c ) 2469698100
Probability ( p i ) p1p2p3p48p49p50
Table 8. Comparison of Regression and Softmax Performance.
Table 8. Comparison of Regression and Softmax Performance.
ModelOptimizerDropout RateTraining Set MAETest Set MAE
RegressionSGD01.194.66
Adam01.194.34
Adam0.21.424.38
SoftmaxSGD01.134.43
Adam01.124.38
Adam0.21.064.33
Adam0.51.264.35
Table 9. Comparison of Model Performance with and Without Data Augmentation.
Table 9. Comparison of Model Performance with and Without Data Augmentation.
Test Set with Data AugmentationTest Set Without Data Augmentation
ABIDE2.852.77
ADNI 4.334.14
CamCAN5.534.90
IXI4.344.33
Whole Test Set4.334.08
Table 10. Comparison of Top Models in Brain Age Prediction Based on MAE.
Table 10. Comparison of Top Models in Brain Age Prediction Based on MAE.
ArticlesTraining/Test (Images)Test Set Age Range (Years)Test Set MAE (Years)
Ours1871/4686–904.08
Ours1871/110 a6–562.77
[12]1801/2256–903.56
[13]850/33219–844.60
[15]1464/11818–943.08
[6]15,146/466118–952.97
[9]5865/55046–964.45
[2]1909/12,39545–803.63
[3]1177/117719–824.29
[4]1601/20018–904.16
a ABIDE only.
Table 11. Comparison of Related Studies Using CamCAN as Test Set.
Table 11. Comparison of Related Studies Using CamCAN as Test Set.
ModelTraining/Test (n Images)Test Set MAE (Years)
[11]-5.47
[14]521/1307.46
[16]
(MRI + MEG)
-4.88
[16]
(MRI only)
-5.33
[17]2003/6485.08
Ours1871/1354.90
Table 12. Sample distribution for ROI analysis.
Table 12. Sample distribution for ROI analysis.
Age GroupCN (Images)AD (Images)
15–2034-
55–652715
70–7556116
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chang, T.-A.; Yan, S.-Y.; Wang, K.-C.; Hung, C.-W. Softmax-Derived Brain Age Mapping: An Interpretable Visualization Framework for MRI-Based Brain Age Prediction. Electronics 2026, 15, 220. https://doi.org/10.3390/electronics15010220

AMA Style

Chang T-A, Yan S-Y, Wang K-C, Hung C-W. Softmax-Derived Brain Age Mapping: An Interpretable Visualization Framework for MRI-Based Brain Age Prediction. Electronics. 2026; 15(1):220. https://doi.org/10.3390/electronics15010220

Chicago/Turabian Style

Chang, Ting-An, Shao-Yu Yan, Kuan-Chih Wang, and Chung-Wen Hung. 2026. "Softmax-Derived Brain Age Mapping: An Interpretable Visualization Framework for MRI-Based Brain Age Prediction" Electronics 15, no. 1: 220. https://doi.org/10.3390/electronics15010220

APA Style

Chang, T.-A., Yan, S.-Y., Wang, K.-C., & Hung, C.-W. (2026). Softmax-Derived Brain Age Mapping: An Interpretable Visualization Framework for MRI-Based Brain Age Prediction. Electronics, 15(1), 220. https://doi.org/10.3390/electronics15010220

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop