1. Introduction
Aging is a universal process marked by profound changes in brain structure, function, and connectivity [
1,
2,
3]. These changes are central to understanding the neural basis of age-related cognitive decline and the heightened vulnerability to neurodegenerative diseases in older adults [
4]. Across the lifespan, specific brain regions, such as the hippocampus, ventricles, and cortical lobes, exhibit dynamic alterations in volume and morphology, reflecting underlying neurobiological processes, including synaptic loss, myelin degradation, and cerebrovascular changes [
5,
6,
7]. One particular method to detail such alterations is to measure the volumetric difference in brain regions from the subjects of various age groups through imaging tools such as MRI [
8,
9]. However, even recent large-scale normative imaging studies face challenges in assembling demographically diverse datasets across the full adult lifespan, often emphasizing midlife participants and predominantly single ethnicity populations [
10,
11,
12].
To address such challenges, we investigate structural brain changes across a large, diverse cohort of subjects aged 20 to 90 years, leveraging high-resolution 3T MRI and neuroimaging tools capable of segmenting 95 brain structures. By integrating datasets from distinct sources, including a Korean population, a widely used European Information eXtraction from Images (IXI) dataset, and the predominantly Caucasian Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, we aim to capture a comprehensive view of age-related trajectories across different demographics [
13,
14]. Importantly, this integration provides complementary normative references to existing single-cohort efforts by spanning early adulthood through late life and incorporating multiple ethnic groups, thereby supplementing underrepresented ethnicities and age ranges (e.g., pre-50 participants absent in ADNI).
In this study, we utilize the MTech Lab BrainMAP pipeline, an optimized combination of advanced neuroimaging tools for preprocessing and segmentation of MRI images. The pipeline integrates preprocessing techniques, such as denoising, intensity normalization, and spatial normalization, with FastSurfer, a neural network-based tool for a streamlined rapid brain structure segmentation orders of magnitude faster than traditional methods such as FreeSurfer [
15,
16]. Through the pipeline, we assess the combined means and standard deviation of brain region volumes, organized into age groups (21–30, 31–40, 41–50, 51–60, 61–70, 71–80, 81–90). Each brain region was then grouped by its respective lobe location (frontal, parietal, temporal, occipital, cingulate, other). To ensure the robustness of the BrainMAP pipeline, we compare its performance against FreeSurfer, the gold standard method for segmentation of brain structures from MRI images [
17,
18]. Using a sample of 20 subjects from each dataset, we evaluate the agreement between the two methods using metrics such as intraclass correlation coefficients (ICC), and coefficient of variation (CoV). We first corrected for sex and cohort related differences to reduce potential systematic variance, then calculated the percentage change in mean bilateral brain region volumes between age groups to detail patterns of volumetric change in aging. Additionally, we utilized polynomial regression analyses to describe non-monotonic volumetric changes in certain brain volumes. Through these analyses, we aim to contribute to the understanding of normative patterns of brain aging using an integrated dataset of diverse populations for a more generalized result.
2. Materials and Methods
2.1. Subjects
A total of 1833 subjects (M = 970, F = 863) spanning three different datasets were collected for this study. Mean age and demographics details per age group for each dataset are shown in
Table 1.
2.1.1. Korean Dataset
The dataset of 364 Korean subjects spanning ages from 21 to 90 (M = 220, F = 144) with no history of major disease was organized for this study. Three subjects above or equal to age 81 were excluded due to very small sample sizes. All participants were interviewed to ensure they did not have a history of any mental disorder, neurologic disease, psychiatric disorders, or other substance use disorders. The images were obtained from Gachon University School of Medicine using a 3T Philips Achieva Scanner with the same 32-channel heal coil and pulse sequence. The T1-weighted MPRAGE sequence was taken following closely with the ADNI protocol, which utilizes the following parameters: 6.8 ms TR, 3.1 ms TE, 256 mm FOV, 9° flip angle, 1 × 1 × 1.2 mm voxel size, 170 slices without gaps.
2.1.2. IXI Dataset
The IXI dataset, which comprises 563 images of healthy subjects spanning ages from 20 to 86 were obtained from the publicly available IXI dataset (brain-development.org), which includes healthy subjects that were recruited to be scanned across multiple hospital sites in London, UK. For this study, we selected 563 T1-weighted images from unique individuals (312 females, 251 males). The IXI dataset contains data acquired at 1.5 T and 3 T field strengths across different scanner manufacturers (Philips Healthcare, Amsterdam, The Netherlands; GE Healthcare, Chicago, IL, USA); detailed acquisition parameters vary by site and are documented on the dataset’s website.
2.1.3. ADNI Dataset
T1-weighted structural MRI data were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu), consisting of 909 subjects (M = 408, F = 501), spanning ages from 51 to 90. Each baseline 3D T1-weighted image was selected corresponding to the first available scan per individual to avoid redundancy from longitudinal follow-ups. Subjects included had a Clinical Dementia Rating (CDR) global score of ≤0.5 and a Mini-Mental State Examination (MMSE) score of ≥25, to ensure the subjects had normal cognition or very mild impairment. All images were acquired according to standardized ADNI MRI acquisition protocols, which are publicly available and designed to minimize inter-site variability across scanner platforms and field strengths [
14].
Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by Principal Investigator Michael W. Weiner, MD. The original goal of ADNI was to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). The current goals include validating biomarkers for clinical trials, improving the generalizability of ADNI data by increasing diversity in the participant cohort, and providing data concerning the diagnosis and progression of Alzheimer’s disease to the scientific community. For up-to-date information, see
adni.loni.usc.edu. 2.2. Image Processing
Each 3D T1w image from the three datasets were processed through the MTech Lab BrainMAP 1.0 fully automated image processing pipeline, which incorporates a series of preprocessing steps and 3D CNN-based FastSurfer to segment the whole brain into 95 anatomical and cortical structures in under a minute. The preprocessing steps involve normalizing the intensity of the image through N4 biasfield correction, conforming the orientation of the brain to the standard MNI152 space, then resampling the image to a 256 × 256 × 256 1 mm isotropic space. The preprocessed images are then fed to the FastSurfer CNN model that segments the image into 95 structures defined by the Desikan-Killiany-Tourville (DKT) atlas, as well as output the volumetric information of each segmented structure in mm3.
2.3. Statistical Analysis
To validate the segmentation accuracy and consistency of the BrainMAP pipeline, we compared its results against FreeSurfer 7.4, which has been recently enhanced with modules such as SynthSeg, SynthStrip, and SynthMorph for improved robustness and speed [
19]. A subset of 20 subjects from each dataset (Korean, IXI, and ADNI) was processed using both pipelines. For this comparison, brain structures were grouped into six broader anatomical categories: subcortical nuclei, frontal lobe, parietal lobe, temporal lobe, occipital lobe, and limbic lobe. Within each lobe, 90 volumes from corresponding left and right hemisphere structures were combined to reduce interhemispheric redundancy. We computed mean volume differences between FreeSurfer and BrainMAP segmentations, standard deviation of differences, intraclass correlation coefficients (ICC), coefficients of variation (CoV), effect sizes (D), and coefficient of determination (R2). High intraclass correlation coefficients and low coefficients of variation would indicate strong agreement between the pipelines.
For comprehensive analysis and improved interpretability, the 95 anatomical regions (90 bilateral regions, 5 non-bilateral: 3rd and 4th ventricle, brain stem, white matter Hypointensities, cerebrospinal fluid) segmented by the BrainMAP pipeline were grouped into six categories based on neuroanatomical convention: subcortical (hippocampus, amygdala, caudate, accumbens, pallidum, putamen, thalamus, choroid plexus, ventral diencephalon (DC)), frontal lobe (caudal middle frontal, rostral middle frontal, superior frontal, lateral orbito frontal, medial orbito frontal, pars operacularis, pars orbitalis, pars triangularis, precentral), parietal lobe (inferior parietal, superior parietal, supra marginal, paracentral, postcentral, precuneus), temporal lobe (entorhinal, fusiform, insula, inferior temporal, middle temporal, superior temporal, transverse temporal, parahippocampal), occipital lobe (cuneus, lateral occipital, lingual, pericalcarine), cingulate lobe (caudal anterior cingulate, isthmus cingulate, posterior cingulate, rostral anterior cingulate), and other (cerebral white matter, cerebellum, cerebellum white matter, inferior lateral ventricles, 3rd and 4th ventricles, cerebrospinal fluid (CSF), brain stem, white matter (WM) hypointensities).
For each age group, the mean and standard deviation of these bilateral volumes were computed after excluding zero values. In addition, 95% confidence intervals (CI) were calculated using the standard error of the mean for each region’s volume distribution across age. Means of left and right hemisphere volumes were summed for each brain region to obtain combined bilateral measurements, then categorized into six separate tables (one for each anatomical category) to improve readability and to facilitate cross-regional comparison.
Regional volumes were adjusted for sex and cohort effects using linear regression models. For each brain region, volume was modeled as
where Vi represents the observed volume for participant i, β0 is the intercept, β1, β2, and γk are regression coefficients, Sex
i was coded as Male = 1, Female = 0, Age
i represents chronological age as a continuous variable, 1{Cohort
i = k} is an indicator function equal to 1 if participant i belongs to cohort k and 0 otherwise, K represents the total number of datasets, and ε
i is the residual error term. Corrected volumes were computed by removing fitted sex and cohort contributions, preserving age-related variance and individual differences. We used the following equation to compute corrected volumes:
where Vi
corrected denotes the corrected volume for participant i and all other terms are as defined above. To verify removal of cohort and sex effects, we re-tested their residual associations on corrected volumes using the following linear model:
where δ0 is the intercept, δ1 is the regression coefficient for sex, δk are regression coefficients for cohort k, η
i is the residual error term for participant i, and 1{Cohort
i = k} is an indicator function equal to 1 if participant i belongs to cohort k and 0 otherwise. Under successful correction, we expect δ1 ≈ 0 (no residual sex effect) and δk ≈ 0 for all k (no residual cohort effects).
p-values for the post-correction coefficient tests were adjusted using the Benjamini–Hochberg false discovery rate across all regions (and across cohort levels where applicable).
To characterize non-linear age-related trajectories, we implemented a comprehensive polynomial regression analysis fitting linear and cubic models to each brain region. Non-monotonic patterns were identified using peak detection algorithms applied to smooth polynomial curves, with regions classified as U-shaped (local minimum in middle age), inverted-U (local maximum in middle age), or linear. The cubic model used to fit each brain region can be described by the following equation:
where Vi represents mean volume for age group i, α0, α1, α2, and α3 are polynomial coefficients, and ε
i is the error term.
R
2 improvement values over linear models were obtained for each pattern (U-shaped, inverted-U), and improvement values higher than 0.2 were considered more fit for that pattern than the conventional linear pattern. We then obtained three different measures to describe age-related change. Endpoint changes (% change) were used to compare the mean volumetric variability between the youngest age group (21–30) with the oldest age group (81–90). The endpoint changes were derived using the following equation:
where V
young refers to the mean bilateral volume of the youngest age group (21–30 years), and V
old refers to the mean bilateral volume of the oldest age group (81–90).
To describe the largest percentage change between any two adjacent age groups, we obtained the maximum consecutive change using the following equation:
where Vi represents the mean volume for age group i, V{i + 1} represents the mean volume for the subsequent age group, and the age groups are indexed as follows: 1 = (21–30), 2 = (31–40), 3 = (41–50), 4 = (51–60), 5 = (61–70), 6 = (71–80), and 7 = (81–90 years). The maximum operator selects the largest absolute percentage change across all six possible consecutive age group pairs.
To describe the maximum variability across all age groups (comparing the minimum and maximum mean volumes across all age groups), we used the following equation:
where V
max refers to the largest mean bilateral volume value out of all age groups (21–90 years), and V
min refers to the smallest mean bilateral volume value of all age groups (21–90 years).
All analyses were descriptive in nature. No formal hypothesis tests of aging effects were performed, as the aim of this study was to provide normative volumetric references rather than inferential comparisons.
4. Discussion
For this study, normative patterns of structural brain aging across broad and demographically diverse populations spanning ages 20 to 90 years were characterized using an integrated dataset composed of high resolution 3T MRI from three distinct cohorts involving Korean, European (IXI), and predominantly Caucasian (ADNI) populations. Initially, the MTech Lab BrainMAP segmentation and processing pipeline used to obtain the volumes of brain regions was validated for its robustness by calculating its agreement with FreeSurfer, the gold standard for segmentation of brain structures, on 20 subjects randomly selected from each population. The validation analysis demonstrated strong agreement between the BrainMAP pipeline and FreeSurfer pipeline, indicated by the high intraclass correlation coefficient (mean ICC = 0.9965) and low coefficients of variation (mean CoV = 5.55%) across all brain regions.
Subsequently after validation, the pipeline was used to process each 3T MRI in the integrated dataset for volumes of 95 brain regions. First, in order to reduce the effects of glaring issues in sex and cohort differences on brain volume, a regression-based was utilized to isolate age-related brain changes from demographic confounds by fitting separate models for each brain region. This approach modeled each region’s volume as a function of sex, age, and cohort membership, then removed only the fitted sex and cohort contributions while preserving age effects and individual differences. Before correction, sex differences were present in nearly all brain regions (91/95), with males typically showing larger volumes, and cohort effects were equally widespread, reflecting technical differences between datasets. The correction proved highly effective, reducing sex effect sizes by over 80% and completely eliminating cohort effects across all regions. Post-correction validation confirmed successful removal of unwanted effects, with sex differences remaining significant in only 3 regions and cohort effects eliminated entirely. The high preservation of individual differences (ICC = 0.94) and age-related variance (partial r2 = 0.31) indicated successful separation of demographic confounds from meaningful biological variability without artificially removing true age effects.
With the corrected data, we calculated endpoint changes using traditional young-to-old comparisons between the 21–30 and 81–90 age groups to quantify overall age-related volumetric changes. Calculation of percentage change in mean bilateral volumes across age groups showed that ventricular structures (inferior lateral ventricle, 116.8%; lateral ventricle, 115.9%; 3rd ventricle, 79.4%), choroid plexus (77.8%), CSF (41.1%) and white matter hypointensities (122.6%) exhibited the largest volumetric expansions. The frontal cortical regions demonstrated substantial volume reductions of up to 23.2%, whereas the parietal (15.3%), cingulate (16.6%), temporal (16.6%), and occipital (10.7%) showed modest shrinkages. Certain cortical and subcortical regions such as the pallidum (2.6%), insula (3.7%), pericalcarine (2.5%), amygdala (7.3%), and cerebellum cortex (4.9%), showed minimal volume shrinkages over the lifespan. Conversely, certain regions such as the caudate (6.6%), entorhinal (2.7%), brain stem (5.2%) showed modest volumetric increases.
As shown in
Table 3, the most drastic volumetric expansion between age ranges was the white matter hypointensities (122.6%), an area that represent areas of signal abnormality on MRI associated with reduced tissue integrity, often due to cerebrovascular pathology such as ischemic injury, demyelination, and microvascular dysfunction [
20,
21,
22,
23]. The dramatic increase observed likely reflects progressive age-related vascular damage, potentially driven by chronic hypertension, arteriosclerosis, or small vessel disease, ultimately contributing to cognitive decline by disrupting neural connectivity and efficiency [
20,
24]. Regions demonstrating similar magnitude of volumetric expansion included the inferior lateral ventricles (116.8%), lateral ventricles (115.9%), third ventricle (79.4%), consistent with results of previous studies that have attributed the enlargement to generalized parenchymal atrophy and CSF redistribution [
25,
26,
27]. Choroid plexus (77.8%) also showed significant enlargement with age, similar to previous studies that have attributed the correlation with compensatory hypertrophy to perfusion loss [
28].
In aging populations, significant atrophy was also observed within frontal and parietal lobes integral to executive functions, spatial orientation, and sensory integration. The pars triangularis (21.5% reduction), rostral middle frontal gyrus (23.2%), and caudal middle frontal gyrus (13.7%) are critical for language processing, executive functioning, and working memory [
29,
30]. As such, their volumetric declines may contribute directly to cognitive deficits frequently observed in aging populations. Similarly, the inferior parietal lobule (14.2% reduction), involved in visuospatial processing and attention, exhibited atrophy, potentially underpinning age-related spatial and attentional deficits [
31,
32].
Additionally, aging populations demonstrated notable volumetric reductions in the temporal lobe structures associated with functions relating to auditory processing, including the transverse temporal gyrus (14.9% reduction) [
33,
34]. Such patterns could reflect deterioration of auditory processing speed and accuracy common in older adults [
35,
36]. Moderate declines in the cingulate cortex, particularly the caudal anterior cingulate (16.6%) and posterior cingulate (15.6%), suggest vulnerability in regions associated with emotion regulation and episodic memory retrieval, potentially contributing to subtle cognitive and emotional alterations observed in aging [
37,
38,
39].
Conversely, certain regions displayed relative volumetric stability, particularly in the insula (3.7%), caudate nucleus (5.4%), pallidum (2.6%), and pericalcarine (2.5%), regions that support diverse functions, including interoceptive awareness, reward and motor control [
40,
41,
42,
43]. Their minimal volume change suggests preserved functionality or compensatory neuroplastic mechanisms mitigating typical aging effects, contributing to maintained cognitive and emotional functionality in older adults [
44].
To identify potential non-linear age trajectories that might be missed by endpoint comparisons, we applied polynomial regression analysis and peak detection algorithms to classify regions as linear, U-shaped, or inverted-U patterns. While the majority of brain regions showed expected linear age-related changes, seven regions displayed clear non-monotonic trajectories with substantial improvements in model fit over linear approaches (R
2 improvement range: 0.231–0.766). Three regions exhibited U-shaped patterns, characterized by volume minima in middle age: the caudate (6.6% endpoint change, 10.4% range change), pallidum (−2.6% endpoint change, 5.3% range change), and insula (−3.7% endpoint change, 4.3% range change). These basal ganglia and insular cortex regions may experience initial vulnerability during middle age due to metabolic stress or lifestyle factors, followed by structural stabilization through compensatory mechanisms or changes in tissue composition. In contrast, four regions showed inverted-U trajectories with volume maxima in middle age: the entorhinal cortex (2.7% endpoint change, 9.5% range change), hippocampus (−11.3% endpoint change, 12.7% range change), amygdala (−10.9% endpoint change, 10.9% range change), and brainstem (5.2% endpoint change, 5.2% range change). These limbic and memory-related structures may reach peak structural integrity in middle age before succumbing to age-related pathological processes, consistent with their known vulnerability to neurodegenerative changes and early involvement in conditions such as Alzheimer’s disease [
3,
5]. The presence of such patterns highlights the complexity of brain aging processes and suggests that linear models may not fully capture volumetric change dynamics in all brain structures [
5,
45].
This study has several limitations that should be acknowledged. First, the cross-sectional design limits causal inference and does not allow us to distinguish true age-related trajectories from potential cohort effects. Additionally, although preprocessing and regression-based adjustments were performed, residual scanner- or site-specific biases may remain. We attempted to mitigate these issues by statistically correcting for both sex and dataset effects, which substantially reduced systematic variance, but there were still regions that showed significant differences in three regions (3rd ventricle, left and right choroid plexus), and cannot fully exclude subtle harmonization differences. Second, potential confounding factors such as genetic background, environmental exposures, and socioeconomic status were not controlled for and may contribute to inter-cohort variability. Additionally, although the cohort consisted of cognitively healthy subjects, we cannot rule out potential diagnosed diseases or non-neuro related diseases that could affect the imaging of subjects. Third, sample sizes were uneven across age bins, particularly in the oldest groups. While we excluded the Korean 81–90 bin due to insufficient numbers (n = 3), results in the older bins of other cohorts should still be interpreted cautiously given possible small-sample noise. Fourth, although non-monotonic patterns were explicitly modeled and described, longitudinal data are ultimately required to confirm these trajectories and to distinguish age effects from generational influences. Fifth, volumetric variability was higher in certain regions, particularly the cerebellum, likely reflecting segmentation challenges in posterior fossa structures. Sixth, volumetric expansions in ventricles and white matter hypointensities, while consistent with prior aging literature and robust after dataset correction, may remain sensitive to acquisition differences and could partly reflect subclinical pathology in some participants. Finally, interpretations of functional or cognitive implications of volumetric changes remain speculative in the absence of behavioral data, and future multimodal studies are needed to link structural changes with functional outcomes.