1. Introduction
Hydrocephalus constitutes a prevalent but extremely diverse neurological disorder defined by abnormal cerebrospinal fluid (CSF) dynamics and ventricular enlargement [
1,
2]. It encompasses a heterogeneous group of disorders with distinct pathophysiological mechanisms, ranging from acute obstruction and inflammatory responses in post-hemorrhagic hydrocephalus to chronic disturbances of CSF dynamics in idiopathic normal-pressure hydrocephalus, often coexisting with neurodegenerative pathology [
1,
2]. Despite significant breakthroughs in neuroimaging and neurosurgical procedures, hydrocephalus diagnosis, monitoring, and treatment decisions remain difficult in everyday clinical practice [
3]. This issue stems from a wide range of etiologies, diverse clinical presentations, and a lack of widely accepted imaging biomarkers that accurately represent disease severity, progression, and therapy responsiveness [
1,
4,
5].
Neuroimaging is essential in the evaluation of hydrocephalus, providing a foundation for diagnosis, etiologic assessment, longitudinal monitoring, and post-treatment evaluation [
6,
7]. The most often employed modalities are computed tomography (CT) and magnetic resonance imaging (MRI), with each providing complementary benefits [
8,
9]. In real-world clinical settings, imaging methods are rarely standardized, and patients frequently receive numerous examinations across different modalities and time points based on clinical urgency, accessibility, and institutional policy [
10,
11].
Simple linear ventricular measurements have long been used as practical markers of ventricular enlargement due to their simplicity and reproducibility. However, the commonly cited thresholds for these measures were derived primarily from Western European and North American populations and should not be interpreted as biologically universal or population-independent norms. Their clinical utility lies less in absolute cut-off values and more in their role as operational reference points that facilitate communication, longitudinal comparison, and decision support within specific clinical contexts [
12]. The Evans index, which is defined as the ratio of maximum frontal horn width to maximum inner skull diameter, has long been utilized as a screening tool for ventriculomegaly [
13,
14]. Additional linear measures, such as third-ventricle width, temporal horn enlargement, frontal and occipital horn ratio and callosal angle, are frequently reported and aid in the radiologic assessment of hydrocephalus severity and subtype [
9,
15,
16]. Despite their extensive use, these measures are not without limitations. They only provide a rough estimate of ventricular size, are affected by age-related brain atrophy and skull morphology [
17,
18]. Furthermore, their ability to identify minor longitudinal changes has been questioned [
19]. Baseline ventricular anatomy also exhibits substantial inter-individual variability and is influenced by age-related brain changes, comorbidity burden, and broader population-specific health trajectories.
Variability caused by different acquisition techniques and cross-modality assessment is a persistent challenge in hydrocephalus imaging [
20,
21]. While MRI is frequently regarded as the gold standard for ventricular and parenchymal examination, CT remains the preferred modality in many clinical settings [
22,
23]. As a result, basic linear measurements remain the most readily available and routinely used imaging techniques worldwide, especially in settings where CT is the primary modality and MRI access is limited [
12]. Longitudinal imaging assessment is equally challenging due to heterogeneous follow-up intervals, mixed imaging modalities across time points, and clinically driven rather than protocolized imaging schedules [
24]. Patients with hydrocephalus frequently receive repeated imaging to track disease progression, guide treatment decisions, and assess post-intervention results. However, follow-up intervals, imaging modalities, and measurement availability are extremely uneven [
22].
Recent advances in artificial intelligence and automated image processing have yielded promising volumetric and prognostic approaches to hydrocephalus diagnosis [
25,
26,
27]. However, these methods frequently necessitate specialized infrastructure, curated datasets, and extensive validation, which limits their immediate use in many institutions [
28,
29,
30]. Against this background, there is a clear need for research on the performance, agreement, and longitudinal behavior of basic ventricular imaging markers.
The present study addresses this gap by examining a retrospective cohort of adults with hydrocephalus evaluated under routine clinical conditions at a single tertiary center. Linear ventricular measurements are treated as geometric descriptors of ventricular size rather than as population-normalized diagnostic constructs. We aimed to assess cross-modality agreement between CT- and MRI-derived linear ventricular measurements, characterize longitudinal changes in ventricular size over time and following CSF diversion, and evaluate the relationship between baseline imaging features and subsequent VP shunt placement. Through this approach, we aim to contribute evidence that bridges the gap between advanced imaging research and everyday clinical practice, reinforcing the role of accessible, reproducible imaging metrics while acknowledging their limitations.
2. Materials and Methods
2.1. Study Design
We performed a single-center, retrospective observational studyat the Timis County Emergency Hospital ‘Pius Brânzeu’ Timișoara, Romania, including consecutive patients evaluated between January 2023 and December 2024 with a clinical and radiological diagnosis of hydrocephalus. This diagnosis of hydrocephalus was established by the treating neurology or neurosurgery team, supported by radiological evidence of ventricular enlargement and CSF circulation disturbance. All analyses were conducted on de-identified records according to institutional policy and in line with STROBE recommendations for observational studies.
2.2. Participants
Eligible patients were identified using consecutive sampling from institutional clinical and radiological records. All adult patients evaluated for suspected or confirmed hydrocephalus at the Clinical County Emergency Hospital ‘Pius Brânzeu’ Timișoara during the study period were screened for eligibility. Inclusion criteria were: (i) age ≥ 18 years; (ii) evaluation between January 2023 and December 2024; (iii) a clinical diagnosis of hydrocephalus established by the treating neurology or neurosurgery team; (iv) radiological evidence of ventricular enlargement and CSF circulation disturbance. Patients were included irrespective of hydrocephalus etiology or management strategy; inclusion was not limited to cases undergoing VP shunt placement.
At least one cranial CT or MRI examination of sufficient quality was required for review. Patients were excluded if imaging quality was insufficient for reliable linear ventricular measurements, including cases with severe motion artifacts, incomplete ventricular coverage, or postoperative changes obscuring relevant anatomic landmarks. During the study period, 96 patients were identified as having a clinical diagnosis of hydrocephalus. Of these, 28 were excluded, resulting in a final analytic cohort of 68 patients. Because imaging availability and completeness varied across individuals and time points, analytic denominators differed by variable and analysis; these are explicitly reported throughout. A complete-case approach was used for all analyses, and no imputation of missing values was performed.
2.3. Clinical Data and Variables
Demographic variables included age, sex and race/ethnicity at the time of the first available imaging examination. All patients were treated within a publicly funded healthcare system providing universal access. Etiology of hydrocephalus was abstracted from clinical documentation and categorized into predefined groups: post-hemorrhagic, tumor-related, post-infectious, idiopathic/normal-pressure hydrocephalus, other secondary causes, or unknown. Device-related variables included placement of an external ventricular drain (EVD) and VP shunt, recorded as binary indicators of exposure at any point during follow-up (“ever-EVD” and “ever-VP”). Dates of device placement were recorded when available to allow temporal pairing with imaging studies.
2.4. Imaging Acquisition and Selection
All available cranial CT and MRI examinations performed during the study period were screened. Imaging protocols were not standardized and reflected routine clinical indications, including emergency assessment, inpatient monitoring, and outpatient follow-up, for both CT and MRI examinations. When multiple examinations were available for a given patient, scan dates were used to establish temporal sequences and to enable longitudinal and cross-modality analyses, with examinations systematically ordered in ascending chronological order for each patient. For analyses requiring modality pairing, CT–MRI pairs were identified based on temporal proximity. Imaging acquisition and follow-up reflected institutional practice patterns at our center and were not intended to represent standardized or universally adopted imaging protocols.
2.5. Imaging Measurements
CT examinations were performed using multi-detector scanners (64–128 slice) operating under routine clinical protocols, including non-contrast axial acquisitions with slice thickness ranging from 0.6 to 5 mm. MRI studies were acquired on 1.5T and 3T scanners using standard clinical sequences, including axial T1-weighted and T2-weighted images with slice thicknesses of 3–5 mm. The Evans index was defined as the ratio of the maximal frontal horn width of the lateral ventricles to the maximal inner skull diameter on the same axial slice [
14]. Third-ventricle (V3) width was measured as the maximal transverse diameter on axial images at or near the level of the foramina of Monro [
31]. The callosal angle was measured on coronal images reconstructed perpendicular to the anterior–posterior commissural plane at the level of the posterior commissure, defined as the angle between the medial walls of the lateral ventricles [
32]. Temporal horn width was measured as the maximal transverse diameter on axial images [
33]. Measurements were performed using digital caliper tools available on the institutional PACS system. Etiologic categories were recorded for descriptive purposes. Linear ventricular measurements were analyzed as continuous variables or used for descriptive stratification only. No diagnostic classification or validation of population-independent cutoff values was performed or intended.
Transependymal CSF exudation was recorded as present or absent based on periventricular hypoattenuation on CT or periventricular hyperintensity on T2-weighted or fluid-attenuated inversion recovery (FLAIR) MRI sequences [
34]. Ventricular configuration was categorized as bi-, tri-, or tetraventricular enlargement according to radiology reports and imaging review. Certain imaging features were not assessable in all patients due to variability in imaging quality and protocols. For the features that could not be reliably assessed in all patients, results are reported as
n/
N, where
n reflects the number of evaluable examinations.
All measurements were performed by trained clinicians with experience in neuroimaging interpretation (radiology and neurosurgery backgrounds). Measurements were conducted independently for CT and MRI examinations. In cases of ambiguity or borderline measurements, images were reviewed jointly to reach consensus. Formal inter-operator variability was not quantified. Operators were not involved in clinical decision-making regarding VP shunt placement at the time of measurement. To minimize measurement bias, operators adhered to predefined anatomical landmarks and measurement conventions, and ambiguous cases were reviewed jointly to achieve consensus. Measurements were performed without access to longitudinal outcomes or treatment decisions when feasible.
To assess intra-observer reliability, a randomly selected subset of 20 imaging examinations was re-measured by the same observer after a washout period of at least two weeks, blinded to the initial measurements. Intra-observer agreement for linear ventricular measurements was evaluated using intraclass correlation coefficients (ICC; two-way mixed-effects model, absolute agreement) and Bland–Altman analysis.
2.6. Outcomes
The primary outcome was VP shunt placement at any time during follow-up (“ever-VP”), selected as a clinically meaningful endpoint reflecting the decision to initiate long-term CSF diversion. Secondary outcomes included ever-EVD placement and the operating characteristics of simple imaging-based rules derived from ventricular measurements, evaluated against subsequent VP shunt placement. VP shunt placement was analyzed as a contextual clinical outcome reflecting integrated, individualized decision-making rather than as an independent validation standard for imaging measurements
2.7. Longitudinal Assessments
Two longitudinal analyses were prespecified. First, the last-to-first change was calculated for patients with two or more dated imaging examinations by subtracting the initial measurement from the final available measurement, irrespective of imaging modality. This approach was chosen to capture overall directional change in ventricular size during follow-up under real-world imaging conditions within a tertiary-care hospital.
Second, in patients who underwent VP shunt placement, pre–post change was assessed by comparing the last available pre-operative measurement with the first post-operative measurement. The interval between surgery and the post-operative scan was recorded to contextualize early imaging changes following shunting.
2.8. CT–MRI Pairing and Agreement
To assess cross-modality agreement, patients with both CT and MRI examinations were matched by identifying the closest CT–MRI pair within a 90-day window, minimizing the absolute time difference between examinations. If multiple pairs met this criterion, the earliest pair was selected. Agreement between modalities was evaluated using Pearson correlation coefficients, Bland–Altman analysis with mean bias (MRI minus CT) and 95% limits of agreement, and the median absolute difference as a robust measure of dispersion.
2.9. Statistical Analysis
Continuous variables are reported as medians with interquartile ranges due to non-normal distributions and bounded measurement scales. Categorical variables are summarized as counts and percentages. Associations between baseline binary imaging features and VP shunt placement were evaluated using Fisher’s exact test, with odds ratios and exact p-values reported. Operating characteristics of prespecified imaging-based rules were calculated as sensitivity, specificity, positive predictive value, negative predictive value, and balanced accuracy. All tests were two-sided with a significance level of p < 0.05. No formal adjustment for multiple comparisons was applied, given the exploratory nature of the association analyses. Pearson correlation coefficients were used to assess the strength of linear association between CT and MRI measurements at the group level. Agreement between modalities was further evaluated using Bland–Altman analysis to quantify systematic bias and limits of agreement.
Longitudinal imaging data were incomplete, with quantifiable Evans index change available in only a subset of patients. No assumptions were made regarding the randomness of missing data, and formal missing-data modeling or imputation was not performed. Longitudinal analyses were therefore treated as exploratory, and results were interpreted cautiously in light of potential systematic missingness related to clinical follow-up patterns and care pathways. For the association analyses, p-values were adjusted for multiple comparisons using the Holm–Bonferroni method (family-wise error rate 0.05).
2.10. Ethics
This study was approved by the Ethics Committee of Timis County Emergency Clinical Hospital (approval number 574/16 October 2025). The analysis was conducted on de-identified data, involved no interventions, and posed no additional risk to patients.
4. Discussion
Within routine clinical imaging practice, linear ventricular measures demonstrated cross-modality agreement and directionally consistent changes following CSF diversion. Imaging-based criteria were associated with subsequent VP shunt use, supporting the practical utility of commonly applied ventricular markers in hydrocephalus assessment. Rather than relying on standardized imaging protocols or controlled follow-up, this study reflects the range of clinical and imaging scenarios encountered in routine hydrocephalus evaluation at a single Romanian tertiary-care hospital.
Linear indices such as Evans’ index provide only a coarse approximation of ventricular enlargement. Other studies have explored automated CT-based volumetric measures derived using deep learning, including transfer-learning pipelines that leverage MRI-derived labels and manual CT annotations in idiopathic normal pressure hydrocephalus (iNPH) cohorts [
35,
36]. Previous research has shown that these metrics maintain good cross-modality agreement and clinical relevance. Recent work comparing CT, conventional MRI, and mobile low-field MRI reported excellent agreement for Evans index and ventricular width measurements, despite heterogeneous acquisition protocols and real-world follow-up conditions providing conceptual consistency regarding measurement behavior across modalities [
37]. In a cohort of definite iNPH patients, Takagi et al. reported only moderate interobserver agreement for Evans’ index measurements and demonstrated that more than 30% of patients would be missed when relying solely on this threshold [
21].
The study of Kockum showed generally substantial to almost perfect agreement between readers and imaging modalities for most iNPH radiological features, such as Evans’ index, temporal horn size, Sylvian fissures, and callosal angle, with lower agreement for white matter changes and focally enlarged sulci [
38]. The consistency observed between CT- and MRI-derived Evans index and third-ventricle width supports the interchangeability of these modalities for linear ventricular assessment. Previous comparative studies have reported similar findings under controlled conditions [
8,
37]; the results extend this evidence to temporally separated examinations obtained during routine care in our institution, reinforcing the clinical utility of CT-based measurements when MRI is unavailable or delayed.
Ventricular size reduction following VP shunt placement is a commonly expected radiologic finding, although the magnitude and timing of change vary considerably across patients and etiologies [
39]. Prior research has shown that clinical improvement may precede or occur independently of substantial radiologic normalization, particularly in adult and normal-pressure hydrocephalus [
39,
40,
41]. Longitudinal MRI studies in iNPH have shown that postoperative changes may manifest predominantly as redistribution of CSF spaces and changes in indices such as callosal angle or z-axial measures, while the Evans index may change little over time, highlighting its limited sensitivity for tracking shunt-related morphologic recovery in some cohorts [
41]. In our cohort, ventricular measurements demonstrated modest but consistent reductions on early post-operative imaging, supporting the notion that large immediate changes should not be expected in all patients. These data emphasize the need to interpret post-shunt imaging in conjunction with clinical state, rather than as a standalone indicator of therapeutic efficacy. Longitudinal analyses demonstrated only small absolute changes in linear ventricular measurements over time, and the clinical significance of these modest variations remains uncertain. Given the limited magnitude of observed changes, it is difficult to determine whether such differences reflect true disease progression or therapeutic response, or instead fall within the range of expected measurement variability, particularly under non-standardized imaging conditions.
Transependymal CSF exudation is a well-described marker of elevated intraventricular pressure and acute hydrocephalus [
6,
34]. While its presence has been associated with disease severity, it is not considered independently diagnostic [
34,
42]. In this study, transependymal exudation showed the largest effect size among binary imaging markers associated with subsequent VP shunting, albeit without statistical significance. This pattern suggests that transependymal edema may increase specificity when interpreted alongside ventricular size indices, supporting its role as a complementary rather than standalone feature in clinical decision-making.
VP shunt placement reflects an integrated clinical decision that incorporates imaging findings, neurological status, symptom progression, and underlying etiology [
43,
44,
45]. As such, imaging-based rules evaluated against VP utilization do not represent the prediction of a biological endpoint but rather alignment with clinical practice [
44,
45]. The observed trade-offs between sensitivity and specificity across different Evans index thresholds reflect this reality, highlighting that ventricular measures should accompany, rather than replace, full clinical assessment. This perspective aligns with broader literature cautioning against overreliance on single imaging metrics or algorithmic thresholds [
29]. The inherent difficulty of predicting shunt responsiveness in iNPH has also been demonstrated in large registry-based studies using advanced multimodal statistical models; notably, a Disease State Index approach integrating clinical, imaging, CSF, and genetic data achieved only modest discriminatory performance between shunt responders and non-responders, underscoring the limitations of algorithmic prediction in this context [
45]. The heterogeneity of ventricular response following VP shunt placement highlights an important and unresolved challenge in hydrocephalus management. Substantial reductions in ventricular size in some patients contrasted with minimal morphologic change in others, despite similar interventions. This variability reflects a combination of etiologic differences, disease chronicity, baseline ventricular compliance, timing of shunt placement, and coexisting neurodegenerative or vascular pathology.
The use of a complete-case analytic approach resulted in variable denominators across comparisons and may have introduced selection bias. Patients with paired CT and MRI examinations or available longitudinal imaging are likely to differ systematically from those without such data, potentially reflecting differences in clinical severity, follow-up intensity, or care pathways. As a result, subgroup and longitudinal findings should be interpreted cautiously and should not be assumed to be representative of the entire cohort.
The interpretation of linear ventricular measurements must be considered in light of population-specific and demographic factors. In the present study, the thresholds of these measurements were not used to assert normative or pathological states, but rather as pragmatic reference values to explore measurement behavior and clinical alignment within a defined cohort. The development of population-adapted reference ranges and normative datasets remains an important goal for future hydrocephalus research.
4.1. Limitations
Although imaging biomarkers play an important role in the diagnosis and monitoring of hydrocephalus, the findings of this study must be interpreted within clear methodological and biological constraints. This was a single-center, retrospective study conducted at a Romanian tertiary-care hospital in a small, ethnically homogeneous cohort, with limited availability of individual-level socioeconomic data, restricting generalizability to other populations and healthcare contexts.
The cohort included a heterogeneous mix of hydrocephalus etiologies, encompassing both acute and chronic pathophysiological processes, for which ventricular enlargement does not carry uniform biological or clinical meaning. As such, imaging measurements should not be interpreted as etiologically interchangeable, nor as supporting uniform diagnostic thresholds or treatment expectations. Associations were not adjusted for potential confounders such as age, etiology, or comorbidity, as multivariable modeling was not feasible given the modest sample size and small subgroup counts.
The retrospective design and non-standardized imaging schedules resulted in substantial missing data and limited availability of paired cross-modality and longitudinal measurements, with unknown missingness mechanisms. Ventriculoperitoneal shunt placement reflects individualized clinical decision-making rather than an independent biological endpoint, precluding causal inference and introducing potential incorporation bias. Some patients underwent multiple CSF diversion procedures, but longitudinal analyses were not stratified by device sequence or number. Finally, variability in imaging technologies and acquisition parameters reflects real-world practice but increases measurement uncertainty and limits individual-level interpretation. Overall, these findings require validation in larger, multi-center cohorts with diverse etiologies and practice settings.
4.2. Implications and Future Directions
All findings should be interpreted within the context of a single-center study and reflect institutional imaging practices rather than universal standards of hydrocephalus assessment. To confirm these findings, future research should concentrate on prospective, multicenter studies that employ standardized imaging techniques and have predetermined follow-up periods. Combining ventricular measures with clinical and etiologic data can improve risk assessment as well as personalized therapy planning.
Although our cohort represented a variety of hydrocephalus etiologies, formal subtype-specific analyses were not possible due to small sample counts and insufficient modality pairing. As a result, our findings reflect the reliability of linear ventricular measures across imaging modalities, rather than etiology-specific behavior. While CT-MRI agreement appears to be consistent across clinical contexts, the interpretation of these measurements, particularly in chronic forms such as iNPH, must be guided by clinical presentation and additional imaging markers.
Recent advances in artificial intelligence and deep learning have enabled automated ventricular segmentation, volumetric analysis, and prognostic modeling in hydrocephalus, with several studies demonstrating promising diagnostic and predictive performance using imaging-based approaches. However, such performance metrics must be interpreted cautiously, as many AI-based approaches rely on relatively homogeneous, single-center, or highly selected datasets. Limited representativeness and inadequate external validation may result in degraded performance when models are applied to demographically, clinically, or institutionally diverse populations, with the potential to amplify existing health disparities.
However, the apparent ‘performance gap’ between automated volumetric methods and traditional linear measurements reflects not only differences in measurement precision but also substantial differences in data requirements, computational infrastructure, and clinical deployment. Automated pipelines typically rely on standardized MRI protocols, curated training datasets, and extensive external validation, which limits their immediate applicability in routine clinical settings where CT remains the primary modality and imaging protocols are heterogeneous.
In contrast, linear ventricular indices offer modality-agnostic, rapid, and reproducible measurements that can be applied across CT and MRI without specialized software. While these measures lack the granularity of full volumetric segmentation and are insufficient as standalone prognostic tools, they are still widely used. To ensure that digital technologies augment rather than replace clinical judgment in hydrocephalus care, such strategies will require rigorous validation, openness, and ethical oversight. Future research should also assess whether automated CT-based volumetric biomarkers, generated via deep-learning segmentation of ventricular CSF, can complement basic markers in everyday practice, particularly in locations where CT is the dominant modality and MRI is not possible [
35].
Future studies should directly correlate the magnitude of change in linear ventricular measurements with standardized clinical outcome measures, such as the modified Rankin Scale or disease-specific grading systems for normal-pressure hydrocephalus. Establishing such links would help determine whether small imaging changes translate into meaningful functional improvement or clinical deterioration and would provide a clearer framework for evaluating both manual and automated measurement approaches.