1. Introduction
Contrast-enhanced T1-weighted (T1+C) magnetic resonance imaging (MRI) is routine in Radiology protocols for detection of suspected focal brain lesions. A ring-enhancing brain lesion (REBL) is a radiological abnormality describing a hypointense lesion surrounded by a bright rim of contrast enhancement from blood–brain barrier disruption. REBLs may be infective (e.g., pyogenic abscess), or neoplastic (e.g., metastasis) in origin [
1]. In tertiary referral centers with large numbers of patients immunocompromised by underlying disease or treatment, REBLs may pose a significant diagnostic challenge as opportunistic infections (e.g., nocardiosis, toxoplasmosis) add to the list of differential diagnoses (
Figure 1). Because patients with infection can rapidly deteriorate with high morbidity and mortality [
2], rapid and accurate distinction is crucial to guide subsequent diagnostic evaluation and treatment, which are vastly different between infection and neoplasm. When there is diagnostic uncertainty, empirical antibiotics are often administered and brain biopsies may be performed unnecessarily, potentially resulting in avoidable side effects and complications such as antimicrobial resistance or devastating neurosurgical sequelae.
Notwithstanding, limitations to current diagnostic approaches exist [
1,
2,
3,
4,
5,
6]. The classic triad of headache, fever, and focal neurologic deficit is present in less than a quarter of patients on admission [
2,
7,
8]. There is significant overlap in inflammatory markers such as white cell count and C-reactive protein between patients with infection and those with neoplasm [
1], and crucial microbiological investigations such as blood cultures are usually unavailable at presentation. While certain neuroimaging features may aid in distinguishing underlying pathological processes [
3,
4], there are many exceptions to the rule. For example, while satellite lesions are more characteristic of abscesses than neoplasms [
9], these are insensitive markers [
10]. On diffusion-weighted imaging (DWI), cavities of abscesses classically exhibit marked hyperintensity from restricted diffusion of contents, while those of cystic/necrotic neoplasms exhibit hypointensity, but reports of abscesses with DWI-hypointense cavities [
11,
12,
13] and neoplasms with DWI-hyperintense cavities abound [
14,
15,
16]. The subjective nature of radiological assessment, combined with factors such as reader experience, fatigue and high workload, can compromise diagnostic accuracy [
6]. Yet, accurate radiological classification into infection or neoplasm is key to triaging for immediate clinical decision-making on the diagnostic and management pathways, including whether expedited drainage or more elective/facultative biopsy of the brain lesion should be undertaken. Direct sampling of the brain lesion offers the highest diagnostic yield and is imperative for establishing the exact microbiological/histological diagnosis for definitive antimicrobial or oncological treatment. However, it may be associated with untenable risks of neurological deficits and hemorrhage [
5], and patient comorbidities may also preclude neurosurgery under general anesthesia. An automated 24/7 objective and accurate imaging-based classification tool could help guide the final clinical decision for an elective or expedited high-risk biopsy following patient stabilization.
High-resolution three-dimensional (3D) T1+C scans are widely used today due to enhanced small lesion detection, detailed structural characterization, multiplanar capabilities and increasingly shortened scan times [
17,
18,
19,
20]. Radiomics coupled with machine learning (ML) algorithms are powerful analytical techniques that use high-dimensional quantitative data from radiological images for model building and clinical prediction [
21,
22]. Specifically, uniform isotropic voxel sizes in medical images enhance feature stability and reproducibility in radiomics-ML models [
23,
24]. These have demonstrated good potential in brain neoplasm characterization [
25,
26,
27,
28], yet their utility in distinguishing infection from neoplasm remains largely under-explored [
29,
30,
31,
32]. The scanty radiomics studies in the literature, summarized in
Supplementary Table S1, were trained on 2D datasets, based on 4–6 mm thick T1+C and T2 fluid-attenuated inversion recovery (FLAIR) [
29,
30] or DWI MRI alone [
32], and compared brain abscess versus a specific neoplastic etiology [
29,
30,
32].
We hypothesize that 3D T1+C radiomics is valuable in distinguishing infective from neoplastic REBLs as 3D MRI acquisitions offer isotropic voxels that capture lesion morphology and spatial heterogeneity more accurately, providing richer volumetric information for radiomic feature extraction than conventional 2D imaging [
18,
20]. Leveraging this advantage, in this exploratory study, we aim to determine if a radiomics-ML model based solely on a 3D T1+C dataset can distinguish infective from neoplastic REBLs, using retrospective data accrued from two tertiary hospitals in our healthcare system.
4. Discussion
Despite the modest sample size for model development, our radiomics-ML model, based solely on T1+C MRI contrast in a high-resolution 3D acquisition, demonstrated good performance across AUC, sensitivity, specificity and balanced accuracy in external testing in this exploratory study. These findings highlight the clinical value and potential of 3D T1+C imaging in distinguishing infective from neoplastic REBLs.
The strength of our study lies in its high-quality clinical dataset. Board-certified infectious disease physicians meticulously reviewed each patient’s medical records to ensure that only cases with definite or probable diagnoses based on our preset criteria were included in our datasets. The 3D T1+C MRI provided high-resolution structural lesion features for radiomics analysis and allowed for improved detection and localization of small REBLs for annotation. The manual bounding box annotation of each REBL was directly supervised by board-certified neuroradiologists, serving as ground truth labels for radiomics analysis. This rigorous process of data curation provides a strong foundation for model development.
Another strength of our study is the high proportion of immunocompromised patients and the diversity of etiologies within our datasets. Around half of the patients with infective REBLs were immunocompromised. Therefore, we were able to include diverse etiologies of infective REBLs, including various opportunistic infections, in our datasets. Similarly, our oncology service sees patients with diverse malignancies. The intentional inclusion of heterogeneous etiologies in our datasets for model development reflects the epidemiology and the rich radio-pathological case mix of CNS infections and neoplasms at our center. As the patient who presents with REBL is often undifferentiated, and differential diagnoses typically extend beyond two etiologies, models that distinguish abscess from a specific neoplasm may possess limited potential for deployment in a real-world clinico-radiology workflow [
29,
30,
32], compared to our model which distinguishes infective REBLs from neoplastic REBLs. Accordingly, the clinical dilemma of infection versus neoplasm faced by the on-duty radiologist could be better resolved by our model. While our model does not yield a specific diagnosis, rapid and accurate classification would assist clinicians in selecting appropriate management strategies—those assessed to have infection are aggressively managed by expedited abscess drainage and antibiotics, while those assessed to have neoplasm are systemically evaluated, including assessment for an extracranial primary tumor that may be safer to biopsy. Consequently, unnecessary brain biopsy, and associated neurological complications may potentially be avoided.
Deep learning architectures, especially CNNs offer automatic feature learning. However, they typically require substantially larger datasets to achieve stable convergence and generalizable performance [
34,
40]. Given our limited sample size, a radiomics-based framework was adopted to ensure interpretability and robustness within the constraints of this exploratory study. To contextualize our radiomics findings against modern representation learning, we explored end-to-end CNN baselines. Compared with radiomics alone, both CNN baselines demonstrated poorer performance. This was similarly observed in other studies [
41,
42], where deep representations did not consistently outperform hand-crafted radiomics on independent testing since representation learning generally benefits from larger and more diverse datasets to achieve stable generalization. Increasing network depth from ResNet-10 to ResNet-18 did not yield improved discrimination in this cohort, suggesting that additional model capacity did not translate into better generalization under the present data constraints. Additionally, we evaluated hybrid models that combined radiomics features with CNN-extracted representations to assess potential complementarity between engineered and learned features. Similarly, these hybrid configurations did not demonstrate improvement over radiomics alone. Unlike Bo et al. [
29], deep learning-based radiomics features did not result in a better performance than radiomics alone despite our superior lesion count, as their setting involved a narrower binary task with more modality inputs than the present study.
We included a wide range of ML classifiers in our exploratory evaluation. MLP demonstrated superior performance in distinguishing infective from neoplastic REBLs due to several key factors. The model architecture in MLP allows it to effectively capture complex, non-linear relationships within the data that are problematic with simpler models like KNN and DT, as well as with classifiers constrained by linear or quadratic decision functions (LR and QDA) or margin-based separation (SVM). The deep architecture of the MLP is also efficacious in leveraging feature combinations of original + LoG + wavelet, allowing it to better understand the diverse characteristics of the data compared to ensemble methods like Random Forest, AdaBoost and XGBoost. Additionally, the iterative training process of MLP facilitates hyperparameter optimization, enhancing the model’s adaptability and performance for the specific dataset. Among the three radiomic feature categories, texture-based features were the most influential in distinguishing infective from neoplastic REBLs (
Supplementary Table S7). These features quantify intra-lesional heterogeneity and edge sharpness, which are radiologically useful in differentiating neoplastic lesions from abscesses. Shape and intensity-based features contributed complementary morphological and statistical information that also enhanced overall model robustness.
The main limitation of our study is the small sample size of patients with infective REBLs, which we mitigated with data augmentation. While the meticulous review by infectious diseases physicians allowed for the inclusion of patients whose microbiological diagnoses were made from blood or extracranial tissue, reflecting current diagnostic approaches, factors such as our stringent patient inclusion criteria, the lower incidence of infective REBLs compared to neoplastic REBLs, the high mortality of CNS infections (often prior to achieving microbiological diagnosis), and the exclusion of patients without a contrast-enhanced MRI resulted in the relatively small cohort. This challenge was similarly encountered by other authors [
29,
30,
32] in slightly different CNS infection versus neoplasm use cases (
Supplementary Table S1). The sample-size-related limitation highlights the need for further evaluation of overfitting and feature selection stability in future work. To mitigate the limitation, in contrast to other models [
29,
30,
32] in which only one lesion per patient was manually segmented, every lesion of each patient was manually segmented, providing a superior lesion count that far outnumbers that of these other studies [
29,
30,
32] and optimizing the data points for our model training.
The majority of the neoplastic lesions that were wrongly classified by our model were metastases (
Supplementary Table S6) while fewer primary brain tumors (astrocytoma, glioblastoma) were wrongly classified. This correlates well with clinical practice, as cystic metastases resemble abscesses on T1+C, while primary brain tumors tend to have thicker walls and different enhancement patterns than metastases. In particular, small lesion size and smooth thin walls with little wall thickening were features most associated with wrong classification among patients with neoplastic REBLs. Future study with inclusion of additional MR sequences, especially DWI which is clinically useful in abscess vs. tumor differentiation, could improve our model performance.
During the review process for the training/validation set, we had purposefully excluded cases in which no pathogen or neoplasm was identified as well as those whose clinical/radiological response to treatment was suboptimal as these patients may have mixed lesions, i.e., both infection and neoplasm in the same patient. While this ensured that model training was based on a dataset with high diagnostic certainty, there may be selection bias as certain patients, such as those with mixed lesions and those who died before a microbiological/pathological diagnosis can be determined, were excluded from our study. Our model may not perform well in these patients, but would surface uncertainty in classification accordingly, triggering the need for greater attention by clinicians. Importantly, while our radiomics-ML may aid with diagnosis, it should not replace clinical judgement, and the provisional diagnosis should always be revisited and revised when clinical response is not as expected.
While our model, trained on a dataset encompassing a broad range of etiologies, offers greater potential for deployment in a real-world clinico-radiology workflow compared to models that differentiate between two specific pathologies, its performance may be impacted by the heterogeneity of etiologies within the broad diagnostic categories of infective and neoplastic REBLs. This heterogeneity may obscure clinically relevant biological, MRI, and radiomic differences between etiologies within each category, potentially limiting diagnostic precision.
The model was trained on a single-center dataset from the largest tertiary referral hospital in our country, affording a rich case mix of CNS infections and neoplasms. External validation on a dataset from a second tertiary center showed comparable performance, supporting generalizability. Further multicenter validation and the application of harmonization techniques such as ComBat to account for batch effects across institutions with different MRI protocols can further improve generalizability. The complementary value of spatial information as related to known spatial predilections of different pathologies was also not assessed in our study. While N4 bias-field correction was not applied, several factors may limit its impact on our findings: (1) radiomic features were extracted from localized ROIs rather than whole-brain regions, minimizing intra-ROI bias-field variation; and (2) all scans were acquired on scanners from the same vendor with built-in prescan normalization.
In this exploratory study, our radiomics model, based solely on a 3D T1+C dataset, demonstrated potential in distinguishing infective from neoplastic REBLs. This finding emphasizes the value of high-resolution 3D T1+C datasets in clinical radiology for quantitative analytics downstream beyond radiological reading and surgical planning. In addition, a time-efficient bounding box approach to lesion localization enabled inclusion of multiple REBLs for model development, and this could be further scoped for automated contour-based REBL segmentation in the next phase. Incorporating additional MR sequences—particularly DWI, which is clinically useful in abscess vs. tumor differentiation—along with clinical data within a multimodal MRI radiomics-ML framework is likely to further enhance discriminative performance and clinical applicability, and certainly warrants further study. Recently, multimodal radiomics models that combine multiple MRI sequences and different imaging modalities (e.g., perfusion MRI) have shown potential in cancer diagnosis and prognostication [
43,
44,
45,
46]. These necessitate accurate image co-registration across different MRI sequences, which brings attendant challenges, especially in restless patients who might have moved between sequence acquisitions. A multimodal radiomics model could achieve a better performance, but would also face challenges such as large computing demands and limited generalizability in the absence of protocol harmonization across institutions.