Next Article in Journal
Ex-Situ Dual Hypothermic Oxygenated Machine Perfusion in Full-Left-Full-Right Split Liver Transplantation for Two Adult Recipients
Previous Article in Journal
The Past, Present, and Future Care for Refractory Chronic Cough in Children and Adults
Previous Article in Special Issue
Impact of Irradiation on Post-Surgical Residuals of WHO Grade I Meningioma
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Radiomics in Pituitary Adenomas: A Systematic Review of Clinical Applications and Predictive Models

by
Edoardo Agosti
1,*,
Marcello Mangili
1,
Pier Paolo Panciani
1,
Lorenzo Ugga
2,
Vittorio Rampinelli
3,
Marco Ravanelli
4,
Alessandro Fiorindi
1 and
Marco Maria Fontanella
1,*
1
Neurosurgery Unit, Department of Medical and Surgical Specialties, Radiological Sciences and Public Health, University of Brescia, 25123 Brescia, Italy
2
Department of Advanced Medical and Surgical Sciences, University of Campania “Luigi Vanvitelli”, 80131 Naples, Italy
3
Unit of Otorhinolaryngology—Head and Neck Surgery, Department of Surgical Specialties, Radiological Sciences, and Public Health, University of Brescia, 25123 Brescia, Italy
4
Radiology Unit, Department of Medical and Surgical Specialties, Radiological Sciences and Public Health, University of Brescia, 25123 Brescia, Italy
*
Authors to whom correspondence should be addressed.
J. Clin. Med. 2025, 14(18), 6595; https://doi.org/10.3390/jcm14186595
Submission received: 17 July 2025 / Revised: 28 August 2025 / Accepted: 13 September 2025 / Published: 18 September 2025

Abstract

Background: Radiomics offers quantitative, high-dimensional data from conventional imaging and holds promise for improving diagnosis and treatment of pituitary adenomas (PAs). This systematic review aimed to synthesize current clinical applications of radiomics in PAs, focusing on diagnostic, predictive, and prognostic modeling. Methods: This review followed the PRISMA 2020 guidelines. A systematic search was performed in PubMed, Scopus, and Web of Science on 10 January 2024, and updated on 5 March 2024, using predefined keywords and MeSH terms. Studies were included if they evaluated radiomics-based models using MRI for diagnosis, classification, consistency, invasiveness, treatment response, or recurrence in human PA populations. Data extraction included study design, sample size, MRI sequences, feature types, machine learning algorithms, and model performance metrics. Study quality was assessed via the Newcastle-Ottawa Scale. Descriptive statistics summarized study characteristics; no meta-analysis was performed due to heterogeneity. Results: Out of 341 identified articles, 49 studies met inclusion criteria, encompassing a total of more than 9350 patients. The majority were retrospective (43 studies, 88%). MRI sequences used included T2-weighted imaging (35 studies, 71%), contrast-enhanced T1WI (34 studies, 69%), and T1WI (21 studies, 43%). PyRadiomics was the most common feature extraction tool (20 studies, 41%). Machine learning was employed in 43 studies (88%), predominantly support vector machines (16 studies, 33%), random forests (9 studies, 18%), and logistic regression (9 studies, 18%). Deep learning methods were applied in 17 studies (35%). Regarding diagnostic performance, 22 studies (45%) reported an (AUC) ≥0.85 in test datasets. External validation was performed in only 6 studies (12%). Radiomics applications included histological subtype prediction (14 studies, 29%), surgical outcome prediction (13 studies, 27%), invasiveness assessment (7 studies, 15%), tumor consistency evaluation (8 studies, 16%), and response to medical or radiotherapy treatments (3 studies, 6%). One study (2%) addressed automated segmentation and volumetry. Conclusions: Radiomics enables high-performance, noninvasive prediction of PA subtypes, consistency, invasiveness, treatment response, and recurrence, with 22 studies (45%) reporting AUC ≥0.85. Despite promising results, clinical translation remains limited by methodological heterogeneity, low external validation (6 studies, 12%), and lack of standardization.

1. Introduction

Pituitary adenomas (PAs) represent approximately 10–15% of primary intracranial neoplasms [1,2,3]. While commonly benign, they are clinically significant due to hormone hypersecretion syndromes and their potential to exert mass effects on adjacent anatomical structures. This dual mechanism of impact can severely compromise patient quality of life [1]. Importantly, over 30% of PAs demonstrate invasive behavior, infiltrating the cavernous sinuses (CSs), bony structures, the hypothalamus, or the internal carotid artery, which can complicate both diagnosis and treatment planning [4,5].
Magnetic resonance imaging (MRI) is the primary imaging modality used in the diagnostic work-up of PAs. It plays a crucial role not only in the initial identification and anatomical localization of the lesion, but also in long-term monitoring of tumor progression or recurrence. The strengths of MRI lie in its high-resolution, multiplanar capabilities and superior soft tissue contrast. However, the traditional approach to MRI interpretation is qualitative and dependent on radiologist expertise, leading to potential inter-observer variability and missed subtle findings. The subjective nature of this evaluation has spurred the development of more objective, reproducible tools [6].
Radiomics is a computational technique that extracts a vast array of quantitative features from standard medical imaging, offering a high-dimensional analysis of tumor phenotypes that surpass visual assessment. These features include intensity, texture, shape, and spatial relationships, capturing microstructural heterogeneity and tumor biology [7]. The field has expanded significantly in recent years, particularly when integrated with artificial intelligence (AI) and machine learning (ML). These tools can build predictive models that automatically interpret complex radiomic data, enhancing diagnostic accuracy, prognostication, and treatment personalization [8,9,10,11].
Radiomics has already demonstrated utility in various domains relevant to the clinical management of Pas [12]. Studies have shown its potential in differentiating PAs from other sellar and suprasellar lesions, as craniopharyngiomas and Rathke cleft cysts [13,14]. Additionally, radiomics has proven valuable in preoperative histological subtyping [15,16,17,18,19,20,21,22,23,24,25,26,27,28]. For instance, models trained on multiparametric MRI data have accurately distinguished subtypes, such as silent corticotroph adenomas (SCAs) and null cell adenomas (NCAs), entities that may otherwise be clinically silent yet have different prognostic implications [21,22,23,24]. Furthermore, PA consistency, a key factor in predicting the ease and success of surgical resection, has been correlated with specific radiomics features [29,30,31,32,33,34]. Radiomics also aids in evaluating invasiveness, particularly the prediction of CS invasion through features that align with higher Knosp grades [35,36,37,38,39,40,41]. Radiomic models have also shown promise in predicting responsiveness to pharmacologic treatments, like somatostatin receptor ligands in patients with acromegaly, and in forecasting the effects of radiotherapy. These predictions allow for more tailored treatment plans and may help avoid ineffective or unnecessary therapies [42,43]. Lastly, radiomics can contribute to the prognostication of recurrence and progression after surgical resection [44,45,46,47].
Despite the growing body of literature, no recent systematic review has comprehensively summarized the current applications of radiomics for PAs, the models that have been proposed, and which approaches have shown the most promising performance [48]. Moreover, even if several studies have been published in recent years covering a wide range of applications from histological characterization and invasion prediction to response assessment and outcome forecasting, these findings often are still fragmented across the literature. Therefore, the present systematic review aims to collect, organize, and analyze these updated data to provide a clear and accessible summary of the state-of-the-art in radiomics for PAs and its applications.

2. Materials and Methods

This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [49]. Two reviewers (E.A. and M.M.) independently performed a comprehensive search of the PubMed, Scopus, and Web of Science databases. The initial search was conducted on 10 January 2025, with a final update on 5 May 2025. The articles identified through this search were published between 2018 and 2025.
The search strategy utilized a combination of keywords and MeSH terms, including “radiomics,” “artificial intelligence,” “machine learning,” “pituitary adenoma,” “pituitary neuroendocrine tumor,” “PitNET,” “MRI,” and “magnetic resonance imaging.” Boolean operators (AND/OR) were employed to construct the search query as follows: (“radiomics” OR “machine learning” OR “artificial intelligence”) AND (“pituitary adenoma” OR “PitNET” OR “pituitary neuroendocrine tumor”) AND (“MRI” OR “magnetic resonance imaging”). Additional relevant studies were identified by manually screening the reference lists of all selected articles.
Studies were eligible for inclusion if they were published in English, involved human subjects, and investigated the application of radiomics (alone or in combination with AI-based methods) for the diagnosis, classification, prediction, or treatment response assessment of PAs using MRI-based features. Both retrospective and prospective original studies were considered. Articles were excluded if they were review articles, editorials, conference abstracts, purely technical or methodological papers without clinical data, or if they involved only phantom or animal models. Studies lacking clearly reported radiomic methodology, outcome measures, or validation procedures were also excluded.
All references were managed using EndNote X9, and duplicate records were removed prior to screening. Two reviewers (E.A. and M.M.) independently screened titles and abstracts for relevance. Full-text screening was then performed on all articles meeting initial inclusion criteria. Discrepancies at any stage were resolved by discussion or consultation with a third reviewer (P.P.P.).

2.1. Data Extraction

Data were extracted systematically from each included article using a pre-defined standardized data collection sheet. Extracted variables included: first author, year of publication, study design, population size, radiomics pipeline (including image acquisition, segmentation, feature extraction, and model development), type of MRI sequences used, clinical endpoint(s) investigated (e.g., differential diagnosis, subtype classification, tumor consistency, invasion, treatment response, or recurrence), and performance metrics (e.g., accuracy, sensitivity, specificity, and area under the curve, i.e., AUC). Additional details were collected on validation methods (internal or external) and the use of AI or ML algorithms.

2.2. Outcomes

The primary objective of this review was to synthesize and characterize the current applications of radiomics in the clinical management of PAs, including diagnostic differentiation, subtype classification, prediction of invasiveness or consistency, treatment response forecasting, and recurrence risk stratification. Secondary outcomes included an evaluation of the types and performance of models developed, common methodological patterns, and the overall level of validation achieved across studies.

2.3. Radiomics Quality Assessment

A qualitative assessment using the Radiomics Quality Score (RQS) framework was performed with the aim of critically analyzing and comparing the clinical applicability and translational potential of the various radiomic models developed [50]. Evaluation of the included studies was conducted according also to the IBSI compliance checklist to examine their reproducibility and adherence to technical standards [51]. Since many items in the IBSI checklist overlap with those in the RQS checklists, we included only the items relevant to image pre-processing steps (Table 1).

2.4. Risk of Bias Assessment

The quality of the observational studies included in the review was evaluated using the Newcastle-Ottawa Scale (NOS), which assesses studies across three domains: selection, comparability, and outcome assessment (Figure 1).
Each study could receive a maximum of 9 points. Those scoring below 7 were classified as low quality and excluded from the analysis. Studies that met or exceeded this threshold were retained, and their quality scores were considered during subgroup analyses and narrative synthesis to explore potential sources of bias. A detailed breakdown of NOS scores for each study is available in Table S1.

2.5. Statistical Analysis

Descriptive statistics were used to summarize study characteristics, radiomics applications, and model performance metrics. Results were presented using medians, ranges, and proportions, where appropriate. Due to methodological heterogeneity across studies—including differences in imaging protocols, feature selection methods, and outcome definitions—a formal meta-analysis was not performed. All data processing and visualization were conducted using R statistical software (version 4.2.0) (https://www.r-project.org, accessed on 3 March 2025).

3. Results

3.1. Literature Review

After removing duplicates, 340 articles were identified. Following a review of titles and abstracts, 76 studies were selected for full-text screening. Of the 74 articles assessed for eligibility, 49 met the inclusion criteria and were retained for the final analysis. The remaining 25 studies were excluded based on the following reasons: 10 were systematic reviews or meta-analyses, 9 did not report selected outcomes and 6 were unrelated to the research question. The articles included in the final analysis were published between 2018 and 2025. An overview of the selection process is presented in the PRISMA flowchart (Figure 2). PRISMA checklist is available in Table S2.

3.2. Data Analysis

A total of 49 studies were included in the systematic review, published between 2018 and 2025, investigating the role of radiomics in the assessment and classification of PAs. The total patient population across all studies was approximately 9351, with individual study sample sizes ranging from 24 to 1045 patients. Of these, 30 studies (61%) reported explicit training and test data splits, with 6 studies (12%) including separate validation cohorts. Regarding MRI sequences, were used T1-weighted imaging (T1WI) in 21 studies (43%), T2-weighted imaging (T2WI) in 35 studies (71%), contrast-enhanced T1WI (CE-T1WI) in 34 studies (69%) and T2-weighted imaging FLAIR (T2WI-FLAIR) in 1 study (2%).
In terms of analytical methodology, shallow ML approaches were utilized in 43 studies (88%), while deep learning (DL) architectures were applied in 17 studies (34%), with some studies using both. The most common ML algorithms were support vector machine (SVM) (19 studies, 44%), random forest (RF) (8 studies, 19%), logistic regression (LR) (8 studies, 19%), and k-nearest neighbors (7 studies, 16%). For DL, convolutional neural networks (CNNs) were used in 8 studies (47%), with architectures such as ResNet, DenseNet, and U-net represented. Feature extraction was performed using PyRadiomics in 23 studies (47%), while Matlab-based custom tools were used in 4 studies (8%), and other platforms including 3D Slicer or proprietary scripts in the remaining cases.
The types of radiomic features extracted varied, but first-order statistics (FOS) were used in 18 studies (37%), shape-based features in 20 studies (41%), and texture features (e.g., GLCM, GLRLM, GLSZM, NGTDM, GLDM) in 25 studies (51%). Wavelet-transformed features were included in 10 studies (20%), and higher-order features (e.g., histogram-based or delta-radiomics) were reported in 5 studies (10%).
Regarding tumor subtype, 35 studies (72%) included PAs with any hormonal secretion profile, 7 studies (14%) focused on functional PAs, including GH-secreting, ACTH, TSH, FSH/LH, and prolactinomas, while other 7 (14%) focused on non-functioning PAs (NFPA). The diagnostic performance was reported using AUC or accuracy values in all studies, with AUCs ranging from 0.55 to 1.00. Most models demonstrated high performance, with 22 studies (45%) reporting AUC values ≥0.85 in the test set. Further-more, 6 studies (12%) included external validation, strengthening the generalizability of their findings (Table 2).
The most frequently represented field was the prediction of histopathological features, explored in 14 studies (29%), making it the dominant area of investigation. Within this category, the preoperative prediction of Ki-67 index was the most common specific endpoint, analyzed in 5 studies, followed by efforts to distinguish subtypes such as SCAs, NCAs, and Tpit/Pit-1/SF-1 families, and to classify functional versus non-functional adenomas. The second most common field was the prediction of response to surgical treatment, addressed in 13 studies (27%). Of these, post-surgical recurrence or regrowth was the most studied endpoint, featured in 4 studies, indicating its clinical relevance. This was followed by investigations into visual outcome, biochemical remission, and intraoperative cerebrospinal fluid leak prediction. The prediction of tumor consistency and prediction of invasiveness were respectively addressed in 8 and 7 studies (16% and 14%). For consistency, all studies focused on distinguishing soft from fibrous tumors, while the invasiveness category primarily targeted CS invasion (6 studies), and to a lesser extent suprasellar extension. Studies on diagnosis of PAs accounted for 4 publications (8%), with most evaluating automated tumor detection from brain MRI, and one study distinguishing cystic PAs from Rathke’s cleft cysts. Lastly, prediction of response to non-surgical treatment, including somatostatin analogs and dopamine agonists (DAs), was investigated in 2 studies (4%), and automated tumor segmentation and volumetry was explored in 1 study (2%) (Table 3).

4. Discussion

Radiomics has increasingly established itself as a transformative tool in the neuro-oncological field, enabling the extraction of high-dimensional, quantitative features from conventional imaging modalities. In the context of PAs, its applications have rapidly expanded across multiple domains, from diagnostic differentiation and tumor subtyping to surgical outcome prediction and response assessment. This systematic review analyzed 49 studies, encompassing over 9300 patients. It highlights both the promise and limitations of current radiomics research in this field and offers a framework for future directions.

4.1. Radiomics Quality Assessment

Through RQS evaluation we observed patterns that echo those identified by Kocak et al. [67] in their recent meta-analysis of 1574 radiomics studies across radiological subspecialties. While the number of published works is steadily increasing, the median RQS remains modest (approximately 30%) reflecting persistent gaps in reproducibility, standardization, and clinical readiness.
Some methodological strengths were recurrently observed. Most studies adequately reported imaging protocols, frequently including acquisition parameters such as magnetic field strength, slice thickness, and TR/TE values. Feature reduction strategies to mitigate overfitting were commonly employed, and performance metrics (particularly ROC curves and AUC values) were almost universally reported, establishing a baseline level of analytical rigor. More recently, an encouraging trend has emerged toward multicenter cohorts and external validation, in contrast to earlier works that relied heavily on single-center data and internal validation techniques.
Nonetheless, several critical limitations persist. None of studies included phantom experiments or test–retest imaging, both of which are essential for evaluating feature robustness across scanners and timepoints. Calibration statistics, integral for assessing the clinical reliability of predictive models, were seldom reported. The integration of radiomic features with non-imaging variables, such as clinical, histopathological, or molecular data, was inconsistent, thus limiting the development of comprehensive, biologically informed models. Most notably, very few studies provided access to code, segmentation masks, or extracted feature sets, undermining transparency and reproducibility, issues explicitly highlighted by the IBSI and by Lambin et al. in their foundational framework [50,51].
Validation remains the principal barrier to clinical applicability. Over 90% of the studies reviewed relied exclusively on internal validation methods, with true external validation still rarely implemented. This raises substantial concerns regarding the generalizability of radiomic signatures across centers, scanners, and patient populations. As Kocak et al. [67] demonstrated, studies employing higher-quality validation strategies consistently achieved superior RQS scores and more reliable results.
This systematic analysis highlights also a marked heterogeneity in adherence to the IBSI preprocessing guidelines, which are crucial for ensuring reproducibility, comparability of extracted features and for enabling clinical translation. Overall, a bimodal distribution emerges: on one side, radiomics-oriented studies, often published in the last five years, present well-defined and transparent pipelines that closely follow IBSI standards; on the other, many deep learning–based works reduce preprocessing to minimal or non-standardized steps. High-quality studies (adherence to checklist >70%), typically report isotropic voxel resampling with explicit target dimensions and sometimes interpolation methods, apply rigorous intensity normalization such as z-score, min–max scaling or sigma-based filtering, include multiscale filters like Laplacian of Gaussian and wavelet with specified parameters, and describe gray-level discretization either through fixed bin width or fixed bin number. They also provide robust ROI processing, often with manual or semi-automatic three-dimensional segmentation validated by interobserver reproducibility measures such as ICC or Dice, and in several cases report artifact correction methods like N4ITK bias field correction. These characteristics are consistently observed in reference studies such as those by Kocak et al. [42], Zeynalova et al. [29], Machado et al. [44], Wan et al. [32], Park et al. [26], Zhang et al. [65], Liu et al. [18], Taslicay et al. [13], Shen et al. [47] Cuocolo et al. [30] and Ugga et al. [15], which stand out for their methodological rigor and reproducibility, allowing replication and multicentric integration. In contrast, low-quality studies in this terms (adherence to the IBSI checklist <30%) omit voxel isotropic resampling and rely on simple resizing or cropping, neglect intensity normalization, do not apply IBSI-compliant filters, and fail to report discretization of gray levels. Segmentation is often absent or replaced with fixed bounding boxes, and no artifact correction is performed, despite the well-known susceptibility of MRI to inhomogeneities. These deficiencies, seen in works such as those by Staartjes et al. [52], Qian et al. [55], Fang et al. [36], Feng et al. [41], Villalonga et al. [58], Liu et al. [27], Wang et al. [33], and Zhang et al. [61], reflect a tendency to prioritize raw CNN learning over methodological transparency, with the consequence that performance metrics obtained in single-center settings cannot be readily generalized. It should be acknowledged that the IBSI compliance checklist, while essential for assessing reproducibility and methodological rigor in traditional handcrafted radiomics, presents inherent limitations when applied to studies employing deep learning techniques, in particular to CNNs. Many IBSI items are specifically designed to evaluate predefined feature extraction and discretization steps, which are not explicitly performed in CNN-based pipelines. As a result, deep learning studies may receive lower IBSI scores, not necessarily due to inferior quality, but because of a methodological mismatch.

4.2. Diagnostic and Subtype Classification

Although the biochemical diagnosis of functioning PAs is relatively straightforward, imaging-based classification remains relevant for differential diagnosis, particularly in the case of silent subtypes or incidentalomas. Numerous studies have explored the use of radiomics in identifying hormonal secretion profiles and tumor subtypes. Baysal et al. [28] demonstrated that artificial neural networks trained on T2WI radiomic features could distinguish seven different hormone-secreting profiles, with AUCs ranging from 0.74 to 0.96, including high performance in distinguishing GH-secreting adenomas (AUC = 0.89) and prolactinomas (AUC = 0.95). Similar subtype classification efforts were conducted by Peng et al. [20], who developed machine learning classifiers (SVM, KNN, NB) capable of distinguishing transcription factor–based PitNET families (Tpit, Pit-1, SF-1) with an AUC up to 0.95 based on multiparametric MRI-derived features.
Specific studies targeted NFPAs, with a focus on histological subtypes like silent corticotroph adenomas (SCAs). Rui et al. [22] and Wang et al. [23] both reported high-performing models for preoperative SCA prediction (AUCs > 0.90), combining radiomic features from T1WI, T2WI, and CE-T1WI sequences with clinical data. Zhang et al. [21] extended this approach to distinguish NCAs, achieving concordance indices up to 0.86 using radiomic nomograms based on T1WI features. Furthermore, in GH-secreting tumors, Park et al. [26] applied radiomics to assess granulation patterns, yielding AUC of 0.83, underscoring radiomics relevance in treatment stratification for acromegaly. A similar model is presented by Liu et al. [27] achieving an AUC of 0.92, with DCA showing that their prediction model had a better net benefit than either the treatment or no treatment schemes when the threshold probability was 0.254 to 0.798.

4.3. Prediction of Tumor Consistency

Preoperative assessment of tumor consistency is vital for surgical planning, as fibrous tumors often pose greater technical challenges during resection. Radiomics has demonstrated utility in this domain, primarily through the analysis of T2WI signal textures. Zeynalova et al. [29] used artificial neural networks to predict tumor consistency with an AUC of 0.71, outperforming conventional signal intensity ratios. Cuocolo et al. [30] further refined this approach using an ensemble learning classifier trained on T2WI-derived features, achieving an AUC of 0.99 and an accuracy of 93%.
Other studies expanded this paradigm to include multiparametric MRI. Wan et al. [32] analyzed T1WI, T2WI, and CE-T1WI sequences using 3D segmentation and showed that texture and wavelet features outperformed shape features in consistency prediction (AUC = 0.90). In GH-secreting adenomas, Fan et al. [54] developed an elastic net-based radiomic model with external validation that reached AUCs of 0.83–0.81 when combined with Knosp grade with DCA confirming its net benefit for preoperative surgical planning. Moreover, they validated the constructed MR radiomics model through a completely independent multicenter prospective validation set, offering robustness among different image acquisition protocols. These findings suggest that tumor consistency, a previously subjective and intraoperative assessment, can now be inferred with reasonable accuracy using non-invasive, preoperative imaging and radiomics.

4.4. Assessment of Invasiveness and Aggressiveness

Cavernous sinus invasion, as graded by the Knosp classification, significantly impacts surgical decision-making. Radiomics provides an opportunity to enhance preoperative prediction of invasiveness, especially in borderline Knosp grades. Niu et al. [35] developed a radiomic-clinical nomogram that integrated CE-T1WI features such as tumor sphericity and internal carotid artery wrapping, yielding an AUC of 0.90. Their DCA demonstrated that at threshold probabilities above 20%, the radiomics nomogram provided greater net benefit compared to conventional clinical-radiological models or treating all/none strategies. Similarly, Zhang et al. [14] analyzed 3D CE-T1WI radiomic features and reported AUCs of 0.86 (training) and 0.73 (validation) for predicting invasiveness.
Radiomics has also been applied to define broader histopathological aggressiveness criteria, including proliferation indices such as Ki-67 and p53. Ugga et al. [15] showed that T2WI-based features could predict high Ki-67 expression with an AUC of 0.87 using a KNN classifier. Wang et al. [25] took a multimodal approach, combining radiomic features and Knosp grade to identify aggressive tumors (AUC = 0.94), while Fan et al. [16] applied similar methods in acromegalic patients, achieving AUCs of 0.96 and 0.89 across training and test cohorts and the DCA demonstrated that both a radiomic signature and derived nomogram had tangible utility in acromegaly management Liu et al. [18] introduced a novel application of dynamic contrast-enhanced MRI (DCE-MRI) for texture feature extraction. By analyzing pharmacokinetic maps such as Ktrans and Kep, they developed a model that achieved an AUC of 0.96 in distinguishing aggressive from non-aggressive PitNETs. Their DLR model DCA curves demonstrated a strong agreement between predicted and observed outcomes in the internal test set and showed superior clinical benefit in the preoperative prediction of concurrent high Ki-67 expression and PIT-1 positivity, marking an innovative step in functional radiomics.

4.5. Prediction of Surgical Outcomes

A key clinical challenge in PA management is the recurrence or progression of residual tumor post-surgery. Galm et al. [68] showed that preoperative T1WI texture metrics such as pixel intensity could stratify recurrence risk in NFPAs, with lower intensity associated with higher recurrence. Machado et al. [44] reported that KNN and RF classifiers trained on CE-T1WI radiomic features achieved AUCs up to 0.98 and 0.96 for recurrence prediction, with 3D segmentation outperforming 2D approaches. Zhang et al. [45] developed a radiomic score using SVMs and demonstrated its value as an independent prognostic factor (AUC = 0.87). In a broader cohort including all adenoma types, Zhang et al. [60] later compared clinical-pathological and radiomic classifiers, reporting improved predictive accuracy with the inclusion of radiomic features (AUC = 0.84 vs. 0.65); moreover DCA results showed that both their delta-radiomic and combined models achieved superior net benefit compared to clinical variables alone, in both development and independent test cohorts.
Post-surgical hormonal remission has also been evaluated. Fan et al. [53] analyzed over 13,000 CE-T1WI features and built a model with AUCs of 0.83–0.81 across training and test cohorts, significantly outperforming models based on clinical parameters alone. They performed a DCA, revealing clinically meaningful benefit at threshold probabilities >13% in the primary cohort and >25% in the validation cohort. These findings indicate that radiomics can contribute meaningfully to post-operative outcome prediction.

4.6. Prediction of Response to Medical and Radiotherapy

Radiomics is increasingly being evaluated as a tool to predict the efficacy of pharmacologic therapies, particularly in functioning PAs. Kocak et al. [42] applied texture analysis to T2WIs in acromegaly patients and found that radiomic classifiers (KNN) outperformed qualitative and semi-quantitative signal assessments in predicting response to somatostatin receptor ligands (AUC = 0.85). In this study there were significant differences, with the best performances shown by the quantitative texture analysis based radiomic model, in terms of sensitivity, specificity and AUC-ROC. The other radiomic models were the 2D ROI-based quantitative rSI evaluation model and the 3D segmentation-based quantitative rSI evaluation model, that showed questionable performances if compared to qualitative (visual) rSI evaluation and granulation pattern-based evaluation. Galm et al. [68] correlated T1WI texture parameters with IGF-1 normalization, showing an odds ratio of 5.96 for higher pixel intensity after adjusting for clinical covariates.
Park et al. [43] applied ensemble classifiers to predict DAs response in prolactinoma patients, achieving an AUC of 0.81. Importantly, conventional imaging markers such as T2 intensity or cystic changes were not predictive, highlighting the incremental value of radiomics. Fan et al. [69] further extended the application to radiotherapy outcomes, showing that CE-T1WI-based radiomic models (AUC = 0.92) surpassed clinical-only models (AUC = 0.86). The combined model incorporating both radiomic and clinical data achieved an AUC of 0.96, offering a compelling argument for radiomics-guided treatment stratification in post-operative settings.

4.7. Technical and Methodological Considerations

Most of the reviewed studies utilized T1WI, T2WI, and CE-T1WI sequences, with PyRadiomics as the dominant feature extraction tool. Feature types commonly included FOS, shape descriptors, and texture metrics (GLCM, GLRLM, GLSZM), with increasing use of wavelet and higher-order features. Most studies relied on SVMs, RF, and logistic regression models, though DL architectures (e.g., CNNs, ResNet, DenseNet) have become more prevalent in recent years, particularly in larger datasets such as those reported by Zhang et al., Liu et al., and Gargya et al.
Despite high reported accuracies (AUC ≥ 0.80 in 81,6% of studies), generalizability remains a concern. Only 6 studies (12%) performed external validation. Moreover, adherence to radiomics quality standards (e.g., IBSI, RQS) was inconsistently reported, and segmentation protocols varied widely. Few studies applied robustness testing or accounted for variability in imaging acquisition.

4.8. Limitations and Future Directions

While current evidence underscores the potential of radiomics to enhance the clinical management of PAs, several limitations hinder its translation into routine practice. Most studies were retrospective and single-center, with substantial heterogeneity in imaging protocols, segmentation methods, feature extraction, and model validation. Only studies by Fan et al. [54] and Li et al. [24] employed external validation, and adherence to radiomics quality standards was inconsistently reported. The absence of standardized pipelines and variability in performance metrics limit reproducibility and comparability. Future research should emphasize prospective, multicenter studies, standardized methodologies, and clinical integration to advance radiomics toward reliable, personalized care in PA management. High-quality research embodies the potential for integration, harmonization, and reproducibility, whereas lower-quality studies, despite innovations in deep learning, often lack the methodological rigor required for reliable translation. The future of the field lies in the convergence of deep learning approaches with IBSI-standardized preprocessing and adherence to best practices outlined in the RQS framework, as reinforced by Kocak et al. [67] and Lambin et al. [50]

5. Conclusions

This systematic review highlights the expanding role of radiomics in the clinical management of PAs, demonstrating high diagnostic performance across applications such as subtype classification, tumor consistency, invasiveness, and treatment outcome prediction. Radiomic models achieved AUC ≥0.85 in 45% of studies, primarily using ML and multiparametric MRI. However, clinical adoption remains limited due to methodological heterogeneity, lack of standardized workflows, and scarce external validation (4%). To enable broader translation into the clinical decision-making process, several concrete recommendations should be prioritized: robustness testing through phantom studies, longitudinal imaging, and inter-observer segmentation variability; rigorous validation, with preference for external and multicenter datasets; transparency through the sharing of code, annotations, and feature sets; integrative modeling that combines radiomic, clinical, and molecular data; and an emphasis on clinical applicability, including the use of calibration and decision curve analyses, as well as the design and registration of prospective studies to evaluate radiomic signatures in real-world workflows.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jcm14186595/s1, Table S1: This table summarized the NOS scores of each study included in the systematic review; Table S2: PRISMA checklist (Reference [70] are cited in the Supplementary Materials).

Author Contributions

Conceptualization, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; methodology, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; validation, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; formal analysis, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; investi-gation, E.A. and M.M.; resources, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; data curation, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; writing—original draft prepa-ration, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; writing—review and editing, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; visualization, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; supervision, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F.; project administration, E.A., M.M., P.P.P., L.U., V.R., M.R., A.F., and M.M.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available in a publicly accessible repository.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

AI = artificial intelligence, AUC = area under the curve, CS = cavernous sinus, CNN = con-volutional neural network, DCE-MRI = dynamic contrast-enhanced magnetic resonance imaging, DL = deep learning, ML = machine learning, FOS = first-order statistics, GLCM = Gray Level Gray Level Co-occurrence Matrix, GLRLM = Gray Level Run Length Matrix, GLSZM = Gray Level Size Zone Matrix, LR = logistic regression, MRI = magnetic resonance imaging, GLDM = Gray Level Dependence Matrix, NCA = null cell adenoma, NFPA = non-functioning pituitary adenoma, NGTDM = Neighboring Gray Tone Difference Matrix, DA = dopamine agonist, PA = pituitary adenoma RF = random forest, SCA = silent cortico-troph adenoma, SVM = support vector machine.

References

  1. Melmed, S. Pituitary-Tumor Endocrinopathies. N. Engl. J. Med. 2020, 382, 937–950. [Google Scholar] [CrossRef]
  2. Ostrom, Q.T.; Price, M.; Neff, C.; Cioffi, G.; Waite, K.A.; Kruchko, C.; Barnholtz-Sloan, J.S. CBTRUS Statistical Report: Primary Brain and Other Central Nervous System Tumors Diagnosed in the United States in 2015–2019. Neuro Oncol. 2022, 24, v1–v95. [Google Scholar] [CrossRef]
  3. Asa, S.L.; Mete, O.; Perry, A.; Osamura, R.Y. Overview of the 2022 WHO Classification of Pituitary Tumors. Endocr. Pathol. 2022, 33, 6–26. [Google Scholar] [CrossRef]
  4. Mehta, G.U.; Lonser, R.R. Management of Hormone-Secreting Pituitary Adenomas. Neuro Oncol. 2017, 19, 762–773. [Google Scholar] [CrossRef] [PubMed]
  5. Raverot, G.; Dantony, E.; Beauvy, J.; Vasiljevic, A.; Mikolasek, S.; Borson-Chazot, F.; Jouanneau, E.; Roy, P.; Trouillas, J. Risk of Recurrence in Pituitary Neuroendocrine Tumors: A Prospective Study Using a Five-Tiered Classification. J. Clin. Endocrinol. Metab. 2017, 102, 3368–3374. [Google Scholar] [CrossRef] [PubMed]
  6. Molitch, M.E. Diagnosis and Treatment of Pituitary Adenomas: A Review. JAMA 2017, 317, 516–524. [Google Scholar] [CrossRef] [PubMed]
  7. Scapicchio, C.; Gabelloni, M.; Barucci, A.; Cioni, D.; Saba, L.; Neri, E. A Deep Look into Radiomics. Radiol. Med. 2021, 126, 1296–1311. [Google Scholar] [CrossRef]
  8. Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; van Stiphout, R.G.P.M.; Granton, P.; Zegers, C.M.L.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting More Information from Medical Images Using Advanced Feature Analysis. Eur. J. Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef]
  9. Maniaci, A.; Lavalle, S.; Gagliano, C.; Lentini, M.; Masiello, E.; Parisi, F.; Iannella, G.; Cilia, N.D.; Salerno, V.; Cusumano, G.; et al. The Integration of Radiomics and Artificial Intelligence in Modern Medicine. Life 2024, 14, 1248. [Google Scholar] [CrossRef]
  10. Gillies, R.J.; Kinahan, P.E.; Hricak, H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016, 278, 563–577. [Google Scholar] [CrossRef]
  11. Ugga, L.; Perillo, T.; Capasso, S.; Negroni, D.; Cuocolo, R. Radiomics in Meningiomas: Pathological and Biomolecular Correlation. In Meningiomas: From Pathology to Clinics; Maiuri, F., Del Basso De Caro, M., Eds.; Springer Nature: Cham, Switzerland, 2024; pp. 121–130. ISBN 978-3-031-76680-0. [Google Scholar]
  12. Zheng, B.; Zhao, Z.; Zheng, P.; Liu, Q.; Li, S.; Jiang, X.; Huang, X.; Ye, Y.; Wang, H. The Current State of MRI-Based Radiomics in Pituitary Adenoma: Promising but Challenging. Front. Endocrinol. 2024, 15, 1426781. [Google Scholar] [CrossRef] [PubMed]
  13. Taslicay, C.A.; Dervisoglu, E.; Ince, O.; Mese, I.; Taslicay, C.; Bayrak, B.Y.; Cabuk, B.; Anik, I.; Ceylan, S.; Anik, Y. A Novel Fusion of Radiomics and Semantic Features: MRI-Based Machine Learning in Distinguishing Pituitary Cystic Adenomas from Rathke’s Cleft Cysts. J. Belg. Soc. Radiol. 2024, 108, 9. [Google Scholar] [CrossRef] [PubMed]
  14. Zhang, Y.; Chen, C.; Tian, Z.; Xu, J. Discrimination between Pituitary Adenoma and Craniopharyngioma Using MRI-Based Image Features and Texture Features. Jpn. J. Radiol. 2020, 38, 1125–1134. [Google Scholar] [CrossRef]
  15. Ugga, L.; Cuocolo, R.; Solari, D.; Guadagno, E.; D’Amico, A.; Somma, T.; Cappabianca, P.; Del Basso de Caro, M.L.; Cavallo, L.M.; Brunetti, A. Prediction of High Proliferative Index in Pituitary Macroadenomas Using MRI-Based Radiomics and Machine Learning. Neuroradiology 2019, 61, 1365–1373. [Google Scholar] [CrossRef] [PubMed]
  16. Fan, Y.; Chai, Y.; Li, K.; Fang, H.; Mou, A.; Feng, S.; Feng, M.; Wang, R. Non-Invasive and Real-Time Proliferative Activity Estimation Based on a Quantitative Radiomics Approach for Patients with Acromegaly: A Multicenter Study. J. Endocrinol. Investig. 2020, 43, 755–765. [Google Scholar] [CrossRef]
  17. Shu, X.-J.; Chang, H.; Wang, Q.; Chen, W.-G.; Zhao, K.; Li, B.-Y.; Sun, G.-C.; Chen, S.-B.; Xu, B.-N. Deep Learning Model-Based Approach for Preoperative Prediction of Ki67 Labeling Index Status in a Noninvasive Way Using Magnetic Resonance Images: A Single-Center Study. Clin. Neurol. Neurosurg. 2022, 219, 107301. [Google Scholar] [CrossRef]
  18. Liu, F.; Zang, Y.; Feng, L.; Shi, X.; Wu, W.; Liu, X.; Song, Y.; Xu, J.; Gui, S.; Chen, X. Concomitant Prediction of the Ki67 and PIT-1 Expression in Pituitary Adenoma Using Different Radiomics Models. J. Imaging Inform. Med. 2024, 38, 394–409. [Google Scholar] [CrossRef]
  19. Sathya, A.; Goyal-Honavar, A.; Chacko, A.G.; Jasper, A.; Chacko, G.; Devakumar, D.; Seelam, J.A.; Sasidharan, B.K.; Pavamani, S.P.; Thomas, H.M.T. Is Radiomics a Useful Addition to Magnetic Resonance Imaging in the Preoperative Classification of PitNETs? Acta Neurochir. 2024, 166, 91. [Google Scholar] [CrossRef]
  20. Peng, A.; Dai, H.; Duan, H.; Chen, Y.; Huang, J.; Zhou, L.; Chen, L. A Machine Learning Model to Precisely Immunohistochemically Classify Pituitary Adenoma Subtypes with Radiomics Based on Preoperative Magnetic Resonance Imaging. Eur. J. Radiol. 2020, 125, 108892. [Google Scholar] [CrossRef]
  21. Zhang, S.; Song, G.; Zang, Y.; Jia, J.; Wang, C.; Li, C.; Tian, J.; Dong, D.; Zhang, Y. Non-Invasive Radiomics Approach Potentially Predicts Non-Functioning Pituitary Adenomas Subtypes before Surgery. Eur. Radiol. 2018, 28, 3692–3701. [Google Scholar] [CrossRef]
  22. Rui, W.; Qiao, N.; Wu, Y.; Zhang, Y.; Aili, A.; Zhang, Z.; Ye, H.; Wang, Y.; Zhao, Y.; Yao, Z. Radiomics Analysis Allows for Precise Prediction of Silent Corticotroph Adenoma among Non-Functioning Pituitary Adenomas. Eur. Radiol. 2022, 32, 1570–1578. [Google Scholar] [CrossRef]
  23. Wang, H.; Chang, J.; Zhang, W.; Fang, Y.; Li, S.; Fan, Y.; Jiang, S.; Yao, Y.; Deng, K.; Lu, L.; et al. Radiomics Model and Clinical Scale for the Preoperative Diagnosis of Silent Corticotroph Adenomas. J. Endocrinol. Invest. 2023, 46, 1843–1854. [Google Scholar] [CrossRef]
  24. Li, H.; Zhao, Q.; Zhang, Y.; Sai, K.; Xu, L.; Mou, Y.; Xie, Y.; Ren, J.; Jiang, X. Image-Driven Classification of Functioning and Nonfunctioning Pituitary Adenoma by Deep Convolutional Neural Networks. Comput. Struct. Biotechnol. J. 2021, 19, 3077–3086. [Google Scholar] [CrossRef]
  25. Wang, X.; Dai, Y.; Lin, H.; Cheng, J.; Zhang, Y.; Cao, M.; Zhou, Y. Shape and Texture Analyses Based on Conventional MRI for the Preoperative Prediction of the Aggressiveness of Pituitary Adenomas. Eur. Radiol. 2023, 33, 3312–3321. [Google Scholar] [CrossRef]
  26. Park, Y.W.; Kang, Y.; Ahn, S.S.; Ku, C.R.; Kim, E.H.; Kim, S.H.; Lee, E.J.; Kim, S.H.; Lee, S.-K. Radiomics Model Predicts Granulation Pattern in Growth Hormone-Secreting Pituitary Adenomas. Pituitary 2020, 23, 691–700. [Google Scholar] [CrossRef]
  27. Liu, C.-X.; Heng, L.-J.; Han, Y.; Wang, S.-Z.; Yan, L.-F.; Yu, Y.; Ren, J.-L.; Wang, W.; Hu, Y.-C.; Cui, G.-B. Usefulness of the Texture Signatures Based on Multiparametric MRI in Predicting Growth Hormone Pituitary Adenoma Subtypes. Front. Oncol. 2021, 11, 640375. [Google Scholar] [CrossRef]
  28. Baysal, B.; Eser, M.B.; Dogan, M.B.; Kursun, M.A. Multivariable Diagnostic Prediction Model to Detect Hormone Secretion Profile From T2W MRI Radiomics with Artificial Neural Networks in Pituitary Adenomas. Medeni. Med. J. 2022, 37, 36–43. [Google Scholar] [CrossRef] [PubMed]
  29. Zeynalova, A.; Kocak, B.; Durmaz, E.S.; Comunoglu, N.; Ozcan, K.; Ozcan, G.; Turk, O.; Tanriover, N.; Kocer, N.; Kizilkilic, O.; et al. Preoperative Evaluation of Tumour Consistency in Pituitary Macroadenomas: A Machine Learning-Based Histogram Analysis on Conventional T2-Weighted MRI. Neuroradiology 2019, 61, 767–774. [Google Scholar] [CrossRef] [PubMed]
  30. Cuocolo, R.; Ugga, L.; Solari, D.; Corvino, S.; D’Amico, A.; Russo, D.; Cappabianca, P.; Cavallo, L.M.; Elefante, A. Prediction of Pituitary Adenoma Surgical Consistency: Radiomic Data Mining and Machine Learning on T2-Weighted MRI. Neuroradiology 2020, 62, 1649–1656. [Google Scholar] [CrossRef] [PubMed]
  31. Zhu, H.; Fang, Q.; Huang, Y.; Xu, K. Semi-Supervised Method for Image Texture Classification of Pituitary Tumors via CycleGAN and Optimized Feature Extraction. BMC Med. Inform. Decis. Mak. 2020, 20, 215. [Google Scholar] [CrossRef]
  32. Wan, T.; Wu, C.; Meng, M.; Liu, T.; Li, C.; Ma, J.; Qin, Z. Radiomic Features on Multiparametric MRI for Preoperative Evaluation of Pituitary Macroadenomas Consistency: Preliminary Findings. J. Magn. Reson. Imaging 2022, 55, 1491–1503. [Google Scholar] [CrossRef]
  33. Wang, H.; Zhang, W.; Li, S.; Fan, Y.; Feng, M.; Wang, R. Development and Evaluation of Deep Learning-Based Automated Segmentation of Pituitary Adenoma in Clinical Task. J. Clin. Endocrinol. Metab. 2021, 106, 2535–2546. [Google Scholar] [CrossRef]
  34. Chen, J.M.; Wan, Q.; Zhu, H.Y.; Ge, Y.Q.; Wu, L.L.; Zhai, J.; Ding, Z.M. The value of conventional magnetic resonance imaging based radiomic model in predicting the texture of pituitary macroadenoma. Zhonghua Yi Xue Za Zhi 2020, 100, 3626–3631. [Google Scholar] [CrossRef]
  35. Niu, J.; Zhang, S.; Ma, S.; Diao, J.; Zhou, W.; Tian, J.; Zang, Y.; Jia, W. Preoperative Prediction of Cavernous Sinus Invasion by Pituitary Adenomas Using a Radiomics Method Based on Magnetic Resonance Images. Eur. Radiol. 2019, 29, 1625–1634. [Google Scholar] [CrossRef]
  36. Fang, Y.; Wang, H.; Feng, M.; Chen, H.; Zhang, W.; Wei, L.; Pei, Z.; Wang, R.; Wang, S. Application of Convolutional Neural Network in the Diagnosis of Cavernous Sinus Invasion in Pituitary Adenoma. Front. Oncol. 2022, 12, 835047. [Google Scholar] [CrossRef]
  37. Kim, M.; Kim, H.S.; Park, J.E.; Park, S.Y.; Kim, Y.-H.; Kim, S.J.; Lee, J.; Lebel, M.R. Thin-Slice Pituitary MRI with Deep Learning-Based Reconstruction for Preoperative Prediction of Cavernous Sinus Invasion by Pituitary Adenoma: A Prospective Study. AJNR Am. J. Neuroradiol. 2022, 43, 280–285. [Google Scholar] [CrossRef]
  38. Park, H.; Nam, Y.K.; Kim, H.S.; Park, J.E.; Lee, D.H.; Lee, J.; Kim, S.; Kim, Y.-H. Deep Learning-Based Image Reconstruction Improves Radiologic Evaluation of Pituitary Axis and Cavernous Sinus Invasion in Pituitary Adenoma. Eur. J. Radiol. 2023, 158, 110647. [Google Scholar] [CrossRef] [PubMed]
  39. Zhang, C.; Heng, X.; Neng, W.; Chen, H.; Sun, A.; Li, J.; Wang, M. Prediction of High Infiltration Levels in Pituitary Adenoma Using MRI-Based Radiomics and Machine Learning. Chin. Neurosurg. J. 2022, 8, 21. [Google Scholar] [CrossRef] [PubMed]
  40. Fang, Y.; Wang, H.; Cao, D.; Cai, S.; Qian, C.; Feng, M.; Zhang, W.; Cao, L.; Chen, H.; Wei, L.; et al. Multi-Center Application of a Convolutional Neural Network for Preoperative Detection of Cavernous Sinus Invasion in Pituitary Adenomas. Neuroradiology 2024, 66, 353–360. [Google Scholar] [CrossRef] [PubMed]
  41. Feng, T.; Fang, Y.; Pei, Z.; Li, Z.; Chen, H.; Hou, P.; Wei, L.; Wang, R.; Wang, S. A Convolutional Neural Network Model for Detecting Sellar Floor Destruction of Pituitary Adenoma on Magnetic Resonance Imaging Scans. Front. Neurosci. 2022, 16, 900519. [Google Scholar] [CrossRef]
  42. Kocak, B.; Durmaz, E.S.; Kadioglu, P.; Polat Korkmaz, O.; Comunoglu, N.; Tanriover, N.; Kocer, N.; Islak, C.; Kizilkilic, O. Predicting Response to Somatostatin Analogues in Acromegaly: Machine Learning-Based High-Dimensional Quantitative Texture Analysis on T2-Weighted MRI. Eur. Radiol. 2019, 29, 2731–2739. [Google Scholar] [CrossRef]
  43. Park, Y.W.; Eom, J.; Kim, S.; Kim, H.; Ahn, S.S.; Ku, C.R.; Kim, E.H.; Lee, E.J.; Kim, S.H.; Lee, S.-K. Radiomics With Ensemble Machine Learning Predicts Dopamine Agonist Response in Patients With Prolactinoma. J. Clin. Endocrinol. Metab. 2021, 106, e3069–e3077. [Google Scholar] [CrossRef]
  44. Machado, L.F.; Elias, P.C.L.; Moreira, A.C.; Dos Santos, A.C.; Murta Junior, L.O. MRI Radiomics for the Prediction of Recurrence in Patients with Clinically Non-Functioning Pituitary Macroadenomas. Comput. Biol. Med. 2020, 124, 103966. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, Y.; Ko, C.-C.; Chen, J.-H.; Chang, K.-T.; Chen, T.-Y.; Lim, S.-W.; Tsui, Y.-K.; Su, M.-Y. Radiomics Approach for Prediction of Recurrence in Non-Functioning Pituitary Macroadenomas. Front. Oncol. 2020, 10, 590083. [Google Scholar] [CrossRef] [PubMed]
  46. Chen, Y.-J.; Hsieh, H.-P.; Hung, K.-C.; Shih, Y.-J.; Lim, S.-W.; Kuo, Y.-T.; Chen, J.-H.; Ko, C.-C. Deep Learning for Prediction of Progression and Recurrence in Nonfunctioning Pituitary Macroadenomas: Combination of Clinical and MRI Features. Front. Oncol. 2022, 12, 813806. [Google Scholar] [CrossRef]
  47. Shen, C.; Liu, X.; Jin, J.; Han, C.; Wu, L.; Wu, Z.; Su, Z.; Chen, X. A Novel Magnetic Resonance Imaging-Based Radiomics and Clinical Predictive Model for the Regrowth of Postoperative Residual Tumor in Non-Functioning Pituitary Neuroendocrine Tumor. Medicina 2023, 59, 1525. [Google Scholar] [CrossRef]
  48. Cuocolo, R.; Imbriaco, M. Machine Learning Solutions in Radiology: Does the Emperor Have No Clothes? Eur. Radiol. 2021, 31, 3783–3785. [Google Scholar] [CrossRef]
  49. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
  50. Lambin, P.; Leijenaar, R.T.H.; Deist, T.M.; Peerlings, J.; de Jong, E.E.C.; van Timmeren, J.; Sanduleanu, S.; Larue, R.T.H.M.; Even, A.J.G.; Jochems, A.; et al. Radiomics: The Bridge between Medical Imaging and Personalized Medicine. Nat. Rev. Clin. Oncol. 2017, 14, 749–762. [Google Scholar] [CrossRef]
  51. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-Based Phenotyping | Radiology. Available online: https://pubs.rsna.org/doi/10.1148/radiol.2020191145 (accessed on 25 August 2025).
  52. Staartjes, V.E.; Serra, C.; Muscas, G.; Maldaner, N.; Akeret, K.; van Niftrik, C.H.B.; Fierstra, J.; Holzmann, D.; Regli, L. Utility of Deep Neural Networks in Predicting Gross-Total Resection after Transsphenoidal Surgery for Pituitary Adenoma: A Pilot Study. Neurosurg. Focus. 2018, 45, E12. [Google Scholar] [CrossRef]
  53. Fan, Y.; Liu, Z.; Hou, B.; Li, L.; Liu, X.; Liu, Z.; Wang, R.; Lin, Y.; Feng, F.; Tian, J.; et al. Development and Validation of an MRI-Based Radiomic Signature for the Preoperative Prediction of Treatment Response in Patients with Invasive Functional Pituitary Adenoma. Eur. J. Radiol. 2019, 121, 108647. [Google Scholar] [CrossRef]
  54. Fan, Y.; Hua, M.; Mou, A.; Wu, M.; Liu, X.; Bao, X.; Wang, R.; Feng, M. Preoperative Noninvasive Radiomics Approach Predicts Tumor Consistency in Patients With Acromegaly: Development and Multicenter Prospective Validation. Front. Endocrinol. 2019, 10, 403. [Google Scholar] [CrossRef] [PubMed]
  55. Qian, Y.; Qiu, Y.; Li, C.-C.; Wang, Z.-Y.; Cao, B.-W.; Huang, H.-X.; Ni, Y.-H.; Chen, L.-L.; Sun, J.-Y. A Novel Diagnostic Method for Pituitary Adenoma Based on Magnetic Resonance Imaging Using a Convolutional Neural Network. Pituitary 2020, 23, 246–252. [Google Scholar] [CrossRef] [PubMed]
  56. Zhang, W.; Sun, M.; Fan, Y.; Wang, H.; Feng, M.; Zhou, S.; Wang, R. Machine Learning in Preoperative Prediction of Postoperative Immediate Remission of Histology-Positive Cushing’s Disease. Front. Endocrinol. 2021, 12, 635795. [Google Scholar] [CrossRef] [PubMed]
  57. Zhang, Y.; Chen, C.; Huang, W.; Cheng, Y.; Teng, Y.; Zhang, L.; Xu, J. Machine Learning-Based Radiomics of the Optic Chiasm Predict Visual Outcome Following Pituitary Adenoma Surgery. J. Pers. Med. 2021, 11, 991. [Google Scholar] [CrossRef]
  58. Villalonga, J.F.; Solari, D.; Cuocolo, R.; De Lucia, V.; Ugga, L.; Gragnaniello, C.; Pailler, J.I.; Cervio, A.; Campero, A.; Cavallo, L.M.; et al. Clinical Application of the “Sellar Barrier’s Concept” for Predicting Intraoperative CSF Leak in Endoscopic Endonasal Surgery for Pituitary Adenomas with a Machine Learning Analysis. Front. Surg. 2022, 9, 934721. [Google Scholar] [CrossRef]
  59. Gargya, S.; Jain, S. CAD System Design for Pituitary Tumor Classification Based on Transfer Learning Technique. Curr. Med. Imaging 2023, 20, E15734056246146. [Google Scholar] [CrossRef]
  60. Zhang, Y.; Zheng, J.; Huang, Z.; Teng, Y.; Chen, C.; Xu, J. Predicting Visual Recovery in Pituitary Adenoma Patients Post-Endoscopic Endonasal Transsphenoidal Surgery: Harnessing Delta-Radiomics of the Optic Chiasm from MRI. Eur. Radiol. 2023, 33, 7482–7493. [Google Scholar] [CrossRef]
  61. Zhang, Y.; Chen, C.; Huang, W.; Teng, Y.; Shu, X.; Zhao, F.; Xu, J.; Zhang, L. Preoperative Volume of the Optic Chiasm Is an Easily Obtained Predictor for Visual Recovery of Pituitary Adenoma Patients Following Endoscopic Endonasal Transsphenoidal Surgery: A Cohort Study. Int. J. Surg. 2023, 109, 896–904. [Google Scholar] [CrossRef]
  62. Behzadi, F.; Alhusseini, M.; Yang, S.D.; Mallik, A.K.; Germanwala, A.V. A Predictive Model for Intraoperative Cerebrospinal Fluid Leak During Endonasal Pituitary Adenoma Resection Using a Convolutional Neural Network. World Neurosurg. 2024, 189, e324–e330. [Google Scholar] [CrossRef]
  63. Da Mutten, R.; Zanier, O.; Ciobanu-Caraus, O.; Voglis, S.; Hugelshofer, M.; Pangalu, A.; Regli, L.; Serra, C.; Staartjes, V.E. Automated Volumetric Assessment of Pituitary Adenoma. Endocrine 2024, 83, 171–177. [Google Scholar] [CrossRef]
  64. Ishimoto, Y.; Ide, S.; Watanabe, K.; Oyu, K.; Kasai, S.; Umemura, Y.; Sasaki, M.; Nagaya, H.; Tatsuo, S.; Nozaki, A.; et al. Usefulness of Pituitary High-Resolution 3D MRI with Deep-Learning-Based Reconstruction for Perioperative Evaluation of Pituitary Adenomas. Neuroradiology 2024, 66, 937–945. [Google Scholar] [CrossRef] [PubMed]
  65. Zhang, Y.; Huang, Z.; Zhao, Y.; Xu, J.; Chen, C.; Xu, J. Radiomics Using Multiparametric Magnetic Resonance Imaging to Predict Postoperative Visual Outcomes of Patients with Pituitary Adenoma. Asian J. Surg. 2024, 48, 166–172. [Google Scholar] [CrossRef] [PubMed]
  66. Agosti, E.; Cuocolo, R.; Mangili, M.; Rampinelli, V.; Veiceschi, P.; Cappelletti, M.; Panciani, P.P.; Piazza, A.; Bove, I.; Solari, D.; et al. Radiomics for Preoperative Assessment of Pituitary Adenoma Consistency with T2-Weighted MRI: A Multicenter Study. J. Neurol. Surg. Part B Skull Base 2025. [Google Scholar] [CrossRef]
  67. Kocak, B.; Keles, A.; Kose, F.; Sendur, A. Quality of Radiomics Research: Comprehensive Analysis of 1574 Unique Publications from 89 Reviews. Eur. Radiol. 2025, 35, 1980–1992. [Google Scholar] [CrossRef]
  68. Galm, B.P.; Martinez-Salazar, E.L.; Swearingen, B.; Torriani, M.; Klibanski, A.; Bredella, M.A.; Tritos, N.A. MRI Texture Analysis as a Predictor of Tumor Recurrence or Progression in Patients with Clinically Non-Functioning Pituitary Adenomas. Eur. J. Endocrinol. 2018, 179, 191–198. [Google Scholar] [CrossRef]
  69. Fan, Y.; Jiang, S.; Hua, M.; Feng, S.; Feng, M.; Wang, R. Machine Learning-Based Radiomics Predicts Radiotherapeutic Response in Patients With Acromegaly. Front. Endocrinol. 2019, 10, 588. [Google Scholar] [CrossRef]
  70. Tricco, A.C.; Lillie, E.; Zarin, W.; O’Brien, K.K.; Colquhoun, H.; Levac, D.; Moher, D.; Peters, M.D.J.; Horsley, T.; Weeks, L.; et al. PRISMA Extension for Scoping Reviews (PRISMAScR): Checklist and Explanation. Ann. Intern. Med. 2018, 169, 467–473. [Google Scholar] [CrossRef]
Figure 1. Newcastle-Ottawa Scale variables.
Figure 1. Newcastle-Ottawa Scale variables.
Jcm 14 06595 g001
Figure 2. PRISMA flow chart.
Figure 2. PRISMA flow chart.
Jcm 14 06595 g002
Table 1. Radiomics quality assessment through RQS and IBSI checklists.
Table 1. Radiomics quality assessment through RQS and IBSI checklists.
Author, YearMethodRQS (36)IBSI (%)
Zhang, 2018 [21]Handcrafted1443
Niu, 2018 [35]Handcrafted1643
Kocak, 2018 [42]Handcrafted1286
Staartjes, 2018 [52]Deep Learning1314
Zeynalova, 2019 [29]Handcrafted1486
Ugga, 2019 [15]Handcrafted1471
Fan, 2019 [53]Handcrafted1643
Fan, 2019 [16]Handcrafted1843
Fan, 2019 [54]Handcrafted1743
Qian, 2020 [55]Deep Learning1029
Peng, 2020 [20]Handcrafted1243
Cuocolo, 2020 [30]Handcrafted1471
Machado, 2020 [44]Handcrafted1086
Park, 2020 [26]Handcrafted13100
Zhu, 2020 [31]Deep Learning929
Chen, 2020 [34]Handcrafted1443
Zhang, 2020 [45]Handcrafted114
Li, 2021 [24]Deep Learning1371
Liu, 2021 [27]Handcrafted1414
Park, 2021 [43]Handcrafted1271
Wan, 2021 [32]Handcrafted12100
Wang, 2021 [33]Handcrafted1314
Zhang, 2021 [56]Handcrafted1414
Zhang, 2021 [57]Handcrafted1243
Baysal, 2022 [28]Deep Learning1857
Chen, 2022 [46]Multimodal1343
Fang, 2022 [36]Deep Learning1429
Feng, 2022 [41]Deep Learning1529
Kim, 2022 [37]Deep Learning1457
Park, 2022 [38]Deep Learning1357
Rui, 2022 [22]Handcrafted1729
Shu, 2022 [17]Deep Learning1357
Villalonga, 2022 [58]Handcrafted1314
Zhang, 2022 [39]Handcrafted1471
Gargya, 2023 [59]Deep Learning1057
Shen, 2023 [47]Handcrafted1686
Wang, 2023 [25]Handcrafted1657
Wang, 2023 [23]Handcrafted1586
Zhang, 2023 [60]Handcrafted1686
Zhang, 2023 [61]Deeep Learning1714
A, 2024 [19]Handcrafted1457
Behzadi, 2024 [62]Deep Learning1543
Da Mutten, 2024 [63]Deep Learning1343
Fang, 2024 [40]Deep Learning1429
Ishimoto, 2024 [64]Deep Learning−271
Liu, 2024 [18]Combined1986
Taslicay, 2024 [13]Handcrafted1486
Zhang, 2024 [65]Handcrafted15100
Agosti, 2025 [66]Handcrafted1571
Table 2. Summary of anamnestic and radiomics data of the studies included in the sys-tematic review. Abbreviations: NF, non-functioning; OC, optic chiasma; ICA, internal ca-rotid artery; GH, growth hormone; IFPA, invasive functional pituitary adenoma; PMA, pi-tuitary macro-adenoma; PRL, prolactin; ACTH, adrenocorticotropic hormone; CPA, cystic pituitary adenoma; RBF-SVM, support vector ma-chine with the radial basis function ker-nel; SVM, support vector machine; MLP, multi-layer perceptron; ANN, artificial neural network; CRNN, convolutional recurrent neural network; kNN, k-nearest neighbors; NB, Naïve Bayes; ET, Extra Trees; RF, Random Forest; LightGBM, light gradient boosting ma-chine; DT, Decision Tree; AdaBoost, Adaptive Boosting; XGBoost, Extreme Gradient Boosting; GBDT, gradient boosting decision tree; BFGS, Broy-den-Fletcher-Goldfarb-Shanno; IF, isolation forest; oSVM, one-class support vector machine; PNN, probabilistic neural network; SNR, signal-to-noise ratio; CNR, con-trast-to-noise ratio; ICA, internal carotid artery; FOS, First Order Statistics; GLCM, Gray Level Gray Level Co-occurrence Matrix; GLRLM, Gray Level Run Length Matrix; GLSZM, Gray Level Size Zone Matrix; NGTDM, Neighboring Gray Tone Difference Matrix; GLDM, Gray Level Dependence Matrix; FOSW, First Order Statistics applied over Wavelet-filtered images; FOSG, First Order Statistics applied over Gradient filtered images; CRNN, Con-volutional Recurrent Neural Network.
Table 2. Summary of anamnestic and radiomics data of the studies included in the sys-tematic review. Abbreviations: NF, non-functioning; OC, optic chiasma; ICA, internal ca-rotid artery; GH, growth hormone; IFPA, invasive functional pituitary adenoma; PMA, pi-tuitary macro-adenoma; PRL, prolactin; ACTH, adrenocorticotropic hormone; CPA, cystic pituitary adenoma; RBF-SVM, support vector ma-chine with the radial basis function ker-nel; SVM, support vector machine; MLP, multi-layer perceptron; ANN, artificial neural network; CRNN, convolutional recurrent neural network; kNN, k-nearest neighbors; NB, Naïve Bayes; ET, Extra Trees; RF, Random Forest; LightGBM, light gradient boosting ma-chine; DT, Decision Tree; AdaBoost, Adaptive Boosting; XGBoost, Extreme Gradient Boosting; GBDT, gradient boosting decision tree; BFGS, Broy-den-Fletcher-Goldfarb-Shanno; IF, isolation forest; oSVM, one-class support vector machine; PNN, probabilistic neural network; SNR, signal-to-noise ratio; CNR, con-trast-to-noise ratio; ICA, internal carotid artery; FOS, First Order Statistics; GLCM, Gray Level Gray Level Co-occurrence Matrix; GLRLM, Gray Level Run Length Matrix; GLSZM, Gray Level Size Zone Matrix; NGTDM, Neighboring Gray Tone Difference Matrix; GLDM, Gray Level Dependence Matrix; FOSW, First Order Statistics applied over Wavelet-filtered images; FOSG, First Order Statistics applied over Gradient filtered images; CRNN, Con-volutional Recurrent Neural Network.
Author, YearPatients MR SequencesML Algorithms Used
or AI Model (ML or DL)
Software Used for Features ExtractionType of Radiomic FeaturesTumor SubtypeAUC ROC (or Accuracy %)
Total (N)Training Dataset (%)Test Dataset (%)Validation Dataset
(%)
Zhang, 2018 [21]1126733-T1WI
CE-T1WI
RBF-SVMMatLabIntensity
Shape and size
Texture
Wavelet-based
NFT1WITraining 0.831
Test0.804
CE-T1WITraining0.634
Test0.510
Niu, 2018 [35]1945050-T2WI
CE-T1WI
Linear SVMMatLabIntensity
Shape and size
Texture
Wavelet-based
ICA wrapped degree
Any Training 0.852
Test 0.826
Kocak, 2018 [42]47---T2WIkNN
C4.5
PyRadiomics
(3D-Slicer extension)
FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
Wavelet-based
GHQuantitative TA0.847
ROI-based quantitative rSI0.581
Staartjes, 2018 [52]140---CE-T1WIMLP (DL)
LR
--AnyMLP (DL)0.962
LR0.86
Zeynalova, 2019 [29]55---T2WIANNPyRadiomics-Any (PMA)0.710
Ugga, 2019 [15]896040-T2WIkNNPyRadiomicsShape
FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
Any0.87
Fan, 2019 [53]1636634-T1WI
T2WI
CE-T1WI
SVMInhouse
program written in Matlab 2015b
Intensity
Shape and size
Texture
Wavelet-based

Any (IFPA)
Training 0.832
Validation0.811
Fan, 2019 [16]1386535-T1WI
T2WI
CE-T1WI
SVMPyRadiomicsShape
FOS
Texture
Wavelet-based
GHTraining 0.96
Validation0.89
Fan, 2019 [54]188533116T1WI
T2WI
CE-T1WI
SVM
LR
PyRadiomicsFOS
Shape and size
GLCM
GLRLM
GLSZM
Wavelet-based
GHTraining0.83
Validation0.81
Qian, 2020 [55]1498020-T1WI
T2WI
CNN--Any-
Peng, 2020 [20]235---T1WI
T2WI
CE-T1WI
SVM
kNN
NB
PyRadiomicsShape
GLCM
GLRLM
GLSZM
GLDM
AnySVMT1WI0.8762
T2WI0.9549
CE-T1WI0.8806
KNNT1WI0.8598
T2WI0.9266
CE-T1WI0.7947
NBT1WI0.8492
T2WI0.9324
CE-T1WI0.8309
Cuocolo, 2020 [30]898020-T2WIETPyRadiomicsHistogram
GLCM
GLRLM
GLSZM
NGTDM
GLDM
Any0.99
Machado, 2020 [44]27---CE-T1WI kNN
RF
LR
SVM
MLP
PyRadiomicsFOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
FOSW
FOSG
NF MLP 0.929
RF 0.877
SVM0.860
LR0.929
kNN0.979
Park, 2020 [26]69---T2WI-PyRadiomicsShape
FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
GH0.834
Zhu, 2020 [31]374---T1WI
T2WI
-DenseNet-ResNet based
Autoencoder framework CRNN
-Any-
Chen, 2020 [34]1017129-T1WI
T2WI
CE-T1WI
---AnyT1WI
Training 0.90
Test0.91
T2WITraining 0.86
Test 0.83
CE-T1WITraining 0.90
Test0.89
CombinedTraining 0.92
Test 0.91
Zhang, 2020 [45] 50---T2WI
CE-T1WI
SVMPython (v. 3.10.7)SVR
GLCM
NGTDM
NF0.87
Li, 2021 [24]18554Group 1 24
Group 2 13
9T1WI
T2WI
CE-T1WI
T2WI-FLAIR
CNN--AnyInternal Validation 10.8063
Internal validation 20.7881
External
independent testing
0.8478
Liu, 2021 [27]49---T1WI
T2WI
CE-T1WI
-PyRadiomicsShape
FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
GHROI1 T1C0.893
T1WI0.918
T2WI0.823
Radiomics 0.908
ROI2 T1C0.860
T1WI 0.898
T2WI0.812
Radiomics0.880
Park, 2021 [43]1778020-T2WIRF
LightGBM
ET
PyRadiomicsFOS
GLCM
GLRLM
GLSZM
PRLTraining0.81
Test0.81
Wan, 2021 [32]1566931-T1WI
T2WI
CE-T1WI
RF
SVM
MatLab-Any
(PMA)
0.90
Wang, 2021 [33]1638020-T1WI
T2WI
CE-T1WI
Linear SVM
RF
ET
kNN
DT
GDBT
AdaBoost
MLP
XGBoost
PyRadiomicsKnosp grade
adenoma volume
adenoma diameters
OC height
ICA contact degree
AnyLinear SVC 0.762
RF 0.824
ET 0.865
KNN 0.920
DT 0.597
GBDT0.807
AdaBoost 0.817
MLP 0.856
XGBoost 0.826
Zhang, 2021 [56]10458020-T1WI
T2WI
CE-T1WI
GBDT
RF
AdaBoost
XGBoost
LR
NB
DT
MLP
--ACTHXGBoost0.712
GBDT 0.734
RF 0.726
AdaBoost 0.699
NB 0.681
LR0.701
DT 0.664
MLP0.700
Stacking 0.743
Zhang, 2021 [57]131---T2WISVM
RF
LDA
PyRadiomicsShape
FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
AnySVM 0.824
LDA0.801
RF0.751
Baysal, 2022 [28]130701515T2WIANN
(BFGS algorithm)
PyRadiomicsShape
FOS
High order features
AnyNF 0.87
GH0.89
PRL0.95
ACTH 0.94
PH0.74
FSH-LH 0.96
TSH0.95
Chen, 2022 [46]7880-20T2WI
CE-T1WI
MLP
CNN
--NFCNN0.84
MLP0.73
Multimodal CNN-MLP0.85
Fang, 2022 [36]371---CE-T1WICNN-- Validation fold 10.89
Validation fold 20.98
Validation fold 30.89
Validation fold 40.96
Validation fold 50.93
Feng, 2022 [41]695---CE-T1WICNN- Any0.98
Kim, 2022 [37]67---CE-T1WIDL-Depth of Invasion
Degree of contact with intracavernous ICA
Any1-mm-slice MR0.79
3-mm-slice MR0.61
Park, 2022 [38]104---CE-T1WIDL--AnyReader 11-mm-slice MR
0.91
3-mm-slice MR
0.88
Reader 21-mm-slice MR
0.92
3-mm-slice MR
0.87
Rui, 2022 [22]3028020-T1WI
T2WI
CE-T1WI
LDA
SVM
RF
GBM
PyRadiomics (3D-Slicer extension)Shape (3D)
Shape (2D)
FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
NFEnsemble0.927
Shu, 2022 [17]2618020-T2WI
CE-T1WI
DLU-net neural network-AnyCE-T1WI87.4%,
T2WI89.4%
CE-T1WI + T2WI89.2%
Villalonga, 2022 [58]1448020-T1WI
T2WI
CE-T1WI
IF
local
outlier factor oSVM
Python-Any-
Zhang, 2022 [39]19690-10CE-T1WISVMPyRadiomicsFOS
Shape (3D)
GLCM
GLSZM
GLRLM
NGTDM
GLDM
Any 0.86
Gargya, 2023 [59]-----CNN (VGG 16, VGG19, ResNet-50, Inception V3)
SVM
kNN
PNN
--AnyVGG16 89%
VGG19 91.5%
Resnet 50 91%
Inception V3 96%
Shen, 2023 [47]1147030-T1WI
T2WI
CE-T1WI
LRR softwareShape
FOS
GLCM
GLRLM
GLSZM
NGTDM
Wavelet-based
NFClinical + radiomics features 0.929
Only radiomics features0.844
Wang, 2023 [25]2467822-CE-T1WILRLIFExSHAPE_Volume (mL)
SHAPE_Volume (vx)
SHAPE_Sphericity
SHAPE_Surface area
SHAPE_Compacity
DISCRETIZED_Q3
DISCRETIZED_Kurtosis
GLCM
GLRM
NGLDM
GLZLM
AnyTraining0.916
Test0.935
Wang, 2023 [23]2958812-T1WI
T2WI
CE-T1WI
Elasticnet LinearSVC
RF
ET
kNN
DT
GBDT AdaBoost
MLP
XGBoost
PyRadiomics-NFLinearSVCTraining0.931
Test0.937
Elasticnet Training0.908
Test0.915
RF Training0.848
Test0.82
ET Training0.831
Test0.845
KNN Training0.836
Test0.762
DT Training0.615
Test0.622
GBDT Training0.862
Test0.819
AdaBoost Training0.667
Test0.793
MLP Training0.903
Test0.905
XGBoost Training0.879
Test0.868
Zhang, 2023 [60]1307030-T2WILinear SVMPyRadiomicsShape
Histogram
Texture
Wavelet-based
AnyDelta-radiomic model Training0.821
Test0.811
Combined model Training0.841
Test0.840
Zhang, 2023 [61]2208020-T2WICNN--Any-
A, 2024 [19]2226733-T1WI
T2WI
CE-T1WI
SVM
LR
RF
MLP
PyRadiomicsShape features
FOS
GLCM
GLRLM
GLSZM
AnyMulti-sequence (LR)Training0.935
Test0.886
Validation0.840
AnyMulti-sequence
(MLP)
Training0.957
Test0.913
Validation0.758
Behzadi, 2024 [62]2207030-T1WI
T2WI
CNN--Any 0.898
Da Mutten, 2024 [63]213919-CE-T1WICNN--Any-
Fang, 2024 [40]7298911-CE-T1WICNN
(ResNet-50)
--Any 0.92
Ishimoto, 2024 [64]24---CE-T1WIDL-SNR
CNR
Any-
Liu, 2024 [18]2478020-T1WI
T2WI
CE-T1WI
LR
SVM
MLP
PyRadiomics
(ML model)
ResNet50 (DL model)
FOS
Shape
Texture
AnyML modelLRTraining0.789
Test0.547
SVMTraining0.904
Test0.645
MLPTraining0.812
Test0.620
DL modelLRTraining1.000
Test0.808
SVMTraining1.000
Test0.812
MLPTraining1.000
Test0.765
DLR modelLRTraining1.000
Test0.810
SVMTraining1.000
Test0.810
MLPTraining0.994
Test0.778
Taslicay, 2024 [13]65---T1WI
T2WI
CE-T1WI
SVM
LR
LGB
PyRadiomics (3D-slicer)FOS
GLCM
GLRLM
GLSZM
NGTDM
GLDM
Any
(CPA)
SVM0.956
LR0.956
LGB0.951
Zhang, 2024 [65]1527030-T1WI
T2WI
CE-T1WI
SVMPyRadiomicsFOS
Shape (3D)
Shape (2D)
GLCM
GLRLM
GLSZM
NGTDM
GLDM
AnyT1WI Training0.784
Test0.767
T2WITraining0.724
Test0.763
CE-T1WITraining0.822
Test0.794
MultiparametricTraining0.851
Test0.847
Agosti, 2025 [66]394801010T2WIETPyRadiomicsShape
FOS
GLCM
GLRLM
GLSZM
NGTDM
Wavelet-based
Any 0.59
Table 3. Summary of radiomics applications for PAs. Abbreviations: CI, confidential interval; CS, cavernous sinus; SF, sellar floor; PIT-1, positive pituitary transcription factor 1; ICA, internal carotid artery; NFPA, non-functioning pituitary adenoma; NCA, null cell adenoma; CSF, cerebrospinal fluid; GTR, gross total resection; PMA, pituitary macroadenoma; SA, somatostatin analogues; DA, dopamine agonist; MRI, magnetic resonance imaging; GH, growth hormone; IFPA, invasive functional pituitary adenoma.
Table 3. Summary of radiomics applications for PAs. Abbreviations: CI, confidential interval; CS, cavernous sinus; SF, sellar floor; PIT-1, positive pituitary transcription factor 1; ICA, internal carotid artery; NFPA, non-functioning pituitary adenoma; NCA, null cell adenoma; CSF, cerebrospinal fluid; GTR, gross total resection; PMA, pituitary macroadenoma; SA, somatostatin analogues; DA, dopamine agonist; MRI, magnetic resonance imaging; GH, growth hormone; IFPA, invasive functional pituitary adenoma.
General Field of ApplicationSpecific EndpointAuthor, Year
Prediction of consistencyDistinction between soft and fibrous tumorsZeynalova, 2019 [29]
Fan, 2019 [54]
Cuocolo, 2020 [30]
Zhu, 2020 [31]
Chen, 2020 [34]
Wan, 2021 [32]
Wang, 2021 [33]
Agosti, 2025 [66]
Prediction of invasivenessPrediction of CS invasionNiu, 2018 [35]
Fang, 2022 [36]
Kim, 2022 [37]
Park, 2022 [38]
Zhang, 2022 [39]
Fang, 2024 [40]
Prediction of SF invasionFeng, 2022 [41]
Prediction of
histopathological
features
Preoperative prediction
of Ki67
Ugga, 2019 [15]
Fan, 2019 [16]
Shu, 2022 [17]
Prediction of Ki67 and PIT-1Liu, 2024 [18]
Distinction between high-risk and low-risk PitNETs (WHO 2021 classifcation)A, 2024 [19]
Distinction among Tpit, Pit-1, and SF-1 subfamiliesPeng, 2020 [20]
Distinction between NCAs and other NFPA subtypesZhang, 2018 [21]
Distinction between SCAs and other NFPA subtypesRui, 2022 [22]
Wang, 2023 [23]
Distinction between functioning and nonfunctioning PAsLi, 2021 [24]
Prediction of aggressiveness (Ki-67 ≥ 3%, positive p53 staining, high mitotic count)Wang, 2023 [25]
Prediction of granulation pattern of GH-secreting PAsPark, 2020 [26]
Liu, 2021 [27]
Prediction of hormonal secretion patternsBaysal, 2022 [28]
Prediction of response to surgical treatmentPrediction of post-surgical recurrence or regrowthMachado, 2020 [44]
Zhang, 2020 [45]
Chen, 2022 [46]
Shen, 2023 [47]
Prediction of post-surgical visual outcomeZhang, 2021 [57]
Zhang, 2023 [61]
Zhang, 2023 [60]
Zhang, 2024 [65]
Prediction of post-surgical biochemical remissionFan, 2019 [53]
Zhang, 2021 [56]
Prediction of intraoperative CSF leak Villalonga, 2022 [58]
Behzadi, 2024 [62]
Prediction of the likelihood of GTRStaartjes, 2018 [52]
Prediction of response to non-surgical treatmentPrediction of response to SA in GH-secreting PMAsKocak, 2018 [42]
Prediction of response to DAsPark, 2021 [43]
Diagnose PAsDetection of
pituitary tumors from brain MRI
Qian, 2020 [55]
Gargya, 2023 [59]
Ishimoto, 2024 [64]
Distinction between pituitary cystic adenomas and Rathke’s cleft cystsTaslicay, 2024 [13]
Automated tumor segmentation and volumetry Lesion detection, evaluation of progression of pituitary incidentalomas and detection of residual tumor.Da Mutten, 2024 [63]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Agosti, E.; Mangili, M.; Panciani, P.P.; Ugga, L.; Rampinelli, V.; Ravanelli, M.; Fiorindi, A.; Fontanella, M.M. Radiomics in Pituitary Adenomas: A Systematic Review of Clinical Applications and Predictive Models. J. Clin. Med. 2025, 14, 6595. https://doi.org/10.3390/jcm14186595

AMA Style

Agosti E, Mangili M, Panciani PP, Ugga L, Rampinelli V, Ravanelli M, Fiorindi A, Fontanella MM. Radiomics in Pituitary Adenomas: A Systematic Review of Clinical Applications and Predictive Models. Journal of Clinical Medicine. 2025; 14(18):6595. https://doi.org/10.3390/jcm14186595

Chicago/Turabian Style

Agosti, Edoardo, Marcello Mangili, Pier Paolo Panciani, Lorenzo Ugga, Vittorio Rampinelli, Marco Ravanelli, Alessandro Fiorindi, and Marco Maria Fontanella. 2025. "Radiomics in Pituitary Adenomas: A Systematic Review of Clinical Applications and Predictive Models" Journal of Clinical Medicine 14, no. 18: 6595. https://doi.org/10.3390/jcm14186595

APA Style

Agosti, E., Mangili, M., Panciani, P. P., Ugga, L., Rampinelli, V., Ravanelli, M., Fiorindi, A., & Fontanella, M. M. (2025). Radiomics in Pituitary Adenomas: A Systematic Review of Clinical Applications and Predictive Models. Journal of Clinical Medicine, 14(18), 6595. https://doi.org/10.3390/jcm14186595

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop