You are currently viewing a new version of our website. To view the old version click .
Cancers
  • Article
  • Open Access

4 December 2025

Enhancing PRRT Outcome Prediction in Neuroendocrine Tumors: Aggregated Multi-Lesion PET Radiomics Incorporating Inter-Tumor Heterogeneity

,
,
,
,
,
,
,
,
and
1
Department of Physics and Astronomy, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
2
Department of Basic and Translational Research, BC Cancer Research Institute, Vancouver, BC V5Z 1L3, Canada
3
Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, 1205 Geneva, Switzerland
4
Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
This article belongs to the Section Methods and Technologies Development

Simple Summary

This study explored how aggregating radiomic features from multiple PET-identified lesions can predict disease progression and Time to Progression (TTP) in patients with Neuroendocrine Tumors (NETs) treated with [177Lu]Lu-DOTA-TATE. Using different lesion selection and aggregation strategies, we found that selecting the top lesions sorted by descending minimum Standard Uptake Value (SUVmin) improved progression prediction, while aggregating all lesion features or using the top five by descending SUVmean yielded more accurate TTP predictions. The best-performing models were Logistic Regression (LR) for progression and Random Survival Forest (RSF) for TTP, highlighting the value of spatial heterogeneity features. Overall, the findings show that strategic lesion selection and aggregation can enhance predictive accuracy and support more personalized PRRT treatment planning.

Abstract

Introduction: Peptide Receptor Radionuclide Therapy (PRRT) with [177Lu]Lu-DOTA-TATE is effective in treating advanced Neuroendocrine Tumors (NETs), yet predicting individual response in this treatment remains a challenge due to inter-lesion heterogeneity. There is a lack of standardized, effective methods for using multi-lesion radiomics to predict progression and Time to Progression (TTP) in PRRT-treated patients. This study evaluated how aggregating radiomic features from multiple PET-identified lesions can be used to predict disease progression (event [progression and death] vs. event-free) and TTP. Methods: Eighty-one NETs patients with multiple lesions underwent pre-treatment PET/CT imaging. Lesions were segmented and ranked by minimum Standard Uptake Value (SUVmin) (both descending and ascending), SUVmean, SUVmax, and volume (descending). From each sorting, the top one, three, and five lesions were selected. For the selected lesions, radiomic features were extracted (using the Pyradiomics library) and lesion aggregation was performed using stacked vs. statistical methods. Eight classification models along with three feature selection methods were used to predict progression, and five survival models and three feature selection methods were used to predict TTP under a nested cross-validation framework. Results: The overall appraisal showed that sorting lesions based on SUVmin (descending) yields better classification performance in progression prediction. This is in addition to the fact that aggregating features extracted from all the lesions, as well as the top five lesions sorted by SUVmean, lead to the highest overall performance in TTP prediction. The individual appraisal in progression prediction models trained on the single top lesion sorted by SUVmin (descending) showed the highest recall and specificity despite data imbalance. The best-performing model was the Logistic Regression (LR) classifier with Recursive Feature Elimination (RFE) (recall: 0.75, specificity: 0.77). In TTP prediction, the highest concordance index was obtained using a Random Survival Forest (RSF) trained on statistically aggregated features from the top five lesions ranked by SUVmean, selected via Univariate C-Index (UCI) (C-index = 0.68). Across both tasks, features from the Gray Level Size Zone Matrix (GLSZM) family were consistently among the most predictive, highlighting the importance of spatial heterogeneity in treatment response. Conclusions: This study demonstrates that informed lesion selection and tailored aggregation strategies significantly impact the predictive performance of radiomics-based models for progression and TTP prediction in PRRT-treated NET patients. These approaches can potentially enhance model accuracy and better capture tumor heterogeneity, supporting more personalized and practical PRRT implementation.

1. Introduction

Peptide Receptor Radionuclide Therapy (PRRT) has emerged as a pivotal treatment for patients with advanced Neuroendocrine Tumors (NETs), offering targeted cytotoxicity via radiolabeled somatostatin analogs [1]. The U.S. FDA-approved radiopharmaceutical [177Lu]Lu-DOTA-TATE specifically binds to somatostatin receptors (SST) expressing gastroenteropancreatic NETs and is used particularly in patients with progressive disease following conventional therapy [2]. Despite its clinical efficacy, the ability to accurately predict individual treatment outcomes remains limited.
Prior to the treatment, Positron Emission Tomography (PET) imaging using SSTR-targeted radiopharmaceuticals often uncovers a substantial tumor burden in patients. This typically includes not only the primary tumor, but also multiple metastatic lesions scattered throughout the body [3]. Each of these metastatic lesions can display unique inter- and intra-lesion biological characteristics and behaviors, such as differences in growth rate, receptor density, metabolic activity, and response to therapy [4]. These variations can significantly impact disease progression and treatment outcomes. Therefore, understanding and assessing the heterogeneity within and among these lesions is crucial for accurate prognosis and effective personalized treatment planning [5].
Radiomics provides a powerful, non-invasive means to quantitatively analyze imaging features from PET/CT scans, offering the potential to uncover prognostic and predictive biomarkers relevant to PRRT response [6]. Traditionally, analyses have centered on a single representative lesion, typically the largest or most metabolically active, potentially missing valuable clinical insights from other lesions [5]. However, determining the optimal approach to integrate radiomic data from multiple lesions within a single patient remains a challenge, particularly due to the variability in lesion count across individuals [3,4,7].
Several studies have shown that incorporating additional Regions of Interest (ROIs), such as other lesions [4,5,8], the peritumoral area [9,10], and even healthy organs [11] alongside the primary tumor can enhance outcome prediction. In this context, Captier et al. [4] introduced RadShap, a model- and modality-agnostic explanation tool for multiregional radiomic models, built on Shapley values [12]. RadShap assigns prediction contributions to individual image regions. Aggregative models using RadShap can outperform single-lesion models for both survival prediction and tumor subtype classification, highlighting the prognostic value of regions beyond the primary tumor. Wilk et al. [8] assessed integrating radiomic data from multiple lesions into survival models for lung cancer. Using PET and PET interpolated to CT resolution features from 115 patients, they tested two strategies: aggregating lesion features or combining model outputs. Survival models were evaluated with Monte Carlo cross-validation. Including all lesions significantly improved prediction (C-index from ~0.61 to ~0.63), showing added value beyond the primary tumor.
Salimi et al. [10] explored improving survival prediction using machine learning (ML) in 2926 head and neck cancer CT scans by including radiomic features from peritumoral tissue around the tumor. The results showed that adding peritumoral radiomics consistently improved prediction, increasing the C-index from 0.64 (tumor only) up to 0.68, highlighting the prognostic value of tissue surrounding the tumor. In another study, Salimi et al. [11] evaluated the added value of radiomic features extracted from healthy organs (referred to as Organomics, or non-tumor radiomics) in predicting survival for non-small cell lung cancer patients using PET/CT images and ML. Using data from 154 patients, features from tumors and 33 healthy organ regions were analyzed across various input combinations, feature selection methods, and ML models. The results showed that Organomics (especially from PET and CT) significantly contributed to model performance.
Despite these advances, there remains no standardized or validated approach for leveraging radiomic information from multiple lesions to predict progression and Time to Progression (TTP) in PRRT-treated patients. In this study, we hypothesize that metastatic lesions differ in their biological contribution to treatment outcome, and that ranking lesions according to imaging-derived markers of SSTR expression and tumor burden provides a meaningful way to identify those most informative for prediction. Guided by this rationale, we evaluated five lesion-ranking strategies: descending maximum Standard Uptake Value (SUVmax), SUVmean, and lesion volume, as well as both descending and ascending SUVmin. Uptake-based rankings reflect differences in receptor density and viable tumor activity, while lesion volume captures the prognostic impact of tumor burden. Including both high- and low-uptake SUVmin orderings enables assessment of lesions with minimal radiopharmaceutical accumulation, which may represent biologically distinct disease components.
This study systematically compares combinations of lesion prioritization and feature aggregation methods to determine how radiomic features from multiple lesions can be optimally utilized to predict disease progression and TTP following [177Lu]Lu-DOTA-TATE therapy. Our goal is to provide a robust framework that captures inter-tumor heterogeneity and improves model performance, ultimately supporting more accurate patient stratification for PRRT.

2. Materials and Methods

The overview of the current study is provided in Figure 1. In the following sections, we further elaborate our methods.
Figure 1. Workflow of the study design. (NET: Neuroendocrine Tumors, RFE: Recursive Feature Elimination, MRMR: Minimum Redundancy Maximum Relevance, UCI: Univariate C-Index, MI: Mutual Information, RF: Random Forest, GNB: Gaussian Naive Bayes, DT: Decision Tree, XGB: eXtreme Gradient Boosting, MLP: Multi-layer Perceptron, LR: Logistic Regression, SVM: Support Vector Machine, KNN: K-Nearest Neighbors, CB: CatBoost, CoxPH: Cox Proportional Hazards, GLMB: Generalized Linear Model Boosting, GLMN: Generalized Linear Model Net, RSF: Random Survival Forests, AUCC: Area Under the Characteristic Curve, ACC: Accuracy, PRE: Precision, REC: Recall, BAC: Balanced Accuracy, SPE: Specificity).

2.1. Data Collection

In this study, we retrospectively analyzed 81 NETs patients (13 progression-free) who underwent pre-treatment 68Ga-DOTA-TOC and 68Ga-DOTA-TATE PET/CT imaging (similar accuracy in detection of SSTR positivity [13]) followed by [177Lu]Lu-DOTA-TATE therapy. Detailed patients’ clinical data is provided in the study by Zamboni et al. [3]. In the current cohort of 81 patients, the median follow-up was 34 months (range 0.7–61.1 months). For the 24-month TTP analysis, 71 patients had evaluable data, 39 of which experienced progression, and 10 of them died.
The analysis was conducted in two phases: (1) classification of patients based on disease progression status (event vs. event-free), and (2) prediction of TTP to assess survival outcomes. Death was also considered as progression in both tasks.

2.2. Lesion Segmentation

All metastatic lesions were segmented using a standardized semi-automatic workflow (MIM Software Inc., Cleveland, OH, USA), which employs a gradient-based algorithm for tumor boundary detection. All lesions in our study were segmented using MIM gradient-based contouring (PETEdge and PETEdge+ tools) [14] which is recognized as a robust and reproducible method for PET tumor delineation [15,16,17,18], and proved to be more reproducible compared to manual segmentation [18]. An experienced investigator carefully verified every detected lesion, removed physiologic uptake, and segmented all lesions using PETEdge (for smaller lesions) and PETEdge+ (for larger, irregular shaped lesions) tools. To ensure consistency, all PET/CT studies were independently re-evaluated by a nuclear medicine physician with expertise in PRRT imaging for metastatic lesion calling. Although formal inter-observer variability metrics were not computed, this dual-review and consensus procedure minimized segmentation inconsistencies and ensured reliable lesion contours throughout the dataset.

2.3. Data Preparation

All images were converted from Bq to SUV. To manage the high-intensity injection site, SUV values were clipped based on the highest pixel value within all segmented lesions per patient. This allowed normalization to the 0–20 range based on global minimum and maximum across images without losing relevant signal, ensuring that lesion intensity information was preserved. Lesions were then ranked based on five criteria: descending SUVmax, SUVmean, and lesion volume, as well as descending and ascending SUVmin (we refer to the ascending order as SUVmin_LH (low to high), while the descending order is simply denoted as SUVmin). Notably, for SUVmin, both descending and ascending orderings were considered to capture variations in lesion uptake characteristics, particularly those with the lowest radiopharmaceutical accumulation.
This sorting process yielded five distinct lesion ranking lists. From each list, the top 1, 3, and 5 lesions were selected for subsequent radiomic feature extraction and analysis, enabling evaluation of the impact of lesion selection strategy on predictive performance. The choice to analyze the top 1, 3, and 5 lesions for each ranking strategy was motivated by the need to balance biological interpretability with statistical feasibility. These subset sizes allowed us to evaluate whether adding a small number of highly ranked lesions provides incremental predictive value while avoiding the noise and overfitting risk associated with including larger numbers of lesions with potentially low clinical relevance. This approach also ensured methodological consistency across ranking strategies and aligns with prior multi-lesion radiomics studies that limit analysis to a small set of the most informative lesions [8].

2.4. Radiomic Features Extraction

One hundred and seven radiomic features, including shape, intensity, and texture from different families of shape (n = 14), First Order (FO, n = 18), Gray Level Co-occurrence Matrix (GLCM, n = 24), Gray Level Run Length Matrix (GLRLM, n = 16), Gray Level Size Zone Matrix (GLSZM, n = 16), Gray Level Dependence Matrix (GLDM, n = 14), and Neighboring Gray Tone Difference Matrix (NGTDM, n = 5) were extracted from each selected lesion using the PyRadiomics library (version 3.1.0) [19]. Table S1 in the Supplementary Material includes all the features. Feature extraction was performed using a bin width of 0.3125. Both images and segmentations were resampled to an isotropic voxel size of (2, 2, 2) mm3, using B-spline interpolation for images and nearest-neighbor interpolation for segmentations. A masked voxel kernel with a radius of 3 was applied to restrict calculations to within the regions of interest (ROIs).

2.5. Lesion Aggregation

To aggregate lesion-level information, two approaches were employed: stacking and statistical aggregation. In the stacking method, feature values from the top 3 and top 5 lesions (including SUVs, volume, and radiomics) were concatenated horizontally to form a combined feature vector. In contrast, statistical aggregation summarized the features across the selected lesions using the key descriptive statistics of minimum, maximum, mean, median, variance, skewness, kurtosis, and coefficient of variation in each feature along the selected lesions and the lesions concatenated horizontally. As expected, aggregation was not applicable for the top 1 lesion dataset, where only features from an individual lesion were available. Additionally, to evaluate the potential benefit of incorporating information from all lesions, a separate dataset was created by statistically aggregating features across all segmented lesions per patient. This allowed assessment of whether comprehensive lesion inclusion enhances model performance.

2.6. Progression Prediction

The dataset was normalized and subjected to a nested cross-validation framework comprising 4 outer loops and 3 inner loops. The Z-score normalization was performed on train dataset and then the mean and standard deviation transformed to test in each outer fold. Class imbalance was addressed using the Synthetic Minority Oversampling Technique (SMOTE) [20] on the train dataset in the inner fold during training. To reduce redundancy, highly correlated features (Spearman correlation > 0.80 lead to the best results) were removed. Multiple correlation thresholds were evaluated during preliminary experiments, and the cutoff of 0.80 for progression prediction provided the best balance between removing redundant features and preserving informative variability across patients. Feature selection was performed iteratively to identify the 15 most predictive features (lead to the best results) using three algorithms: Boruta [21] and Recursive Feature Elimination (RFE) [22], both with a Random Forest (RF) [23] core, and Minimum Redundancy Maximum Relevance (MRMR) [24]. Limiting the final feature subset to 15 features was based on systematic testing across several feature-selection methods, where this number consistently produced the most stable performance across nested cross-validation and bootstrapping while preventing overfitting.
In each fold, the selected features were input into eight ML classifiers: RF, Gaussian Naive Bayes (GNB) [25], Decision Tree (DT) [26], eXtreme Gradient Boosting (XGB) [27], Multi-layer Perceptron (MLP) [28], Logistic Regression (LR) [29], Support Vector Machine (SVM) [30], and K-Nearest Neighbors (KNN) [31]. Hyperparameter tuning was conducted via grid search and 3 inner folds cross-validation. Model performance was evaluated after 1000 bootstrapping on each test fold using several metrics: Area Under the Characteristic Curve (AUCC), accuracy (ACC), precision, recall, F1-score, balanced accuracy (BAC), and specificity. To compare model performances, the Mann–Whitney U test was applied, and p-values were corrected for multiple comparisons using the False Discovery Rate (FDR) method by Benjamini–Hochberg [32].

2.7. Time to Progression

After the normalization, the highly correlated features (Spearman correlation > 0.95 lead to the best results) were removed and the maximum selected features in each fold was 8 (this led to the best results). The correlation threshold for TTP prediction was selected because survival modeling is more sensitive to redundancy and sample-size constraints. Preliminary evaluation of several thresholds showed that 0.95 produced the highest stability and reproducibility. Likewise, capping the selected features at 8 was determined empirically, as this level maximized predictive performance across feature-selection techniques while maintaining an appropriate ratio of predictors to events for survival analysis.
The same nested cross-validation framework was applied for TTP prediction. Feature selection methods included Boruta, Univariate C-Index (UCI) [33], and Mutual Information (MI) [34]. These features were then used to train five survival models: CatBoost (CB) [35], Cox Proportional Hazards (CoxPH) [36], Generalized Linear Model Boosting (GLMB) [37], Generalized Linear Model Net (GLMN) [38], and Random Survival Forests (RSF) [39]. Hyperparameter tuning was conducted via grid search and model performance was assessed using 1000 bootstrapping on the test folds using C-index. Comparisons between models were conducted using the Mann–Whitney U test, with Benjamini–Hochberg FDR correction used to adjust for multiple testing.

3. Results

Table 1 shows the location of selected lesions based on the different sorting methods including SUVmin_LH, SUVmin, SUVmax, SUVmean, and volume. Across all sorting strategies, liver metastases represented the largest proportion of selected lesions, followed by bone and lymph nodes. Methods prioritizing higher uptake or larger lesions (SUVmax, SUVmean, volume) consistently selected a greater number of liver lesions, whereas SUVmin-based ranking yielded a more diverse distribution that captured a higher proportion of bone, peritoneal, and soft-tissue lesions. This pattern persisted when expanding the selection from the top one to the top three and top five lesions, indicating that SUVmin-based ranking tends to highlight biologically heterogeneous lesions across multiple anatomical sites rather than focusing predominantly on liver-related lesions. The overall distribution underscores the variability in lesion selection driven by different ranking strategies and supports the premise that multi-lesion analysis captures a broader spectrum of disease heterogeneity.
Table 1. Anatomical distribution of lesions selected according to each ranking strategy (SUVmin_LH, SUVmin, SUVmax, SUVmean, and volume), illustrating how lesion location varies across the different sorting methods.
The results of this study are presented in overall and individual schemes. The overall appraisal of the results provides a general idea of how the ML models performed being trained on different datasets, aggregation strategies, and sorting methods by considering the evaluation metrics averaged over all combinations of ML algorithms and feature selections. However, the individual assessment attempts to introduce the best performing models in each dataset along with the most important selected features.

3.1. Progression Prediction

3.1.1. Overall Appraisal

Table 2 shows the evaluation metrics averaged on all combinations of ML algorithms and feature selections for the top three sorting and aggregation strategies used to classify disease progression status (event vs. event-free) in each dataset. Comparing the average performance of the models, it is evident that sorting lesions based on the SUVmin and volume is a better lesion selection strategy than other sorting approaches in all the datasets. For instance, using radiomic features extracted from the top one lesion sorted based on SUVmin-enabled ML models show the highest overall performance.
Table 2. The top 3 sorting and aggregation approaches in each dataset with the highest average performance metrics in progression prediction.
Although there are performance differences between lesion selection strategies, missing a progressing patient has direct implications for treatment decisions and follow-up planning. Models trained on lesions ranked by SUVmin consistently achieved higher recall, indicating better sensitivity to early biological changes associated with progression. The observed gap between recall and specificity reflects the limited number of progression-free patients, which creates a substantial class imbalance. To mitigate this, SMOTE was applied during training; however, some instability is expected when the minority class is small. Distribution of the average recall and specificity values of the models are shown in Figure 2a,b.
Figure 2. Distribution of recall (a) and specificity (b) values of all the models in each dataset, aggregation, and sorting strategies. The horizontal line within each box indicates the median, while the box edges represent the interquartile range (25th to 75th percentiles). Whiskers extend to data points within 1.5× the interquartile range, and individual dots represent outliers. SUV: Standardized Uptake Value, LH: Low-to-High.
The Mann–Whitney U test results presented in Figure 3 show if the difference in datasets and sorting in Table 2 are statistically significant in terms of recall and specificity. Generally, the single top lesion sorted based on SUVmin gives superior recall and specificity. A more inclusive picture of the comparison between the average performance of all the models in each lesion selection and sorting approaches is provided in Figure S1 in the Supplementary Material.
Figure 3. Mann–Whitney U test results comparing the average machine performance trained on all the datasets and sorting approaches used in progression event prediction presented in Table 2 regarding (a) recall and (b) specificity. It compares row (Model A) vs. column (Model B): purple = A significantly better, light blue = A significantly worse, red = No difference. SUV: Standardized Uptake Value, LH: Low-to-High.

3.1.2. Individual Appraisal

Table 3 shows evaluation metrics of the top three ML models with the highest performance used in classifying patients based on disease progression status (event vs. event-free) in each dataset along with the most important selected features. The universal-best model turned out to be RFE_LR trained on the radiomic features of a single lesion sorted based on SUVmin.
Table 3. The top 3 ML models and features in each dataset with the highest performance metrics in progression prediction.
SUVmin-sorted lesions could also result in higher predictive power in the top five dataset. This is while sorting based on lesion volume could give more predictive models in the top three dataset. According to Table 3, lesion selection proved to be effective in enhancing machine performance. Also, stacking radiomic features appeared to be a better way of feature aggregation in both top three and top five datasets.
The Mann–Whitney U test results in Figure 4 prove the superiority of the Top_1_SUVmin_RFE_LR combination compared to other combinations presented in Table 3 regarding recall and specificity.
Figure 4. Mann–Whitney U test results comparing the models with the highest performance metrics in progression prediction presented in Table 3 regarding (a) recall and (b) specificity. It compares row (Model A) vs. column (Model B): purple = A significantly better, light blue = A significantly worse, red = No difference. SUV: Standardized Uptake Value, RFE: Recursive Feature Elimination, LR: Logistic Regression, MRMR: Minimum Redundancy Maximum Relevance, GNB: Gaussian Naive Bayes, KNN: K-Nearest Neighbors.
Turning to the most important features, it is immediately obvious that at least one feature from GLSZM and shape families were among the highly predictive features almost every time regardless of the dataset or feature selection method. Moreover, in the stacked-aggerated top three and top five datasets, a greater number of features were selected from the topper lesions. This indicates that lesion importance is directly related to its rank.

3.2. Time to Progression

3.2.1. Overall Appraisal

Table 4 shows C-indices averaged on all combinations of ML algorithms and feature selections for the top three sorting and aggregation strategies used for TTP analysis. The average performance of the models shows that statistical aggregation of features extracted from all lesions leads to the highest overall performance. Sorting the top five lesions based on the SUVmean can also yield the same results. This highlights the fact that including all lesions does not necessarily improve the model performance in SSTR PET.
Table 4. The top 3 sorting and aggregation approaches in each dataset with the highest average performance in TTP analysis.
The Mann–Whitney U test results presented in Figure 5 show if the difference in datasets and sorting in Table 4 are statistically significant. Due to the statistically insignificant difference between overall machine performance between when it is trained on all features and when only the top five SUVmean-sorted lesions are selected, the top five dataset can be used instead. A more inclusive picture of the comparison between the average performance of all the models on each lesion selection and sorting approaches is provided in Figure S2 in the Supplementary Material.
Figure 5. Mann–Whitney U test results comparing the top 3 sorting and aggregation approaches in each dataset with the highest average performance in TTP analysis regarding C-index. It compares row (Model A) vs. column (Model B): purple = A significantly better, light blue = A significantly worse, red = No difference. SUV: Standardized Uptake Value, LH: Low-to-High.

3.2.2. Individual Appraisal

Table 5 shows the C-indices of the top three ML models with the highest performance in TTP analysis along with the most important selected features. The universal-best model appeared to be UCI_RSF trained on the statistically aggregated features of the top five lesions sorted based on SUVmean.
Table 5. The top 3 ML models and features in each dataset with the highest performance in TTP analysis.
This highlights the fact that higher performance can also be achieved in TTP studies by including a smaller number of lesions. SUVmin and SUVmax can also lead to acceptable results depending on the number of lesions selected. According to Table 5, enhancing machine performance is possible by lesion selection. In contrast to progression prediction analysis, models performed better in TTP when fed by statistically aggregated features. Figure 6 shows Mann–Whitney U test results comparing the models presented in Table 5.
Figure 6. Mann–Whitney U test results comparing the top 3 ML models and features in each dataset with the highest performance in TTP analysis regarding C-indices. It compares row (Model A) vs. column (Model B): purple = A significantly better, light blue = A significantly worse, red = No difference. SUV: Standardized Uptake Value, LH: Low-to-High. GLMN: Generalized Linear Model Net, MI: Mutual Information, CoxPH: Cox Proportional Hazards, UCI: Univariate C-Index, GLMB: Generalized Linear Model Boosting, RSF: Random Survival Forests, CB: CatBoost.
The important selected features in Table 5 highlight the importance of GLSZM and GLDM families. Measures of variability such as kurtosis, skewness, and standard deviation turned out to be better ways of aggregation as most of the important features were such aggregations of original radiomic features among the selected lesions.

4. Discussion

This study presented a comprehensive evaluation of lesion selection and feature aggregation strategies for radiomics-based prediction of progression and TTP in patients with NETs undergoing [177Lu]Lu-DOTA-TATE PRRT. While prior work in PET radiomics has largely focused on a single lesion per patient, often selected based on size or metabolic activity [3], our results demonstrate that the method of lesion selection and aggregation significantly impacts model performance, which is also in a good agreement with other studies [3,4,5,8]. This finding emphasizes the need for carefully designed frameworks to better capture the complexity of tumor heterogeneity in NETs.
Traditionally, radiomics studies have relied on features extracted from a single representative lesion, typically the largest or the one with the highest SUV [3]. However, this approach may miss critical biological heterogeneity present across the full disease burden. Our results show that lesion selection based on SUVmin consistently yielded better performance in progression prediction than SUVmax or volume. This suggests that lesions with low uptake which potentially represent dedifferentiated or less receptor-expressing clones may carry significant prognostic value. Such lesions might be more resistant to PRRT, contributing disproportionately to disease progression despite their smaller size or lower visual prominence [40]. Including these SUVmin lesions enhances model sensitivity to biologically aggressive disease. Moreover, methods based on higher uptake or larger size mainly picked liver lesions, while SUVmin (Table 1) captured more bone, peritoneal, and soft-tissue sites. This trend held from the top one through top five selections, showing that SUVmin favors more biologically heterogeneous lesions rather than focusing predominantly on liver-related lesions.
Furthermore, the results of this study indicate that using multi-lesion radiomic features, rather than focusing on a single lesion, can yield improved outcomes in TTP prediction, as demonstrated in Table 4 and Table 5. Another finding of our study was the association between SUVmean and TTP. Previous studies have shown that higher SUVmean values predict better response to PRRT in the NETs [41,42], as well as higher lesion absorbed dose [43]. In our study, again, according to Table 4 and Table 5, both in overall and individual analysis of models, it is obvious that SUVmean-based sorting led to the superior results.
This supports recent arguments in the literature that highlight the limitations of “largest-lesion” heuristics and advocate for data-driven lesion prioritization strategies that reflect clinically relevant biology [4]. Our findings reinforce the need to revise radiomics workflows to move beyond a “one-lesion-fits-all” approach and adopt more nuanced lesion ranking criteria.
Another key contribution of this work is the comparison of stacking versus statistical aggregation techniques for multi-lesion data integration. For progression classification, stacking features across the top-ranked lesions yielded superior results in some scenarios, particularly in the top three and top five lesion datasets. This approach retains lesion-specific signatures that might otherwise be lost in statistical summaries. However, for TTP prediction, statistical aggregation consistently outperformed stacking. Models trained on statistical descriptors (e.g., mean, skewness, kurtosis) of radiomic features across lesions achieved higher C-indices, indicating better survival prediction.
These findings suggest that task-specific aggregation strategies are necessary: detailed lesion-level granularity and shape may benefit classification tasks with discrete outcomes, while statistical summaries may better capture global tumor dynamics relevant to longitudinal endpoints like progression time. Notably, aggregating features from only the top five lesions, ranked by SUVmean, produced comparable to using all lesions, which is a result with important implications for clinical and computational feasibility.
Consistently across both tasks, features from GLSZM category were among the most predictive. GLDM family was also important in TTP analysis. These texture matrices capture patterns of intra-tumoral heterogeneity and spatial distribution of uptake intensities. Specifically, GLSZM reflects the size and distribution of zones with similar uptake intensity, while GLDM quantifies the degree of gray-level differences between neighboring voxels [44]. These features may indicate how uniform or heterogeneous the tumor is in terms of tracer uptake, which could be associated with biological behavior such as aggressiveness, vascularity, cellular density, and somatostatin receptor density in the case of SSRT PET. The selection of radiomic features such as GLSZM_SZNUN and GLDM_SDE, which reflect non-uniformity and dependency strength, respectively, highlights the role of spatial disorder in predicting PRRT outcomes. This complements biological insights from prior research that intra-patient and intra-tumoral heterogeneity, as reflected in PET imaging, correlates with more aggressive disease and poor therapeutic response and prognosis in various cancers [2,4]. While texture features such as GLSZM and GLDM showed strong predictive value, their biological interpretation should be viewed cautiously. These metrics likely capture aspects of intra-tumoral heterogeneity, but their exact correspondence to histopathological or molecular characteristics remains an area of ongoing research, and definitive biological links cannot be assumed.
Moreover, several highly ranked features involved higher-order statistics derived by applying statistical measures like kurtosis, skewness, and standard deviation to the radiomic values across lesions. These aggregation metrics reflect the variability of radiomic expression within and across lesions that may encode biologically meaningful patterns related to receptor expression, dedifferentiation, or treatment resistance. For example, high skewness might suggest that some lesions behave very differently from others, high kurtosis could indicate outlier lesions with extreme characteristics, and standard deviation is able to capture the overall heterogeneity. From a clinical standpoint, such variability may reflect heterogeneous biology, variable treatment response, or the presence of aggressive subclones, thus offering interpretable indices that can support decision-making in managing NETs.
Several prior studies have demonstrated that incorporating radiomic features from regions beyond the primary lesion can improve predictive performance. For example, Wilk et al. [8] showed that including all lesions significantly enhanced survival model performance in lung cancer patients. Our findings are consistent with these results and further extend them by introducing and systematically evaluating lesion ranking and aggregation methods tailored for PRRT.
In contrast to studies incorporating peritumoral [9,10] or non-tumor/healthy-organ radiomics [11,45], our study focused on lesion-based modeling. However, these complementary approaches underscore the growing recognition that prognostically relevant information may exist outside the tumor itself.
The study provides practical recommendations for radiomics implementation in the context of PRRT. First, our findings suggest that a small, systematically selected subset of lesions can offer similar or even better predictive power than all-lesion approaches, enabling more efficient workflows. This is particularly valuable in NETs, where metastatic burden is often extensive, and manual lesion segmentation is labor-intensive. Second, lesion selection based on SUVmin (progression prediction task) or SUVmean (TTP prediction task), rather than SUVmax or volume alone, appears to better stratify risk and capture clinically relevant disease biology. Finally, statistical aggregation of features, particularly for TTP modeling, may offer a scalable and interpretable alternative to stacking, reducing model complexity and potential overfitting, especially in modestly sized datasets.
These insights are directly aligned with the broader aim of radiomics: to derive quantitative biomarkers from imaging that reflect tumor biology, support treatment decisions, and improve patient stratification in personalized oncology [4].
To evaluate the transparency and reproducibility of our study, we applied the METRICS checklist [46] and achieved a score of 68.2%, placing it in the “Good” quality category (Figures S3 and S4). This indicates strong performance in model validation, feature robustness, and clinical relevance, while also identifying areas for improvement in reproducibility and standardization.
This study has several limitations. Although nested cross-validation with bootstrapping was employed to mitigate bias, external validation on independent cohorts is essential. The number of patients who remained progression-free was relatively small (n = 13), potentially affecting model calibration and sensitivity to class imbalance. Furthermore, while this study focused exclusively on imaging features, integrating clinical, molecular, or histopathologic data could enhance predictive accuracy. In addition, incorporating Explainable AI (XAI) frameworks could aid in translating complex radiomic models into clinically actionable tools.
Future studies may explore automated lesion detection, reduce manual segmentation efforts, and enhance reproducibility. Moreover, longitudinal imaging and temporal radiomics could further refine TTP predictions by capturing dynamic treatment response. Another important consideration is model generalizability. Radiomics can be sensitive to variations in scanners, reconstruction protocols, segmentation workflows, and institutional acquisition differences. Although our nested cross-validation reduces overfitting within this dataset, confirming robustness would require harmonized multicenter cohorts, standardized imaging, and reconstruction protocols, and external validation on datasets acquired under different clinical conditions. Such steps are essential before these models can be translated into broadly applicable clinical tools [47,48]. Future research could also explore combining lesion extension and feature aggregation to evaluate its performance on progression and TTP prediction. Finally, our lesion selection and aggregation methodology may be extended to other radiopharmaceutical therapies and tumor types where heterogeneous tumor burden is prevalent.

5. Conclusions

This study demonstrated that lesion selection and feature aggregation strategies significantly influence the predictive performance of radiomics-based models for disease progression and TTP in NET patients undergoing PRRT. Selecting a single lesion based on SUVmin optimizes classification performance, while using the top five lesions ranked by SUVmean with statistical aggregation enhances survival prediction. Features from GLSZM and GLDM families were consistently important, underscoring the value of spatial heterogeneity in risk stratification. These findings support the use of targeted, data-driven lesion selection, and task-specific aggregation approaches to improve efficiency, accuracy, and clinical applicability of radiomics in personalized therapy.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/cancers17233887/s1, Table S1. Radiomic features adopted.; Figure S1. Mann-Whitney U test results comparing the average machine performance trained on all the datasets and sorting approaches used in progression event prediction regarding (a) recall and (b) specificity. It compares row (Model A) vs. column (Model B): purple = A significantly better, light blue = A significantly worse, red = No difference. SUV: Standardized Uptake Value, LH: Low-to-High.; Figure S2. Mann-Whitney U test results comparing the average machine performance trained on all the datasets and sorting approaches used in time to progression (TTP) prediction regarding C-index. It compares row (Model A) vs. column (Model B): purple = A significantly better, light blue = A significantly worse, red = No difference. SUV: Standardized Uptake Value, LH: Low-to-High.; Figure S3. First page of METRICS checklist.; Figure S4. Second page of METRICS checklist.

Author Contributions

Conceptualization, M.S. and A.S.; methodology, M.S. and G.H.; formal analysis, M.S. and O.G.; resources: A.S., Y.M., A.D., C.G.Z., S.J., and M.K.; data curation, M.S. and A.R.S.; writing—original draft preparation, M.S., O.G., and F.Y.; writing—review and editing, A.R., H.Z., F.Y., and A.S.; supervision, A.R., H.Z., F.Y., and A.S.; project administration, M.S.; funding acquisition, A.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Canadian Institutes of Health Research (CIHR) Project Grant PJT-173231 and Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Horizons Grant DH-2025-00119.

Institutional Review Board Statement

All procedures performed in this study were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its last amendments or comparable ethical standards. This study was approved by the University of Iowa Institutional Review Board (IRB#200903778). The study was granted a full waiver of HIPAA Authorization (informed consent) based on the DHHS Federal wide Assurance #FWA00003007 for this project involving retrospective secondary analysis of existing data.

Data Availability Statement

Data is available upon reasonable requests.

Conflicts of Interest

Arman Rahmim is the co-founder of Ascinta Technologies Inc. and Marc Kruzer is employed by MIM Software Inc.

Abbreviations

The following abbreviations are used in this manuscript:
PRRTPeptide Receptor Radionuclide Therapy
NETsNeuroendocrine Tumors
FDAFood and Drug Administration
[177Lu]Lu-DOTA-TATELutetium-177 DOTA-TATE
SSTRSomatostatin Receptor
PETPositron Emission Tomography
CTComputed Tomography
ROI/ROIsRegion(s) of Interest
RadShapRadiomics Shapley (model explanation tool)
MLMachine Learning
TTPTime to Progression
BqBecquerel
SUV/SUVsStandardized Uptake Value(s)
SUVminMinimum Standardized Uptake Value
SUVmeanMean Standardized Uptake Value
SUVmaxMaximum Standardized Uptake Value
GLCMGray Level Co-occurrence Matrix
GLRLMGray Level Run Length Matrix
GLSZMGray Level Size Zone Matrix
GLDMGray Level Dependence Matrix
NGTDMNeighboring Gray Tone Difference Matrix
FOFirst Order
SMOTESynthetic Minority Oversampling Technique
RFERecursive Feature Elimination
MRMRMinimum Redundancy Maximum Relevance
RFRandom Forest
GNBGaussian Naive Bayes
DTDecision Tree
XGBeXtreme Gradient Boosting
MLPMulti-Layer Perceptron
LRLogistic Regression
SVMSupport Vector Machine
KNNK-Nearest Neighbors
AUC/AUCCArea Under the Curve/Area Under the Characteristic Curve
ACCAccuracy
BACBalanced Accuracy
FDRFalse Discovery Rate
UCIUnivariate C-Index
MIMutual Information
CBCatBoost
CoxPHCox Proportional Hazards
GLMBGeneralized Linear Model Boosting
GLMNGeneralized Linear Model Net
RSFRandom Survival Forest
AIArtificial Intelligence
XAIExplainable Artificial Intelligence

References

  1. Van Essen, M.; Krenning, E.P.; Kam, B.L.R.; De Jong, M.; Valkema, R.; Kwekkeboom, D.J. Peptide-Receptor Radionuclide Therapy for Endocrine Tumors. Nat. Rev. Endocrinol. 2009, 5, 382–393. [Google Scholar] [CrossRef]
  2. Hofland, J.; Brabander, T.; Verburg, F.A.; Feelders, R.A.; De Herder, W.W. Peptide Receptor Radionuclide Therapy. J. Clin. Endocrinol. Metab. 2022, 107, 3199–3208. [Google Scholar] [CrossRef]
  3. Gadens Zamboni, C.; Dundar, A.; Jain, S.; Kruzer, M.; Loeffler, B.T.; Graves, S.A.; Pollard, J.H.; Mott, S.L.; Dillon, J.S.; Graham, M.M.; et al. Inter- and Intra-Tumoral Heterogeneity on [68Ga]Ga-DOTA-TATE/[68Ga]Ga-DOTA-TOC PET/CT Predicts Response to [177Lu]Lu-DOTA-TATE PRRT in Neuroendocrine Tumor Patients. EJNMMI Rep. 2024, 8, 39. [Google Scholar] [CrossRef]
  4. Captier, N.; Orlhac, F.; Hovhannisyan-Baghdasarian, N.; Luporsi, M.; Girard, N.; Buvat, I. RadShap: An Explanation Tool for Highlighting the Contributions of Multiple Regions of Interest to the Prediction of Radiomic Models. J. Nucl. Med. 2024, 65, 1307–1312. [Google Scholar] [CrossRef]
  5. Sabouri, M.; Yousefirizi, F.; Gharibi, O.; Dundar, A.; Zamboni, C.; Jain, S.; Menda, Y.; Rahmim, A.; Shariftabrizi, A. PET Intra-Patient Inter-Lesion Radiomics Feature Aggregation to Enhance PRRT Progression Predictions in Patients with Neuroendocrine Tumors. J. Nucl. Med. 2025, 66, 252122. [Google Scholar]
  6. Orlhac, F.; Nioche, C.; Klyuzhin, I.; Rahmim, A.; Buvat, I. Radiomics in PET Imaging. PET Clin. 2021, 16, 597–612. [Google Scholar] [CrossRef]
  7. Geady, C.; Patel, H.; Peoples, J.; Simpson, A.; Haibe-Kains, B. Radiomic-Based Approaches in the Multi-Metastatic Setting: A Quantitative Review. BMC Cancer 2025, 25, 538. [Google Scholar] [CrossRef]
  8. Wilk, A.M.; Swierniak, A.; d’Amico, A.; Suwiński, R.; Fujarewicz, K.; Borys, D. Towards the Use of Multiple ROIs for Radiomics-Based Survival Modelling: Finding a Strategy of Aggregating Lesions. Comput. Methods Programs Biomed. 2025, 269, 108840. [Google Scholar] [CrossRef]
  9. Yousefirizi, F.; Gowdy, C.; Klyuzhin, I.S.; Sabouri, M.; Tonseth, P.; Hayden, A.R.; Wilson, D.; Sehn, L.H.; Scott, D.W.; Steidl, C.; et al. Evaluating Outcome Prediction via Baseline, End-of-Treatment, and Delta Radiomics on PET-CT Images of Primary Mediastinal Large B-Cell Lymphoma. Cancers 2024, 16, 1090. [Google Scholar] [CrossRef]
  10. Salimi, Y.; Hajianfar, G.; Mansouri, Z.; Amini, M.; Sanaat, A.; Zaidi, H. Enhancing Overall Survival Prediction in Head and Neck Cancer with CT Peritumoral Radiomics and Machine Learning. In Proceedings of the 2024 IEEE Nuclear Science Symposium (NSS), Medical Imaging Conference (MIC) and Room Temperature Semiconductor Detector Conference (RTSD), Tampa, FL, USA, 26 October 2024; pp. 1–2. [Google Scholar]
  11. Salimi, Y.; Hajianfar, G.; Mansouri, Z.; Sanaat, A.; Amini, M.; Shiri, I.; Zaidi, H. Organomics: A Concept Reflecting the Importance of PET/CT Healthy Organ Radiomics in Non–Small Cell Lung Cancer Prognosis Prediction Using Machine Learning. Clin. Nucl. Med. 2024, 49, 899–908. [Google Scholar] [CrossRef]
  12. Rozemberczki, B.; Watson, L.; Bayer, P.; Yang, H.-T.; Kiss, O.; Nilsson, S.; Sarkar, R. The Shapley Value in Machine Learning. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna, Austria, 23–29 July 2022. [Google Scholar]
  13. Poeppel, T.D.; Binse, I.; Petersenn, S.; Lahner, H.; Schott, M.; Antoch, G.; Brandau, W.; Bockisch, A.; Boy, C. Differential Uptake of 68Ga-DOTATOC and 68Ga-DOTATATE in PET/CT of Gastroenteropancreatic Neuroendocrine Tumors. In Theranostics, Gallium-68, and Other Radionuclides; Baum, R.P., Rösch, F., Eds.; Recent Results in Cancer Research; Springer: Berlin/Heidelberg, Germany, 2013; Volume 194, pp. 353–371. ISBN 978-3-642-27993-5. [Google Scholar]
  14. Geets, X.; Lee, J.A.; Bol, A.; Lonneux, M.; Grégoire, V. A Gradient-Based Method for Segmenting FDG-PET Images: Methodology and Validation. Eur. J. Nucl. Med. Mol. Imaging 2007, 34, 1427–1438. [Google Scholar] [CrossRef]
  15. Werner-Wasik, M.; Nelson, A.D.; Choi, W.; Arai, Y.; Faulhaber, P.F.; Kang, P.; Almeida, F.D.; Xiao, Y.; Ohri, N.; Brockway, K.D.; et al. What Is the Best Way to Contour Lung Tumors on PET Scans? Multiobserver Validation of a Gradient-Based Method Using a NSCLC Digital PET Phantom. Int. J. Radiat. Oncol. Biol. Phys. 2012, 82, 1164–1171. [Google Scholar] [CrossRef]
  16. Takeda, K.; Takanami, K.; Shirata, Y.; Yamamoto, T.; Takahashi, N.; Ito, K.; Takase, K.; Jingu, K. Clinical Utility of Texture Analysis of 18F-FDG PET/CT in Patients with Stage I Lung Cancer Treated with Stereotactic Body Radiotherapy. J. Radiat. Res. 2017, 58, 862–869. [Google Scholar] [CrossRef]
  17. Liberini, V.; De Santi, B.; Rampado, O.; Gallio, E.; Dionisi, B.; Ceci, F.; Polverari, G.; Thuillier, P.; Molinari, F.; Deandreis, D. Impact of Segmentation and Discretization on Radiomic Features in 68Ga-DOTA-TOC PET/CT Images of Neuroendocrine Tumor. EJNMMI Phys. 2021, 8, 21. [Google Scholar] [CrossRef]
  18. Ha, S.; Choi, H.; Paeng, J.C.; Cheon, G.J. Radiomics in Oncological PET/CT: A Methodological Overview. Nucl. Med. Mol. Imaging 2019, 53, 14–29. [Google Scholar] [CrossRef]
  19. Van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.-C.; Pieper, S.; Aerts, H.J.W.L. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef]
  20. Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-Sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  21. Kursa, M.B.; Rudnicki, W.R. Feature Selection with the Boruta Package. J. Stat. Softw. 2010, 36, 1–13. [Google Scholar] [CrossRef]
  22. Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V. Gene Selection for Cancer Classification Using Support Vector Machines. Mach. Learn. 2002, 46, 389–422. [Google Scholar] [CrossRef]
  23. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  24. Radovic, M.; Ghalwash, M.; Filipovic, N.; Obradovic, Z. Minimum Redundancy Maximum Relevance Feature Selection Approach for Temporal Gene Expression Data. BMC Bioinform. 2017, 18, 9. [Google Scholar] [CrossRef]
  25. Hand, D.J.; Yu, K. Idiot’s Bayes: Not So Stupid after All? Int. Stat. Rev. 2001, 69, 385. [Google Scholar] [CrossRef]
  26. Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees, 1st ed.; Routledge: Abingdon, UK, 2017; ISBN 978-1-315-13947-0. [Google Scholar]
  27. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13 August 2016; pp. 785–794. [Google Scholar]
  28. Rosenblatt, F. The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain. Psychol. Rev. 1958, 65, 386–408. [Google Scholar] [CrossRef]
  29. Cox, D.R. The Regression Analysis of Binary Sequences. J. R. Stat. Soc. Ser. B Stat. Methodol. 1958, 20, 215–232. [Google Scholar] [CrossRef]
  30. Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  31. Cover, T.; Hart, P. Nearest Neighbor Pattern Classification. IEEE Trans. Inf. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
  32. Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 1995, 57, 289–300. [Google Scholar] [CrossRef]
  33. Sierra, B.B.; McLean, C.; Hall, P.S.; Vallejos, C.A. The C-Index Multiverse. arXiv 2025, arXiv:2508.14821. [Google Scholar] [CrossRef]
  34. Peng, H.; Long, F.; Ding, C. Feature Selection Based on Mutual Information Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1226–1238. [Google Scholar] [CrossRef]
  35. Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased Boosting with Categorical Features. arXiv 2017, arXiv:1706.09516. [Google Scholar]
  36. Cox, D.R. Regression Models and Life-Tables. J. R. Stat. Soc. Ser. B Stat. Methodol. 1972, 34, 187–202. [Google Scholar] [CrossRef]
  37. Bühlmann, P.; Yu, B. Boosting with the L2 Loss: Regression and Classification. J. Am. Stat. Assoc. 2003, 98, 324–339. [Google Scholar] [CrossRef]
  38. Friedman, J.; Hastie, T.; Tibshirani, R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J. Stat. Softw. 2010, 33, 1–22. [Google Scholar] [CrossRef] [PubMed]
  39. Ishwaran, H.; Kogalur, U.B.; Blackstone, E.H.; Lauer, M.S. Random Survival Forests. Ann. Appl. Stat. 2008, 2, 841–860. [Google Scholar] [CrossRef]
  40. Lee, H.; Kipnis, S.T.; Niman, R.; O’Brien, S.R.; Eads, J.R.; Katona, B.W.; Pryma, D.A. Prediction of 177Lu-DOTATATE Therapy Outcomes in Neuroendocrine Tumor Patients Using Semi-Automatic Tumor Delineation on 68Ga-DOTATATE PET/CT. Cancers 2023, 16, 200. [Google Scholar] [CrossRef] [PubMed]
  41. Hotta, M.; Sonni, I.; Thin, P.; Nguyen, K.; Gardner, L.; Ciuca, L.; Hayrapetian, A.; Lewis, M.; Lubin, D.; Allen-Auerbach, M. Visual and Whole-Body Quantitative Analyses of 68 Ga-DOTATATE PET/CT for Prognosis of Outcome after PRRT with 177Lu-DOTATATE. Ann. Nucl. Med. 2024, 38, 296–304. [Google Scholar] [CrossRef]
  42. Ladwa, R.; Pattison, D.A.; Smith, J.; Goodman, S.; Burge, M.E.; Rose, S.; Dowson, N.; Wyld, D. Pretherapeutic68 Ga-DOTATATE PET SUV Predictors of Survival of Radionuclide Therapy for Metastatic Neuroendocrine Tumors. J. Clin. Oncol. 2020, 38, 4609. [Google Scholar] [CrossRef]
  43. Akhavanallaf, A.; Peterson, A.B.; Fitzpatrick, K.; Roseland, M.; Wong, K.K.; El-Naqa, I.; Zaidi, H.; Dewaraja, Y.K. The Predictive Value of Pretherapy [68Ga]Ga-DOTA-TATE PET and Biomarkers in [177Lu]Lu-PRRT Tumor Dosimetry. Eur. J. Nucl. Med. Mol. Imaging 2023, 50, 2984–2996. [Google Scholar] [CrossRef]
  44. Abbasian Ardakani, A.; Bureau, N.J.; Ciaccio, E.J.; Acharya, U.R. Interpretation of Radiomics Features–A Pictorial Review. Comput. Methods Programs Biomed. 2022, 215, 106609. [Google Scholar] [CrossRef]
  45. Yusufaly, T.I.; Zou, J.; Nelson, T.J.; Williamson, C.W.; Simon, A.; Singhal, M.; Liu, H.; Wong, H.; Saenz, C.C.; Mayadev, J.; et al. Improved Prognosis of Treatment Failure in Cervical Cancer with Nontumor PET/CT Radiomics. J. Nucl. Med. 2022, 63, 1087–1093. [Google Scholar] [CrossRef]
  46. Kocak, B.; Akinci D’Antonoli, T.; Mercaldo, N.; Alberich-Bayarri, A.; Baessler, B.; Ambrosini, I.; Andreychenko, A.E.; Bakas, S.; Beets-Tan, R.G.H.; Bressem, K.; et al. METhodological RadiomICs Score (METRICS): A Quality Scoring Tool for Radiomics Research Endorsed by EuSoMII. Insights Imaging 2024, 15, 8. [Google Scholar] [CrossRef]
  47. Gharibi, O.; Hajianfar, G.; Sabouri, M.; Mohebi, M.; Bagheri, S.; Arian, F.; Yasemi, M.J.; Bitarafan Rajabi, A.; Rahmim, A.; Zaidi, H.; et al. Myocardial Perfusion SPECT Radiomic Features Reproducibility Assessment: Impact of Image Reconstruction and Harmonization. Med. Phys. 2025, 52, 965–977. [Google Scholar] [CrossRef]
  48. Bagheri, S.; Hajianfar, G.; Sabouri, M.; Gharibi, O.; Yazdani, B.; Aghaee, A.; Nickfarjam, A.M.; Yazdani, A.; Aliasgharzadeh, A.; Moradi, H.; et al. Impact of Field-of-View Zooming and Segmentation Batches on Radiomics Features Reproducibility and Machine Learning Performance in Thyroid Scintigraphy. Clin. Nucl. Med. 2025, 50, 683–694. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.