Next Article in Journal
Dose–Response Relationship Between BRAF V600E Abundance and Cervical Lymph Node Metastasis in Papillary Thyroid Cancer
Previous Article in Journal
Desmoplastic Fibromas of the Bone: A Systematic Review of Clinical Presentation and Surgical Treatment
Previous Article in Special Issue
Role of MRI-Based Radiomics in Sinonasal Cancer Management: A Scoping Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Remaining Survival of Glioblastoma Patients with Radiomics Analysis Based on 18F-DOPA PET Images

by
Jing Qian
1,*,
Deanna Hasenauer
2,
William G. Breen
1,
Paul D. Brown
1,
Christopher H. Hunt
3,
Mark S. Jacobson
3,
Derek R. Johnson
3,
Timothy J. Kaufmann
3,
Bradley J. Kemp
3,
Sani H. Kizilbash
4,
Val J. Lowe
3,
Michael W. Ruff
5,
Jann N. Sarkaria
1,
Joon H. Uhm
6,
Mark J. Zakhary
7,
Maasa H. Seaberg
8,
Hok Seum Wan Chan Tseung
1,
Elizabeth S. Yan
1,
Yan Zhang
1,
Nadia N. Laack
1 and
Debra H. Brinkmann
1,*
add Show full author list remove Hide full author list
1
Department of Radiation Oncology, Mayo Clinic, Rochester, MN 55905, USA
2
Department of Radiation Oncology, Mayo Clinic, Jacksonville, FL 32224, USA
3
Department of Radiology, Mayo Clinic, Rochester, MN 55905, USA
4
Department of Medical Oncology, Mayo Clinic, Rochester, MN 55905, USA
5
Department of Neurology, Mayo Clinic, Rochester, MN 55905, USA
6
Department of Neurosurgery, Mayo Clinic, Rochester, MN 55905, USA
7
University of Florida Health Proton Therapy Institute, Jacksonville, FL 32206, USA
8
Department of Radiation Oncology, University of California San Francisco Medical Center, San Francisco, CA 94143, USA
*
Authors to whom correspondence should be addressed.
Cancers 2025, 17(21), 3560; https://doi.org/10.3390/cancers17213560
Submission received: 2 September 2025 / Revised: 24 October 2025 / Accepted: 29 October 2025 / Published: 3 November 2025
(This article belongs to the Special Issue Radiomics and Imaging in Cancer Analysis)

Simple Summary

Post-treatment surveillance of glioblastoma is challenging due to the complications of discriminating against treatment effects using conventional imaging. This study proposes to address the challenge by linking radiomics features from 18F-DOPA PET images with remaining survival in glioblastoma patients. Employing customized feature selection algorithms and manifold learning, quantitative features were extracted for machine learning, resulting in heightened sensitivity for detecting changes in survival risk. The proposed map for remaining survival has the potential to enhance personalized medicine. Overall, this research underscores the potential effectiveness of 18F-DOPA PET and machine learning models in forecasting survival outcomes and monitoring tumor progression among glioblastoma patients undergoing dose-escalated radiation therapy.

Abstract

Background: Post-treatment prognosis and monitoring are critical for determining the timing of salvage treatment in glioblastoma patients but has been challenging due to difficulties differentiating progression from treatment effects in conventional images. This exploratory study aimed to establish the correlation of radiomics image features from time series of amino acid tracer 18F-DOPA PET images, with outcomes, using machine learning and dimension reduction analysis. Methods: 18F-DOPA PET images were collected for a patient cohort with wild-type IDH and unmethylated MGMT who underwent dose-escalated radiation therapy. Quantitative features were derived from the high uptake region (T/N > 2.0) in pre- and post-radiation therapy follow-up 18F-DOPA PET images. A customized workflow was utilized for pre-selecting predictive features, followed by manifold learning. Machine learning algorithms were employed to establish associations between imaging features and remaining survival (RS), defined as the time between a follow-up scan and date of death. Results: The ML models exhibited 81–83% ROC_AUC in predicting RS evaluated on an independent test dataset. A RS map is proposed for monitoring tumor alterations through serial 18F-DOPA PET scans, demonstrating superior sensitivity and better correlation with survival compared to the RANO criteria. Conclusions: Our study demonstrates that ML models utilizing FU 18F-DOPA PET images have the potential to effectively predict future survival outcomes in patients with glioblastoma treated with dose-escalated radiation therapy. The capability to assess changes in tumor over time through imaging can potentially assist in patient stratification and the selection of salvage treatments, while also aiding in distinguishing treatment effects from genuine tumor progression.

1. Introduction

Glioblastoma [1] is the most common primary malignant brain tumor with poor prognosis. It is typically treated with surgical resection followed by concurrent radiotherapy and chemotherapy [2], sometimes supplemented with tumor treating fields [3], and then adjuvant temozolomide (TMZ). Due to the aggressive nature of glioblastoma, despite these multimodality treatments, the median overall survival is less than 2 years after diagnosis [4]. Salvage treatment is commonly administered upon disease recurrence post-treatment to extend patient survival [5,6]. However, accurately pinpointing recurrence, whether through radiographic or clinical means, poses a considerable challenge due to the complexities arising from the effects of multimodal therapy.
Following treatment for glioblastoma, treatment response surveillance imaging is performed using standard T1-weighted (with and without contrast) and T2-weighted magnetic resonance imaging (MRI) sequences. Unfortunately, despite modern imaging assessment tools, it remains difficult to differentiate tumor progression from treatment effect [7,8], even with established response assessment criteria such as the response assessment in neuro-oncology (RANO) criteria [9,10]. The interaction between radiotherapy and chemotherapy can enhance contrast uptake through various mechanisms, for example, through a disruption of the blood–brain barrier and/or through an alteration in cell metabolism [11], potentially leading to contrast enhancement on imaging with image features similar to those related to true progressive disease (PD), falsely suggesting tumor progression. This phenomenon, known as pseudoprogression (PsP), occurs with reported incidence rates ranging from 9% to 32% [12,13,14]. An incorrect interpretation of progression may either delay time-sensitive salvage treatment or interrupt the effective primary treatment, both resulting in suboptimal tumor control. This difficulty of differentiating treatment effect from tumor progression may become more prevalent with advancements in treatment options that may increase radiographic treatment effect [15,16], such as radiation dose escalation, radiosensitizers, immunotherapy, and other systemic therapies. Consequently, there is a critical need for improved methods to differentiate treatment effect from tumor progression.
In recent years, advanced imaging techniques have emerged as valuable tools for post-treatment surveillance. Examples include advanced MRI modalities such as perfusion and diffusion MRI [17,18,19,20], as well as positron emission tomography (PET) utilizing amino acid tracers [21,22,23,24]. These methods offer functional and metabolic data that can enhance traditional anatomical imaging, aiding in the differentiation of treatment effects from tumor cell proliferation. Studies [25,26,27,28] have suggested PET with amino-acid tracers can provide more specific uptake in tumor tissue than in areas of radiation-induced normal tissue response. Moreover, there have been significant advancements in image analysis techniques. Complex modeling approaches like radiomics [29,30,31,32] and deep learning (DL) [33,34] are now commonly employed for extracting image features and predicting prognosis.
In this exploratory study, we utilize radiomics and machine learning, including nonlinear dimension reduction techniques, to study a time series of PET images with the amino acid tracer 3,4-dihydroxy-6-18F-fluoro-L-phenylalanine (18F-DOPA), on a cohort of newly glioblastoma patients treated with escalated radiation dose [16]. Although potential survival benefits have been demonstrated in the trial with 18F-DOPA PET-guided escalated dose, the disconnect between progression-free survival (PFS) per conventional MR and overall survival (OS) of the patients suggests a high incidence rate of PsP, making the standard of care post-treatment surveillance difficult. This study focuses on the cohort of patients with poorer prognosis (wild isocitrate dehydrogenase (IDH) and unmethylated O6-methylguanine–DNA methyltransferase (MGMT) status). As an exploratory and hypothesis-generating study limited by the data size, our aim is to predict remaining survival after each follow-up (FU) image, thereby demonstrating the potential value of 18F-DOPA as a prognostic tool and a potential aid in differentiating treatment effect from tumor progression.

2. Materials and Methods

2.1. Patients and 18F-DOPA PET Surveillance Images

This study analyzed the 18F-DOPA PET surveillance images from patients enrolled in an Institutional Review Board (IRB)-approved prospective trial (NCT01991977). All participants received 18F-DOPA-guided dose-escalated radiation therapy (DERT). Inclusion criteria in this secondary analysis required patients to have newly diagnosed glioblastoma with IDH-wildtype and unmethylated MGMT promoter status, confirmed by pre-RT molecular testing. Patients without a known MGMT status were excluded. All patients underwent at least one FU 18F-DOPA PET/CT scan. For each available PET FU scan, a corresponding conventional MRI set was also acquired, which has been used to assess progression with RANO criteria; however, not all MRI follow-ups had matched PET acquisitions. The absence of certain PET FU scans was attributable to factors such as 18F-DOPA synthesis failures, patient non-compliance with pre-scan protocols (e.g., fasting), or other logistical constraints. For analysis, patients were randomly assigned to training and testing cohorts in a 70:30 ratio. All FU images were organized by patient and separated into completely independent training and testing datasets accordingly. By this approach, no FU image of any patient in the test cohort was exposed in the training dataset.

2.2. Outcome Endpoints

This study aimed to assess the risk of tumor progression at each FU timepoint, recognizing that risk status evolves dynamically over time. FU PET images provide a temporal snapshot of the tumor’s state at each respective timepoint. However, conventional endpoints such as progression-free survival (PFS) or overall survival (OS) do not capture this time-varying nature of post-treatment tumor behavior. To address this, we introduced the concept of “remaining survival” (RS), defined as the time interval between a given FU imaging acquisition and the date of death, offering a more temporally resolved and clinically relevant outcome measure. Considering that median survival is reported to be 12 months or less from progression in the literature [11,26,32], FU scans were categorized as either “high-risk” or “low-risk” for progressive disease based on whether they had at least 12 months of remaining survival from the time of a given 18F-DOPA FU scan. A model-based risk transition timepoint was defined as the first FU timepoint when a “high risk” image was observed for a patient.

2.3. Radiomics Feature Extraction

PET scans were acquired using either a Discovery 690XT or Discovery MI PET/CT scanner from GE (Chicago, IL, USA), with a standardized acquisition protocol [16,35] to ensure consistency across patients. The reconstructed PET data were resampled into isotropic voxels of 2 × 2 × 2 mm prior to analysis. Feature extraction was performed with the PyRadiomics [36] platform (version 3.0.1), generating a comprehensive set of quantitative descriptors: 26 geometric (shape) features, 19 first-order statistical measures, and 70 texture metrics. The latter included 24 gray level co-occurrence matrix (GLCM), 16 gray level dependence matrix (GLDM), 16 gray level run length matrix (GLRLM), and 14 gray level size zone matrix (GLSZM) features, following the standardized definitions established by the Image Biomarker Standardization Initiative [37,38]. All shape descriptors were reported in voxel-based units. For reference, Table S1 in the Supplementary Materials summarizes the features extracted in this work.
Feature extraction was limited to regions of elevated tracer uptake, defined by a tumor-to-normal-brain (T/N) SUV ratio greater than 2.0. For the scan without T/N SUV ratio higher than 2.0, the features were automatically set to 0 by design, reflecting the absence of above-threshold uptake. This T/N ratio, established in a prior clinical trial [39] through biopsy correlation and multimodal image comparison, was prospectively adopted in our dose-escalation protocol to guide dose painting on gross tumor volume [16]. This high-uptake zone has previously been shown in our work to correspond to the biologically most aggressive portion of the lesion and to provide greater predictive value than the entire tumor volume [35]. To minimize inter-observer variability, tumor regions were delineated using an SUV-threshold-based autosegmentation approach. Each segmentation was subsequently reviewed by both an experienced medical physicist and a nuclear medicine physician, who carefully excluded areas of physiological uptake, such as striatal signals.

2.4. Radiomics Feature Selection

Feature selection is a crucial step in reducing input size to eliminate noise in data before machine learning training, and is particularly vital for small datasets. This study developed a multi-stage feature selection algorithm depicted in Figure 1. Exclusive to the training image set, the algorithm comprised three stages. Stage I evaluated the predictivity of each feature to remaining survival, categorizing feature values into two groups based on the optimal threshold, determined by the lowest p-value. Stage II involved 5 folds cross-validation of the features selected in Stage I to ensure their consistent predictivity across random subsets of the data. The underlying hypothesis was that a progressive or preventive feature should exhibit the same behavior across different subsets of the data. Both the entire training group and subsets consisting of FU1 images acquired at consistent timepoints after RT were considered in Stage II. Only features selected for all sets were deemed robustly predictive and advanced to Stage III. In Stage III, known as pair correlation filtering, the bivariate Pearson correlation coefficient was calculated for selected features categorized as shape, first order, and texture. If the correlation coefficient between two features exceeded 0.8, one of the features was eliminated. Priority was given to features that had been previously reported in the literature [11,26,32] to minimize the introduction of new features and leverage the additional robustness from independent testing using a similar, although not identical, amino acid PET tracer. Following these stages, only a limited number of features remained. While feature selection reduced noise, it also diminished information from discarded features. Further improvement on noise reduction and data orthogonality was achievable through feature compression and abstraction with manifold learning [36]. In this study, we utilized Uniform Manifold Approximation and Projection (UMAP) [37], a nonlinear dimension reduction technique to map the data structure in a high-dimensional space into a lower-dimensional space by preserving both global and local data structures. The selected features were projected into space of two features: UMAP x and UMAP y. As per standard practice in machine learning, a Standard Scaler, implemented in Scikit-Learn [40] (version 1.5.1), was applied to the projected features before feeding into the ML models.
In order to assess the efficacy of the feature selection pipeline and the individual contributions of its components, ablation tests were conducted by systematically removing elements of the pipeline. Ablation Test 1 involved the elimination of the third stage of feature selection, resulting in a larger set of selected features prior to manifold learning. This test aimed to evaluate the ability of manifold learning to manage higher-dimensional data. Ablation Test 2, on the other hand, excluded manifold learning before machine learning to assess model performance in the absence of feature abstraction.

2.5. ML Algorithms

Two machine learning (ML) approaches were employed to explore the relationship between the features selected from the FU 18F-DOPA PET images and remaining survival. The first approach used AutoML with Amazon’s AutoGluon [41] (version 1.0.0), a new open-source framework of machine learning models. Auto ML is designed to automatically select the appropriate ML algorithm and optimize hyperparameters. The second ML algorithm employed was the Explainable Boosting Machine (EBM) which offers the advantage of producing interpretable model outputs. The model was built using Microsoft (Seattle, WA, USA)’s InterpretML library [42] (version 0.5.0). This is an implementation of the Generalized Adaptative Model (GA2M) [43] which replaces single variables in a linear model with single-feature models (shape functions) that utilize bagging and gradient boosting, as well as additional terms for feature pairing interactions. Because of its additive nature, the contribution of each feature to the model and to the classification for each individual patient can be easily quantified and visualized. It has been shown to have comparable classification performance [42] with other popular models like Random Forest and XGBoost, but with much better interpretability.
Both models examined in this study underwent validation using an independent test dataset, which comprised FU PET scans from the patients whose information was excluded from the model training process.
For comparison, we implemented a univariate baseline using TBRmax (maximum T/N SUV ratio per scan), a reported predictive marker in multiple amino-acid PET glioma studies [32,44,45,46]. The decision threshold was selected on the training set (via a simple grid search to maximize accuracy) and then fixed and applied to the independent test set for evaluation.

3. Results

3.1. Patient Cohort and Servalliance Images

A total of 33 patients were identified with the inclusion criteria stated. The span of OS ranged from 5 months to 41 months, with a median at 15 months. Ages of patients ranged from 19 to 77 years with a median of 55 years, comprising 58% male and 42% female. The number of 18F-DOPA FU images for a given patient ranged from 1 to 14 with a median of 3. The first FU images were acquired consistently at 1 month after completing RT (ranging from 27 days to 42 days with a median of 33 days). Subsequent 18F-DOPA FU frequency determined for each patient based on the latest available FU images. The training and testing sets have 103 FU 18F-DOPA images from 25 patients and 37 FU 18F-DOPA images from 10 patients, respectively. The patients in the testing set were not exposed to the training set.

3.2. Radiomics Feature Selection

Starting with 115 radiomics features extracted from PET images, individual predictivity evaluation reduced the pool of the features to 104. After cross-validation stages, a total of 47 features (including 11 shape features, 12 first-order features, and 24 texture features) were identified. These features demonstrated the potential to stratify patients into distinct groups with statistical differences, following the application of an optimized threshold. After the third stage of pair correlation filtering, only 6 features, including 3 shape features, 1 first order feature and 2 texture features, were identified. They are summarized in Table 1 and their brief explanation can be found in Supplementary Table S2. The dimension reduction stage with UMAP further reduces those 6 features into 2 projected features (UMAP x and UMAP y) and their visualization is plotted in Figure 2. The feature count reduction flow is also summarized in Supplementary Table S3.

3.3. Model Prediction

Utilizing the two projected UMAP features, Autogluon (version 0.5.0) was used to train 219 classification models with high-quality presets, encompassing Random Forest, Light GBM, and various ensemble models, among others, based on the training dataset. The evaluation metric used was the Area Under Curve (AUC), and the best-trained model (WeightedEnsemble_L3) achieved an AUC of 0.81 when applied to the independent test data. Based on 2000 bootstrap resamples, the model’s AUC had a 95% confidence interval of [0.650, 0.942]. WeightedEnsemble_L3 is a three-level ensemble model employing Autogluon’s multi-layer stacking strategy [41]. At the bottom layer, it consists of 110 individual models with nonzero weights, which feed into the second layer composed of 7 ensembled models, ultimately contributing to the top layer (WeightedEnsemble_L3). A schematic of the WeightedEnsemble_L3 architecture and its receiver operating characteristic (ROC) curve on the test dataset are depicted in Figure 3a and Figure 3b, respectively.
The EBM model was also trained with the same training dataset. When it was applied to the test data, it achieved the AUC of 0.83. Similarly, based on 2000 bootstrap resamples, the EBM model’s AUC had a 95% confidence interval of [0.681, 0.944]. The ROC curve on the test dataset based on EBM model is shown in Figure 3b.
A sensitivity test to the trained models was also performed by redefining high risk status to RS ≤ 10 months and RS ≤ 14 months. The AUCs of AutoGluon are 0.80 and 0.81, respectively. The AUCs of EBM are 0.82 and 0.83, respectively.
On the training set, the baseline model identified an optimal TBRmax threshold of 2.16, yielding an AUC of 0.72. Applied unchanged to the independent test set, it achieved an AUC of 0.66.
Ablation Test 1, which omitted Stage III feature selection, and Ablation Test 2, which excluded manifold learning, both resulted in inferior model performance when assessed using the reserved test dataset. In Ablation Test 1, Autogluon achieved an AUC of 0.71, while EBM attained 0.68. Conversely, in Ablation Test 2, Autogluon achieved an AUC of 0.80, while EBM reached 0.73.
To depict the changes in tumor biology associated with the RS of each patient, we proposed a RS map, which represents the temporal evolution of a patient’s remaining life with respect to the pre-RT imaging timepoint. Figure 4 showcases the predictions of RS risk for each PET image in the test dataset, utilizing the EBM model. In this figure, the pre-RT PET image is designated as timepoint 0 for each patient. Each square on the RS map corresponds to a subsequent 18F-DOPA PET FU imaging timepoint. PET scans that are predicted to be high-risk according to the model are depicted as dark blue squares, while low-risk PET scans are represented by light blue squares. For reference, a cyan diamond is provided to indicate the timepoint at which radiographic progression was determined solely based on the RANO criteria derived from conventional MRI. As shown in the figure, the model is also applied to pre-RT images, which are not included in the training dataset. Although pre-RT images are very different from FU images because they are followed with interventions, it is reasonable to assume that pre-RT images should present high risk for survival if intervention (multimodal therapies) was not applied. The model prediction is consistent with this assumption, classifying 86% of pre-RT images as high risk.
The EBM model’s predicted transition timepoints from low-risk to high-risk status were compared with the radiographical progression timepoints determined using RANO criteria based on conventional MRIs for the entire cohort, as shown in Table 2. The MRIs were typically obtained within a day before or after the 18F-DOPA scans to ensure simultaneous assessment. Based on RANO progression calls, the mean RS of the cohort from the timepoint of the progression calls was found to be 8.0 months, with a standard deviation of 6.2 months. In contrast, based on the model predictions, the mean RS at the timepoint of model-predicted risk transition was 10.0 months, with a standard deviation of 5.6 months, excluding the patients without a model-predicted risk transition. Among the patients in this study, 33.3% had RANO progression calls coincident with model-predicted risk transition timepoints. This subgroup had an average survival of 10 months with a standard deviation of 6.2 months after the RANO progression timepoints. 91% of this coincident subgroup had remaining survival less than or equal to 12 months after the RANO progression call. For 45.4% of patients, the model predicted risk transition occurred earlier than the RANO call. 6.1% of patients had a model predicted risk transition timepoint later than the RANO call. In the remaining 15.2% of patients, no 18F-DOPA scans were acquired after the RANO call, and all 18F-DOPA scans prior to and coincidence with the RANO call timepoint were predicted to be low risk, so a risk transition timepoint was not reached for the available scans. Only 40% of this subgroup had RS ≤ 12 months after the RANO progression call. Averaged remaining survival and the percentage of RS ≤ 12 months are reported in Table 2.
Both AutoGluon and EBM provide feature-importance estimates to aid interpretability. Applied to the independent test set, the importances of the UMAP-projected features are summarized in Table 3; for comparability, values are normalized to sum to 1. Because AutoGluon does not include explicit pairwise interaction terms, the “interaction” entry is zero for AutoGluon, whereas EBM reports contributions from both main effects and modeled pairwise interactions.
The EBM model also provides a means to evaluate the contribution of each feature to the model prediction at an individual level. Figure 5 illustrates an example of feature interpretation for a patient in the test cohort (RT_FDOPA40). The probability of being classified as high risk is calculated for each image, and the contributions (weights) of the shape function for each feature or the interaction between two variables are plotted quantitatively. A positive value indicates favoring a high-risk classification, while a negative value favors a low-risk classification. The final decision is a balance between all of the features.
To further assess clinical utility, we conducted a decision curve analysis (DCA). The results are presented in Figure 6a. The Kaplan–Meier survival curves comparing progression-free survival (PFS; RANO-defined) with model-predicted low-risk survival (MLRS) are presented in Figure 6b.

4. Discussion

Differentiating true tumor progression from treatment-related effects remains a persistent and unresolved challenge in the post-treatment surveillance of glioblastoma. The RANO criteria, which rely on conventional MRI sequences, have limited accuracy—particularly in the context of escalated radiation doses. In the clinical trial involving dose escalation, PFS based on RANO criteria has shown no correlation with OS, suggesting that many progression assessments may be confounded by treatment effects. These limitations underscore the need for more informative imaging modalities to improve the accurate identification of true progression. In this study, we evaluated the utility of longitudinal post-treatment 18F-DOPA PET imaging for predicting RS in glioblastoma patients. Given the reported correlation between pathologically confirmed PFS and OS [26,32], We explored the use of RS as a surrogate but dynamic marker for OS to capture the evolving risk status of the tumor. Our findings indicate that incorporating 18F-DOPA PET into post-treatment surveillance can enhance clinical decision-making by providing additional prognostic value. This PET-based model has potential clinical value in prognostication and may support more personalized decisions regarding salvage treatments.
In outcome analysis for relatively small datasets, for example, for uncommon cancers like glioblastoma, feature selection and dimensionality reduction are essential for mitigating noise and uncovering meaningful patterns in high-dimensional data. While selecting only the most predictive features can reduce noise, it risks discarding complementary or interacting information. Conversely, applying dimensionality reduction directly to large, noisy feature sets can obscure relevant signals. Striking a balance between noise reduction and information preservation is a core challenge in machine learning and an area of active research. To address this, we developed a tailored pipeline that first identifies robust predictive features to narrow the input space, followed by manifold learning using UMAP for further dimensionality reduction and feature abstraction. This combination preserved key information while minimizing redundancy, leading to optimal performance across models, as shown in Figure 3. Ablation tests demonstrated that injecting too many raw features prior to manifold learning (Ablation Test 1), or omitting manifold learning altogether (Ablation Test 2), consistently degraded model performance in both AutoML and EBM frameworks. Beyond improving model accuracy, UMAP also enabled visualization of the high-dimensional feature space (Figure 2), facilitating the identification of feature patterns and clusters that would otherwise be hidden in high dimensional data. Notably, patients with shorter remaining survival (RS) tended to cluster in regions with lower UMAP x and y values, while those with longer RS exhibited the opposite trend, suggesting distinct underlying feature distributions. Although manifold learning techniques like UMAP have been widely applied in genomics (for example, in clustering tumor subtypes), they remain underutilized in radiomics, particularly with diagnostic imaging data. To our knowledge, this study represents the first application of UMAP to PET radiomics in glioblastoma, demonstrating both its analytical and interpretive value.
Both the AutoML (Autogluon) and EBM models demonstrated reasonable and comparable ROC_AUC. The AutoML approach takes care of many machine learning details in the background, including data scaling, model selection and ensemble, as well as cross validation. It significantly reduces the domain knowledge related to machine learning and enhances the utilization of machine learning tools. The multiple layer architecture design of Autogluon, with model ensembles, also helps to maintain the model’s robustness. The performance degradation in the ablation tests is less severe compared to that of the EBM model. For comparison, a univariate baseline model using TBRmax, a metric reported as prognostic in multiple PET glioma studies [32,44,45,46], performed weaker than both AutoML and EBM on the training and test sets, though it remained predictive; notably, the optimal TBRmax threshold selected on the training set was similar to the optimal cutoffs reported elsewhere [32,45]. The superior performance of AutoML and EBM underscores the added value of multivariable modeling over single-parameter approaches. Furthermore, the AUCs achieved with AutoML and EBM are comparable to, or modestly higher than, previously reported multivariable radiomics model [32]; however, direct comparison is not appropriate given differences in PET tracers, cohort composition, and treatment protocols. Because our cohort underwent 18F-DOPA-guided dose escalation, evaluation and external validation in patients treated with standard-dose regimens are warranted as such data become available.
Both AutoGluon and EBM provide native feature-importance estimates (Table 3). The resulting profiles differ by algorithm: AutoGluon places most of the weight on UMAP-x and does not model interaction terms, whereas EBM, while also ranking UMAP-x highest, shows a more balanced distribution across features and includes an explicit UMAP x–y interaction effect. The EBM model, known for its interpretability, also provides valuable insights into how each feature contributes to the model’s decision for individual images. This information is particularly useful for clinicians in evaluating changes in tumor status, as it transforms the machine learning model from a black box into a glass box, as illustrated by the example in Figure 5. This interpretability provides clinicians with a deeper understanding of the model’s decision-making process and highlights the evolution of critical predictive features. By offering more detailed information beyond a simple “yes” or “no” classification, this interpretability empowers clinicians in making informed clinical decisions based on the model’s insights.
In the decision-curve analysis on the independent test set (Figure 6a), both AutoGluon and EBM showed positive net benefit across clinically plausible thresholds (~0.10–0.40), outperforming “treat-all” and “treat-none.” AutoGluon provided greater benefit at lower–mid thresholds (~0.10–0.35), consistent with early-warning use, whereas EBM was comparable and modestly higher at higher thresholds (~0.40–0.50). These findings suggest that model-guided surveillance offers greater clinical utility than default strategies across common operating points, with the threshold tailored to local risk tolerance and practice patterns. Complementing this, Kaplan–Meier curves (Figure 6b) comparing RANO-defined PFS with model-predicted low-risk survival (MLRS) show distinct temporal patterns consistent with earlier model signaling in a subset of patients. Taken together, the DCA and KM results support the potential for actionable early warnings, warranting confirmation in larger, multi-institutional cohorts.
Based on 18F-DOPA PET images, the RS map suggested in this work provides a tool to monitor the patients’ status associated with their survival, as a surrogate to tumor progression. With significant treatment complications and high rate of incorrect progressional calls, especially for patients treated with escalated radiation dose, the remaining survival becomes a more reliable indicator than the PFS determined by the conventional methods. 18F-DOPA PET images can provide a superior sensitivity to detect tumor progression. For the patients included in this study, 41.2% showed earlier indications from 18F-DOPA PET than the conventional MRIs, and their shorter survival after progressional call also confirmed the better sensitivity of 18F-DOPA PET. In clinical practice, earlier detection of progression risk transition could alter management in practice: prompting earlier confirmatory imaging and consideration of salvage therapies (e.g., re-resection, re-irradiation, systemic therapy, or trial enrollment). For the patients with later indication from 18F-DOPA PET, their survival after progressional call is significantly longer than the average but their RS after model predicted risk transition is consistent with the average RS, which suggests that 18F-DOPA PET may also have better capability to distinguish true tumor progression from treatment effects.
In this study, we introduce RS as a pragmatic endpoint and use risk transition with respect to RS as a surrogate marker of progression—information that is readily obtainable and clinically manageable when histopathologic confirmation is not feasible. Guided by prior reports, we adopted RS ≤ 12 months as the primary threshold; sensitivity analyses across 10–14 months yielded consistent findings, indicating robustness to threshold selection. We emphasize that RS is not a substitute for formal progression adjudication (e.g., RANO) but provides complementary, time-anchored evidence when progression determination is challenging. Future work is necessary to prospectively validate the relationship between RS-based risk transitions and pathology-confirmed progression.
Although our work has shown promising applications of 18F-DOPA PET images and the radiomics modeling based on 18F-DOPA PET, this study has several limitations that should be acknowledged. Firstly, the small sample size poses a primary limitation. A larger sample size, ideally from cross-institutional data, will be extremely helpful to validate the ML models and improve their accuracy for personalized medicine. As an amino-acid PET tracer that is not FDA approved, the available data are still very scarce and will take time to accumulate more data, but we optimistically expect that more evidence of the strength of 18F-DOPA PET will continue to be demonstrated. Secondly, our investigation only included patients treated with escalated RT dose, and it remains unclear whether the models and prognostic delta features would hold consistent for a patient cohort treated with a standard dose protocol. Although we believe that 18F-DOPA PET imaging can be a promising tool for the surveillance of glioblastoma treated with standard dose protocol, prospective trials are required to determine its clinical impact and integration pathways. Thirdly, our study focused exclusively on patients with unmethylated MGMT and wild-type IDH1, reflecting where adequate, consistent longitudinal 18F-DOPA PET FU imaging was available. In accordance with WHO CNS5 (2021) excluding IDH1-mutant glioma from glioblastoma, our cohort focuses on IDH1-wildtype. As noted in prior studies [16,47,48], patients with IDH mutation or MGMT promoter methylation often have longer survival trajectories; however, in our dose-escalation cohort, 18F-DOPA PET FU imaging in these subgroups was frequently incomplete or discontinued after RANO-based progression determinations, yielding too few complete longitudinal series for reliable stratified modeling. Therefore, in this study, we have not explored the differences in radiomic features between cohorts with different histologic biomarkers, which could potentially provide valuable insights. Fourthly, although the manifold learning combined with feature selection turns out to be an effective way to reduce noise and makes high dimensional data visible, it comes with a price too. The manifold learning decreases the interpretation of the features and makes it less straightforward to understand what changes in an image contributes to the model decision. Of course, there is no guarantee that such a dominant and interpretable feature may exist at all. Noise reduction, feature abstraction and interpretability are all very active research topics in machine learning, and they hold very critical meanings to outcome studies, including our work. Given these limitations, this work should be regarded as exploratory and hypothesis-generating, and it warrants subsequent studies in larger, more heterogeneous cohorts and across diverse radiotherapy protocols to evaluate generalizability.

5. Conclusions

Radiomic features extracted from PET images acquired with the amino acid tracer 18F-DOPA offer a unique tool to capture glioblastoma tumor status after radiotherapy treatment and can be used to monitor tumor progression and potentially improve upon standard MRI surveillance. Based on radiomics features extracted from automatically contoured tumor core volumes with a high T/N SUV ratio, this work focused on building a radiomics model to predict RS, which is used as a practical surrogate for tumor progression.
With an in-house developed feature selection algorithm followed by manifold learning, the prediction model for RS was constructed using two machine learning algorithms: Auto-ML and Explainable Boost Machine. These models demonstrated promising performance, achieving AUC of 81% and 83%, respectively, when evaluated on an independent test set. Notably, the models also exhibited high accuracy (>80%) in identifying aggressive tumor biology when applied to pre-RT images, which were not included in the training set. These findings suggest that the radiomics features extracted from FU 18F-DOPA PET scans can serve as valuable additions to the traditional conventional MR approach, which lacked correlation between overall survival (OS) and progression-free survival (PFS) in detecting tumor progression. Moreover, the proposed remaining survival map provides a tool for monitoring the evolution of tumor progression.
Moving forward, our research aims to expand in two directions. Firstly, we plan to investigate the MRI data, specifically diffusion and perfusion MRI sequences acquired concurrently with PET images. This analysis will provide further insights into tumor heterogeneity and its temporal changes, complementing the findings from this study. Secondly, we aim to broaden the cohort size by incorporating data from different protocols and institutions. As part of this effort, we are organizing a cross-institutional study to utilize 18F-DOPA PET, with planned harmonization of acquisition/reconstruction parameters, centralized data curation/QA, and a prespecified analysis pipeline. This collaboration will prospectively enroll patients for 18F-DOPA-guided dose escalation radiotherapy, enabling external validation and assessment of generalizability across centers. This expansion will enable us to validate the robustness of our models and prognostic features and facilitate the development of multivariable models for more personalized medicine approaches. We firmly believe that amino-acid PET tracers, including 18F-DOPA, hold immense potential to enhance glioblastoma treatment and management, presenting exciting research and clinical opportunities.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers17213560/s1, Table S1: The extracted features in this study include 26 shape, 19 first-order and 70 texture features from each scan. And the texture features can be furthered categorized into 24 gray level co-occurrence matrix (GLCM), 16 gray level dependence matrix (GLDM), 16 gray level run length matrix (GLRLM), and 14 gray level size zone matrix (GLSZM)); Table S2: The list of identified delta features showing significant differences between preRT and postRT images, with a brief description of each feature; Table S3: Feature count reduction flow in the feature processing steps utilized in this study.

Author Contributions

Conceptualization, J.Q., D.H., W.G.B., N.N.L. and D.H.B.; methodology, J.Q., D.H. and D.H.B.; software, J.Q.; validation, J.Q.; formal analysis, J.Q.; data curation, D.H., W.G.B., P.D.B., C.H.H., M.S.J., D.R.J., T.J.K., B.J.K., S.H.K., V.J.L., M.W.R., J.N.S., J.H.U., M.J.Z., M.H.S., H.S.W.C.T., E.S.Y., Y.Z., N.N.L. and D.H.B.; writing—original draft preparation, J.Q., D.H., W.G.B. and D.H.B.; writing—review and editing: all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Cancer Institute of the National Institutes of Health (grant number R01CA178200 to N.N.L., D.H.B.); Center for Clinical and Translational Science of Mayo Clinic (2022 Advance the Practice Research Award to J.Q.).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Mayo Clinic (protocol code 13-005106 and 2013-07-24).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data generated and analyzed during this study are included in this article. Extracted radiomics features and the source code are available upon request.

Conflicts of Interest

V.J.L. serves as a consultant for Bayer Schering Pharma, Piramal Life Sciences, Life Molecular Imaging, Eisai Inc., AVID Radiopharmaceuticals, Eli Lilly and Company, and Merck Research and receives research support from GE Healthcare, Siemens Molecular Imaging, AVID Radiopharmaceuticals, and the NIH (NIA, NCI). The other authors have no conflicts of interest to declare. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Louis, D.N.; Perry, A.; Wesseling, P.; Brat, D.J.; Cree, I.A.; Figarella-Branger, D.; Hawkins, C.; Ng, H.K.; Pfister, S.M.; Reifenberger, G.; et al. The 2021 WHO Classification of Tumors of the Central Nervous System: A summary. Neuro-Oncology 2021, 23, 1231–1251. [Google Scholar] [CrossRef]
  2. Stupp, R.; Weber, D.C. The Role of Radio- and Chemotherapy in Glioblastoma. Oncol. Res. Treat. 2005, 28, 315–317. [Google Scholar] [CrossRef]
  3. Taphoorn, M.J.B.; Dirven, L.; Kanner, A.A.; Lavy-Shahaf, G.; Weinberg, U.; Taillibert, S.; Toms, S.A.; Honnorat, J.; Chen, T.C.; Sroubek, J.; et al. Influence of Treatment With Tumor-Treating Fields on Health-Related Quality of Life of Patients With Newly Diagnosed Glioblastoma. JAMA Oncol. 2018, 4, 495–504. [Google Scholar] [CrossRef] [PubMed]
  4. Henriksen, O.M.; Álvarez-Torres, M.d.M.; Figueiredo, P.; Hangel, G.; Keil, V.C.; Nechifor, R.E.; Riemer, F.; Schmainda, K.M.; Warnert, E.A.H.; Wiegers, E.C.; et al. High-Grade Glioma Treatment Response Monitoring Biomarkers: A Position Statement on the Evidence Supporting the Use of Advanced MRI Techniques in the Clinic, and the Latest Bench-to-Bedside Developments. Part 1: Perfusion and Diffusion Techniques. Front. Oncol. 2022, 12, 810263. [Google Scholar] [CrossRef] [PubMed]
  5. Kim, D.; Lee, J.H.; Kim, N.; Lim, D.H.; Song, J.H.; Suh, C.-O.; Wee, C.W.; Kim, I.A. Optimizing Recurrent Glioblastoma Salvage Treatment: A Multicenter Study Integrating Genetic Biomarkers From the Korean Radiation Oncology Group (21-02). Neurosurgery 2024, 95, 584–595. [Google Scholar] [CrossRef]
  6. Cuncannon, M.; Wong, M.; Jayamanne, D.; Guo, L.; Cove, N.; Wheeler, H.; Back, M. Role of delayed salvage bevacizumab at symptomatic progression of chemorefractory glioblastoma. BMC Cancer 2019, 19, 445. [Google Scholar] [CrossRef]
  7. Hygino da Cruz, L.C., Jr.; Rodriguez, I.; Domingues, R.C.; Gasparetto, E.L.; Sorensen, A.G. Pseudoprogression and Pseudoresponse: Imaging Challenges in the Assessment of Posttreatment Glioma. Am. J. Neuroradiol. 2011, 32, 1978–1985. [Google Scholar] [CrossRef]
  8. Thust, S.C.; Bent, M.J.v.D.; Smits, M. Pseudoprogression of brain tumors. J. Magn. Reson. Imaging 2018, 48, 571–589. [Google Scholar] [CrossRef]
  9. Wen, P.Y.; Chang, S.M.; Bent, M.J.V.D.; Vogelbaum, M.A.; Macdonald, D.R.; Lee, E.Q. Response Assessment in Neuro-Oncology Clinical Trials. J. Clin. Oncol. 2017, 35, 2439–2449. [Google Scholar] [CrossRef]
  10. Chukwueke, U.N.; Wen, P.Y. Use of the Response Assessment in Neuro-Oncology (RANO) criteria in clinical trials and clinical practice. CNS Oncol. 2019, 8, CNS28. [Google Scholar] [CrossRef]
  11. Brandes, A.A.; Franceschi, E.; Tosoni, A.; Blatt, V.; Pession, A.; Tallini, G.; Bertorelle, R.; Bartolini, S.; Calbucci, F.; Andreoli, A.; et al. MGMT Promoter Methylation Status Can Predict the Incidence and Outcome of Pseudoprogression After Concomitant Radiochemotherapy in Newly Diagnosed Glioblastoma Patients. J. Clin. Oncol. 2008, 26, 2192–2197. [Google Scholar] [CrossRef]
  12. Wick, W.; Chinot, O.L.; Bendszus, M.; Mason, W.; Henriksson, R.; Saran, F.; Nishikawa, R.; Revil, C.; Kerloeguen, Y.; Cloughesy, T. Evaluation of pseudoprogression rates and tumor progression patterns in a phase III trial of bevacizumab plus radiotherapy/temozolomide for newly diagnosed glioblastoma. Neuro-Oncology 2016, 18, 1434–1441. [Google Scholar] [CrossRef] [PubMed]
  13. Topkan, E.; Topuk, S.; Oymak, E.; Parlak, C.; Pehlivan, B. Pseudoprogression in Patients With Glioblastoma Multiforme After Concurrent Radiotherapy and Temozolomide. Am. J. Clin. Oncol. 2012, 35, 284–289. [Google Scholar] [CrossRef] [PubMed]
  14. Young, R.; Gupta, A.; Shah, A.; Graber, J.; Zhang, Z.; Shi, W.; Holodny, A.; Omuro, A. Potential utility of conventional MRI signs in diagnosing pseudoprogression in glioblastoma. Neurology 2011, 76, 1918–1924. [Google Scholar] [CrossRef] [PubMed]
  15. Kim, M.M.; Sun, Y.; Aryal, M.P.; Parmar, H.A.; Piert, M.; Rosen, B.; Mayo, C.S.; Balter, J.M.; Schipper, M.; Gabel, N.; et al. A Phase 2 Study of Dose-intensified Chemoradiation Using Biologically Based Target Volume Definition in Patients With Newly Diagnosed Glioblastoma. Int. J. Radiat. Oncol. 2021, 110, 792–803. [Google Scholar] [CrossRef]
  16. Laack, N.N.; Pafundi, D.; Anderson, S.K.; Kaufmann, T.; Lowe, V.; Hunt, C.; Vogen, D.; Yan, E.; Sarkaria, J.; Brown, P.; et al. Initial Results of a Phase 2 Trial of 18F-DOPA PET-Guided Dose-Escalated Radiation Therapy for Glioblastoma. Int. J. Radiat. Oncol. 2021, 110, 1383–1395. [Google Scholar] [CrossRef]
  17. Patel, P.; Baradaran, H.; Delgado, D.; Askin, G.; Christos, P.; Tsiouris, A.J.; Gupta, A. MR perfusion-weighted imaging in the evaluation of high-grade gliomas after treatment: A systematic review and meta-analysis. Neuro-Oncology 2017, 19, 118–127. [Google Scholar] [CrossRef]
  18. Leimgruber, A.; Ostermann, S.; Yeon, E.J.; Buff, E.; Maeder, P.P.; Stupp, R.; Meuli, R.A. Perfusion and diffusion MRI of glioblastoma progression in a four-year prospective temozolomide clinical trial. Int. J. Radiat. Oncol. 2006, 64, 869–875. [Google Scholar] [CrossRef]
  19. Prager, A.; Martinez, N.; Beal, K.; Omuro, A.; Zhang, Z.; Young, R. Diffusion and Perfusion MRI to Differentiate Treatment-Related Changes Including Pseudoprogression from Recurrent Tumors in High-Grade Gliomas with Histopathologic Evidence. Am. J. Neuroradiol. 2015, 36, 877–885. [Google Scholar] [CrossRef]
  20. Kim, M.M.; Parmar, H.A.; Aryal, M.P.; Mayo, C.S.; Balter, J.M.; Lawrence, T.S.; Cao, Y. Developing a Pipeline for Multiparametric MRI-Guided Radiation Therapy: Initial Results from a Phase II Clinical Trial in Newly Diagnosed Glioblastoma. Tomography 2019, 5, 118–126. [Google Scholar] [CrossRef]
  21. Prather, K.Y.; O’nEal, C.M.; Westrup, A.M.; Tullos, H.J.; Hughes, K.L.; Conner, A.K.; A Glenn, C.; Battiste, J.D. A systematic review of amino acid PET in assessing treatment response to temozolomide in glioma. Neuro-Oncol. Adv. 2022, 4, vdac008. [Google Scholar] [CrossRef]
  22. Cicone, F.; Galldiks, N.; Papa, A.; Langen, K.-J.; Cascini, G.L.; Minniti, G. Repeated amino acid PET imaging for longitudinal monitoring of brain tumors. Clin. Transl. Imaging 2022, 10, 457–465. [Google Scholar] [CrossRef]
  23. Nabavizadeh, A.; Bagley, S.J.; Doot, R.K.; Ware, J.B.; Young, A.J.; Ghodasara, S.; Zhao, C.; Anderson, H.; Schubert, E.; Carpenter, E.L.; et al. Distinguishing Progression from Pseudoprogression in Glioblastoma Using18F-Fluciclovine PET. J. Nucl. Med. 2023, 64, 852–858. [Google Scholar] [CrossRef]
  24. Darcourt, J.; Chardin, D.; Bourg, V.; Gal, J.; Schiappa, R.; Blonski, M.; Koulibaly, P.-M.; Almairac, F.; Mondot, L.; Le Jeune, F.; et al. Added value of [18F]FDOPA PET to the management of high-grade glioma patients after their initial treatment: A prospective multicentre study. Eur. J. Nucl. Med. 2023, 50, 2727–2735. [Google Scholar] [CrossRef] [PubMed]
  25. Galldiks, N.; Langen, K.-J. Anwendung der Aminosäure-PET in der Diagnostik und Therapie von zerebralen Gliomen [Use of Amino Acid PET in the Diagnostic and Treatment Management of Cerebral Gliomas]. Fortschritte Neurol.·Psychiatr. 2012, 80, 17–23. [Google Scholar] [CrossRef]
  26. Galldiks, N.; Dunkl, V.; Stoffels, G.; Hutterer, M.; Rapp, M.; Sabel, M.; Reifenberger, G.; Kebir, S.; Dorn, F.; Blau, T.; et al. Diagnosis of pseudoprogression in patients with glioblastoma using O-(2-[18F]fluoroethyl)-l-tyrosine PET. Eur. J. Nucl. Med. 2015, 42, 685–695. [Google Scholar] [CrossRef] [PubMed]
  27. Galldiks, N.; Dunkl, V.; Ceccon, G.; Tscherpel, C.; Stoffels, G.; Law, I.; Henriksen, O.M.; Muhic, A.; Poulsen, H.S.; Steger, J.; et al. Early treatment response evaluation using FET PET compared to MRI in glioblastoma patients at first progression treated with bevacizumab plus lomustine. Eur. J. Nucl. Med. 2018, 45, 2377–2386. [Google Scholar] [CrossRef]
  28. Galldiks, N.; Abdulla, D.S.; Scheffler, M.; Wolpert, F.; Werner, J.-M.; Hüllner, M.; Stoffels, G.; Schweinsberg, V.; Schlaak, M.; Kreuzberg, N.; et al. Treatment Monitoring of Immunotherapy and Targeted Therapy Using 18F-FET PET in Patients with Melanoma and Lung Cancer Brain Metastases: Initial Experiences. J. Nucl. Med. 2021, 62, 464–470. [Google Scholar] [CrossRef]
  29. Narang, S.; Lehrer, M.; Yang, D.; Lee, J.; Rao, A. Radiomics in glioblastoma: Current status, challenges and potential opportunities. Transl. Cancer Res. 2016, 5, 383–397. [Google Scholar] [CrossRef]
  30. Kim, J.Y.; Park, J.E.; Jo, Y.; Shim, W.H.; Nam, S.J.; Kim, J.H.; Yoo, R.-E.; Choi, S.H.; Kim, H.S. Incorporating diffusion- and perfusion-weighted MRI into a radiomics model improves diagnostic performance for pseudoprogression in glioblastoma patients. Neuro-Oncology 2019, 21, 404–414. [Google Scholar] [CrossRef]
  31. Lohmann, P.; Lerche, C.; Bauer, E.; Steger, J.; Stoffels, G.; Blau, T.; Dunkl, V.; Filss, C.P.; Stegmayr, C.; Neumaier, B.; et al. Differentiation of Pseudoprogression from Tumor Progression in Glioblastoma Patients Based on Fet Pet Radiomics. Neuro-Oncology 2017, 19, vi148–vi149. [Google Scholar] [CrossRef]
  32. Lohmann, P.; Elahmadawy, M.A.; Gutsche, R.; Werner, J.-M.; Bauer, E.K.; Ceccon, G.; Kocher, M.; Lerche, C.W.; Rapp, M.; Fink, G.R.; et al. FET PET Radiomics for Differentiating Pseudoprogression from Early Tumor Progression in Glioma Patients Post-Chemoradiation. Cancers 2020, 12, 3835. [Google Scholar] [CrossRef] [PubMed]
  33. Lee, J.; Wang, N.; Turk, S.; Mohammed, S.; Lobo, R.; Kim, J.; Liao, E.; Camelo-Piragua, S.; Kim, M.; Junck, L.; et al. Discriminating pseudoprogression and true progression in diffuse infiltrating glioma using multi-parametric MRI data through deep learning. Sci. Rep. 2020, 10, 20331. [Google Scholar] [CrossRef] [PubMed]
  34. Moassefi, M.; Faghani, S.; Conte, G.M.; Kowalchuk, R.O.; Vahdati, S.; Crompton, D.J.; Perez-Vega, C.; Cabreja, R.A.D.; Vora, S.A.; Quiñones-Hinojosa, A.; et al. A deep learning model for discriminating true progression from pseudoprogression in glioblastoma patients. J. Neuro-Oncol. 2022, 159, 447–455. [Google Scholar] [CrossRef] [PubMed]
  35. Qian, J.; Herman, M.G.; Brinkmann, D.H.; Laack, N.N.; Kemp, B.J.; Hunt, C.H.; Lowe, V.; Pafundi, D.H. Prediction of MGMT Status for Glioblastoma Patients Using Radiomics Feature Extraction From 18F-DOPA-PET Imaging. Int. J. Radiat. Oncol. 2020, 108, 1339–1346. [Google Scholar] [CrossRef]
  36. van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.-C.; Pieper, S.; Aerts, H.J.W.L. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef]
  37. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef]
  38. Lei, M.; Varghese, B.; Hwang, D.; Cen, S.; Lei, X.; Desai, B.; Azadikhah, A.; Oberai, A.; Duddalwar, V. Benchmarking Various Radiomic Toolkit Features While Applying the Image Biomarker Standardization Initiative toward Clinical Translation of Radiomic Analysis. J. Digit. Imaging 2021, 34, 1156–1170. [Google Scholar] [CrossRef]
  39. Pafundi, D.H.; Laack, N.N.; Youland, R.S.; Parney, I.F.; Lowe, V.J.; Giannini, C.; Kemp, B.J.; Grams, M.P.; Morris, J.M.; Hoover, J.M.; et al. Biopsy validation of 18F-DOPA PET and biodistribution in gliomas for neurosurgical planning and radiotherapy target delineation: Results of a prospective pilot study. Neuro-Oncology 2013, 15, 1058–1067. [Google Scholar] [CrossRef]
  40. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. arXiv 2012, arXiv:1201.0490. [Google Scholar]
  41. Erickson, N.; Mueller, J.; Shirkov, A.; Zhang, H.; Larroy, P.; Li, M.; Smola, A. AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data. arXiv 2020, arXiv:2003.06505. [Google Scholar]
  42. Nori, H.; Jenkins, S.; Koch, P.; Caruana, R. InterpretML: A Unified Framework for Machine Learning Interpretability. arXiv 2019, arXiv:1909.09223. [Google Scholar] [CrossRef]
  43. Lou, Y.; Caruana, R.; Gehrke, J. Intelligible models for classification and regression. In Proceedings of the The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, 12–16 August 2012; pp. 150–158. [Google Scholar] [CrossRef]
  44. Patel, C.B.; Fazzari, E.; Chakhoyan, A.; Yao, J.; Raymond, C.; Nguyen, H.; Manoukian, J.; Nguyen, N.; Pope, W.; Cloughesy, T.F.; et al. 18F-FDOPA PET and MRI characteristics correlate with degree of malignancy and predict survival in treatment-naïve gliomas: A cross-sectional study. J. Neuro-Oncol. 2018, 139, 399–409. [Google Scholar] [CrossRef] [PubMed]
  45. Celli, M.; Caroli, P.; Amadori, E.; Arpa, D.; Gurrieri, L.; Ghigi, G.; Cenni, P.; Paganelli, G.; Matteucci, F. Diagnostic and Prognostic Potential of 18F-FET PET in the Differential Diagnosis of Glioma Recurrence and Treatment-Induced Changes After Chemoradiation Therapy. Front. Oncol. 2021, 11, 721821. [Google Scholar] [CrossRef]
  46. Qian, J.; Pafundi, D.H.; Breen, W.G.; Brown, P.D.; Hunt, C.H.; Jacobson, M.S.; Johnson, D.R.; Kaufmann, T.J.; Kemp, B.J.; Kizilbash, S.H.; et al. Analysis of imaging signatures in 18F-DOPA PET of glioblastoma treated with dose-escalated radiotherapy. Front. Oncol. 2025, 15, 1623313. [Google Scholar] [CrossRef]
  47. Hegi, M.E.; Diserens, A.-C.; Gorlia, T.; Hamou, M.-F.; De Tribolet, N.; Weller, M.; Kros, J.M.; Hainfellner, J.A.; Mason, W.; Mariani, L.; et al. MGMT Gene Silencing and Benefit from Temozolomide in Glioblastoma. N. Engl. J. Med. 2005, 352, 997–1003. [Google Scholar] [CrossRef]
  48. Stupp, R.; Mason, W.P.; van den Bent, M.J.; Weller, M.; Fisher, B.; Taphoorn, M.J.B.; Belanger, K.; Brandes, A.A.; Marosi, C.; Bogdahn, U.; et al. Radiotherapy plus Concomitant and Adjuvant Temozolomide for Glioblastoma. N. Engl. J. Med. 2005, 352, 987–996. [Google Scholar] [CrossRef]
Figure 1. High level schematic workflow for feature extraction, selection, compression, model training and testing.
Figure 1. High level schematic workflow for feature extraction, selection, compression, model training and testing.
Cancers 17 03560 g001
Figure 2. UMAP Visualization of Selected Features. The figure displays a visualization of the 2 projected UMAP features derived from the 6 features selected after pair correlation filtering. Each data point is colored to indicate the deviation of the RS from 12 months. RS less than 12 months (high risk) is represented in blue, with darker blue indicating shorter remaining survival. Conversely, RS longer than 12 months (low risk) is represented in red, with deeper shades of red suggesting longer remaining survival.
Figure 2. UMAP Visualization of Selected Features. The figure displays a visualization of the 2 projected UMAP features derived from the 6 features selected after pair correlation filtering. Each data point is colored to indicate the deviation of the RS from 12 months. RS less than 12 months (high risk) is represented in blue, with darker blue indicating shorter remaining survival. Conversely, RS longer than 12 months (low risk) is represented in red, with deeper shades of red suggesting longer remaining survival.
Cancers 17 03560 g002
Figure 3. ML architecture and performance. (a) Multiple layer architecture of Autogluon WeightedEnsemble_L3 model. The bottom layer has 110 models. The second layer has 7 models. (b) AUC_ROC curve of the test data for Autoglueon ML model. Ablation 1 is the test without Stage III of the feature selection in the pipeline; Ablation 2 is the test without manifold learning after feature selection. Chance level is random guess. (c) AUC_ROC curves of the test data for the EBM model.
Figure 3. ML architecture and performance. (a) Multiple layer architecture of Autogluon WeightedEnsemble_L3 model. The bottom layer has 110 models. The second layer has 7 models. (b) AUC_ROC curve of the test data for Autoglueon ML model. Ablation 1 is the test without Stage III of the feature selection in the pipeline; Ablation 2 is the test without manifold learning after feature selection. Chance level is random guess. (c) AUC_ROC curves of the test data for the EBM model.
Cancers 17 03560 g003
Figure 4. EBM model predicted FU map for each patient from the test cohort based on their pre-RT and FU PET images. The X axis reflects the elapsed time in months from pre-RT imaging to the respective FU imaging timepoint. Y axis is the anonymized patient ID. For each Pre-RT or FU PET image timepoint, dark blue shading indicates a remaining survival prediction by the model to be high risk, whereas light blue shading indicates a low-risk prediction. Cyan diamonds represent the RANO radiographic progression timepoint as determined by conventional MR images. Red square represents the pre-RT images. OS for each patient is listed in the far-right column.
Figure 4. EBM model predicted FU map for each patient from the test cohort based on their pre-RT and FU PET images. The X axis reflects the elapsed time in months from pre-RT imaging to the respective FU imaging timepoint. Y axis is the anonymized patient ID. For each Pre-RT or FU PET image timepoint, dark blue shading indicates a remaining survival prediction by the model to be high risk, whereas light blue shading indicates a low-risk prediction. Cyan diamonds represent the RANO radiographic progression timepoint as determined by conventional MR images. Red square represents the pre-RT images. OS for each patient is listed in the far-right column.
Cancers 17 03560 g004
Figure 5. The contribution of each feature or feature interaction to the model decision for (1) Pre-RT (FU0), (2) FU1 and (3) FU2 images of test patient RT_FDOPA40. The left column (a) shows the weights of each shape function, including both single variable and two-variable interactions, at each image acquisition timepoint. Values above 0 favor a high-risk classification and values below 0 favor a low-risk classification. The right column (b) illustrates a single slice of the tumor at the corresponding timepoints.
Figure 5. The contribution of each feature or feature interaction to the model decision for (1) Pre-RT (FU0), (2) FU1 and (3) FU2 images of test patient RT_FDOPA40. The left column (a) shows the weights of each shape function, including both single variable and two-variable interactions, at each image acquisition timepoint. Values above 0 favor a high-risk classification and values below 0 favor a low-risk classification. The right column (b) illustrates a single slice of the tumor at the corresponding timepoints.
Cancers 17 03560 g005
Figure 6. Clinical utility: (a) Decision curve analysis on the independent test set. Net benefit (y axis) is plotted against threshold probability (x axis) for AutoGluon (red) and EBM (green), with treat-all (orange dashed) and treat-none (blue dashed) reference strategies. Both models yield positive net benefit across clinically plausible thresholds. (b) Kaplan–Meier curves comparing progression-free survival (PFS; RANO-defined) with model-predicted low-risk survival (MLRS).
Figure 6. Clinical utility: (a) Decision curve analysis on the independent test set. Net benefit (y axis) is plotted against threshold probability (x axis) for AutoGluon (red) and EBM (green), with treat-all (orange dashed) and treat-none (blue dashed) reference strategies. Both models yield positive net benefit across clinically plausible thresholds. (b) Kaplan–Meier curves comparing progression-free survival (PFS; RANO-defined) with model-predicted low-risk survival (MLRS).
Cancers 17 03560 g006
Table 1. Selected Features with Optimized Cut Points and p-values. Selected features identified using the in-house developed feature selection algorithm, along with their corresponding optimized cut points for categorizing high vs. low risk, and associated p-values between low risk and high risk data. The feature selection process is entirely based on the training dataset.
Table 1. Selected Features with Optimized Cut Points and p-values. Selected features identified using the in-house developed feature selection algorithm, along with their corresponding optimized cut points for categorizing high vs. low risk, and associated p-values between low risk and high risk data. The feature selection process is entirely based on the training dataset.
CategoryFeaturesOptimized
Cut Point
p-Value
ShapeMeshVolume10.114<0.001
MajorAxisLength0.477<0.001
LeastAxisLength0.194<0.001
First OrderVariance0.001<0.001
Texture[GLCM] Correlation0.003<0.001
[GLRLM] LongRunEmphasis0.499<0.001
Table 2. Comparison between the EBM model predicted high-risk to low-risk transition timepoint based on 18F-DOPA PET images and the RANO progression call based on conventional MR images for different subgroups. For each category, the percentages of patients whose remaining survival is equal or less than 12 months after RANO progression calls or model predicted risk transition are also given.
Table 2. Comparison between the EBM model predicted high-risk to low-risk transition timepoint based on 18F-DOPA PET images and the RANO progression call based on conventional MR images for different subgroups. For each category, the percentages of patients whose remaining survival is equal or less than 12 months after RANO progression calls or model predicted risk transition are also given.
MP Coincident with RANOMP Earlier than RANOMP Later than RANOMissing Transition
Percentage of the cohort population33.3%45.4%6.1%15.2%
RS after RANO PC (months)10.4 ± 6.27.1 ± 5.316.5 ± 3.514.3 ± 5.7
% (RS ≤ 12 months) after RANO PC91%87%0%40%
RS after MP Transition (months)10.4 ± 6.210.1 ± 5.910.5 ± 0.7N/A
% (RS ≤ 12) after MP Transition91%80%100%N/A
MP: Model Prediction. PC: Progressional call. RS: Remaining survival.
Table 3. Feature-importance comparison for the AutoGluon AutoML and EBM models. Importances are computed using each model’s native estimator and normalized to sum to 1. The pairwise interaction term (UMAP x–y) is reported only for EBM and is not available in AutoGluon.
Table 3. Feature-importance comparison for the AutoGluon AutoML and EBM models. Importances are computed using each model’s native estimator and normalized to sum to 1. The pairwise interaction term (UMAP x–y) is reported only for EBM and is not available in AutoGluon.
FeaturesAutogluonEBM
UMAP x0.9100.417
UMAP y0.070.292
UMAP x–y00.291
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Qian, J.; Hasenauer, D.; Breen, W.G.; Brown, P.D.; Hunt, C.H.; Jacobson, M.S.; Johnson, D.R.; Kaufmann, T.J.; Kemp, B.J.; Kizilbash, S.H.; et al. Predicting Remaining Survival of Glioblastoma Patients with Radiomics Analysis Based on 18F-DOPA PET Images. Cancers 2025, 17, 3560. https://doi.org/10.3390/cancers17213560

AMA Style

Qian J, Hasenauer D, Breen WG, Brown PD, Hunt CH, Jacobson MS, Johnson DR, Kaufmann TJ, Kemp BJ, Kizilbash SH, et al. Predicting Remaining Survival of Glioblastoma Patients with Radiomics Analysis Based on 18F-DOPA PET Images. Cancers. 2025; 17(21):3560. https://doi.org/10.3390/cancers17213560

Chicago/Turabian Style

Qian, Jing, Deanna Hasenauer, William G. Breen, Paul D. Brown, Christopher H. Hunt, Mark S. Jacobson, Derek R. Johnson, Timothy J. Kaufmann, Bradley J. Kemp, Sani H. Kizilbash, and et al. 2025. "Predicting Remaining Survival of Glioblastoma Patients with Radiomics Analysis Based on 18F-DOPA PET Images" Cancers 17, no. 21: 3560. https://doi.org/10.3390/cancers17213560

APA Style

Qian, J., Hasenauer, D., Breen, W. G., Brown, P. D., Hunt, C. H., Jacobson, M. S., Johnson, D. R., Kaufmann, T. J., Kemp, B. J., Kizilbash, S. H., Lowe, V. J., Ruff, M. W., Sarkaria, J. N., Uhm, J. H., Zakhary, M. J., Seaberg, M. H., Wan Chan Tseung, H. S., Yan, E. S., Zhang, Y., ... Brinkmann, D. H. (2025). Predicting Remaining Survival of Glioblastoma Patients with Radiomics Analysis Based on 18F-DOPA PET Images. Cancers, 17(21), 3560. https://doi.org/10.3390/cancers17213560

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop