1. Introduction
Prostate cancer (PCa) is the most commonly diagnosed malignancy in men and remains a major contributor to cancer-related mortality worldwide [
1]. Its biological behavior is highly heterogeneous, ranging from indolent disease to aggressive metastatic spread involving bone, lymph nodes, and visceral organs [
2]. PSMA-targeted PET/CT has markedly enhanced diagnostic accuracy by enabling sensitive detection of both primary and metastatic lesions, even at low PSA levels [
3]. However, a subset of patients exhibit PSMA-negative PET/CT findings despite clinical progression [
4,
5], revealing a disconnect between tumor activity and tracer uptake. This highlights a biological gap in lesion-based imaging and underscores the need for systemic biomarkers capable of detecting recurrence when conventional approaches fail. This limitation further suggests that clinically relevant disease biology may extend beyond visually detectable tumor lesions and may instead involve broader systemic alterations not captured by conventional lesion-centered imaging approaches.
Clinical outcomes in PCa are shaped not only by tumor characteristics but also by systemic host factors, including hormonal, metabolic, and immune regulation [
6,
7]. Although androgen deprivation therapy and next-generation antiandrogens have improved survival, many patients with advanced or PSMA-negative disease continue to relapse [
8,
9], reflecting complex tumor–host interactions [
10]. Identifying imaging correlates of these systemic processes may improve prognostication and guide precision treatment. In particular, endocrine and neurohormonal pathways may influence tumor progression, treatment resistance, and systemic adaptation, making endocrine-related imaging biomarkers an area of growing interest in precision oncology.
Radiomics provides a non-invasive approach to quantify tissue heterogeneity from standard imaging [
11]. In PCa, radiomics derived from PSMA-PET and CT has shown potential in tumor characterization, recurrence prediction, and treatment response assessment [
12,
13], yet prior studies largely focused on tumor regions, neglecting the physiological information contained in normal organs. Evidence from other cancers suggests that radiomic features from uninvolved organs can reflect systemic effects of malignancy [
14,
15]. Such findings support the concept that cancer-related imaging phenotypes may emerge not only from tumors themselves, but also from host organs involved in systemic metabolic, endocrine, and inflammatory regulation.
Given the central hormonal and metabolic roles of endocrine organs, including the adrenal glands, thyroid, the hypothalamus–pituitary complex, and testes, subtle imaging changes within these structures may serve as biomarkers of systemic disease activity. Accordingly, we propose a multi-organ radiomics framework that integrates quantitative features from [18F]DCFPyL PSMA-PET/CT with clinical parameters to predict clinical progression in PSMA-negative PCa, aiming to identify endocrine-based imaging signatures of tumor–host interactions. To our knowledge, this represents one of the first studies to investigate endocrine organ radiomics in PSMA-negative prostate cancer using a multi-organ PET/CT-based framework focused on systemic disease characterization and progression prediction.
2. Methods and Materials
The flowchart of the study is presented in
Figure 1, with the methodological workflow illustrated in detail. The following sections provide a comprehensive description of the study methods.
2.1. Patient Dataset
This study used the 101-patient cohort originally reported by Harsini et al. [
5]. All patients had biochemical recurrence following radical prostatectomy and underwent a baseline [
18F]DCFPyL PET/CT scan that was negative according to both visual assessment and SUV-based criteria. The median PSA level at the time of imaging was 0.56 ng/mL (range: 0.4–11.3). Clinical progression (CP) was defined as the radiologic emergence of new lesions on follow-up imaging, including PET/CT, MRI, CT, or bone scan; PSA-only progression was not considered CP. Based on this definition, 67 patients experienced clinical progression during follow-up, while 34 patients remained progression-free. Although 36 patients (36%) received salvage radiotherapy (sRT) within 3 months after imaging, all models were designed to use only baseline variables available at the time of imaging. Accordingly, post-imaging treatment variables (e.g., RT after PET, dose-related parameters) were excluded from all predictive models to prevent temporal data leakage and ensure clinical applicability.
2.2. PET/CT Imaging Protocol
Patients fasted for 4 h before imaging. They received an intravenous injection of 237–474 MBq [18F]DCFPyL, adjusted by body weight with ±10% target activity variation. Imaging was performed 120 min post-injection from the vertex/top of head to mid-thigh/proximal femurs using a Discovery PET/CT 600 or 690 scanner (GE Healthcare). A non-contrast CT was first acquired for localization and attenuation correction using 120 kV, automatic mA selection 30–200 mA, and noise index 20. PET images were then acquired at 2–4 min per bed position, adjusted for patient girth, and reconstructed using ordered subset expectation maximization with point spread function modeling.
2.3. Image Segmentation
Normal endocrine organs were segmented manually on the CT component of each PET/CT scan. The adrenal glands, thyroid, and testes were delineated individually due to their clear anatomical boundaries. Given the limitations of low-dose CT in resolving central brain structures, the hypothalamus and pituitary gland were segmented as a single combined region of interest to reduce partial-volume effects and improve feature stability. All segmentations were performed by a board-certified radiologist with >10 years of oncologic imaging experience, ensuring consistency and anatomical accuracy. Additionally, all PET/CT scans included full cranial coverage up to the vertex, confirming that hypothalamic sellar structures were fully captured and not truncated at the skull base.
2.4. Radiomic Feature Extraction
Radiomic features were extracted from attenuation-corrected PET and CT volumes following Image Biomarker Standardization Initiative (IBSI) recommendations. First-order intensity metrics and second-order texture matrices (GLCM, GLRLM, GLSZM, NGTDM) were computed using the PyRadiomics v3.1.0 library, which adheres to standardized IBSI feature definitions [
16].
2.5. Image Preprocessing
To maximize feature reproducibility, CT volumes were resampled to 1.0 mm isotropic spacing and PET volumes to 2.0 mm spacing. CT intensities were clipped to –500 to 500 HU, and PET intensities were limited to 0–30 SUV. Fixed-bin discretization (25 HU for CT; 0.25 SUV for PET) was applied, and PET images were re-segmented to exclude negative values. All preprocessing steps were standardized across the cohort.
2.6. Data Preprocessing
Clinical, CT-radiomic, and PET-radiomic features were merged by patient identifier. Clinical variables that could introduce temporal leakage, including RT after PET, ADT after PET, and dose-related variables, were removed before model development. Missing values were handled using median imputation, chosen for its robustness in small datasets and reduced sensitivity to outliers. Near-zero variance features were removed using variance thresholding. Highly correlated features were filtered using a Pearson correlation threshold of , retaining only nonredundant predictors. Continuous features were then standardized using z-score normalization.
To prevent information leakage, all preprocessing steps, including median imputation, variance filtering, correlation filtering, and feature scaling, were performed exclusively within the training subset and then applied to the corresponding validation fold.
2.7. Feature Selection
Feature selection was performed independently within the training subset using three predefined methods: ANOVA F-test with SelectKBest, LASSO-based selection using L1-regularized logistic regression with internal cross-validation, and recursive feature elimination using class-weighted logistic regression. The number of selected features was fixed at k = 5 for all models and configurations to reduce model selection bias and improve comparability across experiments. Selected features were recorded to assess feature selection stability across model configuration.
2.8. Model Development
Five machine learning classifiers were evaluated: logistic regression, random forest, gradient boosting, support vector machine with a radial basis function kernel, and histogram-based gradient boosting. Logistic regression and support vector machine models use class weighting to account for class imbalance. Random forest uses balanced subsampling. All models were trained using only the selected features from the training subset.
Model configurations included clinical-only, CT-only, PET-only, CT + clinical, PET + clinical, multi-organ CT-only, multi-organ PET-only, true CT + PET fusion, and CT + PET + clinical fusion models. Endocrine organ radiomic inputs included the adrenal glands, hypothalamus–pituitary (Hyp-Pit) complex, testes, and thyroid.
2.9. Validation Strategy
Model performance was evaluated using a stratified train/test split framework with 70% training data and 30% held-out testing data. All preprocessing operations, including median imputation, feature scaling, and feature selection, were performed exclusively within the training subset prior to application to the independent test set to reduce potential information leakage. Final performance metrics, bootstrap confidence intervals, calibration analysis, and DeLong statistical comparisons were calculated using held-out test set predictions.
2.10. Performance Evaluation and Statistical Analysis
The primary performance metric was the area under the receiver operating characteristic curve (AUC). Secondary metrics included accuracy, sensitivity, specificity, precision, F1-score, Brier score, and log loss. Binary predictions were generated using predicted probabilities and standard classification thresholds. Uncertainty was estimated using bootstrap resampling with 1000 iterations, generating 95% confidence intervals for AUC, accuracy, sensitivity, specificity, precision, F1-score, Brier score, and log loss. Statistical comparison between candidate models and the best clinical-only baseline model was performed using the DeLong test for correlated ROC curves. The clinical-only model was used as the reference comparator.
2.11. Calibration Analysis
Model calibration was assessed using Brier score, log loss, and calibration plots. Calibration curves compared the mean predicted probability with the observed event frequency across probability bins. ROC and calibration plots were generated for the top-performing models.
2.12. Feature Stability and Interpretability
Feature selection stability was assessed by recording the selected features in the held-out test set validation and counting their selection frequency across model configurations. For the final best-performing model, feature importance was estimated using model-specific importance values when available, coefficient magnitudes for linear models, or permutation importance when intrinsic feature importance was not available. SHAP (SHapley Additive exPlanations) analysis was performed for representative top-performing CT-based, PET-based, clinical-only, and multimodal fusion models to evaluate feature importance and model interpretability.
2.13. Noise and Signal Considerations
Because normal endocrine tissues may have low physiologic PSMA uptake, the PET signal in these structures may be influenced by blood pool activity and image noise. Low-dose CT also provides limited soft-tissue contrast, particularly for small structures such as the Hyp-Pit region. To reduce noise-related bias, standardized preprocessing, variance filtering, correlation filtering, fold-specific feature selection, and out-of-fold validation were applied. Nevertheless, the analysis remains exploratory, and the absence of external validation, endocrine laboratory biomarkers, and segmentation reproducibility testing limits biological interpretation.
4. Discussion
This study demonstrates that radiomic profiling of normal endocrine organs may reveal systemic imaging signatures related to clinical progression in men with PSMA-negative prostate cancer. By integrating CT- and PET-derived features from the adrenal glands, Hyp-Pit complex, thyroid, and testes with clinical variables, we observed that structurally normal but hormonally active organs carry quantifiable textural patterns relevant to disease behavior and systemic tumor–host interactions [
14,
15,
17]. The strongest multimodal fusion model, integrating TESTIS_CT and TESTIS_PET radiomics with clinical variables using RFE and gradient boosting, achieved an AUC of 0.758 (95% CI: 0.653–0.849), while the best clinical-only model achieved an AUC of 0.727 (95% CI: 0.618–0.833). Although multimodal integration numerically improved discrimination performance and specificity, the incremental improvement relative to the clinical-only baseline remained modest (ΔAUC = 0.031) and did not reach statistical significance in DeLong analysis (
p = 0.569). Therefore, these findings should be interpreted as exploratory and hypothesis-generating rather than confirmatory. Calibration analysis also demonstrated moderate agreement between predicted probabilities and observed outcomes across the strongest multimodal fusion, PET + clinical, and CT + clinical models, although these findings should be interpreted cautiously given the limited cohort size and absence of external validation.
Among the evaluated endocrine organs, the adrenal glands demonstrated distinct CT- and PET-based radiomic patterns related to clinical progression. CT-derived biomarkers such as first-order kurtosis, HU standard deviation, and GLSZM Zone Entropy were frequently identified in predictive pipelines, suggesting structural heterogeneity and variable tissue organization, while PET-derived features including GLSZM_SizeZoneNonUniformityNormalized and NGTDM_Busyness reflected metabolic heterogeneity and irregular uptake patterns. In the final analysis, ADRENAL_CT + clinical models achieved an AUC of 0.700 (95% CI: 0.590–0.809), whereas ADRENAL_PET + clinical models achieved an AUC of 0.695 (95% CI: 0.593–0.798), both demonstrating relatively high specificity values (0.816 and 0.789, respectively). Although adrenal-only imaging models demonstrated more limited discrimination performance, integration with clinical variables improved overall model balance and stability. Biologically, these findings may reflect adrenal cortical remodeling and endocrine adaptation related to chronic hypothalamic–pituitary–adrenal activation and sustained androgen synthesis. Previous translational studies have demonstrated that, even after castration, the adrenal glands continue to produce androgen precursors such as DHEA and androstenedione through CYP17A-mediated pathways, potentially contributing to persistent androgen receptor signaling and castration-resistant progression [
18,
19]. Accordingly, increased radiomic heterogeneity within the adrenal glands may reflect subtle systemic hormonal remodeling rather than direct tumor involvement. Nevertheless, these biological interpretations remain exploratory, and future studies incorporating endocrine laboratory biomarkers such as cortisol, ACTH, and DHEA-S will be necessary to validate whether adrenal radiomic phenotypes truly reflect HPA axis activity and endocrine adaptation in PSMA-negative prostate cancer.
The hypothalamus–pituitary complex also emerged as an important contributor within several predictive pipelines, particularly when integrated with clinical variables. CT- and PET-derived features such as GLCM Autocorrelation, Idmn, and NGTDM Contrast, which reflect spatial regularity, texture organization, and metabolic heterogeneity, were repeatedly identified among higher-performing models. In the final analysis, HYPO_PIT_CT + clinical models achieved an AUC of 0.709 (95% CI: 0.591–0.813), while HYPO_PIT_PET + clinical models achieved an AUC of 0.726 (95% CI: 0.616–0.830), demonstrating relatively balanced sensitivity and specificity profiles. These findings suggest that radiomic alterations within the Hyp-Pit region may reflect systemic endocrine adaptation related to androgen suppression and disease progression. Chronic GnRH and LH feedback during androgen deprivation therapy has been related to pituitary remodeling and altered vascularity, which may contribute to subtle imaging heterogeneity potentially detectable through radiomic analysis [
2,
20]. When combined with PSA kinetics and treatment-related variables, these imaging signatures may represent indirect markers of endocrine axis adaptation or treatment-related physiologic stress rather than direct tumor-related changes. Nevertheless, the biological interpretation of these findings remains exploratory, and future validation incorporating LH, FSH, GnRH analog treatment data, endocrine biomarkers, and dedicated pituitary MRI correlation will be required to determine whether these radiomic signatures truly reflect hypothalamic–pituitary remodeling in PSMA-negative prostate cancer.
Testis-derived radiomics demonstrated some of the strongest and most consistent predictive performance across CT-based, PET-based, and multimodal fusion pipelines. In the final analysis, the strongest TESTIS_CT + clinical model achieved an AUC of 0.729 (95% CI: 0.619–0.834), while the strongest TESTIS_PET + clinical model achieved an AUC of 0.733 (95% CI: 0.630–0.826). Moreover, the highest-performing overall multimodal fusion model combined TESTIS_CT and TESTIS_PET radiomics with clinical variables and achieved an AUC of 0.758 (95% CI: 0.653–0.849). Key radiomic biomarkers included HU standard deviation, GLCM Contrast, GLCM Autocorrelation, and GLSZM entropy-related features, reflecting tissue heterogeneity, structural irregularity, and gray-level nonuniformity. SHAP analysis further demonstrated that TESTIS-derived radiomic biomarkers consistently contributed among the strongest predictive features across both standalone and fusion models. Biologically, these findings align with the central role of testicular androgen production in sustaining androgen receptor signaling and prostate cancer progression. Chronic androgen deprivation therapy and endocrine adaptation may induce subtle structural remodeling, altered tissue density, fibrosis, or vascular heterogeneity within the testes that become detectable through radiomic analysis. The recurrent selection of TESTIS-derived biomarkers across independent pipelines suggests that testicular imaging phenotypes may reflect systemic hormonal adaptation related to recurrent or treatment-resistant disease rather than direct tumor involvement. Nevertheless, these interpretations remain exploratory, and future studies integrating testosterone levels, endocrine biomarkers, treatment response data, and longitudinal imaging will be required to clarify the biological basis and clinical significance of testicular radiomic heterogeneity in PSMA-negative prostate cancer [
7].
The thyroid gland also demonstrated complementary predictive value, particularly within PET-based and clinically integrated models. In the final analysis, the strongest THYROID_PET + clinical model achieved an AUC of 0.678 (95% CI: 0.546–0.791), while THYROID_CT + clinical models achieved an AUC of 0.642 (95% CI: 0.523–0.751). Relevant radiomic biomarkers included GLSZM SizeZoneNonUniformity, first-order entropy, GLCM Autocorrelation, and texture heterogeneity features related to metabolic variability and signal complexity. Although thyroid-based models did not achieve the same level of performance as TESTIS- or HYPO_PIT-based pipelines, they demonstrated a moderate predictive contribution and relatively balanced specificity profiles. Biologically, these imaging patterns may reflect altered thyroid hormone regulation, vascular remodeling, or systemic inflammatory responses related to androgen deprivation therapy and endocrine adaptation. Experimental evidence suggests that thyroid hormones may enhance androgen signaling through both genomic and non-genomic pathways, including activation of androgen-related transcriptional programs and MAPK/ERK signaling cascades involved in prostate cancer progression [
21]. Accordingly, increased texture entropy or heterogeneous uptake within the thyroid may represent indirect imaging correlates of systemic hormonal disequilibrium rather than organ-specific pathology. Nevertheless, these biological interpretations remain speculative, and future studies integrating thyroid function biomarkers, endocrine laboratory testing, and longitudinal imaging will be required to determine whether thyroid radiomic phenotypes have clinically meaningful associations with progression risk in PSMA-negative prostate cancer.
Integrating findings across endocrine organs suggests that systemic endocrine remodeling may leave quantifiable imaging signatures within hormonally active normal tissues. Across multiple predictive pipelines, recurrent radiomic biomarkers such as autocorrelation, entropy-related features, HU standard deviation, and gray-level nonuniformity were repeatedly related to progression prediction, indicating shared patterns of structural and metabolic heterogeneity. CT-based radiomics generally demonstrated greater structural stability and specificity, whereas PET-based models showed relatively higher sensitivity but appeared more vulnerable to metabolic noise and physiologic uptake variability. Importantly, multimodal fusion models integrating CT radiomics, PET radiomics, and clinical variables consistently achieved the strongest overall predictive performance, while true CT + PET fusion without clinical integration demonstrated substantially lower discrimination. These findings suggest that endocrine organ radiomics may provide complementary system-level information that contextualizes conventional clinicopathologic variables rather than functioning as independent predictive biomarkers.
A major observation of this study was the consistently strong performance of clinical variables across nearly all predictive pipelines. The strongest clinical-only configuration achieved an AUC of 0.727 (95% CI: 0.618–0.833), demonstrating that conventional clinicopathologic predictors retained substantial prognostic value in PSMA-negative prostate cancer. Dominant clinical contributors identified through SHAP analysis included Treatment Category, T Category, pN stage, Site of Lesion, and Primary Treatment, emphasizing the importance of tumor burden, treatment history, and disease stage in recurrence prediction. Although multimodal fusion models achieved slightly higher discrimination performance, DeLong analysis demonstrated that the incremental improvement beyond the clinical-only baseline remained modest and statistically nonsignificant. These findings highlight that the primary contribution of endocrine organ radiomics may lie in complementing and refining established clinical information rather than replacing it.
Biologically, the associations identified in this study should be interpreted cautiously and considered hypothesis-generating. Radiomic biomarkers reflect image-derived structural and textural variation rather than direct hormonal, metabolic, or molecular activity. Although prior biological evidence supports potential links between androgen signaling, adrenal steroidogenesis, thyroid hormone regulation, and endocrine axis remodeling in prostate cancer progression [
19,
20,
21], the present study did not include endocrine laboratory biomarkers, pituitary MRI, or radiogenomic validation. Accordingly, the proposed endocrine-related mechanisms remain indirect and exploratory. Future studies integrating comprehensive hormonal profiling, including TSH, free T
4/T
3, cortisol, DHEA-S, LH, FSH, testosterone, and GnRH-related treatment data, together with external multi-center validation and longitudinal imaging, will be necessary to determine whether endocrine organ radiomic phenotypes truly reflect biologically meaningful systemic adaptation related to progression and treatment resistance in PSMA-negative prostate cancer.
5. Limitations
This study has several important limitations: First, it was a retrospective single-center analysis with a relatively small cohort of PSMA-negative patients, limiting statistical power and generalizability. Accordingly, the findings should be interpreted as exploratory and hypothesis-generating rather than definitive. Although multimodal fusion models numerically improved discrimination performance relative to the clinical-only baseline, these improvements did not reach statistical significance in DeLong analysis, likely reflecting both the modest incremental value of endocrine organ radiomics and the limited statistical power of the cohort. In addition, the absence of external validation and the relatively small sample size contributed to wide confidence intervals across several model configurations.
Manual segmentation of small endocrine structures on low-dose CT may introduce observer variability, partial-volume effects, and segmentation uncertainty, particularly within the hypothalamus–pituitary region. Interobserver reproducibility was not formally assessed. Furthermore, imaging alone cannot reliably identify subclinical endocrine abnormalities, and systematic chart reviews for thyroid disease, thyroid medication use, thyroidectomy, adrenal nodules, or other endocrine disorders were not performed. These factors may have introduced unrecognized biologic and imaging confounders.
Several additional technical and biological limitations should also be considered. The PET signal within normal endocrine and brain tissues is relatively low and partially influenced by physiologic blood pool activity, while low-dose CT provides limited soft-tissue contrast, increasing susceptibility to imaging noise and unstable radiomic measurements despite standardized preprocessing and strict fold-specific validation procedures. Moreover, radiomic biomarkers capture image-derived structural and textural variation rather than direct hormonal or molecular activity. Therefore, the proposed biological interpretation of endocrine organ radiomics as surrogate markers of systemic endocrine remodeling remains indirect and unvalidated. Follow-up imaging was also not protocolized, raising the possibility of lead-time bias, while post-imaging treatment heterogeneity may have influenced progression outcomes despite exclusion of post-imaging treatment variables from predictive modeling to avoid temporal data leakage. Future studies should incorporate external multi-center validation cohorts, endocrine laboratory biomarkers, radiogenomic analysis, interobserver reproducibility testing, and standardized imaging protocols to determine the biological and clinical robustness of these findings.