Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment

Li, Chunyang; Bohman, Julia; Patil, Vikas; Mcshinsky, Richard; Yong, Christina; Burningham, Zach; Samore, Matthew; Halwani, Ahmad S.

doi:10.3390/biomedinformatics5030042

Open AccessArticle

Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment

by

Chunyang Li

^1,2,*

,

Julia Bohman

^1,2

,

Vikas Patil

^1,2,

Richard Mcshinsky

^1,2,

Christina Yong

^1,2

,

Zach Burningham

^1,2,

Matthew Samore

^1,2 and

Ahmad S. Halwani

^1,2,3

¹

Department of Internal Medicine, University of Utah, Salt Lake City, UT 84112, USA

²

VA Salt Lake City Health Care System, Salt Lake City, UT 84148, USA

³

Huntsman Cancer Institute, Salt Lake City, UT 84112, USA

^*

Author to whom correspondence should be addressed.

BioMedInformatics 2025, 5(3), 42; https://doi.org/10.3390/biomedinformatics5030042

Submission received: 13 June 2025 / Revised: 15 July 2025 / Accepted: 31 July 2025 / Published: 4 August 2025

(This article belongs to the Special Issue Integrating Health Informatics and Artificial Intelligence for Advanced Medicine)

Download

Browse Figures

Versions Notes

Abstract

Background: Prostate cancer (PC) is the second leading cause of cancer-related death in men in the United States. A subset of patients develops non-metastatic, castration-resistant PC (nmCRPC), for which management requires a personalized consideration for appropriate treatment. However, there is no consensus regarding when to switch from androgen deprivation therapy (ADT) to more aggressive treatments like abiraterone or enzalutamide. Methods: We analyzed 5037 nmCRPC patients and employed a Weibull Time to Event Recurrent Neural Network to identify patients who would benefit from switching from ADT to abiraterone/enzalutamide. We evaluated this model using differential treatment benefits measured by the Kaplan–Meier estimation and milestone probabilities. Results: The model achieved an area under the curve of 0.738 (standard deviation (SD): 0.057) for patients treated with abiraterone/enzalutamide and 0.693 (SD: 0.02) for patients exclusively treated with ADT at the 2-year milestone. The model recommended 14% of ADT patients switch to abiraterone/enzalutamide. Analysis showed a statistically significant absolute improvement using model-recommended treatments in progression-free survival (PFS) of 0.24 (95% confidence interval (CI): 0.23–0.24) at the 2-year milestone (PFS rate increasing from 0.50 to 0.74) with a hazard ratio of 0.44 (95% CI: 0.39–0.50). Conclusions: Our model successfully identified nmCRPC patients who would benefit from switching to abiraterone/enzalutamide, demonstrating potential outcome improvements.

Keywords:

deep learning; non-metastatic castration-resistant prostate cancer; survival analysis; electronic health records; treatment recommendation

1. Introduction

Prostate cancer is the second leading cause of cancer-related death and one of the most commonly diagnosed cancers for men in the United States [1,2]. Disease trajectory shows considerable variation across individuals. It is often asymptomatic in early stages and remains indolent for some patients, while for others it behaves aggressively. There is also heterogeneity in the cancer’s change from indolent to aggressive, with some patients experiencing a quick progression at diagnosis, and others progressing suddenly at different points over time [3,4,5].

Progression of prostate cancer is defined by the disease stage and/or response to prior treatment. Patients with localized disease typically receive definitive treatment such as radical prostatectomy or radiation therapy, after which approximately half of patients will experience disease progression [6,7]. Rising prostate-specific antigen (PSA) levels indicate further progression and inform the decision to initiate treatment of androgen deprivation therapy (ADT). After beginning ADT, patients have a median of 19 months before PSA levels again begin to rise, a phenomenon known as castration resistance [8]. This stage—non-metastatic, castration-resistant prostate cancer (nmCRPC)—is defined differently by varying bodies of experts, but by all definitions involves “castrate” levels of testosterone (<50 ng/mL), a notable rise or rises in the PSA level, and no evidence of metastatic disease on imaging [9,10,11]. Despite being crucial to treatment decisions, there is no consistency in how PSA values are tracked over time, with heterogeneity in when measurements are taken and how key metrics like the PSA doubling time (PSADT) are calculated [4]. Research examining nmCRPC treatment practices and outcomes in the Veterans Health Administration (VHA) suggests that providers often do not calculate PSADT, and that patients are often not started on newer treatments, even when their PSADT meets the criteria for high risk [12].

Patients with nmCRPC experience progression-free survival for an average of 25–30 months prior to developing metastatic disease, and 33% of nmCRPC patients progress to bone metastasis within 2 years [13]. A PSADT less than 10 months has been shown to be a clinically significant threshold for identifying patients at the time of nmCRPC with faster progression to metastasis or death [14,15]. Clinical trials have demonstrated an improvement in clinical outcomes when patients with a PSADT ≤ 10 months are treated with more aggressive treatments such as enzalutamide, apalutamide, darolutamide, and others [16,17,18,19]. Patients with less aggressive disease are often maintained on ADT.

Not all patients have the same level of data available for their PSA values over time, and with different formulations for calculating PSADT based on available PSA data [4], the implementation of clinical decision-making based on PSADT ≤ 10 months in real-world settings fails to reflect the knowledge gleaned from clinical trials. Furthermore, decisions to escalate treatment are subject to a variety of factors often not captured in clinical trials, such as advanced age or other demographics [20]. It is thus crucial to identify individual patient risk among the general population beyond those patients represented in clinical trials and academic medical centers.

Artificial intelligence approaches, such as deep learning (DL), have been proposed as the next frontier in supporting clinical decision-making in medicine [21]. Compared to traditional statistical analysis, survival analysis using artificial intelligence allows for more effective use of longitudinal data, requires no a priori assumptions regarding the model, and can leverage high-dimensional data such as radiology and pathology reports. The feasibility of applying DL has been previously explored in prostate cancer, specifically in the diagnosis of disease using radiographic imaging [22], biopsy information [23], stratification of patient risk based on 3-dimensional prostate gland structures [24], contouring of targets for radiation therapy [25], prediction of aggressive disease from molecular features [26], and prediction of survival after different radiotherapy treatments [27]. No studies have attempted to stratify patient risk at the decision-making stage of nmCRPC using longitudinal yet commonly obtained data that are available through the routine care and treatment of patients.

2. Materials and Methods

This study is approved under IRB# 00129914. The analysis was conducted in AWS VA Enterprise Cloud Prospect. The cohort study follows the Strengthening Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines.

We created a DL model capable of providing treatment recommendations based on sequential disease-based data. Treatment recommendations were based on model predictions of individual benefits of switching to more aggressive treatments than ADT, thereby providing critical information to nmCRPC patients and their providers for making treatment decisions.

Using data from the VHA and pharmacy dispensation records from Veteran Affairs Corporate Data Warehouse, we identified a nationwide cohort of 13,557 patients diagnosed with prostate cancer from 1 January 2006, through 31 December 2019, who later developed nmCRPC. One year after the start of nmCRPC was selected as the landmark time, and patients who developed metastatic disease or died within that 1-year period were excluded. Figure 1 provides the patient cohort flow diagram.

Features for time to event, or progression-free survival (PFS), included age at nmCRPC date, body mass index (BMI), Gleason score, ethnicity, time from prostate cancer diagnosis to nmCRPC, Charlson comorbidity index (CCI) 6 months prior to nmCRPC date, treatment history, and PSA values. Details about feature processing and handling of missing data are provided in Appendix A.

Data were split into 10 folds for a 10-fold cross-validation, with each fold randomly sampled without replacement and stratified by the treatments patients received over time to ensure that the treatment distribution over time within each fold reflected the original dataset. This split enabled balanced treatment distributions among the training, testing, and validation sets. For each cross-validation iteration, eight folds were used for training, one fold for testing and one fold for validation. This process resulted in 10 validation sets that covered the entirety of the dataset. We evaluated the performance of the model by averaging its performance across these sets.

Our outcome of interest was time to metastatic disease or death from the landmark date of one year post-nmCRPC. Two prediction models were separately trained based on two data cohorts, respectively, defined as patients exclusively treated with ADT and patients treated with abiraterone (abi) or enzalutamide (enza). Each cohort was split into training, testing, and validation sets. For patients who received ADT, the individual event probabilities were obtained by combining the predicted event probabilities for the validation set. Their counterfactual event probabilities (i.e., probability of experiencing the event if they had received abi or enza treatment) were obtained by fitting them into the trained prediction model with the abi or enza cohort 10 times and averaging the predicted probabilities.

The deep learning architecture used for our predictive models and treatment recommendation model was the Weibull Time to Event Recurrent Neural Network (WTTE-RNN) model [28], a prediction model initially created for web churn using a DL survival analysis approach. Time to the next event is assumed to follow a Weibull distribution, and a recurrent neural network (RNN) model outputs the distribution scale and shape parameters. Figure 2 shows how the model made predictions using our data. We implemented WTTE-RNN using a gated recurrent unit (GRU) as the RNN. Further details concerning model architecture and hyperparameters used can be found in Appendix B.

Model performance was evaluated by the area under the receiver operating characteristic curve (AUROC) and calibration plots. We based our clinical benefit analysis on Mehltretter et al. [29], adapting their approach to fit a survival analysis scenario. Two years post-nmCRPC was chosen as the milestone time point for our event-free survival evaluation.

We compared the predicted event rate under the hypothetical case wherein all patients were assigned the model-recommended treatment (model-recommended treatment event rate) with the predicted event rate under the observed case where patients were treated with ADT (true event rate). We predicted the event rate under every possible treatment for each patient and took the treatment with the highest PFS probability as the model-recommended treatment per patient. To obtain the model-recommended treatment event rate, the data for each patient were adjusted to reflect their model-recommended treatment and then the model predicted the event rate using these adjusted data. We then found the difference between the mean event rate for the true data and the mean model-recommended treatment event rate. As an additional comparison, we plotted the Kaplan–Meier survival curves for the model-recommended treatment–adjusted data versus the true data.

We also compared the PFS among patients who happened to have received the model-recommended treatment, and the PFS among patients whose true treatments differed from the model-recommended treatment. This approach allowed a comparison of the actual event rate between the group that was concordant with model recommendations by chance and the group that was discordant with model recommendations by chance. Analysis of non-hypothetical cases allowed for better measurement of the DL survival model’s ability to correctly identify patients who were assigned ADT but could have benefitted from the receipt of abi or enza. We only included patients from the ADT cohort for this analysis.

A similar analysis was conducted based on the PSADT ≤ 10 months treatment recommendation rule. A binary indicator was created based on PSADT, with 1 representing patients identified as risky (i.e., needing more aggressive treatment) with PSADT ≤ 10 months, and 0 representing patients identified as not risky (i.e., not requiring treatment beyond ADT) with PSADT > 10 months. However, 57.4% of patients were missing PSADT. Missingness in PSADT is common and often indicative of a better underlying condition as it likely was not necessary to measure PSA as frequently for such patients. We compared 3 different approaches for handling missingness in PSADT: a complete case analysis; encoding the binary indicator arising from PSADT as 0 for patients with missing values; and imputing missing values of PSADT using the MissForest package [30]. Patients with a PSADT binary indicator of 1 who were given new treatments, as well as patients who stayed on ADT and had a PSADT binary indicator of 0, were counted as concordant. Patients who were not given new treatments in response to a PSADT indicator of 1 were counted as discordant. We compared the 2-year milestone event rate differences between these concordant and discordant groups for each of the 3 methods for handling missingness of PSADT. Results from comparing event rates in those concordant or discordant with PSADT-based treatment recommendations were compared with results from the DL survival model event rates in those concordant or discordant with model-recommended treatments.

3. Results

Our final cohort consisted of 5037 patients with nmCRPC. Among these, 1254 patients received abi or enza, and 3783 exclusively received ADT within one year of nmCRPC. The median number of records per patient for individuals who received ADT only versus abi/enza within one year of nmCRPC was 9 (IQR: 5–14) versus 21 (IQR: 13–31), respectively, with maximum records of 57 and 82, respectively.

There were 2153 patients (42.7%) who experienced at least one event (i.e., death or metastasis) during the observation period. Of patients who experienced an event, 731 (14.5%) developed metastatic disease and 1842 (36.6%) died during the observation period. The median follow-up time was 2.5 years (interquartile range [IQR]: 1.5–4.3 years). Baseline characteristics of the study cohort are summarized in Table 1. There was a high degree of missingness for the Gleason score. Details around how missing values were filled in are in Appendix A.

The average AUROC for the model trained on the abi/enza cohort at the 2-year milestone was 0.738 (standard deviation [SD]: 0.057), and the average AUROC for the model trained on the ADT cohort at the 2-year milestone was 0.693 (SD: 0.02). We determined variable importance by excluding one variable at a time from the model and comparing each resulting AUROC against the AUROC of the model with all variables included. In these comparisons, the larger the drop in AUROC, the more important the variable. We ran the analysis 100 times and took the average variable importance of each variable, which are displayed in Figure 3. The prediction target for calculating importance was death or metastasis within 2 years of nmCRPC. Treatment was identified as the most important variable, followed by the PSA and BMI for the abi/enza cohort. The treatment feature reflects patients’ treatment history, including the duration of ADT prior to receiving abi/enza, as well as other treatments administered. No information around treatments collected after the time of nmCRPC was included. For the ADT cohort, age was the most important variable, followed by CCI and PSA.

At 2 years, the model recommended that 3254 (86.0%) of 3783 patients stay on ADT for the maximum benefit, and 529 patients (14.0%) would derive a greater benefit from switching to a more aggressive treatment. Patient characteristics of the group recommended to stay on ADT and of those recommended to take abi/enza are detailed in Appendix C.

Within the ADT cohort, the difference between the rate of PFS at the 2-year milestone based on true data versus the data adjusted to reflect model treatment recommendations was 0.1, with a PFS rate of 0.703 for the unadjusted data and a PFS rate of 0.803 for the adjusted data.

The rate of PFS at 2 years among patients concordant with model treatment recommendations, i.e., patients given ADT for whom the model recommended ADT, was 0.74. The rate of PFS at 2 years among patients discordant with model treatment recommendations was 0.50. We observed a significant absolute PFS rate improvement of 0.24 (95% CI: 0.23–0.24), a relative reduction of 47.32%, and an odds ratio of 0.36. The hazard ratio for those whose treatments were concordant versus discordant with the model recommendation was 0.44 (95% CI: 0.39–0.50).

The complete case analysis for adherence to PSADT-based treatment recommendations had a PFS rate at the 2-year milestone of 0.73 among concordant patients and a PFS rate of 0.58 among discordant patients. The absolute improvement was 0.15 (95% CI: 0.08–0.19), a relative improvement of 35.38%, and an odds ratio of 0.52. The hazard ratio for those whose treatment was concordant versus discordant with the PSADT-recommended treatment was 0.69 (95% CI: 0.58–0.82).

In Table 2, we report results for patients concordant/discordant with model-recommended treatments versus patients concordant/discordant with PSADT-based treatment recommendations. The rate improvements and hazard ratios for those whose treatments were concordant/discordant with model-recommended treatments are significantly higher than those concordant/discordant with PSADT-based recommended treatments both in the complete case analysis and using imputed PSADT. Interestingly, imputing the binary indicator arising from a PSADT with 0 yielded better results than using imputed values from MissForest to create the binary indicator.

We also compared differences in concordant/discordant patients according to model-recommended treatments versus PSADT-based recommendations by plotting Kaplan–Meier survival curves, as shown in Figure 4. Discordant patients demonstrated worse survival outcomes under both paradigms. Further details comparing PSADT with the DL model are in Appendix D.

4. Discussion

We used DL to create a personalized medicine model that can identify patients with nmCRPC who would likely benefit from switching from ADT to abi or enza. We demonstrated that our model can recommend treatment and potentially improve patient outcomes using a few commonly available and easily extractable clinical features, allowing for easy clinical implementation and interpretation.

The current clinical standard for treatment recommendations is based on PSADT. However, the PSA is not always well measured across patients, and it can be difficult for conventional algorithms to accurately capture how the PSA changes over time and in response to treatment [4]. We aimed to incorporate change in the PSA, alongside treatment, as time-series inputs used to predict individual patients’ outcomes under different treatments received after nmCRPC prognosis, thereby yielding personalized treatment recommendations. We were able to make individualized predictions and recommendations, demonstrating the average potential clinical benefits at a group level as well as examining group differences. We were able to obtain a significantly greater PFS based on concordance with the DL model recommendations compared to concordance with recommendations based on PSADT. Through this thorough investigation of potential clinical benefit, we sought to facilitate an understanding of DL prediction and increase clinician trust in the prognostic products resulting from DL methods.

We adapted the approach applied in Mehltretter et al. [29] to a survival analysis setting using time-series EHR data. We applied WTTE-RNN, a model that has been used effectively in prognosis prediction for breast cancer patients [31] and predicting equipment failure [32]. To our knowledge, ours is the first study to apply this model in prostate cancer. WTTE-RNN can automatically process time-series information, eliminating the need to perform manual feature engineering. In addition, this method provides a fuller picture, capturing more patient-level characteristics than PSADT. The model also offers flexibility in use, since the time-dependent hazards are obtained from Weibull shape and scale parameters; it can provide predictions to an infinite time domain without an increasing number of neuron units that could lead to model overfitting.

As seen in Table 1, there are significant differences in key characteristics like BMI, Gleason score, and PSADT between the ADT cohort versus the abi/enza cohort. Such differences highlight the importance of restricting the analysis of potential benefits from following treatment recommendations based on the DL model to just the ADT cohort.

A limitation of this study is the observational nature of our dataset; patients who received different treatments may have other fundamental differences. Our analysis of potential clinical benefit did not control for potential confounding. While it is suggestive of benefits arising from more aggressive treatments for specific individuals based on deep learning recommendations, it does not definitively prove the existence of clinically significant benefits.

In clinical practice, most patients are treated with ADT after their nmCRPC date. Previous work has found that 60.6% of patients with nmCRPC are treated with ADT in VHA hospitals [12]. Those who are given more aggressive treatments, such as second-generation antiandrogens, are typically patients with severe disease and a less favorable prognosis. This limits possible inferences from our model, and we therefore chose to examine only patients receiving ADT, who likely form a more homogenous cohort.

We have laid critical groundwork for future use of a treatment recommendation model as a tool in a clinical context. An important next step for clinical adoption is further improving the AUROC of the model. Currently the model achieves an AUROC around 0.7, whereas clinicians would require a higher level of accuracy. Incorporating more data features could potentially boost model performance. Future work includes adding more temporal features to the DL model, such as more lab results, clinical visit notes, related ICD codes, etc. With more features available, more investigations could be performed to determine the relative contributions of each set of predictor variables in predicting patient outcomes, as well as increasing model performance. After adding more features, validating the model using an external dataset would be the next step in translating this tool into clinical practice to inform treatment decisions for patients with nmCRPC. Additional steps needed before implementing this model in clinical practice in the general US population would include prospective validation studies using both VHA and non-VHA cohorts, as well as explorations of how clinicians could most effectively and efficiently operationalize the model within a shared decision framework.

5. Conclusions

In this study, we demonstrated that a DL survival model using time-series EHR data to predict the risk of metastatic disease or death in patients with nmCRPC can make individualized treatment recommendations for those exclusively treated with ADT. If incorporated within EHR systems, the model’s predictions and recommendations can provide valuable information to guide clinicians and patients in making treatment decisions.

Author Contributions

Conceptualization, C.L., J.B., Z.B. and A.S.H.; Data curation, V.P. and R.M.; Formal analysis, C.L.; Methodology, C.L.; Supervision, Z.B. and A.S.H.; Writing—original draft, C.L. and J.B.; Writing—review and editing, C.L., J.B., C.Y. and M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study is approved under IRB #00129914.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data include protected health information and are not available. Detailed information on model tuning and implementation is available upon request.

Acknowledgments

We would like to thank Siamack Ayandeh for the creation of the analytics study mart environment and the Veterans Health Administration Office of Research and Development for funding cloud credits.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

abi	abiraterone
ADT	androgen deprivation therapy
AUROC	area under the receiving operating characteristic
BMI	body mass index
CCI	Charlson comorbidity index
CI	confidence interval
DL	deep learning
EHR	electronic heath records
enza	enzalutamide
IQR	interquartile range
nmCRPC	non-metastatic castration resistant prostate cancer
PC	prostate cancer
PFS	progression free survival
PSA	prostate specific antigen
PSADT	prostate specific antigen doubling time
RNN	recurrent neural network
SD	standard deviation
VHA	Veterans Health Administration
WTTE-RNN	Weibull time to event recurrent neural network

Appendix A

Features for time to event, or progression-free survival (PFS), were either static or time-series. Static features included age at nmCRPC date, body mass index (BMI), Gleason score, ethnicity, time from prostate cancer diagnosis to nmCRPC, and Charlson comorbidity index (CCI) 6 months prior to nmCRPC diagnosis date. Time-series features included treatment, PSA, number of days from prostate cancer diagnosis to the record time, and nmCRPC status. Variation in PSA under various treatments was included to inform the model how aggressive and or refractory the disease was to current treatment at any point in time. We coded the nmCRPC status as a longitudinal indicator with three categories: pre-nmCRPC, onset of nmCRPC, and post-start of nmCRPC. For prediction, time-series data was exclusively taken from the period between the diagnosis of PC to initiation of ADT or abi/enza. Categorical variables were converted to binary indicators, and all variables were standardized to the same unit.

Missing data for static features was imputed using the mean (continuous features) or median/mode (discrete features) values within each fold of the data split. For time-series features, the missing values were filled in using the latest available non-missing value prior to the missing value. If no prior value was available, missing values were imputed with the mean values within each fold. We adopted this imputation approach to avoid temporal information leakage and leakage from the test set to the training set.

Appendix B

The WTTE-RNN model provides a survival function arising from defining a Weibull distribution with the scale and shape parameters,

α

and

β

, outputted from the RNN model. The survival function, hazard function, and cumulative hazard are defined as follows:

\begin{array}{l} S (t| x) = e x p [{(- \frac{t}{α})}^{β}] \\ λ (t| x) = {(\frac{t}{α})}^{β - 1} \cdot \frac{β}{α} \\ Λ (t| x) = {(\frac{t}{α})}^{β} \end{array}

The loss function is defined as log-likelihood loss, derived from the Weibull distribution to account for censored data. Density at points is compared for uncensored data and probability over regions was considered for uncensored data. The optimization problem for the RNN in the continuous and discrete cases is defined as follows:

\begin{array}{l} \underset{w}{m a x i m i z e l o} g (L (w, y, u, x)) : = \sum_{t = 0}^{T} (u_{t} \cdot [β_{t} \cdot l o g (\frac{y_{t}}{α_{t}}) + \log (β_{t})] - {(\frac{y_{t}}{α_{t}})}^{β_{t}}) \\ \underset{w}{m a x i m i z e l o} g (L_{d} (w, y, u, x)) : = \sum_{t = 0}^{T} (u_{t} \cdot [e x p [{(\frac{y_{t} + 1}{α_{t}})}^{β_{t}} - {(\frac{y_{t}}{α_{t}})}^{β_{t}}] - 1] - {(\frac{y_{t} + 1}{α_{t}})}^{β_{t}}) \end{array}

where

t

is time steps of a sequence,

w

is parameters for

R N N, y_{t} \in [0, \infty)

is time to event at time step

t; u_{t} \in {0,1}

is a censoring indicator with 0 indicating right censoring and 1 otherwise; and

x

is feature vector. In our case, we discretized time to event into months.

For each individual, WTTE-RNN predicted an a and b describing the individual time to event distribution. Each patient was assigned all treatments:

i, j, k

etc. For each treatment

i

, we had an independent event function −

E (t | x) = 1 - S (t | x) = 1 - e x p [{(- \frac{t}{α})}^{β}]

.

We first passed a patient through the trained DL model with treatment

i

and again with treatment

j

. The event probabilities

E (t | x, t r t = i)

and

E (t | x, t r t = j)

(defined below) for each treatment at the same pre-defined milestone times were compared. If

E (t| x, t r t = i) > E (t | x, t r t = j)

, then treatment

i

led to a higher risk for the patient than treatment

j

; therefore, treatment

j

was deemed the more beneficial treatment option for the patient.

\begin{array}{l} E (t∣ x, t r t = i) = 1 - e x p [{(- \frac{t}{α_{i}})}^{β_{i}}] \\ E (t∣ x, t r t = j) = 1 - e x p [{(- \frac{t}{α_{j}})}^{β_{j}}] \end{array}

We used two gated recurrent unit (GRU) layers and the Adam optimizer. Other hyperparameters were tuned according to Table A1.

Table A1. Hyperparmeter tuning.

Hyperparameter	Range	ADT ¹ Cohort	Abi/Enza ² Cohort
Hidden units	(20, 200)	80	30
Learning rate	(0.0001, 0.01)	0.01	0.01
Batch size	(64, 1000)	500	200

¹ Androgen deprivation therapy. ² Abiraterone or enzalutamide.

Appendix C

Table A2 contains characteristics of patients from the ADT cohort recommended by the model to switch to abi/enza versus patients recommended to stay on ADT.

Table A2. ADT cohort characteristics.

	Overall	Recommended ADT	Recommended Abi/Enza
	(N = 3783)	(N = 3254)	(N = 529)	p-Value
Demographics
Race/Ethnicity, N (%)
Non-Hispanic White	2033 (56.2)	1760 (56.6)	273 (53.5)	<0.001
Black	1177 (32.5)	970 (31.2)	207 (40.6)
Hispanic	252 (7.0)	235 (7.6)	17 (3.3)
Others	158 (4.4)	145 (4.7)	13 (2.5)
Age at nmCRPC, median [IQR]	70.0 [65.0–78.0]	69.0 [64.0–75.0]	82.0 [75.0–86.0]	<0.001
BMI, mean (SD) †	29.6 (6.0)	30.0 (6.0)	27.4 (5.7)	<0.001
Clinical Characteristics
Gleason score, median [IQR] ‡	7.0 [7.0–8.0]	7.0 [7.0–8.0]	7.0 [7.0–8.0]	0.001
Disease characteristics at nmCRPC
PSA doubling time €				<0.001
>10 months	1162 (72.5)	994 (74.5)	168 (62.7)
≤10 months	441 (27.5)	341 (25.5)	100 (37.3)
Metastasis or death, N (%)				<0.001
No	2351 (62.1)	2129 (65.4)	222 (42.0)
Yes	1432 (37.9)	1125 (34.6)	307 (58.0)

Footnotes: † 2 missing values. ‡ 478 missing values. € 2180 missing values. IQR = interquartile range; SD = standard deviation; ADT = androgen deprivation therapy; Abi = abiraterone; Enza = enzalutamide.

Appendix D

Table A3 provides median survival times as well as the p-values for log-rank tests corresponding to differences in patients concordant/discordant with model vs. PSADT treatment recommendations.

Table A3. Survival times.

Model	Median Survival Time in Years		Log-Rank p-Value
Model	Concordant	Discordant	Log-Rank p-Value
WTTE-RNN ¹	6.01	2.31	<0.005
PSADT ² complete case	5.17	3.52	<0.005
PSADT MissForest imputed	5.32	5.29	0.64
PSADT 0 imputed	5.52	3.52	<0.005

¹ Weibull Time to Event Recurrent Neural Network. ² Prostate specific antigen doubling time.

References

Bray, F.; Laversanne, M.; Sung, H.; Ferlay, J.; Siegel, R.L.; Soerjomataram, I.; Jemal, A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2024, 74, 229–263. [Google Scholar] [CrossRef]
Siegel, R.L.; Giaquinto, A.N.; Jemal, A. Cancer statistics, 2024. CA Cancer J. Clin. 2024, 74, 12–49. [Google Scholar] [CrossRef]
Spratt, D.E.; Zumsteg, Z.S.; Feng, F.Y.; Tomlins, S.A. Translational and clinical implications of the genetic landscape of prostate cancer. Nat. Rev. Clin. Oncol. 2016, 13, 597–610. [Google Scholar] [CrossRef] [PubMed]
Rosinha, A.; Rabaca, C.; Calais, F.; Pinto, J.M.; Barreira, J.V.; Fernandes, R.; Ramos, R.; Fialho, A.C.; Palma dos Reis, J. Improving the identification of high-risk non-metastatic castration-resistant prostate cancer patients in clinical practice. Front. Oncol. 2024, 13, 1266369. [Google Scholar] [CrossRef] [PubMed]
Ziaran, S.; Varchulova Novakova, Z.; Bohmer, D.; Danisovic, L. Biomarkers for determination prostate cancer: Implication for diagnosis and prognosis. Neoplasma 2015, 62, 683–691. [Google Scholar] [CrossRef] [PubMed]
Amling, C.L.; Bergstralh, E.J.; Blute, M.L.; Slezak, J.M.; Zincke, H. Defining prostate specific antigen progression after radical prostatectomy: What is the most appropriate cut point? J. Urol. 2001, 165, 1146–1151. [Google Scholar] [CrossRef]
Lee, W.R.; Hanks, G.E.; Hanlon, A. Increasing prostate-specific antigen profile following definitive radiation therapy for localized prostate cancer: Clinical observations. J. Clin. Oncol. 1997, 15, 230–238. [Google Scholar] [CrossRef]
Sharifi, N.; Dahut, W.L.; Steinberg, S.M.; Figg, W.D.; Tarassoff, C.; Arlen, P.; Gulley, J.L. A retrospective study of the time to clinical endpoints for advanced prostate cancer. BJU Int. 2005, 96, 985–989. [Google Scholar] [CrossRef]
Lowrance, W.T.; Murad, M.H.; Oh, W.K.; Jarrard, D.F.; Resnick, M.J.; Cookson, M.S. Castration-resistant prostate cancer: AUA guideline amendment 2018. J. Urol. 2018, 200, 1264–1272. [Google Scholar] [CrossRef]
Scher, H.I.; Morris, M.J.; Stadler, W.M.; Higano, C.; Basch, E.; Fizazi, K.; Antonarakis, E.S.; Beer, T.M.; Carducci, M.A.; Chi, K.N.; et al. Trial design and objectives for castration-resistant prostate cancer: Updated recommendations from the Prostate Cancer Clinical Trials Working Group 3. J. Clin. Oncol. 2016, 34, 1402–1418. [Google Scholar] [CrossRef]
Halwani, A.S.; Rasmussen, K.M.; Patil, V.; Li, C.C.; Yong, C.M.; Burningham, Z.; Gupta, S.; Narayanan, S.; Lin, S.-W.; Carroll, S.; et al. Real-world practice patterns in veterans with metastatic castration-resistant prostate cancer. In Urologic Oncology: Seminars and Original Investigations; Elsevier: Amsterdam, The Netherlands, 2020; pp. 1.e1–1.e10. [Google Scholar]
Halwani, A.S.; Patil, V.; Morreall, D.; Li, C.; Yong, C.; Appukkutan, S.; Partridge, J.; Jhaveri, J.; Rasmussen, K.M. Real-world treatment patterns among veterans with nonmetastatic castration-resistant prostate cancer (nmCRPC). J. Clin. Oncol. 2022, 40, e17042. [Google Scholar] [CrossRef]
Smith, M.R.; Kabbinavar, F.; Saad, F.; Hussain, A.; Gittelman, M.C.; Bilhartz, D.L.; Wynne, C.; Murray, R.; Zinner, N.R.; Schulman, C.; et al. Natural history of rising serum prostate-specific antigen in men with castrate nonmetastatic prostate cancer. J. Clin. Oncol. 2005, 23, 2918–2925. [Google Scholar] [CrossRef] [PubMed]
Arnold, P.; Penaloza-Ramos, M.C.; Adedokun, L.; Rees, S.; Lockhat, M.; Spary, L.; Watkins, A.; Gnanapragasam, V.; Crabb, S.J. Clinical characteristics and outcomes for patients with non-metastatic castration-resistant prostate cancer. Sci. Rep. 2021, 11, 22151. [Google Scholar] [CrossRef] [PubMed]
Smith, M.R.; Saad, F.; Oudard, S.; Shore, N.; Fizazi, K.; Sieber, P.; Tombal, B.; Damiao, R.; Marx, G.; Miller, K.; et al. Denosumab and bone metastasis–free survival in men with nonmetastatic castration-resistant prostate cancer: Exploratory analyses by baseline prostate-specific antigen doubling time. J. Clin. Oncol. 2013, 31, 3800–3806. [Google Scholar] [CrossRef] [PubMed]
Hussain, M.; Fizazi, K.; Saad, F.; Rathenborg, P.; Shore, N.; Ferreira, U.; Ivashchenko, P.; Demirhan, E.; Modelska, K.; Phung, D.; et al. Enzalutamide in men with nonmetastatic, castration-resistant prostate cancer. N. Engl. J. Med. 2018, 378, 2465–2474. [Google Scholar] [CrossRef]
Smith, M.R.; Saad, F.; Chowdhury, S.; Oudard, S.; Hadaschik, B.A.; Graff, J.N.; Olmos, D.; Mainwaring, P.N.; Lee, J.Y.; Uemura, H.; et al. Apalutamide treatment and metastasis-free survival in prostate cancer. N. Engl. J. Med. 2018, 378, 1408–1418. [Google Scholar] [CrossRef]
Fizazi, K.; Shore, N.; Tammela, T.L.; Ulys, A.; Vjaters, E.; Polyakov, S.; Jievaltas, M.; Luz, M.; Alekseev, B.; Kuss, I.; et al. Darolutamide in nonmetastatic, castration-resistant prostate cancer. N. Engl. J. Med. 2019, 380, 1235–1246. [Google Scholar] [CrossRef]
Fizazi, K.; Shore, N.D.; Tammela, T.; Ulys, A.; Vjaters, E.; Polyakov, S.; Jievaltas, M.; Luz, M.; Alekseev, B.; Kuss, I.; et al. Overall survival (OS) results of phase III ARAMIS study of darolutamide (DARO) added to androgen deprivation therapy (ADT) for nonmetastatic castration-resistant prostate cancer (nmCRPC). J. Clin. Oncol. 2020, 38, 5514. [Google Scholar] [CrossRef]
Saad, F.; Bogemann, M.; Suzuki, K.; Shore, N. Treatment of nonmetastatic castration-resistant prostate cancer: Focus on second-generation androgen receptor inhibitors. Prostate Cancer Prostatic Dis. 2021, 24, 323–334. [Google Scholar] [CrossRef]
Giordano, C.; Brennan, M.; Mohamed, B.; Rashidi, P.; Modave, F.; Tighe, P. Accessing artificial intelligence for clinical decision-making. Front. Digit. Health 2021, 3, 645232. [Google Scholar] [CrossRef]
Campanella, G.; Hanna, M.G.; Geneslaw, L.; Miraflor, A.; Werneck Krauss Silva, V.; Busam, K.J.; Brogi, E.; Reuter, V.E.; Klimstra, D.S.; Fuchs, T.J. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 2019, 25, 1301–1309. [Google Scholar] [CrossRef]
Bulten, W.; Pinckaers, H.; van Boven, H.; Vink, R.; de Bel, T.; van Ginneken, B.; van der Laak, J.; de Kaa, C.H.-V.; Litjens, G. Automated gleason grading of prostate biopsies using deep learning. arXiv 2019, arXiv:1907.07980. [Google Scholar] [CrossRef]
Xie, W.; Reder, N.P.; Koyuncu, C.; Leo, P.; Hawley, S.; Huang, H.; Mao, C.; Postupna, N.; Kang, S.; Serafin, R.; et al. Prostate cancer risk stratification via nondestructive 3D pathology with deep learning–assisted gland analysis. Cancer Res. 2022, 82, 334–345. [Google Scholar] [CrossRef]
Almeida, G.; Tavares, J.M.R. Deep learning in radiation oncology treatment planning for prostate cancer: A systematic review. J. Med. Syst. 2020, 44, 179. [Google Scholar] [CrossRef] [PubMed]
Elmarakeby, H.A.; Hwang, J.; Arafeh, R.; Crowdis, J.; Gang, S.; Liu, D.; AlDubayan, S.H.; Salari, K.; Kregel, S.; Richter, C.; et al. Biologically informed deep neural network for prostate cancer discovery. Nature 2021, 598, 348–352. [Google Scholar] [CrossRef] [PubMed]
Hu, L.; Ji, J.; Li, F. Estimating heterogeneous survival treatment effect in observational data using machine learning. Stat. Med. 2021, 40, 4691–4713. [Google Scholar] [CrossRef] [PubMed]
Martinsson, E. WTTE-RNN: Weibull Time to Event Recurrent Neural Network a Model for Sequential Prediction of Time-to-Event in the Case of Discrete or Continuous Censored Data, Recurrent Events or Time-Varying Covariates. Master’s Thesis, University of Gothenburg, Goteborg, Sweden, 2017. [Google Scholar]
Mehltretter, J.; Fratila, R.; Benrimoh, D.; Kapelner, A.; Perlman, K.; Snook, E.; Israel, S.; Miresco, M.; Turecki, G. Differential treatment benefit prediction for treatment selection in depression: A deep learning analysis of STAR*D and CO-MED data. bioRxiv 2019, 4, 679779. [Google Scholar] [CrossRef]
Stekhoven, D.J.; Bühlmann, P. MissForest—Non-parametric missing value imputation for mixed-type data. Bioinformatics 2012, 28, 112–118. [Google Scholar] [CrossRef]
Kim, J.-Y.; Lee, Y.S.; Yu, J.; Park, Y.; Lee, S.K.; Lee, M.; Lee, J.E.; Kim, S.W.; Nam, S.J.; Park, Y.H.; et al. Deep learning-based prediction model for breast cancer recurrence using adjuvant breast cancer cohort in tertiary cancer center registry. Front. Oncol. 2021, 11, 596364. [Google Scholar] [CrossRef]
Li, W.; Cai, J.; Li, H.; Yu, F.; Ding, P. Estimation of Equipment Remaining Useful Life based on WTEE-RNN Method. In Proceedings of the 2022 2nd International Conference on Computational Modeling, Simulation and Data Analysis (CMSDA), Zhuhai, China, 2–4 December 2022; pp. 130–134. [Google Scholar]

Figure 1. Patient cohort flow diagram.

Figure 2. WTTE-RNN model:

X_{t_{1}}, X_{t_{2}}, \dots, X_{t_{n}}

are time-dependent features at time

t_{1}, \dots, t_{n}

, while

h_{0}, h_{1}, \dots, h_{n}

are hidden units passed in the gated recurrent unit (GRU). The outputs are scale and shape parameters,

α

and

β,

for a Weibull distribution that is used to construct a survival curve for the patient.

Figure 2. WTTE-RNN model:

X_{t_{1}}, X_{t_{2}}, \dots, X_{t_{n}}

are time-dependent features at time

t_{1}, \dots, t_{n}

, while

h_{0}, h_{1}, \dots, h_{n}

are hidden units passed in the gated recurrent unit (GRU). The outputs are scale and shape parameters,

α

and

β,

for a Weibull distribution that is used to construct a survival curve for the patient.

Figure 3. Variable importance plot for predicting death or metastasis within 2 years of nmCRPC for the abi/enza cohort (left) and ADT cohort (right). The word “treatment” here refers to treatment history—i.e., the frequency and duration of ADT treatments prior to nmCRPC (and thus prior to receiving abi/enza, for the abi/enza cohort), as well as any other treatments administered before nmCRPC.

Figure 4. Patient outcome concordant vs. discordant with the recommended treatment for model recommendation (left) and PSADT recommendation (right).

Table 1. Baseline characteristics.

	Overall	ADT	Abi/Enza
	(N = 5037)	(N = 3783)	(N = 1254)	p-Value
Demographics
Race/Ethnicity, N (%)
Non-Hispanic White	2828 (56.1)	2033 (53.7)	795 (63.4)	<0.001
Black	1477 (29.3)	1177 (31.1)	300 (23.9)
Hispanic	314 (6.2)	252 (6.7)	62 (4.9)
Others	418 (8.3)	321 (8.5)	97 (7.7)
Age at nmCRPC, median [IQR]	71.0 [65.0–78.0]	70.0 [65.0–78.0]	71.0 [67.0–79.0]	<0.001
BMI, mean (SD) †	29.8 (6.0)	29.6 (6.0)	30.4 (5.9)	<0.001
Clinical Characteristics
Gleason score, median [IQR] ‡	7.0 [7.0–8.0]	7.0 [7.0–8.0]	8.0 [7.0–9.0]	<0.001
Disease characteristics at nmCRPC
PSA doubling time				<0.001
>10 months	1368 (63.8)	1162 (72.5)	206 (38.1)
≤10 months	776 (36.2)	441 (27.5)	335 (61.9)
PSA doubling time imputed with MissForest, N (%)				<0.001
>10 months	3374 (67.0)	2733 (72.2)	641 (51.1)
≤10 months	1663 (33.0)	1050 (27.8)	613 (48.9)
Metastasis, N (%)				<0.001
No	4306 (85.5)	3442 (91.0)	864 (68.9)
Yes	731 (14.5)	341 (9.0)	390 (31.1)
Mortality, N (%)				<0.001
Alive	3195 (63.4)	2522 (66.7)	673 (53.7)
Deceased	1842 (36.6)	1261 (33.3)	581 (46.3)

Footnotes: † 2 missing values. ‡ 733 missing values. IQR = interquartile range; SD = standard deviation; ADT = androgen deprivation therapy; Abi = abiraterone; and Enza = enzalutamide.

Table 2. Progression free survival comparison results.

Recommendation Model	Data Cohort	Rate Improvement Mean 95% CI ¹	Percentage Change Mean 95% CI	Odds Ratio Mean 95% CI	Hazard Ratio Mean 95% CI
DL ³ model	All ADT ² patients	0.236 [0.231, 0.242]	47.32% [46.69%, 47.96%]	0.359 [0.352, 0.367]	0.44 [0.39, 0.50]
DL ³ model	PSADT ⁴ available	0.215 [0.208, 0.223]	44.17% [43.17%, 45.17%]	0.396 [0.384, 0.409]	0.48 [0.40, 0.58]
Imputed PSADT	Impute 0	0.133 [0.127, 0.139]	31.79% [30.72%, 32.86%]	0.559 [0.544, 0.573]	0.69 [0.59, 0.80]
Imputed PSADT	MissForest impute	0.016 [0.012, 0.012]	4.90% [3.66%, 6.15%]	0.932 [0.914, 0.949]	0.97 [0.87, 1.09]
PSADT	PSADT available	0.149 [0.143, 0.155]	35.38% [34.28%, 36.48%]	0.517 [0.503,0.531]	0.69 [0.58, 0.82]

¹ Confidence interval. ² Androgen deprivation therapy. ³ Deep learning. ⁴ Prostate-specific antigen doubling time.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, C.; Bohman, J.; Patil, V.; Mcshinsky, R.; Yong, C.; Burningham, Z.; Samore, M.; Halwani, A.S. Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment. BioMedInformatics 2025, 5, 42. https://doi.org/10.3390/biomedinformatics5030042

AMA Style

Li C, Bohman J, Patil V, Mcshinsky R, Yong C, Burningham Z, Samore M, Halwani AS. Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment. BioMedInformatics. 2025; 5(3):42. https://doi.org/10.3390/biomedinformatics5030042

Chicago/Turabian Style

Li, Chunyang, Julia Bohman, Vikas Patil, Richard Mcshinsky, Christina Yong, Zach Burningham, Matthew Samore, and Ahmad S. Halwani. 2025. "Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment" BioMedInformatics 5, no. 3: 42. https://doi.org/10.3390/biomedinformatics5030042

APA Style

Li, C., Bohman, J., Patil, V., Mcshinsky, R., Yong, C., Burningham, Z., Samore, M., & Halwani, A. S. (2025). Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment. BioMedInformatics, 5(3), 42. https://doi.org/10.3390/biomedinformatics5030042

Article Menu

Deep Learning Treatment Recommendations for Patients Diagnosed with Non-Metastatic Castration-Resistant Prostate Cancer Receiving Androgen Deprivation Treatment

Abstract

1. Introduction

2. Materials and Methods

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI