Abstract
Clinical management of prostate cancer is challenging because of its highly variable natural history and so there is a need for improved predictors of outcome in non-metastatic men at the time of diagnosis. In this study we calculated the model score from the leading clinical multivariable model, PREDICT prostate, and the poor prognosis DESNT molecular subtype, in a combined expression and clinical dataset that were taken from malignant tissue at prostatectomy (n = 359). Both PREDICT score (p < 0.0001, IQR HR = 1.59) and DESNT score (p < 0.0001, IQR HR = 2.08) were significant predictors for time to biochemical recurrence. A joint model combining the continuous PREDICT and DESNT score (p < 0.0001, IQR HR = 1.53 and 1.79, respectively) produced a significantly improved predictor than either model alone (p < 0.001). An increased probability of mortality after diagnosis, as estimated by PREDICT, was characterised by upregulation of cell-cycle related pathways and the downregulation of metabolism and cholesterol biosynthesis. The DESNT molecular subtype has distinct biological characteristics to those associated with the PREDICT model. We conclude that the inclusion of biological information alongside current clinical prognostic tools has the potential to improve the ability to choose the optimal treatment pathway for a patient.
1. Introduction
Prostate cancer is distressingly common (diagnosed in 48,487 of men in UK per year) but not frequently fatal (13% of male cancer deaths) [1]. The progression of prostate cancer is highly heterogeneous [2], and clinical management is challenging [3,4]. It is also estimated, that as many as 50–80% of PSA-detected prostate cancers are clinically irrelevant, that is, even without treatment, they would never have caused any symptoms [5]. This has confounded attempts to develop a consistent and reliable approach to identify aggressive disease. Radical treatment of early prostate cancer, with surgery or radiotherapy, can lead to life changing side-effects of treatment such as impotence or incontinence [6]. There is a need for improved predictors of outcome in non-metastatic men at the time of diagnosis.
One approach is to use the information that is already collected at the point of diagnosis and before treatment, to assess prognosis and the value of treatment. Thurtle et al. (2019) developed an approach, termed ‘PREDICT Prostate’, that modelled, at the time of diagnosis, prostate cancer specific mortality (PCSM) and non-prostate cancer mortality (NPCM) using separate multivariable Cox models within a competing risks framework [7]. The NPCM model utilises the variables age and comorbidity, while the PCSM model combines age, PSA, Gleason grade, clinical T stage, and proportion of positive biopsy cores at the time of diagnosis. The model shows good discrimination in large validation datasets from the UK (n = 3000; C-index = 0.84; 95% CI: 0.82–0.86) [7], Singapore (n = 2546; C-index = 0.84; 95% CI: 0.80–0.87) [7], Sweden (n = 69,206; C-index = 0.85; 95% CI 0.85–0.86) [8], and the United States of America (n = 171,942; C-index = 0.82; 95% CI 0.81–0.83) [9]. It has been endorsed by the National Institute for Health and Care Excellence (NICE) [10] and is available in a user-friendly web interface (https://prostate.predict.nhs.uk/ (accessed on 1 May 2022)). Another approach to improve prediction of outcome is to use additional novel biomarkers [11].
Within any single cancer disease type, sub-classification using molecular markers can be an important way to accurately determine prognosis, optimise treatment pathways, and help develop targeted drugs. In previous work, we have successfully identified a novel aggressive molecular subtype of human prostate cancer, called DESNT, that can predict outcome after radical surgery (prostatectomy) and is associated with metastasis. This was discovered by applying the Bayesian clustering method Latent Process Decomposition to transcriptome data [12,13,14]—this takes into account the heterogeneous composition of prostate cancer. Prostatectomy patients with most of their expression assigned to the DESNT type exhibit poor outcomes relative to other patients (p < 4.28 × 10−5; Log-rank test) and has been validated in eight independent transcriptome datasets. Cancers assigned to the DESNT group have an increased risk of developing metastasis (X2-test, p = 1.86 × 10−3) [13]. The amount of the DESNT signature is an independent prognostic predictor of time to biochemical recurrence (HR = 1.52, 95% CI = [1.36, 1.7], p = 9.0 × 10−14, Cox regression model) [13]. This framework was developed from samples taken at prostatectomy, but we have preliminary data to suggest it’s applicability to biopsies [15]. We are in the process of developing a diagnostic lab to utilise the DESNT framework as an accredited clinical test.
In this work we modelled whether adding the poor prognosis DESNT signature to the PREDICT Prostate algorithm has the potential to improve our ability to predict the overall progress of prostate cancer. Transcriptome data from tumour tissue collected at an initial treatment of proctectomy were used as a proxy for the information that could be gathered from cancerous biopsy tissue at the time of diagnosis. Secondary aims are to determine whether the PREDICT Prostate clinical model can predict disease prognosis after surgical treatment of prostate cancer; and find the similarities and differences in the genes and molecular pathways which drive a higher PREDICT score and characterise the DESNT molecular subtype.
2. Materials and Methods
2.1. Datasets and Filtering
Microarray datasets from prostate tissue were processed and normalised as described in Luca et al. (2020) (Table 1). In brief, Affymetrix microarray dataset was normalised using the RMA algorithm [16] or previous normalised values were used. Only probes corresponding to genes measured by all platforms were retained. The CamCap and CancerMap datasets have 40 patients in common and thus 20 of the common samples were excluded at random from each dataset. The ComBat algorithm [17] from the sva R package and quantile transformation, was used to mitigate study-specific effects. The ethical approvals obtained for each dataset are listed in the original publications.
Table 1.
Transcriptome datasets used. FF = Fresh Frozen.
The combined dataset was filtered to remove samples that were missing one or more of the clinical variables required for the Prostate PREDICT model (patient’s age, T-stage, PSA and the Gleason histological grade group). Only primary tumour tissue from the prostate were included. Duplicate samples were also removed. For the Stephenson dataset only Gleason sum was available, so 44 samples were removed that had a Gleason sum of seven. The resulting dataset consists of 359 samples.
2.2. R Implementation of the Prostate PREDICT Model
The Prostate PREDICT model was originally implemented in the language STRATA [7]. We have translated this to the freely available open-source R statistical programming language [21] and made the code available (https://doi.org/10.5281/zenodo.7248417 accessed on 25 October 2022). Our implementation of the Prostate PREDICT model was extensively verified by comparing the results produced by those of the PREDICT Prostate webpage tool (https://prostate.predict.nhs.uk/tool accessed on 1 May 2022) for a wide variety of inputs. The results were identical, for example, when age = 75, T-stage = 2, PSA = 25 and Gleason score = 4 + 3, the 10-year predicted survival from initial conservative management was 55% via the webpage tool and 0.549 in R. We also examined how the R version PREDICT results vary with clinical variables to ensure that they made logical sense.
As we are interested in reducing radical treatment in prostate cancer the results from the PREDICT model used initial conservative management as the treatment strategy rather than radical treatment. For the datasets used here, comorbidity (the patient had not experienced a hospital admission in the last 2 years for something other than prostate cancer) and detailed biopsy histopathology results were unavailable and so are set to zero, as is done in the online implementation when this information is unavailable. For each sample, the prostate cancer specific mortality probability (PCSM) at ten years after diagnosis (as a percentage) was calculated using as input the associated clinical variables age at diagnosis, PSA at diagnosis, T stage, and prostatectomy Gleason grade group (as a proxy for biopsy Gleason grade group). The non-prostate cancer mortality probability (NPCM, as a percentage) was calculated using age at diagnosis. The PREDICT score, the increase in probability of mortality at 10 years from having prostate cancer, was defined as NPCM-PCSM.
2.3. DESNT Score and Assignment
Latent Process Decomposition (LPD) was applied to the MSKCC dataset [18] to produce the DESNT framework model, exactly as described in Luca et al. (2020) [13]. This model was then applied to the other datasets using the OAS-LPD algorithm, a modified version of the LPD algorithm in which new sample(s) are decomposed into LPD signatures, without retraining the model. Again, as described in Luca et al. (2020). LPD is an unsupervised Bayesian approach which decomposes each sample’s expression into signature expression profiles of each molecular subtype. For each sample a score between 0 and 1 is given for each subtype which represents the proportion of a sample’s expression that is explained by the signature expression profile for that subtype. Here, the proportion of expression assigned to the DESNT subtype is termed the DESNT score and are the exact scores produced in previous work [13]. If the DESNT score is the largest score across the subtypes, the sample is considered a member of the DESNT subtype.
2.4. Differential Gene Expression Analysis
Differentially expressed genes were identified for each comparison using a moderated t-test implemented in the limma (v 3.52.0) R package [22] with a threshold of Benjamin-Hochberg false discovery rate < 0.05.
2.5. Functional Enrichment Analysis
Functional enrichment analysis was performed using the gProfiler2 (v0.2.1) [23] R package utilising the KEGG, RECTOME, and Gene Ontology database for biological process terms. The gSCS (Set Counts and Sizes) correction method was used to determine significantly enriched pathways and ontology terms with significance p < 0.05.
2.6. Statistical Tests
All analyses were performed in R (version 4.1.2) using default parameters unless otherwise stated. Survival analyses were performed using Cox proportional hazards regression models, the log-rank test, and Kaplan–Meier estimator, as implemented in the survival R package with biochemical recurrence after prostatectomy as the end point. Pairwise comparisons of Kaplan–Meier curves using Log-Rank test were performed using the SurvMiner (v 0.4.9), with p-values adjusted using the Benjamini-Hochberg multiple testing correction. All plots were created using ggplot2 (v 3.3.6). All statistical tests performed were two-sided non-parametric tests unless otherwise stated.
3. Results
3.1. Data Overview
We combined transcriptome data from malignant samples taken at an initial treatment of prostatectomy from four studies: the MSKCC [18], CancerMap [12], Stephenson [19] and CamCap [20] studies (Table 1). These were filtered to have results from one primary sample per patient with the required clinical information required for the Prostate PREDICT model (n = 359; Table 2). The proportion of expression assigned to the DESNT poor prognosis molecular subtype (DESNT score) were gathered from previous results [13]. For each sample, the prostate cancer specific mortality probability (PCSM) at ten years after diagnosis (as a percentage) was calculated using an implementation of the Prostate PREDICT model in R (see methods), under the assumption of initial conservative management, using as input the associated clinical variables: age at diagnosis, PSA at diagnosis, clinical T stage, and prostatectomy Gleason grade group (as a proxy for biopsy Gleason grade group). The equivalent expected non-prostate cancer mortality (NPCM) was calculated using age at diagnosis. The PREDICT score, the increase in probability of mortality at 10 years caused from having prostate cancer, was defined as NPCM-PCSM.
Table 2.
Summary of clinical variables of cohorts. BCR = Biochemical recurrence after prostatectomy as defined by two PSA measurements at values greater than or equal to 0.2 ng/mL. BCR/Follow up is time to biochemical recurrence or last clinical update. T-stage = clinical tumour stage. The PREDICT score, the percentage increase in probability of mortality at 10 years from having prostate cancer, is defined as the non-prostate cancer mortality minus prostate cancer specific mortality at 10 years. DESNT score is the proportion of expression assigned to the DESNT poor prognosis molecular subtype.
DESNT scores from our combined dataset had a median value of 0.09 and an interquartile range of 0.32. PREDICT scores had a median value of 5.84 and an interquartile range of 3.24. There was a weak correlation between DESNT score and PREDICT score (Figure 1A; rho = 0.21; p < 0.05; Spearmen’s correlation). The DESNT score is very variable with respect to the PREDICT score (Figure 1B). The PREDICT score showed a statistically significant increase in samples that were DESNT cancers, i.e., where the proportion assigned to the DESNT subtype was higher than all other subtypes in the framework (Figure 2; p < 0.001; Mann–Whitney U test; difference in medians = 1.93).
Figure 1.
The relationship between Prostate PREDICT score and DESNT score. (A) Scatter plot and distribution. (B) A waterfall plot showing how the DESNT score varies with PREDICT score. The horizontal line at 5.8 represents the median PREDICT score. The PREDICT score is the percentage increase in probability of mortality at 10 years from having prostate cancer defined as the non-prostate cancer mortality minus prostate cancer specific mortality at 10 years. DESNT score is the proportion of expression that is explained by the signature expression profile of the DESNT molecular subtype.
Figure 2.
Differences in PREDICT score between prostate cancer samples assigned to DESNT and those not. Samples assigned to DESNT are those where the largest proportion of expression is explained by the expression signature of the DESNT subtype. The PREDICT score is the increase in probability of mortality at 10 years caused by having prostate cancer defined as the non-prostate cancer mortality minus prostate cancer specific mortality at 10 years.
3.2. Predictive Ability of PREDICT and DESNT Score to Predict Time to Biochemical Recurrence
Both PREDICT score and DESNT score, when applied in separate models, have a significant association with time to biochemical recurrence (PREDICT: p < 0.0001, IQR HR = 1.59 [95% CI 1.43–1.76]; DESNT: p < 0.0001, IQR HR = 2.08 [95% CI 1.58–2.76]; Cox proportional hazards regression). A joint Cox proportional hazards model built with the two continuous independent variables PREDICT and DESNT score (p < 0.0001, IQR HR = 1.53 and 1.79, respectively; Table 3) was significantly better at predicting biochemical recurrence outcome than PREDICT score (p < 0.001; likelihood ratio test) or DESNT score (p < 0.001) alone. To illustrate this, samples were categorised into DESNT cancers or non-DESNT cancers, and upper PREDICT score or lower PREDICT score (split around the median; Table 4). A Kaplan-Meir plot shows clear delineation between each combination of groups (Figure 3; Log-rank p-value < 0.001). At five years, the estimated proportion that are biochemical recurrence free are 92% (Lower PREDICT score & Not DESNT), 65% (Upper PREDICT score & Not DESNT), 56% (Lower PREDICT score & DESNT), and 38% (Upper PREDICT score & DESNT). Pair-wise, all Kaplan–Meier curves are significantly different (p < 0.001; log-rank test; Benjamini-Hochberg adjusted p-values) apart from “Lower PREDICT score & DESNT” vs. “Upper PREDICT score & DESNT” (p = 0.18) and “Lower PREDICT score & DESNT” vs. “Upper PREDICT score & Not DESNT” (p = 0.53).
Table 3.
Summary of Cox proportional hazard model combining PREDICT score and DESNT score. Endpoint is time to biochemical recurrence.
Table 4.
Discretisation of samples into DESNT vs. non-DESNT (based on the dominant subtype expression signature) and upper and lower PREDICT score (split around the median PREDICT score).
Figure 3.
Kaplan–Meier plot showing the survival curves for samples grouped by DESNT and Prostate PREDICT model status. Endpoint is time to biochemical recurrence (BCR). Samples are divided into DESNT vs. non-DESNT and upper PREDICT vs. lower PREDICT (split around the median PREDICT score). The “At Risk” table below the plot shows the number of patients in each group, at the corresponding time point, that have not had a biochemical recurrence event and have longer follow up than that time. The “Events” table shows the cumulative number of biochemical recurrence events observed in a group at that time point.
3.3. Characterisation of the Genes and Biological Processes behind PREDICT and DESNT Scores
To biologically characterise the PREDICT score we compared the expression profiles from samples with the top 25% PREDICT score versus the bottom 25%. We found 451 genes to be significantly differentially expressed (287 downregulated; 164 upregulated; adjusted p-values < 0.05; adapted t-test; Table 5; Table S1). 162 pathways or ontological terms were found to be significantly enriched in upregulated genes and 74 with downregulated genes (p < 0.05; Table S2). This corresponded to 63 GO biological process terms, six KEGG pathways, and five Reactome pathways for downregulated genes, and 143 GO biological process terms, four KEGG pathways, and 15 Reactome pathways for downregulated genes (see Table 6 for enriched Reactome pathways).
Table 5.
The top ten differentially expressed genes between the samples with the 25% highest PREDICT scores versus the lowest 25% PREDICT scores, ranked by log2 fold change. p values adjusted for multiple testing correction using the Benjamini-Hochberg algorithm. Whether these genes overlap with the 51 significant differential expressed genes Luca et al. (2020) observed as characteristic of the DESNT subtype and the differentially expressed genes for DESNT found in this study are also shown. Full results in Table S1.
Table 6.
Reactome pathways found to be significantly enriched for significantly upregulated or downregulated genes for PREDICT high score samples versus low score samples. Direction = whether pathway is enriched for downregulated or upregulated genes. Full results including KEGG pathway and GO BP terms in Table S2.
4507 genes were found to be differentially expressed between DESNT vs. non-DESNT samples (1973 downregulated; 2534 upregulated; adjusted p-values < 0.05; adapted t-test; Table 7; Table S3). Of the 51 differential expressed genes Luca et al. (2020) observed as characteristic of the DESNT subtype across multiple datasets (49 down-regulated, two up-regulated), all of them were differentially expressed in this analysis and were altered in the same direction (Table S3). A much larger number of genes were identified here as the DESNT characteristic genes reported in Luca et al. are the overlap of differentially expressed genes from multiple different comparisons in independent cohorts. 449 pathways or ontological terms were found to be significantly enriched in upregulated genes and 1391 with downregulated genes (p < 0.05; Table S4). This corresponded to 1288 GO biological process terms, 58 KEGG pathways, and 45 Reactome pathways for downregulated genes, and 373 GO biological process terms, nine KEGG pathways, and 67 Reactome pathways for upregulated genes (see Table 8 for the top 10 enriched Reactome pathways).
Table 7.
The top ten differentially expressed genes between the samples classified as DESNT samples versus non-DESNT samples, ranked by log2 fold change. p values adjusted for multiple testing correction using the Benjamini-Hochberg algorithm. Whether these genes overlap with the 51 significant differential expressed genes Luca et al. (2020) observed as characteristic of the DESNT subtype and the differentially expressed genes for PREDICT score found in this study are also shown. Full results in Table S3.
Table 8.
The top 10 Reactome pathways found to be uniquely significantly enriched for significantly upregulated or downregulated genes for DESNT sample status, ranked by significance. Direction = whether pathway is enriched for downregulated or upregulated genes. Full results in Table S4.
Of the 451 genes found to be characteristic of PREDICT, the majority (78%) were also found to be differentially expressed in the same direction between DESNT vs. non-DESNT samples (Table S1), but only 8% of DESNT differentially expressed genes were found to be differentially expressed in the same direction between PREDICT score high vs. PREDICT score low samples. Only 24 out of 51 characteristic DESNT genes from Luca et al. (2020) were found to characterise PREDICT. 93 out of 236 (40%) enriched pathways/ontology terms were unique to the PREDICT Score (Table S2; 37% GO biological process terms, 80% KEGG pathways and 40% Reactome pathways). Similarly, 1697 out of 1840 (92%) enriched pathways/ontology terms were unique to DESNT (Table S4; 92% GO biological process terms, 97% KEGG pathways and 89% Reactome pathways). Taken together these results are suggestive that DESNT provides additional information to PREDICT based on the underlying biological processes.
4. Discussion
In this study we examined four large expression data sets that were taken from primary prostate cancer samples at prostatectomy for men with prostate cancer that had not received any other treatment. This data, along with relevant clinical data, was used as a proxy of the biological information that could be gathered from biopsies at the time of diagnosis. We calculated the model score from the leading clinical multivariable model, PREDICT prostate, and the poor prognosis DESNT molecular subtype. We showed the potential for the PREDICT Prostate clinical model to predict disease prognosis after surgical treatment of prostate cancer. We also found that by combining the DESNT score with the PREDICT score produced a significantly better predictor of outcome following prostatectomy. The return of prostate cancer after prostatectomy is an indication that micrometastases were present at the time of surgery [24]—it is estimated that up to 70% of patients have disseminated tumour cells after prostatectomy [25]. Therefore, poor treatment response at prostatectomy may give an indication of overall disease state. Our findings are important because it suggests that we can make a better-informed decision at the time of diagnosis of whether to perform radical treatment or not if molecular information is included.
For the first time the biological mechanisms behind an increased probability of mortality at ten years after diagnosis caused by prostate cancer (i.e., a higher PREDICT score) has been examined. The top 10 differentially expressed genes are the downregulation of ANPEP, CD38, SLC22A3, NPY, MSMB, MT1G, MT1M, KRT15, & SFRP4 and upregulation of COMP.
ANPEP was the top-ranked downregulated gene in both PREDICT and DESNT analyses. Aminopeptidase N (APN) is the enzyme encoded by ANPEP that belongs to a group of widely expressed ectopeptidases [26]. APN is multifunctional for the post-secretory processing of neuropeptides and regulating the access of these molecules to cellular receptors. The role of APN positively associated with intracellular signalling and has been shown to play an important role in metastasis of several malignancies, including prostate cancer through neoangiogenesis [27,28,29]. Sorenson et al. (2013) observed a significant (p < 0.001) downregulation of ANPEP expression in prostate cancer in comparison with non- malignant prostate tissue samples [30]. The authors concluded that negative APN immunoreactivity is a prognostic factor for patients harbouring clinically localised prostate cancer for both recurrence-free and cancer-specific survival endpoints.
CD38 has previously been reported as a marker of the luminal cells in human prostate [31]. Using CD38 as a marker, Liu et al. (2016) identified low expression of the gene in a progenitor-like subset of luminal cells within the human prostate that are capable of initiation of human prostate cancer in an in vivo tissue-regeneration assay [32]. They also demonstrated that luminal cells with low CD38 expression are associated with disease progression and poor survival outcome in prostate cancers.
Neuropeptide Y (NPY) is a gene involved in various physiological and homeostatic processes such as stress response. Liu et al. (2007) observed lower expression levels of NPY that were associated with more aggressive clinical behaviour in prostate cancer [33]. MT1G [34], MSMB [35], SLC22A3 [36], COMP [37] and KRT15 [38] have also been associated with aggressive and/or poor clinical outcome in prostate cancer.
Functional enrichment analysis identified many molecular pathways that were upregulated or downregulated in high PREDICT score samples. This included the upregulation of many cell cycle related pathways, a well-known hallmark of cancer [39] and the downregulation of metabolism and cholesterol biosynthesis. Consistent with this result, Rye et al. found robust and consistent downregulation of nearly all genes in the cholesterol synthesis pathway in prostate cancer [40].
In this study we have also shown that the DESNT molecular subtype has shared and distinct biological characteristics to the general aggressive phenotype picked up by the PREDICT prognosis model. A much larger number of differentially expressed genes and enriched pathways were detected. This suggests that samples assigned as DESNT have expression profiles that are more like each other than samples with similar PREDICT scores, and so there is greater statistically power to detect differences. Only 8% of DESNT differentially expressed genes were found to be differentially expressed in the same direction in PREDICT score high samples. There were also many distinct enriched pathways including the downregulation of signalling pathways and extracellular matrix organisation and upregulation of DNA replication and translation. The DESNT signature has a distinctive biological profile, which is further evidence that it is a valid molecular subtype.
This study has several limitations. Firstly, data comes from prostatectomy samples rather than biopsy samples at diagnosis. This confines the characteristics of the cohort and often Gleason score is upgraded at prostatectomy [41], although prostatectomy was the primary treatment closest to diagnosis for these patients and so is a reasonable proxy to use. Secondly, the full power of the PREDICT model could not be utilised as full diagnostic biopsy information was unavailable for these datasets. Thirdly, biochemical recurrence was used as the clinical endpoint whereas metastatic disease or cancer-specific death would be more informative—PREDICT was not developed or calibrated for biochemical relapse as an outcome hence its performance in this setting has not previously been assessed. Finally, compared to the tens of thousands that the PREDICT model has been validated in, the numbers are relatively low, however we have used robust methods to compensate for this and reported confidence intervals throughout. Despite these limitations, the results support the notion of the potential value of including biological measurements along with the clinical variables collected as part of the standard clinical pathway. Future studies where transcriptome data is generated from a large series of biopsies with good quality clinical data with long follow up would be welcomed.
There is a need for improved predictors of outcome in non-metastatic men at the time of diagnosis to allow the optimal treatment pathway to be chosen. The inclusion of biological information, in particular the DESNT poor prognosis molecular subtype, alongside the best-of-breed clinical prognostic tool, PREDICT prostate, has the potential to make this improvement. This combination has the potential to help avoid unnecessary treatments and life-altering side-effects and improve survival in prostate cancer patients.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/curroncol30010013/s1, Table S1: Significantly differentially expressed genes between the samples with the 25% highest PREDICT scores versus the lowest 25% PREDICT scores; Table S2: KEGG pathways, Reactome pathways, and Gene Ontology biological processes terms found to be significantly enriched for significantly upregulated or downregulated genes for PREDICT high score samples versus low score samples; Table S3: Significantly differentially expressed genes between the samples classified as DESNT samples versus non-DESNT samples; Table S4: KEGG pathways, Reactome pathways, and Gene Ontology biological processes terms found to be significantly enriched for significantly upregulated or downregulated genes for DESNT sample status.
Author Contributions
Conceptualization, D.S.B. and V.J.G.; methodology, D.S.B. and R.C.; software, D.S.B., L.W. and R.C.; formal analysis, D.S.B., L.W. and R.C.; investigation, D.S.B., L.W. and R.C.; resources, D.S.B. and V.J.G.; data curation, D.S.B., L.W. and R.C.; writing—original draft preparation, D.S.B. and L.W.; writing—review and editing, L.W., R.C., V.J.G., C.S.C., J.C. and D.S.B.; visualization, D.S.B. and L.W.; supervision, D.S.B., R.C., J.C. and C.S.C.; project administration, D.S.B., J.C. and C.S.C.; funding acquisition, D.S.B., J.C. and C.S.C. All authors have read and agreed to the published version of the manuscript. All authors critiqued the manuscript for important intellectual content.
Funding
This work was funded by the Bob Champion Cancer Trust, The Masonic Charitable Foundation, The King Family, The Hargrave Foundation and The University of East Anglia. We acknowledge support from Prostate Cancer Research, Movember, Prostate Cancer UK, The Big C Cancer Charity, Cancer Research UK and The Andy Ripley Memorial Fund.
Institutional Review Board Statement
Ethical review and approval were waived for this study due to only publicly available datasets being used. Ethical approval details for each dataset are available in the original publications.
Data Availability Statement
The datasets analysed during the current study are publicly available: MSKCC [18]: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE21034 (accessed on 1 May 2016); CancerMap [12]: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE94767 (accessed on 1 May 2016); Stephenson [19]: Data available from the corresponding author of this paper. CamCap [20]: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE70768 (accessed on 1 May 2016) and https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE70769 (accessed on 1 May 2016).
Acknowledgments
The authors would like to thank those men with prostate cancer and the subjects who have donated their time and samples to the data sets used in this study. Part of the research presented in this paper was carried out on the High Performance Computing Cluster supported by the Research and Specialist Computing Support service at the University of East Anglia.
Conflicts of Interest
C.S.C. and D.S.B. are co-inventors on a patent application from the University of East Anglia on the detection of DESNT prostate cancer. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
References
- Cancer Research UK Prostate cancer statistics. Available online: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/prostate-cancer (accessed on 17 August 2021).
- D’Amico, A.V.; Moul, J.; Carroll, P.R.; Sun, L.; Lubeck, D.; Chen, M.-H.; D’Amico, A.V.; Moul, J.; Carroll, P.R.; Sun, L.; et al. Cancer-Specific Mortality After Surgery or Radiation for Patients With Clinically Localized Prostate Cancer Managed During the Prostate-Specific Antigen Era. J. Clin. Oncol. 2003, 21, 2163–2172. [Google Scholar] [CrossRef] [PubMed]
- Attard, G.; Cooper, C.S.; de Bono, J.S. Steroid Hormone Receptors in Prostate Cancer: A Hard Habit to Break? Cancer Cell 2009, 16, 458–462. [Google Scholar] [CrossRef] [PubMed]
- Schiffmann, J.; Wenzel, P.; Salomon, G.; Budäus, L.; Schlomm, T.; Minner, S.; Wittmer, C.; Kraft, S.; Krech, T.; Steurer, S.; et al. Heterogeneity in D’Amico classification-based low-risk prostate cancer: Differences in upgrading and upstaging according to active surveillance eligibility. Urol. Oncol. 2015, 33, 329.e13–329.e19. [Google Scholar] [CrossRef]
- Hamdy, F.C.; Donovan, J.L.; Lane, J.A.; Mason, M.; Metcalfe, C.; Holding, P.; Davis, M.; Peters, T.J.; Turner, E.L.; Martin, R.M.; et al. 10-Year Outcomes after Monitoring, Surgery, or Radiotherapy for Localized Prostate Cancer. N. Engl. J. Med. 2016, 375, 1415–1424. [Google Scholar] [CrossRef]
- Chou, R.; Croswell, J.M.; Dana, T.; Bougatsos, C.; Blazina, I.; Fu, R. Review Annals of Internal Medicine Screening for Prostate Cancer: A Review of the Evidence for the U.S. Preventative Services Task Force. Ann. Intern. Med. 2011, 155, 762–771. [Google Scholar] [CrossRef]
- Thurtle, D.R.; Greenberg, D.C.; Lee, L.S.; Huang, H.H.; Pharoah, P.D.; Gnanapragasam, V.J. Individual prognosis at diagnosis in nonmetastatic prostate cancer: Development and external validation of the PREDICT Prostate multivariable model. PLoS Med. 2019, 16, e1002758. [Google Scholar] [CrossRef]
- Thurtle, D.; Bratt, O.; Stattin, P.; Pharoah, P.; Gnanapragasam, V. Comparative performance and external validation of the multivariable PREDICT Prostate tool for non-metastatic prostate cancer: A study in 69,206 men from Prostate Cancer data Base Sweden (PCBaSe). BMC Med. 2020, 18, 139. [Google Scholar] [CrossRef]
- Lee, C.; Light, A.; Alaa, A.; Thurtle, D.; Schaar, M.V.D.; Gnanapragasam, V.J. Application of a novel machine learning framework for predicting non-metastatic prostate cancer-specific mortality in men using the Surveillance, Epidemiology, and End Results (SEER) database. Lancet Digit. Health 2021, 3, e158–e165. [Google Scholar] [CrossRef]
- NICE. Prostate Cancer: Diagnosis and Management; National Institute for Health and Care Excellence: London, UK, 2019. [Google Scholar]
- Anceschi, U.; Tuderti, G.; Lugnani, F.; Biava, P.M.; Malossini, G.; Luciani, L.; Cai, T.; Marsiliani, D.; Filianoti, A.; Mattevi, D.; et al. Novel Diagnostic Biomarkers of Prostate Cancer: An Update. Curr. Med. Chem. 2019, 26, 1045–1058. [Google Scholar] [CrossRef]
- Luca, B.; Brewer, D.S.; Edwards, D.R.; Edwards, S.; Whitaker, H.C.; Merson, S.; Dennis, N.; Cooper, R.A.; Hazell, S.; Warren, A.Y.; et al. DESNT: A Poor Prognosis Category of Human Prostate Cancer. Eur. Urol. Focus 2018, 4, 842–850. [Google Scholar] [CrossRef]
- Luca, B.; Moulton, V.; Ellis, C.; Edwards, D.R.; Campbell, C.; Cooper, R.A.; Clark, J.; Brewer, D.S.; Cooper, C.S. A novel stratification framework for predicting outcome in patients with prostate cancer. Br. J. Cancer 2020, 122, 1467–1476. [Google Scholar] [CrossRef] [PubMed]
- Luca, B.-A.; Moulton, V.; Ellis, C.; Connell, S.P.; Brewer, D.S.; Cooper, C.S. Convergence of Prognostic Gene Signatures Suggests Underlying Mechanisms of Human Prostate Cancer Progression. Genes 2020, 11, 802. [Google Scholar] [CrossRef] [PubMed]
- Ellis, C. Using Latent Process Decomposition to Classify Prostate and Colorectal Cancers. Ph.D. Thesis, University of East Anglia, School of Computing Sciences, Norwich, UK, 2021. [Google Scholar]
- Irizarry, R.A.; Hobbs, B.; Collin, F.; Beazer-Barclay, Y.D.; Antonellis, K.J.; Scherf, U.; Speed, T.P. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 2003, 4, 249–264. [Google Scholar] [CrossRef]
- Johnson, W.E.; Li, C.; Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostat. Oxf. Engl. 2007, 8, 118–127. [Google Scholar] [CrossRef]
- Taylor, B.S.; Schultz, N.; Hieronymus, H.; Gopalan, A.; Xiao, Y.; Carver, B.S.; Arora, V.K.; Kaushik, P.; Cerami, E.; Reva, B.; et al. Integrative Genomic Profiling of Human Prostate Cancer. Cancer Cell 2010, 18, 11–22. [Google Scholar] [CrossRef] [PubMed]
- Stephenson, A.J.; Smith, A.; Kattan, M.W.; Satagopan, J.; Reuter, V.E.; Scardino, P.T.; Gerald, W.L. Integration of gene expression profiling and clinical variables to predict prostate carcinoma recurrence after radical prostatectomy. Cancer 2005, 104, 290–298. [Google Scholar] [CrossRef]
- Ross-Adams, H.; Lamb, A.D.D.; Dunning, M.J.J.; Halim, S.; Lindberg, J.; Massie, C.M.M.; Egevad, L.A.A.; Russell, R.; Ramos-Montoya, A.; Vowler, S.L.L.; et al. Integration of copy number and transcriptomics provides risk stratification in prostate cancer: A discovery and validation cohort study. EBioMedicine 2015, 2, 1133–1144. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing. Available online: https://www.r-project.org/ (accessed on 7 April 2020).
- Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
- Kolberg, L.; Raudvere, U.; Kuzmin, I.; Vilo, J.; Peterson, H. gprofiler2—An R package for gene list functional enrichment analysis and namespace conversion toolset g:Profiler. F1000Research 2020, 9, ELIXIR-709. [Google Scholar] [CrossRef]
- Msaouel, P.; Pissimissis, N.; Halapas, A.; Koutsilieris, M. Mechanisms of bone metastasis in prostate cancer: Clinical implications. Best Pract. Res. Clin. Endocrinol. Metab. 2008, 22, 341–355. [Google Scholar] [CrossRef]
- Morgan, T.M.; Lange, P.H.; Porter, M.P.; Lin, D.W.; Ellis, W.J.; Gallaher, I.S.; Vessella, R.L. Disseminated Tumor Cells in Prostate Cancer Patients after Radical Prostatectomy and without Evidence of Disease Predicts Biochemical Recurrence. Clin. Cancer Res. 2009, 15, 677–683. [Google Scholar] [CrossRef] [PubMed]
- Carl-McGrath, S.; Lendeckel, U.; Ebert, M.; Röcken, C. Ectopeptidases in tumour biology: A review. Histol. Histopathol. 2006, 21, 1339–1353. [Google Scholar] [CrossRef] [PubMed]
- Menrad, A.; Speicher, D.; Wacker, J.; Herlyn, M. Biochemical and functional characterization of aminopeptidase N expressed by human melanoma cells. Cancer Res. 1993, 53, 1450–1455. [Google Scholar] [PubMed]
- Ishii, K.; Usui, S.; Sugimura, Y.; Yoshida, S.; Hioki, T.; Tatematsu, M.; Yamamoto, H.; Hirano, K. Aminopeptidase N regulated by zinc in human prostate participates in tumor cell invasion. Int. J. Cancer 2001, 92, 49–54. [Google Scholar] [CrossRef]
- Hashida, H.; Takabayashi, A.; Kanai, M.; Adachi, M.; Kondo, K.; Kohno, N.; Yamaoka, Y.; Miyake, M. Aminopeptidase N is involved in cell motility and angiogenesis: Its clinical significance in human colon cancer. Gastroenterology 2002, 122, 376–386. [Google Scholar] [CrossRef]
- Sørensen, K.D.; Abildgaard, M.O.; Haldrup, C.; Ulhøi, B.P.; Kristensen, H.; Strand, S.; Parker, C.; Høyer, S.; Borre, M.; Ørntoft, T.F. Prognostic significance of aberrantly silenced ANPEP expression in prostate cancer. Br. J. Cancer 2013, 108, 420–428. [Google Scholar] [CrossRef]
- Kramer, G.; Steiner, G.; Fodinger, D.; Fiebiger, E.; Rappersberger, C.; Binder, S.; Hofbauer, J.; Marberger, M. High Expression of a CD38-Like Molecule in Normal Prostatic Epithelium and its Differential Loss in Benign and Malignant Disease. J. Urol. 1995, 154, 1636–1641. [Google Scholar] [CrossRef]
- Liu, X.; Grogan, T.R.; Hieronymus, H.; Hashimoto, T.; Mottahedeh, J.; Cheng, D.; Zhang, L.; Huang, K.; Stoyanova, T.; Park, J.W.; et al. Low CD38 Identifies Progenitor-like Inflammation-Associated Luminal Cells that Can Initiate Human Prostate Cancer and Predict Poor Outcome. Cell Rep. 2016, 17, 2596–2606. [Google Scholar] [CrossRef]
- Liu, A.; Furusato, B.; Ravindranath, L.; Chen, Y.; Srikantan, V.; McLeod, D.G.; Petrovics, G.; Srivastava, S. Quantitative analysis of a panel of gene expression in prostate cancer--with emphasis on NPY expression analysis. J. Zhejiang Univ. Sci. B 2007, 8, 853–859. [Google Scholar] [CrossRef]
- Henrique, R.; Jerónimo, C.; Hoque, M.O.; Nomoto, S.; Carvalho, A.L.; Costa, V.L.; Oliveira, J.; Teixeira, M.R.; Lopes, C.; Sidransky, D. MT1G hypermethylation is associated with higher tumor stage in prostate cancer. Cancer Epidemiol Biomark. Prev. 2005, 14, 1274–1278. [Google Scholar] [CrossRef]
- Bergström, S.H.; Järemo, H.; Nilsson, M.; Adamo, H.H.; Bergh, A. Prostate tumors downregulate microseminoprotein-beta (MSMB) in the surrounding benign prostate epithelium and this response is associated with tumor aggressiveness. Prostate 2018, 78, 257–265. [Google Scholar] [CrossRef] [PubMed]
- Chen, L.; Hong, C.; Chen, E.C.; Yee, S.W.; Xu, L.; Almof, E.U.; Wen, C.; Fujii, K.; Johns, S.J.; Stryke, D.; et al. Genetic and Epigenetic Regulation of the Organic Cation Transporter 3, SLC22A3. Pharm. J. 2013, 13, 110–120. [Google Scholar] [CrossRef] [PubMed]
- Englund, E.; Canesin, G.; Papadakos, K.S.; Vishnu, N.; Persson, E.; Reitsma, B.; Anand, A.; Jacobsson, L.; Helczynski, L.; Mulder, H.; et al. Cartilage oligomeric matrix protein promotes prostate cancer progression by enhancing invasion and disrupting intracellular calcium homeostasis. Oncotarget 2017, 8, 98298–98311. [Google Scholar] [CrossRef]
- Zhong, P.; Shu, R.; Wu, H.; Liu, Z.; Shen, X.; Hu, Y. Low KRT15 expression is associated with poor prognosis in patients with breast invasive carcinoma. Exp. Ther. Med. 2021, 21, 305. [Google Scholar] [CrossRef]
- Hanahan, D. Hallmarks of Cancer: New Dimensions. Cancer Discov. 2022, 12, 31–46. [Google Scholar] [CrossRef] [PubMed]
- Rye, M.B.; Bertilsson, H.; Andersen, M.K.; Rise, K.; Bathen, T.F.; Drabløs, F.; Tessem, M.-B. Cholesterol synthesis pathway genes in prostate cancer are transcriptionally downregulated when tissue confounding is minimized. BMC Cancer 2018, 18, 478. [Google Scholar] [CrossRef]
- Corcoran, N.M.; Hong, M.K.H.; Casey, R.G.; Hurtado-Coll, A.; Peters, J.; Harewood, L.; Goldenberg, S.L.; Hovens, C.M.; Costello, A.J.; Gleave, M.E. Upgrade in Gleason score between prostate biopsies and pathology following radical prostatectomy significantly impacts upon the risk of biochemical recurrence. BJU Int. 2011, 108, E202–E210. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).