Next Article in Journal
Towards Accurate Microwave Characterization of Tissues: Sensing Depth Analysis of Open-Ended Coaxial Probes with Ex Vivo Rat Breast and Skin Tissues
Previous Article in Journal
D-karyo—A New Prenatal Rapid Screening Test Detecting Submicroscopic CNVs and Mosaicism
 
 
Comment published on 5 July 2021, see Diagnostics 2021, 11(7), 1213.
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimized Identification of High-Grade Prostate Cancer by Combining Different PSA Molecular Forms and PSA Density in a Deep Learning Model

by
Francesco Gentile
1,2,*,
Matteo Ferro
2,3,
Bartolomeo Della Ventura
2,4,
Evelina La Civita
2,5,
Antonietta Liotti
2,5,
Michele Cennamo
5,
Dario Bruzzese
2,6,
Raffaele Velotta
2,4 and
Daniela Terracciano
2,5,*
1
Department of Experimental and Clinical Medicine, University Magna Graecia of Catanzaro, 88100 Catanzaro, Italy
2
ElicaDea, Spinoff of Federico II University, 80131 Naples, Italy
3
Division of Urology, European Institute of Oncology (IEO), IRCCS, Via Ripamonti 435, 20141 Milan, Italy
4
Department of Physics “Ettore Pancini”, University of Naples “Federico II”, Via Cintia 26 Ed. G, 80126 Naples, Italy
5
Department of Translational Medical Sciences, University of Naples “Federico II”, 80131 Naples, Italy
6
Department of Public Health, University of Naples “Federico II”, 80131 Naples, Italy
*
Authors to whom correspondence should be addressed.
Diagnostics 2021, 11(2), 335; https://doi.org/10.3390/diagnostics11020335
Submission received: 3 February 2021 / Revised: 10 February 2021 / Accepted: 15 February 2021 / Published: 18 February 2021
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

:
After skin cancer, prostate cancer (PC) is the most common cancer among men. The gold standard for PC diagnosis is based on the PSA (prostate-specific antigen) test. Based on this preliminary screening, the physician decides whether to proceed with further tests, typically prostate biopsy, to confirm cancer and evaluate its aggressiveness. Nevertheless, the specificity of the PSA test is suboptimal and, as a result, about 75% of men who undergo a prostate biopsy do not have cancer even if they have elevated PSA levels. Overdiagnosis leads to unnecessary overtreatment of prostate cancer with undesirable side effects, such as incontinence, erectile dysfunction, infections, and pain. Here, we used artificial neuronal networks to develop models that can diagnose PC efficiently. The model receives as an input a panel of 4 clinical variables (total PSA, free PSA, p2PSA, and PSA density) plus age. The output of the model is an estimate of the Gleason score of the patient. After training on a dataset of 190 samples and optimization of the variables, the model achieved values of sensitivity as high as 86% and 89% specificity. The efficiency of the method can be improved even further by training the model on larger datasets.

1. Introduction

After skin cancer, prostate cancer (PC) is the most common among men [1,2]. The gold standard for the diagnosis of PC is based on the PSA (prostate-specific antigen) test [3,4]. Since PSA is produced by normal and malignant cells of the prostate gland, it is an organ-specific but not a cancer-specific biomarker. Based on this preliminary health screening, the physician decides on whether to proceed with further tests, typically prostate biopsy, to confirm or reject the initial hypothesis of a tumor, evaluate its aggressiveness, and assess its susceptibility to develop into a malignant cancer. Any elevation of the PSA above the cut-off is a cause for concern [3,4]. Nevertheless, various factors can cause a man’s PSA level to fluctuate, and a PSA test may either yield false-positive results or overdiagnosis of prostate cancer. Misinterpretation of diagnostic tests can expose patients to unnecessary treatments, including surgery and radiation therapy, with undesirable side effects, such as incontinence, erectile dysfunction, infections, and pain. Only about 25% of men who have a biopsy due to an elevated PSA level truly have cancer. The guidelines of the European Association of Urology (EAU-ESTRO-SIOG) for PC diagnosis recommend measuring PSA concentration and conduct a digital rectal examination (DRE) [5].
However, DRE has low sensitivity and PSA has low specificity, so the final diagnosis depends on biopsy. Consequently, a high rate of overdiagnosis and potentially overtreatment is observed and there is a strong clinical need of laboratory tests able to provide useful information to choose the best treatment option for each patient [6,7]. The increasing age of the worldwide population highlights this challenge as relevant, particularly in younger men who are at highest risk of the fearsome side effects of overtreatment. Therefore, it is mandatory to find diagnostic tools able to avoid unnecessary biopsy without missing clinically relevant PC.
We developed a deep learning algorithm that can identify aggressive forms of PC from a panel of 4 clinical variables plus the patient age. The levels of (i) total PSA (tPSA), (ii) free PSA (fPSA), (iii) the isoform-2 of proPSA (p2PSA), and (iv) PSA density were determined in 222 patients using automated immunometric assays described in the Methods of this article. The hypothesis that a similar set of variables plus age may represent most of the information on the state of PC is based on previous clinical studies and reports [8,9,10,11,12,13,14,15,16]. Thus, a model containing those variables can potentially diagnose PC with maximum achievable accuracy. Histological examination of samples has been used to validate the predictions of the algorithms.
The choice of using artificial neuronal networks (ANNs) to interpret data is motivated by the complexity of the problem. The number of potential variations that can result from a combination of 5 variables extracted by several different patients is very large. Describing such extremely complex data sets with deterministic models may be infeasible. A deterministic model may fail to keep up with the challenges that complex biological systems pose [17]. Contrary to deterministic models, artificial neural networks can discover the laws that determine the relationship among variables by analyzing the output of the system [18]. ANNs are statistical models partially modeled on biological neural networks. They are capable of modeling and processing nonlinear relationships between inputs and outputs in parallel. Architecturally, ANNs are modeled using layers of artificial neurons, or computational units able to receive input and apply an activation function along with a threshold to determine if messages are passed along, and these ultimately determine the output. In a simple model, the first layer is the input layer, followed by one hidden layer, and lastly by an output layer. Each layer can contain one or more neurons. Deep neuronal networks have more hidden layers, more neurons per layer, and more paths between neurons, endowing the networks with increased abstraction and problem-solving capabilities.
The proposed model could expedite the diagnostic pathway of PC patients by triaging high-grade cancers. Such a model predicts a biopsy Gleason score, which is the most powerful prognostic predictor for PC patients.

2. Methods

2.1. Study Population

Before prostate biopsy (minimum 16 cores), 437 subjects were enrolled in a prospective observational study, approved by the hospital ethics committee. Blood specimens were collected according to predetermined standard operating procedure [19]. Participants provided written approved consent. Ethical approval for this study was given by the institutional Ethics Committee of the University of Naples Federico II (118/20). Among these, 222 met eligibility criteria for this study: age over 50 years, no prior prostate surgery and biopsy, no bacterial acute or chronic prostatitis, no use of 5-α reductase inhibitors, PSA values between 2 and 20 ng/mL availability of serum samples and corresponding clinical data, and completion of at least a 16-core template biopsy after enrollment. The training cohort included 179 PC patients and the evaluation cohort 43. Figure 1 shows the summary of included patients. Table 1 shows descriptive characteristics of the study population.

2.2. Biomarker Measurement

Participants had blood drawn before DRE at each visit. Whole blood was allowed to clot before serum was separated by centrifugation. Serum aliquots were stored at −80 °C until samples were processed, according to Semjonow et al. [20]. Specimens were analyzed in a blinded fashion for PSA, fPSA, and p2PSA using a Access2 Immunoassay System analyzer (Beckman Coulter, Brea, CA, USA) calibrated against the WHO standard for PSA and fPSA. The analytical performance of the measurements assessed with control materials (Beckman Coulter) showed values within the allowed recommended limits. PSA density was calculated as PSA/prostate volume. Prostate volume was measured by transrectal ultrasound (TRUS). Patients underwent prostate biopsies according to a standardized institutional saturation scheme, which consisted of at least 16 needle biopsy cores obtained under transrectal ultrasound (TRUS) guidance. Primary and secondary Gleason scores were assigned by a single genitourinary pathologist blinded to the biomarker values, according to the 2005 consensus conference of the International Society of Urological Pathology definitions [21].

2.3. The Deep Learning Model

We used a deep-learning algorithm to interpret data and identify aggressive forms of cancer. The algorithm was based on artificial neural networks (ANN) with an input layer, an output layer, and 7 hidden layers in cascade: i.e., a (1) linear layer, (2) normalization layer, (3) hyperbolic tangent non-linear layer, (4) linear layer, (5) normalization layer, (6) hyperbolic tangent non-linear layer, (7) linear layer. Thus, the hidden layers are given by the repetitions of two fundamental units. The input layer was here represented by a set of 4 clinical variables (total PSA, free PSA, precursor of PSA, and PSA density) plus age. The output layer was a vector of 6 integer values ranging from 5 to 10, representing the hypothesized Gleason score of the cancer. In the network, we used 3 neurons for each of the considered variables in each of the layers. The algorithm was implemented in Mathematica. The Mathematica NetInitialize() function was used to instantiate the learning machine. The argument of the function NetInitialize() is the entire sequence of layers that constitute the neural networks model, which are depicted in Figure 2b and here reported for convenience of the reader: NetChain[(Net0, LinearLayer(12), BatchNormalizationLayer(), ElementwiseLayer(Tanh), LinearLayer(6), BatchNormalizationLayer(), ElementwiseLayer(Tanh), LinearLayer(6)]. The command NetChain implies that all the layers are concatenated together. The values between parentheses for each layer represent the number of neurons in that layer, which is identical to that of the immediately preceding layer if left blank. Moreover, LinearLayer, BatchNormalizationLayer, and ElementwiseLayer(Tanh) are the linear, normalization, and non-linear layer, respectively, the latter being based on the hyperbolic tangent function tanh. In this sequence, Net0 is the model input, i.e., the values of concentration of tPSA, fPSA, p2PSA, and PSA density linked together in one layer. The algorithm was trained upon a dataset of 179 samples setting an epoch number of 4000 samples and a batch size of 1024. Thus, the network was trained such that it visited each example 4000 times while processing up to 1024 examples simultaneously. The values of total PSA, free PSA, precursor of PSA, PSA density, and age were concatenated to create a single feature set for each sample. In the tests, the error loss always converged to a value of steady state (10−2), indicating that the model achieved optimal performance for the considered values of model parameters. The algorithm was validated against a dataset of 43 samples. We measured the performance of the model using 5 different metrics: (i) accuracy, (ii) precision, (iii) sensitivity, (iv) specificity, and (v) efficiency, where (i) accuracy is the proportion of correct predictions among the total number of examined cases, (ii) precision is the complement to 100 of the average percentage error of the prediction, (iii) sensitivity is the proportion of positive results that are true positives, and (iv) specificity is the proportion of negative results that are true negatives. We define efficiency (v) the mathematical average of accuracy, precision, sensitivity, and specificity.

3. Results

3.1. The Deep Learning Model

The GS against the tPSA, fPSA, p2PSA, PSA density, and age is reported in Figure 2a. These sets of labelled data encode the relationship between the concentration of 4 different biomarkers, plus age, and the Gleason score previously determined through solid biopsy. Remarkably, none of those variables (individually) seem to correlate effectively with the GS. For all the cases considered in Figure 2a, values of r2 statistics far from 1 indicate that data do not match with the predictions of simple linear models. ANNs can find hidden patterns in the data and help to formulate a model that deciphers the relationship between clinical variables and the GS. To interpret data, we used the network reported in Figure 2b. The input of the model is a list of values representing the concentration of target biomarkers in blood, and age. The number of variables used as an input was varied to verify their effect on the predictions of the model. The full form of the model uses 5 variables (4 biomarker-derivative variables plus age). The minimum number of variables used in the model was 2, chosen as the combinations of the initial set of 5 variables taking 2 elements at a time. The model achieved maximum performance for an initial set of 4 clinical variables tPSA, fPSA, p2PSA, and PSA density, i.e., the configuration reported in Figure 2b. Adding the variable age did not significantly improve the analysis. Results of the model as a function of the number of input variables are shown in the remainder of the paper. In all cases, the output at the last layer is an array of 6 values, from 5 to 10, representing the Gleason score associated with a specific input. The neural network was trained on a database of 190 samples. Then, the model was benchmarked against a validation set of 43 samples.

3.2. Best-Case Performance of the Model

Figure 3a reports the prediction of the model compared to the validation dataset for a basis of 4 input variables (tPSA, fPSA, p2PSA, and PSA density). In 22 out 43 cases, the prediction of the model template matches with the measured GS. In 19 cases, the difference between the estimate of the model and the true value of GS is of 1 grade. In 2 cases, the GS assigned by the model is 2 points higher than the true value of GS determined by solid biopsy. Thus, for this configuration the accuracy of the model is ~51%, while the precision is ~93%. Here, we define accuracy as the ratio between the correct predictions and the total number of examined cases, while precision is the complement to 100 of the average percentage errors of the prediction. We then evaluated the performance of the model in identifying aggressive forms of PC. We used = 7 as a cut off value, so that any GSGSco indicates the onset of aggressive PC (Figure 3b). We calculated the sensitivity (specificity) of the model test as the proportion of the true positives (negatives) correctly identified. For this choice of input variables, we found that the sensitivity of the model is ~86%, while the specificity is ~74%.

3.3. Simulations Predict Patterns of Variables Associated with Aggressive Forms of PC

We then used the model to simulate the more probable GS resulting from values of tPSA, fPSA, p2PSA, and PSA density varying over pathological intervals. The values of concentration of tPSA, fPSA, p2PSA, and PSA density were varied between 2.11–17.62 ng/mL (tPSA), 0.1–4.43 ng/mL (fPSA), 0.080–99.12 pg/mL (p2PSA), 0.04–0.9 ng/mL2 (PSA density). In this space of variables, we determined those patterns that are associated with aggressive (GS ≥ 7) and non-aggressive (GS < 7) forms of PC. Results are reported in Figure 4. We represent each combination of 4 variables in the diagrams as disks where the spatial coordinates indicate the concentration of p2PSA and tPSA, the radius of the disk is proportional to the concentration of fPSA, while the color space encodes information about the PSA density. This graphical representation enables us to visualize at a glance several different combinations of clinical variables and the way through which they are correlated to the GS of PC. Simulations help to identify 3 domains where the variables have different effects.
In the low p2PSA concentration range (0–20 pg/mL), aggressive forms of PC are correlated to elevated values of tPSA (10–20 ng/mL), sufficiently high values of fPSA (>2 ng/mL), and low values of PSA density (<0.3 ng/mL2). For intermediate values of p2PSA concentration (20–70 pg/mL), aggressive forms of PC are correlated to larger values of fPSA (>3 ng/mL) and larger values of PSA density (>0.3 ng/mL2), while tPSA seems to have minor effects. For high values of p2PSA concentration (>70 pg/mL), aggressive forms of PC are associated with lower values of tPSA (<10 ng/mL), low values of fPSA (<2 ng/mL), and low values of PSA density (<0.3 ng/mL2).
The heterogonous nature of data and the markedly non-linear way in which they interact to yield different values of GS reinforce the view that a deep learning approach is necessary to diagnose aggressive forms of PC.

3.4. Optimizing Performance: Varying Inputs to the Model

To understand the effects of the input variables we examined how the number and type of variable influences the performance of the model. We developed several different versions of the neuronal networks in which the input variables to the networks were chosen from different combinations of tPSA, fPSA, p2PSA, PSA density, and age. Then, for each combination we evaluated the output of the model in terms of accuracy, precision, sensitivity, specificity, and efficiency, defined as the mathematical average of the 4 last parameters. Results are reported in Figure 5. For notational convenience, in the following we will abbreviate each variable with its initials (tPSA: T, fPSA: F, p2PSA: P, PSA density: D, age: A) so that we will indicate specific combinations of biomarkers with a code.
We observe that accuracy (55%) and precision (93%) are optimized for the set of 4 variables (TFPD). For these estimators, the performance of the model built on an initial set of 3 variables (TFP) is comparable to that measured for 4, with an accuracy of 54% and a precision of 91%. As expected, any combination of 2 variables performs significantly worse than the 3 or 4 variable configuration, with estimated values of accuracy and precision of 45% and 89% for TF, that is the best result among the combinations of T, F, and P. Moreover, the accuracy and precision of the model falls to 39% and 90%, respectively, considering the full set of 5 variables (TFPDA). Considering sensitivity, the combination of 4 variables (TFPD) achieves again the best result, with a value of sensitivity as high as 86%. However, values of sensitivity reported for several other configurations are equivalent to that found for TFPD, with 72% for TFP and 83% for TFPDA. Remarkably, the model scored values of sensitivity equal to 80% and 82% for TP and FP, respectively.
Moving to specificity, we recorded the highest value (89%) for the set of 3 variables TFP, followed by TFPD (74%) and TFPDA (73%). The values of specificity are significantly lower for TF (57%), TP (43%), and FP (47%).
The efficiency (e), found as the average of accuracy, precision, sensitivity, and specificity, is a measure of the goodness of the model evaluated all over the other metrics. Values of e indicate that the model achieves maximum performance using as input variables TFPD or TFP, for which e = 0.78, with a marked increment to the model’s simplest version, that receives as an input 2 sole variables.

4. Discussion

Diagnosis and treatment of PC based only on PSA and DRE can lead to overdiagnosis and consequently to overtreatment of indolent cancers [22,23]. Despite the promising results of several new circulating and urinary biomarkers [24] and risk calculators [25], at present no specific biomarker is available that accurately points to biopsy. In order to reduce the side effects of overtreatment, there is a growing need of new diagnostic approach to choose the next step in the clinical management of patients with suspected PC.
Once the presence of malignancy has been established, patients with low-risk cancer have the option of active surveillance (AS) combined with deferred treatment. Such a strategy allows the avoidance of early invasive treatment (radical prostatectomy or radiation therapy) and the associated complications which are harmful for their quality of life. In fact, in 81% of cases patients develop erectile dysfunction, in 17% urinary incontinence, and in 12% bowel dysfunction [26].
The current gold standard to select patients for AS is the use of the D’Amico/NCCN and EAU risk stratification systems or Partin’s table. However, these criteria were developed more than a decade ago when some biomarkers such as −2proPSA were not available. Thus, adding new biomarkers which are better associated with high-grade PC could be an attractive strategy to identify new tools for patient risk stratification.
In the current study we report on the impact of a model including PSA, fPSA, p2PSA, and PSAD on the detection of high-grade PC in an initial biopsy cohort of men aged ≥ 50 years, a representative study population for the impact on the quality of life of possible side effects caused by overtreatment. Total, free, and -2proPSA are used to calculate PHI and there is evidence for PHI and -2proPSA being associated with an increased risk for a positive biopsy and for a Gleason score of ≥7 [27].
In our present study, applying a deep learning strategy, we demonstrated that the addition of PSA density to the three different PSA molecular forms included in PHI calculation showed the highest accuracy and precision for the identification of high-grade PC (GS ≥ 7). Our findings confirmed the relevance of PSA molecular forms included in PHI calculation and also highlight the role played by prostate volume in the clinical impact of PC biomarkers. The accuracy and precision obtained for the ANN model including PSA density demonstrated higher values compared to the model when prostate volume was excluded.
Certainly, further investigations are needed to develop new models specifically related to PC aggressiveness to increase current accuracy. However, prostate volume seems to play a key role in the interpretation of laboratory tests for PC [28,29,30]. So, circulating biomarkers related to prostate volume (i.e., PSA and phi density) are worthy of further study. Phi density (phi/prostate volume) outperformed phi in the identification of clinically significant PC in three different single-center studies [31,32,33].
The PSA test is part of routine clinical practice and the measurement of prostate volume is simple; therefore, the application of PSAD to identify high-grade PC would be possible. Accordingly, the recently developed Stockholm-3 model also includes PSA density to obtain the best accuracy to predict clinically significant PC and several recent reports suggest the additional value of PSA density to detect high-risk PC [34,35,36]. Moreover, the addition of PSAD to the criteria used to select patients for active surveillance reduced the rate of biopsy reclassification [37,38].
ANNs have been used to predict prostate cancer outcomes [39,40], but very few models based on circulating biomarkers for predicting the grade of PC are available [41]. Computational intelligence approaches are based on mathematical models able to deal with the uncertainty typically found in the clinical data used for PC prognosis and diagnosis. On this basis, these algorithms represent a suitable platform to develop new strategies for diagnosing and staging PC. For example, not every PC patient will exhibit abnormal results in all biomarkers, so combinations can lead to better diagnostic and prognostic performance [42].
Other authors have explored the potential of this approach. Some researchers [43,44] developed an ANN model for the prognosis of PC, potentially useful to select the patients for biopsy. Shariat et al. [45] suggested that prediction tools can provide help during PC patient clinical decision-making.
Finally, there is evidence that ANN models can be comparable to logistic regression models in predicting PC with clinical significance [46]. Notably, to be available for large populations the panel of reliable biomarkers has to be detected rapidly and at low cost, so development of biosensors [47] together with the use of computational-approach-based models shows relevant potential to face the challenge of selecting patients with clinically significant PC.
There are several limitations to this study. Firstly, it includes data of one population from one area located in Southern Italy, therefore results may not be extrapolated to worldwide population before an external validation. Secondly, our study includes only patients with tPSA between 2 and 20 ng/mL and we did not evaluate how the model performs in patients with tPSA values higher than 20 ng/mL, who are at highest risk of PC. In addition, we consider as outcome GS defined at biopsy, not at radical prostatectomy, thus leaving the identification of high-grade PC under the influence of biopsy sampling error.
However, our present study underlines the promising role of ANN models for the prediction of PC aggressiveness. Currently, the proposed model has been implemented as a research tool, and once prospective validation on a large study population is conducted, the tool will be a simple to use application which can be widely accessible. Such a tool will take the laboratory test results (i.e., PSA, fPSA, p2PSA, and PSAD) of an individual patient and predict his likelihood of having high-grade PC. Therefore, it could aid the clinical decision-making process. In fact, while being more expensive in first-line testing, it could be a tool that may help to reduce costs for the whole clinical management of PC patients, including repeated biopsies, surgery, and drugs for post-surgical complications.
Ongoing work is applying the proposed model to a larger dataset and continuing the search of novel algorithms for predicting aggressive disease. The validation of this model needs to be confirmed on large cohorts prospectively enrolled to assess whether it plays a role in survival, cancer-specific mortality, or metastatic progression.

5. Conclusions

In conclusion, in this first reported cohort of men with initial prostate biopsies, an ANN model including PSA, fPSA, p2PSA, and PSA density showed itself to be a good predictor of high-grade biopsy outcome. The investigated model may minimise the risk of missing aggressive cancers, while reducing the number of unnecessary biopsies in men with potentially low-grade disease. The use of our model may have potential in decision making when choosing a treatment strategy that best matches disease aggressiveness and treatment invasiveness.

Author Contributions

Conceptualization, F.G. and D.T.; methodology, F.G.; software, F.G.; validation, F.G., R.V. and D.T.; formal analysis, F.G., D.B.; investigation, F.G.; resources, M.F., D.T.; data curation, M.F., B.D.V., E.L.C., A.L., M.C.; writing—original draft preparation, F.G.; writing—review and editing, D.T.; visualization, R.V.; supervision, F.G., R.V., D.T.; project administration, M.F., D.T.; funding acquisition, F.G., R.V., D.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical approval for this study was given by the institutional Ethics Committee of the University of Naples Federico II (118/20).

Informed Consent Statement

Participants provided written approved consent.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding authors (F.G., D.T.).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2021, 1–41. [Google Scholar] [CrossRef]
  2. Siegel, R.L.; Miller, K.D.; Fuchs, H.E.; Jemal, A. Cancer Statistics, 2021. CA Cancer J. Clin. 2021, 71, 7–33. [Google Scholar] [CrossRef] [PubMed]
  3. Barry, M.J. Prostate-Specific–Antigen Testing for Early Diagnosis of Prostate Cancer. N. Engl. J. Med. 2001, 344, 1373–1377. [Google Scholar] [CrossRef]
  4. Lilja, H.; Ulmert, D.; Vickers, A.J. Prostate-specific antigen and prostate cancer: Prediction, detection and monitoring. Nat. Rev. Cancer 2008, 8, 268–278. [Google Scholar] [CrossRef]
  5. Mottet, N.; van den Bergh, R.C.N.; Briers, E.; Van den Broeck, T.; Cumberbatch, M.G.; De Santis, M.; Fanti, S.; Fossati, N.; Gandaglia, G.; Gillessen, S.; et al. EAU-EANM-ESTRO-ESUR-SIOG Guidelines on Prostate Cancer-2020 Update. Part 1: Screening, Diagnosis, and Local Treatment with Curative Intent. Eur. Urol. 2020. [Google Scholar] [CrossRef]
  6. Dall’Era, M.A.; Albertsen, P.C.; Bangma, C.; Carroll, P.R.; Carter, H.B.; Cooperberg, M.R.; Freedland, S.J.; Klotz, L.H.; Parker, C.; Soloway, M.S. Active surveillance for prostate cancer: A systematic review of the literature. Eur. Urol. 2012, 62, 976–983. [Google Scholar] [CrossRef] [Green Version]
  7. Tokudome, S.; Ando, R.; Koda, Y. Discoveries and application of prostate-specific antigen, and some proposals to optimize prostate cancer screening. Cancer Manag. Res. 2016, 8, 45–47. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Cantiello, F.; Russo, G.I.; Ferro, M.; Cicione, A.; Cimino, S.; Favilla, V.; Perdonà, S.; Bottero, D.; Terracciano, D.; De Cobelli, O.; et al. Prognostic accuracy of Prostate Health Index and urinary Prostate Cancer Antigen 3 in predicting pathologic features after radical prostatectomy. Urol. Oncol. 2015, 33, 15–23. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Dayyani, F.; Zurita, A.J.; Nogueras-González, G.M.; Slack, R.; Millikan, R.E.; Araujo, J.C.; Gallick, G.E.; Logothetis, C.J.; Corn, P.G. The combination of serum insulin, osteopontin, and hepatocyte growth factor predicts time to castration-resistant progression in androgen dependent metastatic prostate cancer- an exploratory study. BMC Cancer 2016, 16, 721. [Google Scholar] [CrossRef] [Green Version]
  10. Ferro, M.; Bruzzese, D.; Perdonà, S.; Marino, A.; Mazzarella, C.; Perruolo, G.; D’Esposito, V.; Cosimato, V.; Buonerba, C.; Di Lorenzo, G.; et al. Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) significantly improve prostate cancer detection at initial biopsy in a total PSA range of 2–10 ng/mL. PLoS ONE 2013, 8, e67687. [Google Scholar] [CrossRef]
  11. Ferro, M.; Bruzzese, D.; Perdonà, S.; Mazzarella, C.; Marino, A.; Sorrentino, A.; Di Carlo, A.; Autorino, R.; Di Lorenzo, G.; Buonerba, C.; et al. Predicting prostate biopsy outcome: Prostate health index (phi) and prostate cancer antigen 3 (PCA3) are useful biomarkers. Clin. Chim. Acta 2012, 413, 1274–1278. [Google Scholar] [CrossRef]
  12. Ferro, M.; Lucarelli, G.; Bruzzese, D.; Di Lorenzo, G.; Perdonà, S.; Autorino, R.; Cantiello, F.; La Rocca, R.; Busetto, G.M.; Cimmino, A.; et al. Low serum total testosterone level as a predictor of upstaging and upgrading in low-risk prostate cancer patients meeting the inclusion criteria for active surveillance. Oncotarget 2017, 8, 18424–18434. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Ferro, M.; Lucarelli, G.; Bruzzese, D.; Perdonà, S.; Mazzarella, C.; Perruolo, G.; Marino, A.; Cosimato, V.; Giorgio, E.; Tagliamonte, V.; et al. Improving the prediction of pathologic outcomes in patients undergoing radical prostatectomy: The value of prostate cancer antigen 3 (PCA3), prostate health index (phi) and sarcosine. Anticancer Res. 2015, 35, 1017–1023. [Google Scholar]
  14. Gong, D.; Wang, Y.; Wang, Y.; Chen, X.; Chen, S.; Wang, R.; Liu, L.; Duan, C.; Luo, S. Extensive serum cytokine analysis in patients with prostate cancer. Cytokine 2020, 125, 154810. [Google Scholar] [CrossRef] [PubMed]
  15. Perdonà, S.; Bruzzese, D.; Ferro, M.; Autorino, R.; Marino, A.; Mazzarella, C.; Perruolo, G.; Longo, M.; Spinelli, R.; Di Lorenzo, G.; et al. Prostate Health Index (phi) and Prostate Cancer Antigen 3 (PCA3) Significantly Improve Diagnostic Accuracy in Patients Undergoing Prostate Biopsy. Prostate 2013, 73, 227–235. [Google Scholar] [CrossRef]
  16. Terracciano, D.; Bruzzese, D.; Ferro, M.; Mazzarella, C.; Di Lorenzo, G.; Altieri, V.; Mariano, A.; Macchia, V.; Di Carlo, A. Preoperative insulin-like growth factor-binding protein-3 (IGFBP-3) blood level predicts gleason sum upgrading. Prostate 2012, 72, 100–107. [Google Scholar] [CrossRef]
  17. Barabási, A.-L. The network takeover. Nat. Phys. 2012, 8, 14–16. [Google Scholar] [CrossRef]
  18. Topol, E.J. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 2019, 25, 44–56. [Google Scholar] [CrossRef]
  19. Tuck, M.K.; Chan, D.W.; Chia, D.; Godwin, A.K.; Grizzle, W.E.; Krueger, K.E.; Rom, W.; Sanda, M.; Sorbara, L.; Stass, S.; et al. Standard operating procedures for serum and plasma collection: Early detection research network consensus statement standard operating procedure integration working group. J. Proteome Res. 2009, 8, 113–117. [Google Scholar] [CrossRef] [Green Version]
  20. Semjonow, A.; Kopke, T.; Eltze, E.; Pepping-Schefers, B.; Burgel, H.; Darte, C. Pre-analytical in-vitro stability of (−2)proPSA in blood and serum. Clin. Biochem. 2010, 43, 926–928. [Google Scholar] [CrossRef] [PubMed]
  21. Epstein, J.I.; Allsbrook, W.C., Jr.; Amin, M.B.; Egevad, L.L.; Committee, I.G. The 2005 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma. Am. J. Surg. Pathol. 2005, 29, 1228–1242. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Tan, G.H.; Nason, G.; Ajib, K.; Woon, D.T.S.; Herrera-Caceres, J.; Alhunaidi, O.; Perlis, N. Smarter screening for prostate cancer. World, J. Urol. 2019, 37, 991–999. [Google Scholar] [CrossRef]
  23. Osses, D.F.; Roobol, M.J.; Schoots, I.G. Prediction Medicine: Biomarkers, Risk Calculators and Magnetic Resonance Imaging as Risk Stratification Tools in Prostate Cancer Diagnosis. Int. J. Mol. Sci. 2019, 20, 1637. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Ferro, M.; De Cobelli, O.; Lucarelli, G.; Porreca, A.; Busetto, G.M.; Cantiello, F.; Damiano, R.; Autorino, R.; Musi, G.; Vartolomei, M.D.; et al. Beyond PSA: The Role of Prostate Health Index (phi). Int. J. Mol. Sci. 2020, 21, 1184. [Google Scholar] [CrossRef] [Green Version]
  25. Trottier, G.; Roobol, M.J.; Lawrentschuk, N.; Bostrom, P.J.; Fernandes, K.A.; Finelli, A.; Chadwick, K.; Evans, A.; van der Kwast, T.H.; Toi, A.; et al. Comparison of risk calculators from the Prostate Cancer Prevention Trial and the European Randomized Study of Screening for Prostate Cancer in a contemporary Canadian cohort. Bju. Int. 2011, 108, E237–E244. [Google Scholar] [CrossRef] [PubMed]
  26. Wilt, T.J.; Brawer, M.K.; Jones, K.M.; Barry, M.J.; Aronson, W.J.; Fox, S.; Gingrich, J.R.; Wei, J.T.; Gilhooly, P.; Grob, B.M.; et al. Radical prostatectomy versus observation for localized prostate cancer. N. Engl. J. Med. 2012, 367, 203–213. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Heidegger, I.; Klocker, H.; Pichler, R.; Pircher, A.; Prokop, W.; Steiner, E.; Ladurner, C.; Comploj, E.; Lunacek, A.; Djordjevic, D.; et al. ProPSA and the Prostate Health Index as predictive markers for aggressiveness in low-risk prostate cancer-results from an international multicenter study. Prostate Cancer Prostatic. Dis. 2017, 20, 271–275. [Google Scholar] [CrossRef]
  28. Benson, M.C.; Whang, I.S.; Pantuck, A.; Ring, K.; Kaplan, S.A.; Olsson, C.A.; Cooner, W.H. Prostate specific antigen density: A means of distinguishing benign prostatic hypertrophy and prostate cancer. J. Urol. 1992, 147, 815–816. [Google Scholar] [CrossRef]
  29. Benson, M.C.; Whang, I.S.; Olsson, C.A.; McMahon, D.J.; Cooner, W.H. The use of prostate specific antigen density to enhance the predictive value of intermediate levels of serum prostate specific antigen. J. Urol. 1992, 147, 817–821. [Google Scholar] [CrossRef]
  30. Filella, X.; Foj, L.; Alcover, J.; Auge, J.M.; Molina, R.; Jimenez, W. The influence of prostate volume in prostate health index performance in patients with total PSA lower than 10 mug/L. Clin. Chim. Acta 2014, 436, 303–307. [Google Scholar] [CrossRef]
  31. Tosoian, J.J.; Druskin, S.C.; Andreas, D.; Mullane, P.; Chappidi, M.; Joo, S.; Ghabili, K.; Mamawala, M.; Agostino, J.; Carter, H.B.; et al. Prostate Health Index density improves detection of clinically significant prostate cancer. Bju Int. 2017, 120, 793–798. [Google Scholar] [CrossRef] [Green Version]
  32. Druskin, S.C.; Tosoian, J.J.; Young, A.; Collica, S.; Srivastava, A.; Ghabili, K.; Macura, K.J.; Carter, H.B.; Partin, A.W.; Sokoll, L.J.; et al. Combining Prostate Health Index density, magnetic resonance imaging and prior negative biopsy status to improve the detection of clinically significant prostate cancer. Bju Int. 2018, 121, 619–626. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Lopes Vendrami, C.; McCarthy, R.J.; Chatterjee, A.; Casalino, D.; Schaeffer, E.M.; Catalona, W.J.; Miller, F.H. The Utility of Prostate Specific Antigen Density, Prostate Health Index, and Prostate Health Index Density in Predicting Positive Prostate Biopsy Outcome is Dependent on the Prostate Biopsy Methods. Urology 2019, 129, 153–159. [Google Scholar] [CrossRef]
  34. Aminsharifi, A.; Howard, L.; Wu, Y.; De Hoedt, A.; Bailey, C.; Freedland, S.J.; Polascik, T.J. Prostate Specific Antigen Density as a Predictor of Clinically Significant Prostate Cancer When the Prostate Specific Antigen is in the Diagnostic Gray Zone: Defining the Optimum Cutoff Point Stratified by Race and Body Mass Index. J. Urol. 2018, 200, 758–766. [Google Scholar] [CrossRef] [PubMed]
  35. Nordstrom, T.; Akre, O.; Aly, M.; Gronberg, H.; Eklund, M. Prostate-specific antigen (PSA) density in the diagnostic algorithm of prostate cancer. Prostate Cancer Prostatic Dis. 2018, 21, 57–63. [Google Scholar] [CrossRef] [PubMed]
  36. Jue, J.S.; Barboza, M.P.; Prakash, N.S.; Venkatramani, V.; Sinha, V.R.; Pavan, N.; Nahar, B.; Kanabur, P.; Ahdoot, M.; Dong, Y.; et al. Re-examining Prostate-specific Antigen (PSA) Density: Defining the Optimal PSA Range and Patients for Using PSA Density to Predict Prostate Cancer Using Extended Template Biopsy. Urology 2017, 105, 123–128. [Google Scholar] [CrossRef]
  37. Jin, B.S.; Kang, S.H.; Kim, D.Y.; Oh, H.G.; Kim, C.I.; Moon, G.H.; Kwon, T.G.; Park, J.S. Pathological upgrading in prostate cancer patients eligible for active surveillance: Does prostate-specific antigen density matter? Korean J. Urol. 2015, 56, 624–629. [Google Scholar] [CrossRef] [Green Version]
  38. Ha, Y.S.; Yu, J.; Salmasi, A.H.; Patel, N.; Parihar, J.; Singer, E.A.; Kim, J.H.; Kwon, T.G.; Kim, W.J.; Kim, I.Y. Prostate-specific antigen density toward a better cutoff to identify better candidates for active surveillance. Urology 2014, 84, 365–371. [Google Scholar] [CrossRef] [PubMed]
  39. Ecke, T.H.; Bartel, P.; Hallmann, S.; Koch, S.; Ruttloff, J.; Cammann, H.; Lein, M.; Schrader, M.; Miller, K.; Stephan, C. Outcome prediction for prostate cancer detection rate with artificial neural network (ANN) in daily routine. Urol. Oncol. 2012, 30, 139–144. [Google Scholar] [CrossRef]
  40. Stephan, C.; Buker, N.; Cammann, H.; Meyer, H.A.; Lein, M.; Jung, K. Artificial neural network (ANN) velocity better identifies benign prostatic hyperplasia but not prostate cancer compared with PSA velocity. BMC Urol. 2008, 8, 10. [Google Scholar] [CrossRef] [Green Version]
  41. Boegemann, M.; Stephan, C.; Cammann, H.; Vincendeau, S.; Houlgatte, A.; Jung, K.; Blanchet, J.S.; Semjonow, A. The percentage of prostate-specific antigen (PSA) isoform (−2)proPSA and the Prostate Health Index improve the diagnostic accuracy for clinically relevant prostate cancer at initial and repeat biopsy compared with total PSA and percentage free PSA in men aged ≤65 years. Bju Int. 2016, 117, 72–79. [Google Scholar] [CrossRef]
  42. Munteanu, V.C.; Munteanu, R.A.; Gulei, D.; Schitcu, V.H.; Petrut, B.; Berindan Neagoe, I.; Achimas Cadariu, P.; Coman, I. PSA Based Biomarkers, Imagistic Techniques and Combined Tests for a Better Diagnostic of Localized Prostate Cancer. Diagnostics 2020, 10, 806. [Google Scholar] [CrossRef] [PubMed]
  43. Schwarzer, G.; Schumacher, M. Artificial neural networks for diagnosis and prognosis in prostate cancer. Semin Urol. Oncol. 2002, 20, 89–95. [Google Scholar] [CrossRef] [PubMed]
  44. Snow, P.B.; Smith, D.S.; Catalona, W.J. Artificial neural networks in the diagnosis and prognosis of prostate cancer: A pilot study. J. Urol. 1994, 152, 1923–1926. [Google Scholar] [CrossRef]
  45. Shariat, S.F.; Kattan, M.W.; Vickers, A.J.; Karakiewicz, P.I.; Scardino, P.T. Critical review of prostate cancer predictive tools. Future Oncol. 2009, 5, 1555–1584. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Tsao, C.W.; Liu, C.Y.; Cha, T.L.; Wu, S.T.; Sun, G.H.; Yu, D.S.; Chen, H.I.; Chang, S.Y.; Chen, S.C.; Hsu, C.Y. Artificial neural network for predicting pathological stage of clinically localized prostate cancer in a Taiwanese population. J. Chin. Med. Assoc. 2014, 77, 513–518. [Google Scholar] [CrossRef] [Green Version]
  47. Dejous, C.; Krishnan, U.M. Sensors for diagnosis of prostate cancer: Looking beyond the prostate specific antigen. Biosens. Bioelectron. 2020, 173, 112790. [Google Scholar] [CrossRef]
Figure 1. Summary of included patients.
Figure 1. Summary of included patients.
Diagnostics 11 00335 g001
Figure 2. Gleason score reported as a function of several different clinical variables and age used in this study (a). The artificial neural network (ANN) used to analyze data and build a predictive model of PC aggressiveness. The network receives as an input the values of the 4 clinical variables of the study. These variables are combined together through successive layers and transmitted to an output layer which determines the grade of the cancer (b).
Figure 2. Gleason score reported as a function of several different clinical variables and age used in this study (a). The artificial neural network (ANN) used to analyze data and build a predictive model of PC aggressiveness. The network receives as an input the values of the 4 clinical variables of the study. These variables are combined together through successive layers and transmitted to an output layer which determines the grade of the cancer (b).
Diagnostics 11 00335 g002
Figure 3. The prediction of the AI model on a cohort of 43 patients used as a validation set compared to the true values of GS measured from those patients (a). Using a cut-off value of 7, the model achieved 86% sensitivity and 74% specificity (b).
Figure 3. The prediction of the AI model on a cohort of 43 patients used as a validation set compared to the true values of GS measured from those patients (a). Using a cut-off value of 7, the model achieved 86% sensitivity and 74% specificity (b).
Diagnostics 11 00335 g003
Figure 4. Visual representation of the output of the AI model as a function of the values of the clinical variables tPSA, fPSA, p2PSA, and PSA density. In the upper (lower) row, we report the combination of variables associated with aggressive (non-aggressive) forms of PC, with GS ≥ 7 (GS < 7).
Figure 4. Visual representation of the output of the AI model as a function of the values of the clinical variables tPSA, fPSA, p2PSA, and PSA density. In the upper (lower) row, we report the combination of variables associated with aggressive (non-aggressive) forms of PC, with GS ≥ 7 (GS < 7).
Diagnostics 11 00335 g004
Figure 5. We measured the performance of the model using 4 different metrics, i.e., accuracy (A), sensitivity (Se), specificity (Sp), and precision (P). The error (e) is the mean of these 4 metrics. We report in the diagrams the values of A, Se, Sp, P, and e determined for the AI model receiving as an input different combinations of the clinical variables used in the study plus age. The model achieved maximum efficiency using as input variables: tPSA, fPSA, p2PSA, and PSA density.
Figure 5. We measured the performance of the model using 4 different metrics, i.e., accuracy (A), sensitivity (Se), specificity (Sp), and precision (P). The error (e) is the mean of these 4 metrics. We report in the diagrams the values of A, Se, Sp, P, and e determined for the AI model receiving as an input different combinations of the clinical variables used in the study plus age. The model achieved maximum efficiency using as input variables: tPSA, fPSA, p2PSA, and PSA density.
Diagnostics 11 00335 g005
Table 1. Characteristics of the study population.
Table 1. Characteristics of the study population.
Patients (n)222
Mean (range, age, years)64 (50–73)
DRE negative/positive n (%)143/79
Median (95% CI) prostate volume (mL)50 (45–60)
Gleason score < 796/126
tPSA ng/mL6.35 (4.48–8.38)
fPSA ng/mL0.89 (0.7–1.3)
p2PSA (pg/mL)20.29 (14.45–29.52)
PSAD (ng/mL2)0.13 (0.09–0.17)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gentile, F.; Ferro, M.; Della Ventura, B.; La Civita, E.; Liotti, A.; Cennamo, M.; Bruzzese, D.; Velotta, R.; Terracciano, D. Optimized Identification of High-Grade Prostate Cancer by Combining Different PSA Molecular Forms and PSA Density in a Deep Learning Model. Diagnostics 2021, 11, 335. https://doi.org/10.3390/diagnostics11020335

AMA Style

Gentile F, Ferro M, Della Ventura B, La Civita E, Liotti A, Cennamo M, Bruzzese D, Velotta R, Terracciano D. Optimized Identification of High-Grade Prostate Cancer by Combining Different PSA Molecular Forms and PSA Density in a Deep Learning Model. Diagnostics. 2021; 11(2):335. https://doi.org/10.3390/diagnostics11020335

Chicago/Turabian Style

Gentile, Francesco, Matteo Ferro, Bartolomeo Della Ventura, Evelina La Civita, Antonietta Liotti, Michele Cennamo, Dario Bruzzese, Raffaele Velotta, and Daniela Terracciano. 2021. "Optimized Identification of High-Grade Prostate Cancer by Combining Different PSA Molecular Forms and PSA Density in a Deep Learning Model" Diagnostics 11, no. 2: 335. https://doi.org/10.3390/diagnostics11020335

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop