Next Article in Journal
The Protective Role of Feruloylserotonin in LPS-Induced HaCaT Cells
Next Article in Special Issue
Chemometrics Approaches in Forced Degradation Studies of Pharmaceutical Drugs
Previous Article in Journal
Cytotoxic Polyketides from a Deep-Sea Sediment Derived Fungus Diaporthe phaseolorum FS431
Previous Article in Special Issue
Identification of Metabolites of Eupatorin in Vivo and in Vitro Based on UHPLC-Q-TOF-MS/MS
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Untargeted Metabolomic Profile for the Detection of Prostate Carcinoma—Preliminary Results from PARAFAC2 and PLS–DA Models

1
Dipartimento di Chimica, Università degli Studi di Torino, Via P. Giuria 7, 10125 Torino, Italy
2
Centro Regionale Antidoping e di Tossicologia “A. Bertinaria”, Regione Gonzole 10/1, 10043 Orbassano, Italy
3
Division of Urology, San Luigi Gonzaga Hospital and University of Torino, 10043 Orbassano, Italy
4
Department of Food Science, Faculty of Science, University of Copenhagen, Rolighedsvej 30, 1958 Frederiksberg, Denmark
*
Author to whom correspondence should be addressed.
Molecules 2019, 24(17), 3063; https://doi.org/10.3390/molecules24173063
Submission received: 24 July 2019 / Revised: 20 August 2019 / Accepted: 20 August 2019 / Published: 22 August 2019

Abstract

:
Prostate-specific antigen (PSA) is the main biomarker for the screening of prostate cancer (PCa), which has a high sensibility (higher than 80%) that is negatively offset by its poor specificity (only 30%, with the European cut-off of 4 ng/mL). This generates a large number of useless biopsies, involving both risks for the patients and costs for the national healthcare systems. Consequently, efforts were recently made to discover new biomarkers useful for PCa screening, including our proposal of interpreting a multi-parametric urinary steroidal profile with multivariate statistics. This approach has been expanded to investigate new alleged biomarkers by the application of untargeted urinary metabolomics. Urine samples from 91 patients (43 affected by PCa; 48 by benign hyperplasia) were deconjugated, extracted in both basic and acidic conditions, derivatized with different reagents, and analyzed with different gas chromatographic columns. Three-dimensional data were obtained from full-scan electron impact mass spectra. The PARADISe software, coupled with NIST libraries, was employed for the computation of PARAFAC2 models, the extraction of the significative components (alleged biomarkers), and the generation of a semiquantitative dataset. After variables selection, a partial least squares–discriminant analysis classification model was built, yielding promising performances. The selected biomarkers need further validation, possibly involving, yet again, a targeted approach.

Graphical Abstract

1. Introduction

Prostate cancer (PCa) is the most common non-skin cancer in men [1,2] and the second most frequently diagnosed malignancy in males worldwide [3]. The first biomarker for PCa detection was prostatic acid phosphatase (PAP), which was introduced in the 1930s [1]. In the 1980s, PAP was replaced by prostate-specific antigen (PSA) [1,4], a secreted protein encoded by a prostate-specific gene and member of the tissue kallikrein family [1], which is produced almost exclusively in the prostate [5,6]. After the introduction of PSA, more men were diagnosed with PCa, with the majority having the early-stage, clinically indolent form of the disease. However, a large number of patients affected by a benign pathology, such as inflammation or hyperplasia, exhibited abnormal PSA values, which lead to the execution of useless biopsies and demonstrate the low specificity of this biomarker [1,6]. This phenomenon was generally designated as “overdiagnosis” or “overtreatment” [1,4,7,8].
Considerable effort has been devoted to improving the PSA-test performance, including the introduction of PSA density, PSA velocity (and doubling time), the dosage of free or complexed PSA, and the quantitation of its isoforms [1,2]. A combination of these parameters yields the Prostate Health Index (PHI) [3].
Meanwhile, intensive research has been devoted to the search for different biomarkers, mainly by applying omics methods (e.g., genomics, proteomics, transcriptomics, and metabolomics) [1], and several authors have reviewed the emerging biomarkers, among which the most prominent are the urinary prostate cancer antigen 3 (PCA3) [1,3,5] and transmembrane protease, serine 2 (TMPRSS2-ERG) (sometimes combined together) [1,3,5]. Alpha-methylacyl-CoA Racemase (AMACR) demonstrated high sensitivities and specificities on prostate biopsy, but it is not suitable for non-invasive detection in urine [1,5]. Increased diagnostic performances were obtained by the serum dosage of human kallikrein-related peptidase 2 (KLK2) in combination with total and free PSA [5]. An evolution of the application of kallikreins consists in a blood measurement of the four existing isoforms which, combined with clinical information, allows the probability calculation of PCa incidence [3]. Significantly increased levels of prostasomes were found in blood samples from patients with PCa [9], while elevated levels of urinary sarcosine were found to be associated with aggressive forms of prostate cancer [1].
Studies conducted in the 1970s and 1980s highlighted the correlation between increased urinary excretion of polyamines (i.e., spermine, spermidine, and putrescine) and several types of cancer [10,11]. However, anomalous oxidative degradation reactions of these polyamines resulted in low concentrations of these biomarkers in approximately 20% of the patients, leading to false-negative prediction and consequently limiting their application as diagnostic biomarkers [10].
The correlation between altered steroidal biosynthesis and PCa is well known [12,13,14]. For this reason, in a previous study, we carried out a targeted analysis of urine samples, addressed to a large panel of androgens, including testosterone and its principal phase I metabolites. The multivariate statistical interpretation of these steroid profiles produced satisfactory results in terms of sensitivity, specificity, and area under the curve (AUC) [15].
In this study, the search for new urinary biomarkers was undertaken by using untargeted methods. In perspective, emerging biomarkers could possibly be combined with the most discriminating steroid biomarkers to improve their screening performances further, without altering the inherent simplicity of the instrumental procedure. In fact, the ideal biomarker should be cheap to determine, non-invasive, easily accessible, and quickly quantifiable [1,2]. Taking into account the abovementioned considerations, gas chromatography–electron impact mass spectrometry (GC–EIMS) would give a more suitable solution than the other commonly used analytical techniques to provide a three-dimensional pattern for untargeted analysis. Urine was chosen as the election matrix, as it is easily available in large volumes and involves non-invasive sampling.

2. Materials and Methods

2.1. Chemicals and Reagents

Tert-butyl methyl ether (TBME), ethyl acetate, dithioerythritol, ammonium iodide (NH4I), N-Methyl-N-(trimethylsilyl)trifluoroacetamide (TMSTFA), and trifluoroacetic anhydride (TFAA) were provided by Sigma-Aldrich (Milan, Italy). β-glucuronidase from Escherichia coli was purchased from Roche Life Science (Indianapolis, IN, USA). Ultra-pure water was obtained using a Milli-Q® UF-Plus apparatus (Millipore, Bedford, MA, USA).

2.2. Samples Collection

The subjects involved in this study were recruited in the ambulatory of the Department of Urology at the San Luigi Hospital of Orbassano (TO, Italy), after approval of the protocol by the reference Ethical Committee (protocol number 0019267). A total of 91 subjects were enrolled, including 43 affected by prostate carcinoma (PCa, confirmed by a positive biopsy) and 48 diagnosed with benign prostatic hyperplasia (BPH, with a PSA lower than the European cut-off of 4 ng/mL or with a PSA above the threshold but a negative biopsy result). In a previous study, the progressive modification of the urinary steroidal profile with age was investigated [16]. From this study, we decided to enroll only individuals older than 60 years, when the bias effect due to aging became negligible [16]. Moreover, since ethnicity represents another important bias factor, only Caucasian individuals were recruited. Finally, diabetes, other carcinoma, metabolic diseases, and therapies suspected to alter the urinary steroid profile (such as steroid therapy) were considered as exclusion criteria.
Body mass index (BMI), alcohol consumption, medical therapy, digital rectal examination, PSA value, and biopsy Gleason Score (GS) were recorded. In detail, the group’s mean age and standard deviation was 70 ± 10 years for BPH and 70 ± 8 years for PCa. BMI was within the range of normality for all individuals (between 18.5 and 25), and PSA was 3.8 ± 2.3 ng/mL for BPH and 11.0 ± 9.5 ng/mL for PCa. The PCa class was distributed as low risk (GS = 3 + 3, 15 patients), middle risk (GS = 3 + 4 and 4 + 3, 21 patients), and high risk (GS = 4 + 4 and 4 + 5, seven patients).

2.3. Sample Treatment and GC–MS Analysis

Firstly, the protein components of the urinary samples were precipitated by centrifugation at 4000 rpm for 5 min. Two aliquots (A and B) of 5 mL each were taken from each sample. The urine pH was adjusted between 6.8 and 7.4 by adding 2 mL of phosphate buffer and a few drops of NaOH 1M or HCl 1M whenever necessary. Enzymatic hydrolysis of the glucuronide metabolites was conducted with 100 µL of β-glucuronidase from Escherichia coli (equivalent to 83 enzymatic units) by heating it in the oven for 1 h. After cooling to room temperature, the two aliquots were subjected to different liquid–liquid extraction (LLE) with 5 mL of TBME each, at basic (pH ≥ 10) and acid (pH ≤ 1) conditions, respectively, obtained by the introduction of some drops of NaOH 1M and HCl 1M. Both aliquots were dried under a gentle nitrogen stream at room temperature. The dried aliquot A was derivatized using 50 µL TFAA at 65 °C for 1 h. Then, the solvent was dried and the residue was dissolved in 50 µL TBME and injected into the GC–MS. The chromatographic separation was achieved with a J&W Scientific HP-5, 17 m × 0.2 mm (i.d.) × 0.33 µm (f.t.) capillary column. The oven temperature was programmed as follows: The starting temperature of 90 °C was held for 1 min. Then, the temperature of 180 °C was reached with a rate of 30 °C/min and held for 7 min. A final heating rate of 15 °C/min was applied until the temperature of 325 °C was reached (held for 3 min). The chromatographic run lasted 22.20 min.
Aliquot B was derivatized using 50 µL of TMSTFA/NH4I/dithioerythritol (1.000:2:4 v/w/w), at 70 °C for 30 min and then injected into a GC–MS equipped with a J&W Scientific HP-1, 17 m × 0.2 mm (i.d.) × 0.11 µm (f.t.) capillary column. The oven temperature was programmed to heat up from 120 to 177 °C at a rate of 70 °C/min, and from 177 to 236 °C at a rate of 5 °C/min. A final heating rate of 30 °C/min was applied until the temperature of 315 °C was reached. The chromatographic run lasted 18.25 min. Both the runs were performed in full-scan mode, in the interval 40–650 m/z at a scan rate of 2.28 scans/s.
Because the samples were analyzed in five analytical sections performed on different days, it was important to monitor the occurrence of a data structure due to the different analytical sections. The exploratory unsupervised data analysis can serve to this scope, and principal component analysis (PCA) was employed. No clustering or trend related to the day of the analysis was detected.

2.4. Statistical Analysis

The main steps of the statistical analysis are reported in Figure 1.

2.5. Pre-Treatment of the Raw Data

The .AIA files of the chromatographic runs were downloaded using the software ChemStation®.
The PARADISe version 3 software was employed to convert the files in a form suitable for MATLAB (extension .mat). The alignment procedure, both propaedeutic and mandatory for the following steps of data analysis, was executed over the three-way (samples × retention time × m/z) array of size 91 × 3099 × 612 (over 172 million data) and 91 × 1640 × 652 (over 97 million data) for the trifluoroacetyl (TFA) and trimethylsilyl (TMS) derivatives, respectively. The correlation optimized warping (COW) was performed along the retention time and the m/z dimensions [17]. The two matrices were segmented along the retention time dimension to improve the performances of the COW algorithm, and for each slice the computation was iterated until a visually satisfying result was obtained. Lastly, to improve the visualization of the data, the baseline was subtracted. It is important to highlight that the latter computational step only served to improve the data visualization by the operator, because PARAFAC2 is able to recognize the baseline and the noise components, allowing their automatic exclusion [18,19,20,21].

2.6. PARAFAC2 Models Computation and Molecular Identification

The aligned dataset was analyzed in the PARADISe software, to proceed with the computation of PARAFAC2 models; the operating procedure consists in the manual identification of intervals along the chromatogram (with each interval ideally containing approximately one peak). The PARAFAC2 models were built introducing the non-negativity constraint and performing 10,000 iterations for interval [18,19,20,21].
Within the software, the operator can label the components as (i) baseline, (ii) noise, or (iii) compounds. All the mass spectra of the components belonging to the third category are automatically compared with the NIST database. A report is produced, including the relative concentrations of the detected compounds (assuming a uniform response factor of 1), and the n (number subjectively chosen by the user) most likely identifications for each compound. Finally, the relative concentrations were normalized using the urinary creatinine values.

2.7. Classification Models

The dataset composed by the relative concentrations of the detected metabolites for each sample was used to perform partial least squares–discriminant analysis (PLS–DA) [22], classifying the samples into having prostate carcinoma or not. Firstly, the dataset was log10 transformed (with the aim of achieving a more even distribution of each of the variables) and autoscaled. Then, the variable importance in projection (VIP) method (using a threshold of 1) [23] and genetic algorithms (GAs) [24] were run to select the most relevant variables. The reduced dataset was finally used to build the PLS–DA classification model. The model was then validated using the repeated double cross-validation (dCV) approach [25]. The PLS_Toolbox version 8.5 (Eigenvector Research, Inc., Manson, WA, USA) was used to perform this part of the analysis [26].

3. Results and Discussion

The preliminary PARAFAC2 model extracted a total of 329 relevant compounds (184 from the chromatographic run after TMS derivatization and 145 after TFA derivatization). Of these, 89 were selected using the VIP algorithm, and a further 58 substances were discarded by one cycle of GAs. The final dataset, consisting in a 91 × 32 (subjects × variables) matrix, was employed to build a PLS–DA classification model. Due to the heterogeneity of the patients enrolled (in terms of pathology staging, prostatic volume, and PSA values), the model was validated using repeated double cross-validation (30 repetitions were performed) [25] instead of the standard external validation, in which the use of a limited and heterogeneous population may result in significant bias. The plot reporting the Y-value predicted in cross-validation (CV) in one of the several classification models produced during the repeated double cross-validation process is shown in Figure 2A. The corresponding receiver operating characteristic (ROC) curve is depicted in Figure 2B. The high values of the area under the curve (AUC) for both the estimated and cross-validated ROCs are an indicator of high performances and robustness of the model. In detail, using a discriminating Y-value of 0.5, the model provides 92.5 ± 2.2% sensitivity and 88.7 ± 3.9% specificity for the cancer-affected population. On average, misclassification occurred on about 3 ± 2 patients affected by carcinoma out of 43 and 5 ± 2 patients with hyperplasia out of 48.
Of the 32 compounds selected by the dedicated algorithms to build the model, 17 were not found in the available NIST libraries, while for seven other compounds, the identification provided by automatic spectral matching was deemed incorrect. On the other hand, manual mass spectra interpretation was made difficult by the structural similarity of many candidate biomarkers, as well as the effect of the derivatizations, that introduced functional groups (e.g., −Si(CH3)3) yielding prevalent fragment ions in the spectrum. The mass spectra of the 32 metabolites are provided as Supplementary Materials (Supplementary Figure S1). Table 1 reports the eight identified compounds, accompanied by their Human Metabolome Database (HMDB) and Kyoto Encyclopedia of Genes and Genomes (KEGG) identification numbers, when available. Since the PARADISe output provides only a rough semiquantitative report based on the total ion current (TIC) without any external calibration, the real physiological concentration of each metabolite could not be evaluated. However, these absolute TIC values can be evaluated in relative terms to provide an averaged qualitative comparison between the two populations for all the analytes. The overexpression and underexpression of these metabolites allegedly linked to the occurrence of PCa are reported in Table 1.
It is interesting to note that among the eight identified compounds, five (63%) are involved in steroidal biosynthesis, confirming their potential in the detection and diagnosis of PCa. Similarly, Choi et al. found elevated levels of 16-hydroxy-dehydroepiandrosterone, epiandrosterone, etiocholanolone, and pregnanetriol in patients diagnosed with papillary thyroid carcinoma [27]. The first steroid appears to also be overexpressed in the present case for patients with PCa, but pregnanetriol was underexpressed in the same patients and epiandrosterone was found in comparable concentrations in the two populations. Dehydroepiandrosterone is involved in the expression of insulin-like growth factor 1, whose dysregulation is implicated in certain colon, liver, prostate, and breast cancers [28]. This observation may justify the inclusion of 16-hydroxy-dehydroepiandrosterone among the potential biomarkers for PCa. Pregnanetriol, together with 5 β-pregnanediol, is also known to be dysregulated in adrenal syndromes, such as adrenal tumors or Cushing’s syndrome [29,30]. Increased androsterone levels were found in a cohort of PCa-affected individuals within a multivariate investigation of the urinary steroidal profile, and the present findings are in accordance with our previous study [16]. Other steroids that proved useful to discriminate PCa from BPH [27] were possibly overlooked in the present untargeted selection because of their low concentration in urine.
The overexpression of serotonin and its biomarkers (among which, 5-hydroxyindoleacetic acid) represents a potential urinary biomarker for neuroblastic and carcinoid tumors [31]. While there is no evidence in the literature of an association between 5-hydroxyindoleacetic acid and PCa, the present data suggest such a hypothesis, as its overexpression is clearly evident in the PCa-affected population considered.
Phytoestrogens are a class of substances accredited to prevent the onset of cancer [32,33]. In accordance with this hypothesis, enterodiol is underexpressed in the present PCa population.
Among the 32 selected biomarkers, different contributions to the overall discrimination achieved by the PLS–DA model (Figure 1) were expected. A rough estimation of the relative importance of these biomarkers is expressed by their selectivity ratios [34], reported in Figure 3. Nine biomarkers exhibit selectivity ratios higher that 0.1, while, for five others, values between 0.07 and 0.1 were found. Interestingly, out of the nine biomarkers with the highest selectivity ratio, eight are underexpressed in the PCa patients, apparently suggesting them as protective substances. The expression of the 14 metabolites is represented in the form of boxplots in Figure 4. Tentative PLS–DA models were built with only these 9 and 14 biomarkers, but their overall efficiency significantly dropped with respect to the model of 32 biomarkers, demonstrating that the relative contribution of the remaining biomarkers is not negligible. In particular, the specificity index was considerably reduced in the models of 9 biomarkers and 14 biomarkers, while the sensitivity score remained relatively high.
Further testing was also conducted on the six biomarkers showing comparable mean intensity for the two populations. One variable at a time was removed, and a new classification model was computed using a simple cross-validation with each reduced dataset. Five of the seven new models yielded decreased sensitivity and specificity, while the other two models provided comparable performance, substantially confirming the choice of the 32 biomarker model.

4. Conclusions

The preliminary results reported in the present study support the premise that GC–MS tridimensional data can be profitably exploited in untargeted metabolomics studies devoted to prostatic carcinoma diagnosis. Compared to the more resource-demanding ultra-high-performance liquid chromatography–tandem mass spectrometer (UHPLC–MS/MS) and ultra-high-performance liquid chromatography–high-resolution mass spectrometry (UHPLC–HRMS) approaches frequently presented in the literature, GC–MS offers comparable chromatographic resolution and structured spectroscopic information, as is generated by the fragment ion pattern typical of electron impact ionization. On these complex data arrays, the ultimate performance in the extraction of crucial information relies on the software purposely adopted and PARAFAC2 combined with VIP and GA methods of variables selection proved to produce highly efficient models of class discrimination, allowing us to distinguish prostatic carcinoma from benign hyperplasia with good sensitivity and specificity scores.
A common limitation of untargeted metabolomics methods, including the present one, is that the most abundant components of the screened samples are preferentially isolated as potential biomarkers with respect to minor constituents, possibly present at trace levels. This explains the differences in the selected biomarkers with respect to the targeted approach that we previously tested [27]. On the other hand, complementary sets of biomarkers are extracted and then evaluated from targeted and untargeted approaches, to be subsequently combined to achieve optimal performance. More work has to be done on large populations of PCa-affected patients and controls to confirm the present findings, and further effort is necessary to reveal the identity of the most valuable biomarkers and possibly confirm their real value as interesting biomarkers by univariate statistics. Despite these limitations to be overcome in the subsequent investigations, the strategy adopted in the present study, based on non-invasive urine sampling, cheap instrumentation, and advanced data treatment by PARADISe software, appears to be extremely promising in PCa screening.

Supplementary Materials

The following are available online.

Author Contributions

All the Authors participated to the preliminary study design and planning. Recruitment, participation criteria, clinical evaluations, F.P.; Development of the analytical method, E.A. (Eleonora Amante) and A.S.; Sample processing, E.A. (Eleonora Amante); Method validation, E.A. (Eleonora Amante), E.A. (Eugenio Alladio), and M.V.; Chemometrics and statistical analysis, R.B., E.A. (Eleonora Amante), and E.A. (Eugenio Alladio); Mass spectra interpretation, M.V., E.A. (Eleonora Amante); Writing—Original Draft Preparation, E.A. (Eleonora Amante); Writing—Review & Editing, M.V. All Authors read and approved the final manuscript.

Funding

This research was funded by the Italian “Ministero dell’Istruzione, dell’Università e della Ricerca” within a PRIN 2017 call for bids (Research Projects of Relevant National Interest—grant 2017Y2PAB8).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Prenser, J.R.; Rubin, M.A.; Wei, J.T.; Chinnaiyan, A.M. Beyond PSA: The next generation of prostate cancer biomarkers. Sci. Transl. Med. 2012, 4, 127rv3. [Google Scholar]
  2. Velonas, V.M.; Woo, H.H.; Dos Remedios, C.G.; Assinder, S.J. Current status of biomarkers for prostate cancer. Int. J. Mol. Sci. 2013, 14, 11034–11060. [Google Scholar] [CrossRef] [PubMed]
  3. Hendriks, R.J.; Van Oort, I.M.; Schalken, J.A. Blood-based and urinary prostate cancer biomarkers: A review and comparison of novel biomarkers for detection and treatment decisions. Prostate Cancer Prostatic Dis. 2017, 20, 12–19. [Google Scholar] [CrossRef] [PubMed]
  4. Etzioni, R.; Penson, D.F.; Legler, J.M.; Di Tommaso, D.; Boer, R.; Gann, P.H.; Feuer, E.J. Overdiagnosis due to prostate-specific antigen screening: Lessons from U.S. prostate cancer incidence trends. J. Natl. Cancer Inst. 2002, 94, 981–990. [Google Scholar] [CrossRef] [PubMed]
  5. Sardana, G.; Dowell, B.; Diamandis, E.P. Emerging biomarkers for the diagnosis and prognosis of prostate cancer. Clin. Chem. 2008, 54, 1951–1960. [Google Scholar] [CrossRef] [PubMed]
  6. Kramer, B.S.; Brown, M.L.; Prorok, P.C.; Potosky, A.L.; Gohagan, J.K. Prostate cancer screening: What we know and what we need to know. Ann. Intern. Med. 1993, 119, 914–923. [Google Scholar] [CrossRef] [PubMed]
  7. Moyer, V.A. Screening for Prostate Cancer: U.S. Preventive Services Task Force Recommendation Statement. Ann. Intern. Med. 2012, 157, 120–134. [Google Scholar] [CrossRef] [Green Version]
  8. Gigerenzer, G.; Mata, J.; Frank, R. Public knowledge of benefits of breast and prostate cancer screening in Europe. J. Natl. Cancer Inst. 2009, 101, 1216–1220. [Google Scholar] [CrossRef]
  9. Tavoosidana, G.; Ronquist, G.; Darmanis, S.; Yan, J.; Carlsson, L.; Wu, D.; Conze, T.; Ek, P.; Semjonow, A.; Eltze, E.; et al. Multiple recognition assay reveals prostasomes as promising plasma biomarkers for prostate cancer. Expert Rev. Anticancer Ther. 2011, 11, 1341–1343. [Google Scholar] [CrossRef]
  10. Bachrach, U. Polyamines and cancer: Minireview article. Amino Acids 2004, 26, 307–309. [Google Scholar] [CrossRef]
  11. Schipper, R.G.; Romijn, J.C.; Cuijpers, V.M.; Verhofstad, A.A. Polyamines and prostatic cancer. Biochem. Soc. Trans. 2003, 31, 375–380. [Google Scholar] [CrossRef] [PubMed]
  12. Lévesque, E.; Huang, S.P.; Audet-Walsh, E.; Lacombe, L.; Bao, B.Y.; Fradet, Y.; Laverdière, I.; Rouleau, M.; Huang, C.Y.; Yu, C.C.; et al. Molecular markers in key steroidogenic pathways, circulating steroid levels, and prostate cancer progression. Clin. Cancer Res. 2013, 19, 699–709. [Google Scholar] [CrossRef] [PubMed]
  13. Gnanapragasam, V.J.; Robson, C.N.; Leung, H.Y.; E Neal, D. Androgen receptor signalling in the prostate. BJU Int. 2000, 86, 1001–1013. [Google Scholar] [CrossRef] [PubMed]
  14. Kelloff, G.J.; Lieberman, R.; Steele, V.E.; Boone, C.W.; Lubet, R.A.; Kopelovich, L.; Malone, W.A.; Crowell, J.A.; Higley, H.R.; Sigman, C.C. Agents, biomarkers, and cohorts for chemopreventive agent development in prostate cancer. Urology 2001, 57, 46–51. [Google Scholar] [CrossRef]
  15. De Luca, S.; Fiori, C.; Manfredi, M.; Amante, E. Preliminary results of prospective evaluation of urinary endogenous steroid profile and prostatic carcinoma-induced deviation. J. Urol. 2019, 201, e263–e264. [Google Scholar] [CrossRef]
  16. Amante, E.; Alladio, E.; Salomone, A.; Vincenti, M.; Marini, F.; Alleva, G.; De Luca, S.; Porpiglia, F. Correlation between chronological and physiological age of males from their multivariate urinary endogenous steroid profile and prostatic carcinoma-induced deviation. Steroids 2018, 139, 10–17. [Google Scholar] [CrossRef] [PubMed]
  17. Tomasi, G.; Berg, F.V.D.; Andersson, C. Correlation optimized warping and dynamic time warping as preprocessing methods for chromatographic data. J. Chemom. 2004, 18, 231–241. [Google Scholar] [CrossRef]
  18. Johnsen, L.G.; Skou, P.B.; Khakimov, B.; Bro, R. Gas chromatography—Mass spectrometry data processing made easy. J. Chromatogr. A 2017, 1503, 57–64. [Google Scholar] [CrossRef]
  19. Amigo, J.M.; Skov, T.; Bro, R.; Coello, J.; Maspoch, S. Solving GC-MS problems with PARAFAC2. TrAC Trends Anal. Chem. 2008, 27, 714–725. [Google Scholar] [CrossRef]
  20. Bro, R.; Andersson, C.A.; Kiers, H.A.L. PARAFAC2—Part II. Modeling chromatographic data with retention time shifts. J. Chemom. 1999, 13, 295–309. [Google Scholar] [CrossRef]
  21. Amigo, J.M.; Popielarz, M.J.; Callejón, R.M.; Morales, M.L.; Troncoso, A.M.; Petersen, M.A.; Toldam-Andersen, T.B.; Morales, M.L. Comprehensive analysis of chromatographic data by using PARAFAC2 and principal components analysis. J. Chromatogr. A 2010, 1217, 4422–4429. [Google Scholar] [CrossRef] [PubMed]
  22. Ballabio, D.; Consonni, V. Classification tools in chemistry. Part 1: Linear models. PLS-DA. Anal. Methods 2013, 5, 3790–3798. [Google Scholar] [CrossRef]
  23. Wold, S.; Johansson, E.; Cocchi, M. PLS: Partial Least Squares Projections to Latent Structures. In 3D QSAR in Drug Design: Theory, Methods and Applications; KLUWER ESCOM Science Publisher: Heidelberg, Germany, 1993; pp. 523–550. [Google Scholar]
  24. Zou, W.; Tolstikov, V.V. Probing genetic algorithms for feature selection in comprehensive metabolic profiling approach. Rapid Commun. Mass Spectrom. 2008, 22, 1312–1324. [Google Scholar] [CrossRef] [PubMed]
  25. Filzmoser, P.; Liebmann, B.; Varmuza, K. Repeated double cross validation. J. Chemom. 2009, 23, 160–171. [Google Scholar] [CrossRef]
  26. Wise, B.; Gallagher, N.; Bro, R. PLS_Toolbox 8.5; Eigenvector Research, Inc.: Manson, WA, USA, 2017; Available online: http://eigenvector.com/software/pls-toolbox/ (accessed on 22 August 2019).
  27. Choi, M.H.; Moon, J.-Y.; Cho, S.-H.; Chung, B.C.; Lee, E.J. Metabolic alteration of urinary steroids in pre- and post-menopausal women, and men with papillary thyroid carcinoma. BMC Cancer 2011, 11, 342. [Google Scholar] [CrossRef] [PubMed]
  28. Miller, K.K.M. The Biological Actions of Dehydroepiandrosterone. Drug Metab. Rev. 2006, 38, 89–116. [Google Scholar]
  29. Arlt, W.; Biehl, M.; Taylor, A.E.; Hahner, S.; Libé, R.; Hughes, B.A.; Schneider, P.; Smith, D.J.; Stiekema, H.; Krone, N.; et al. Urine steroid metabolomics as a biomarker tool for detecting malignancy in adrenal tumors. J. Clin. Endocrinol. Metab. 2011, 96, 3775–3784. [Google Scholar] [CrossRef] [PubMed]
  30. Arlt, W.; Stewart, P.M. Adrenal corticosteroid biosynthesis, metabolism, and action. Endocrinol. Metab. Clin. N. Am. 2005, 34, 293–313. [Google Scholar] [CrossRef] [PubMed]
  31. Lionetto, L.; Lostia, A.M.; Stigliano, A.; Cardelli, P.; Simmaco, M. HPLC-mass spectrometry method for quantitative detection of neuroendocrine tumor markers: Vanillylmandelic acid, homovanillic acid and 5-hydroxyindoleacetic acid. Clin. Chim. Acta 2008, 398, 53–56. [Google Scholar] [CrossRef]
  32. Stephens, F.O.; Unit, O. Phytoestrogens and prostate cancer: Possible preventive role. Med. J. Aust. 1997, 167, 138–140. [Google Scholar] [CrossRef]
  33. Hedelin, M.; Klint, Å.; Chang, E.T.; Bellocco, R.; Johansson, J.E.; Andersson, S.O.; Heinonen, S.M.; Adlercreutz, H.; Adami, H.O.; Grönberg, H.; et al. Dietary phytoestrogen, serum enterolactone and risk of prostate cancer: The Cancer Prostate Sweden Study (Sweden). Cancer Causes Control 2006, 17, 169–180. [Google Scholar] [CrossRef] [PubMed]
  34. Rajalahti, T.; Arneberg, R.; Berven, F.S.; Myhr, K.-M.; Ulvik, R.J.; Kvalheim, O.M. Biomarker discovery in mass spectral profiles by means of selectivity ratio plot. Chemom. Intell. Lab. Syst. 2009, 95, 35–48. [Google Scholar] [CrossRef]
Sample Availability: Samples of the compounds are available from the authors.
Figure 1. Statistical analysis workflow.
Figure 1. Statistical analysis workflow.
Molecules 24 03063 g001
Figure 2. Y-value predicted in cross-validation (CV) (A) and receiver operating characteristic (ROC) curves (B) of one of the several classification models built during the repeated double cross-validation procedure.
Figure 2. Y-value predicted in cross-validation (CV) (A) and receiver operating characteristic (ROC) curves (B) of one of the several classification models built during the repeated double cross-validation procedure.
Molecules 24 03063 g002
Figure 3. Selectivity ratio of the 32 selected features. The variables above the threshold of 0.1 are reported in green, and the ones between the thresholds 0.07–0.1 are reported in red.
Figure 3. Selectivity ratio of the 32 selected features. The variables above the threshold of 0.1 are reported in green, and the ones between the thresholds 0.07–0.1 are reported in red.
Molecules 24 03063 g003
Figure 4. Boxplot, in logarithmic (base-10) scale, of the 14 compounds above the selectivity ratio threshold of 0.07 (see Figure 3).
Figure 4. Boxplot, in logarithmic (base-10) scale, of the 14 compounds above the selectivity ratio threshold of 0.07 (see Figure 3).
Molecules 24 03063 g004
Table 1. List of the 32 selected metabolites. The kind of derivatization, retention time, and expression in prostate cancer (PCa)-affected individuals are reported. Moreover, metabolites with a putative identification are accompanied by the match score with NIST library and the relative identification (ID) number in the Human Metabolome Database (HMDB) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The mass spectra of the 32 metabolites are reported in Supplementary Materials—Figure S1.
Table 1. List of the 32 selected metabolites. The kind of derivatization, retention time, and expression in prostate cancer (PCa)-affected individuals are reported. Moreover, metabolites with a putative identification are accompanied by the match score with NIST library and the relative identification (ID) number in the Human Metabolome Database (HMDB) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The mass spectra of the 32 metabolites are reported in Supplementary Materials—Figure S1.
CompoundDerivatizationRetention Time (min)Match with NISTHMDB IDKEGG IDExpression in PCa Patients
TMS derivatives15-Hydroxyindoleacetic acidTMS5.26893HMDB0000763C05635overexpression
2Unknown 1TMS5.86---overexpression
3Unknown 2TMS7.44---underexpression
4AndrosteroneTMS8.14912HMDB0000031C00523overexpression
516-HydroxydehydroisoandrosteroneTMS9.23888HMDB0000352C05139overexpression
6Unknown 3TMS9.84---comparable
7Unknown 4TMS10.31---underexpression
8Unknown 5TMS10.61---underexpression
9Unknown 6TMS11.29---underexpression
10Unknown 7TMS11.32---comparable
11EnterodiolTMS12.19826HMDB0005056C18166underexpression
125β-pregnanediolTMS12.53853HMDB0005943Not availableunderexpression
13Unknown 8TMS13.6---overexpression
14Unknown 9TMS13.67---comparable
15PregnanetriolTMS13.73904HMDB0006070Not availableunderexpression
16Unknown 10TMS14.03---underexpression
17Unknown 11TMS14.50---underexpression
18Unknown 12TMS14.53---underexpression
19Unknown 13TMS14.6---overexpression
20Unknown 14TMS14.66---underexpression
21Unknown 15TMS15.04---underexpression
TFA derivatives22Unknown 16TFA1.63---underexpression
23Unknown 17TFA1.71---comparable
24Vanillyl alcoholTFA3.37860HMDB0032012C06317overexpression
25Unknown 18TFA4.97---comparable
26Unknown 19TFA5.71---underexpression
27Unknown 20TFA3.32---underexpression
28EpiandrosteroneTFA15.61925HMDB0000365C07635comparable
29Unknown 21TFA16.32---underexpression
30Unknown 22TFA17.87---underexpression
31Unknown 23TFA18.11---overexpression
32Unknown 24TFA18.24---underexpression

Share and Cite

MDPI and ACS Style

Amante, E.; Salomone, A.; Alladio, E.; Vincenti, M.; Porpiglia, F.; Bro, R. Untargeted Metabolomic Profile for the Detection of Prostate Carcinoma—Preliminary Results from PARAFAC2 and PLS–DA Models. Molecules 2019, 24, 3063. https://doi.org/10.3390/molecules24173063

AMA Style

Amante E, Salomone A, Alladio E, Vincenti M, Porpiglia F, Bro R. Untargeted Metabolomic Profile for the Detection of Prostate Carcinoma—Preliminary Results from PARAFAC2 and PLS–DA Models. Molecules. 2019; 24(17):3063. https://doi.org/10.3390/molecules24173063

Chicago/Turabian Style

Amante, Eleonora, Alberto Salomone, Eugenio Alladio, Marco Vincenti, Francesco Porpiglia, and Rasmus Bro. 2019. "Untargeted Metabolomic Profile for the Detection of Prostate Carcinoma—Preliminary Results from PARAFAC2 and PLS–DA Models" Molecules 24, no. 17: 3063. https://doi.org/10.3390/molecules24173063

Article Metrics

Back to TopTop