Next Article in Journal
In Vitro Metabolic Fate of the Synthetic Cannabinoid Receptor Agonists QMPSB and QMPCB (SGT-11) Including Isozyme Mapping and Esterase Activity
Next Article in Special Issue
Metabolomics Signature of Plasma Renin Activity and Linkage with Blood Pressure Response to Beta Blockers and Thiazide Diuretics in Hypertensive European American Patients
Previous Article in Journal
Comparison of Osteosarcoma Aggregated Tumour Models with Human Tissue by Multimodal Mass Spectrometry Imaging
Previous Article in Special Issue
Metabolomics Studies in Psoriatic Disease: A Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dysregulated Alanine as a Potential Predictive Marker of Glioma—An Insight from Untargeted HRMAS-NMR and Machine Learning Data

1
School of Biochemistry and Biotechnology, University of the Punjab, Lahore 54590, Pakistan
2
Riphah College of Rehabilitation and Allied Health Sciences, Riphah International University, Lahore 54770, Pakistan
3
Department of Data Science, Punjab University College of Information Technology, University of the Punjab, Lahore 54590, Pakistan
4
Punjab Institute of Neurosciences (PINS), Lahore General Hospital, Lahore 54000, Pakistan
5
Departments of Radiology and Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02129, USA
*
Author to whom correspondence should be addressed.
Both authors contributed equally.
Metabolites 2021, 11(8), 507; https://doi.org/10.3390/metabo11080507
Submission received: 29 May 2021 / Revised: 22 July 2021 / Accepted: 28 July 2021 / Published: 1 August 2021
(This article belongs to the Special Issue Biomarkers and Human Blood Metabolites)

Abstract

:
Metabolic alterations play a crucial role in glioma development and progression and can be detected even before the appearance of the fatal phenotype. We have compared the circulating metabolic fingerprints of glioma patients versus healthy controls, for the first time, in a quest to identify a panel of small, dysregulated metabolites with potential to serve as a predictive and/or diagnostic marker in the clinical settings. High-resolution magic angle spinning nuclear magnetic resonance spectroscopy (HRMAS-NMR) was used for untargeted metabolomics and data acquisition followed by a machine learning (ML) approach for the analyses of large metabolic datasets. Cross-validation of ML predicted NMR spectral features was done by statistical methods (Wilcoxon-test) using JMP-pro16 software. Alanine was identified as the most critical metabolite with potential to detect glioma with precision of 1.0, recall of 0.96, and F1 measure of 0.98. The top 10 metabolites identified for glioma detection included alanine, glutamine, valine, methionine, N-acetylaspartate (NAA), γ-aminobutyric acid (GABA), serine, α-glucose, lactate, and arginine. We achieved 100% accuracy for the detection of glioma using ML algorithms, extra tree classifier, and random forest, and 98% accuracy with logistic regression. Classification of glioma in low and high grades was done with 86% accuracy using logistic regression model, and with 83% and 79% accuracy using extra tree classifier and random forest, respectively. The predictive accuracy of our ML model is superior to any of the previously reported algorithms, used in tissue- or liquid biopsy-based metabolic studies. The identified top metabolites can be targeted to develop early diagnostic methods as well as to plan personalized treatment strategies.

1. Introduction

Malignant brain tumors such as glioma and glioblastoma multiforme (GBM), arising from the glial cells of the central nervous system (CNS), are among the most lethal forms of human cancers. Their aggressive nature, infiltrating growth, and a two-fold blood–brain and blood–brain–tumor barrier (unlike most other cancers) make them both difficult to diagnose early and challenging to treat [1,2,3]. Therefore, despite surgical resection and advanced multimodal treatment modalities available currently such as biodegradable medicated wafer implants, postoperative concomitant chemo-radiation therapy (Stupp regimens), maintenance adjuvant chemotherapy, Optune®—tumor treating electric fields, immuno-chemotherapy—and so on, the recurrence and mortality rates of malignant gliomas are significantly higher than other solid and hematological malignancies; the median overall survival of the patients is <1.2 years [3,4,5,6,7,8].
Although our understanding of the intricate molecular networks and/or their cross-talks that initiate the successive, yet aggressive series of proliferative events in gliomas is very limited, there is increasing evidence suggesting that a complex interplay of chromosomal alterations (gene-gene fusions), genetic aberrations (point mutations), and epigenetic modifications (methylations) contributes to the tumor biogenesis [3,9,10]. Besides this, the underlying concealed perturbations in several finely tuned signaling (such as p53, VEGF, ErbB, RTK, Akt) and metabolic pathways (for instance, choline, central-carbon, and glutamine) also drive and accelerate the metastatic capabilities of gliomas [10,11,12,13]. Several research groups have demonstrated that, in order to perform (i) harvesting of energy and replenishing the ‘nutrient (glucose) sink’ for a continuous, unchecked cellular proliferation; (ii) epithelial-to-mesenchymal transition (EMT); (iii) enhanced cell migration; and iv) increased angiogenesis, the cancer cells establish a preference for glycolytic metabolism—a so-called Warburg effect [13,14,15]. This metabolic reprogramming/shift from the oxidative phosphorylation to glycolytic pathway coupled with concomitant upregulated expression of the solute carrier 2 (SLC2) family of transporters (e.g., glucose transporter GLUT1, GLUT3, GLUT4) is the central hallmark of malignant gliomas [13,15,16]. While on one hand, it exerts a selective pressure for tumor survival in the tumor microenvironment, at the same time, it drastically perturbs the metabolic influx of glucose within the cell cytoplasm and in circulation (more importantly, blood plasma, which is supplying nutrients to the entire body).
Given that the pronounced alterations in metabolic pathways and their corresponding metabolites manifest during the course of ‘tumor initiation-to-progression’ and/or in response to the treatment in different cancers, we have compared the metabolic fingerprints of glioma patients versus healthy controls in a quest to identify a panel of small, dysregulated, circulating metabolites with the potential to serve as a predictive- and/or diagnostic markers in the clinical settings. One of the most advanced and sophisticated techniques, high resolution magic angle spinning nuclear magnetic resonance spectroscopy (HRMAS-NMR), has been used for untargeted metabolomics and data acquisition in conjunction with the machine learning (ML) approach to analyze the large metabolic datasets. HRMAS-NMR is considered to be advantageous over the latest mass spectrometric methods thanks to its high reproducibility without the need for any pre-measurement treatment, robustness, non-destructive nature, minimum sample preparation time, and ability to provide the qualitative as well as the quantitative fingerprints of all measurable metabolites in a single experiment [17,18,19,20]. However, the data obtained from HRMAS-NMR are usually complex; raw NMR signal processing and targeted analysis pipeline often require an expert to reveal the biological insights. We have thus combined the HRMAS-NMR with ML-based data mining algorithms to assist in the traditional chemo-metric analysis. This, to the best of our best knowledge, is the first study that systematically, yet in less time and with minimized human biasness/intervention, assesses the plethora of small circulating metabolites in blood, both qualitatively and quantitatively, and presents a sensitive marker demonstrating the highest glioma-predictive accuracy (precision 1.0; recall 0.96; F1 measure 0.98) ever reported.

2. Results

2.1. Clinicopathological Characteristics of Study Group

Untargeted metabolic profiling was performed by HRMAS-NMR using plasma samples (n = 42) obtained from low-grade (LGG, n = 9) and high-grade glioma (HGG, n = 17) patients along with healthy controls (n = 16). The primary demographics and clinical characteristics of enrolled patients are shown in Table 1. Briefly, 60% of the enrolled patients were males, while 40% were females, with the average age of patients being 38 years. Out of the 93 initially registered LGG and HGG cases, only those patients were included in the study who were confirmed by histopathological analysis of tumor tissue and were graded according to the World Health Organization (WHO) classification of brain tumors. The glioma patients having any metabolic disorder (i.e., diabetes mellitus), hypertension, liver (hepatitis/liver cirrhosis), and/or cardiovascular disease were excluded from the study.

2.2. Correlating Circulating Metabolomic Profiles for Glioma Detection

The representative HRMAS-NMR spectra obtained from the plasma samples of LGG, HGG, and the healthy control groups are shown as Figure 1 (peaks of important metabolites are labelled). The curve fitting of spectra from 4.66 to 0.50 ppm identified the resonance peaks in 251 regions out of the 417 spectral regions (the remaining 166 regions mostly contained zeros), which were selected for further analysis after applying the feature selection techniques.
Analysis of glioma versus control groups (LGG + HGG = 26, healthy control = 16) using the two ML algorithms, extra tree classifier and random forest, identified 104 and 158 spectral regions, respectively, that differentiated the glioma samples from the healthy controls with 100% accuracy (F1-measure: 1.00). Both models gave maximum accuracy, but the number of identified spectral regions was too high to be supportive in routine analyses; therefore, we used the third ML model, i.e., logistic regression (Figure 2A). Interestingly, the predictive accuracy of this model with single spectral region was 98% (F1-measure: 0.98). The aggregate confusion matrix and fivefold cross-validations applied to estimate the prediction error/evaluate the performance of each fitted model revealed only one false negative prediction with logistic regression, whereas none were revealed by extra tree classifier and random forest. The performance of all ML models used for glioma detection and classification is summarized in Table 2.
The logistic regression model was regarded as best as it could differentiate glioma samples from the healthy controls with 98% accuracy using a minimum number of spectral regions (n = 1). The spectral regions identified by other two ML models were arranged in descending order of importance and the top 30 were selected for metabolite identification. Major contributing metabolites (n = 24) corresponding to these 30 spectral regions were searched from the available literature and databases and are summarized in Table 3. The single spectral region identified by the logistic regression model (1.47) was unveiled to be ‘alanine’ and it was also found to be significantly decreased in glioma cases. Other top dysregulated metabolites for glioma versus control cases included methionine, N-acetylaspartate (NAA), valine, glutamine, γ-aminobutyric acid (GABA), serine, α-glucose, lactate, and arginine.

2.3. Correlating Circulating Metabolomic Profiles for Glioma Classification (LGG/HGG)

A second set of ML analyses was performed to identify the spectral regions that may potentially differentiate and classify low- (grade I–II) and high-grade (grade III–IV) gliomas ((total 25 samples in two groups (LGG = 9, HGG = 16)). Logistic regression presented the highest F1-measure (0.86) and identified 92 spectral regions, whereas extra tree classifier and random forest presented 0.83 and 0.79 F1-measures, while identifying 107 and 88 spectral regions, respectively. The top 30 spectral regions contributing towards detection and classification of glioma are listed in Table 3 in descending order of importance. Their corresponding dysregulated metabolites’ search came up with 22 metabolites that included choline, NAA, valine, succinate, GABA, creatine, isoleucine, glutamine, fatty acids, and taurine, among others. The diagnostic ability of each predictive model used was further checked by ROC plots (Figure 2B), which showed high sensitivity of all three models for glioma versus control analyses, with the area under the curve (AUC) being 0.99. For the second set of analyses (classification of glioma in low- and high-grades), the logistic regression model proved to be more sensitive (AUC = 0.86) in comparison with extra tree classifier (AUC = 0.76) and random forest (AUC = 0.78). Additional analysis performed using a separate test dataset of oligodendroglioma patients revealed the usefulness of ML models. Extra tree classifier and random forest detected the presence of tumor by high accuracy (F1-measure = 0.91). Logistics regression detected the tumor by selecting only one feature (i.e., alanine) and scored an F1-measure value of 0.75 (Supplementary Data: Table S2). The overall results point towards the usefulness of ML-assisted circulating metabolite profiling for both the detection and classification of gliomas.

2.4. Identification of Statistically Significant Dysregulated Metabolites and Glioma-Associated Pathways

The ML-predicted top 30 spectral regions and corresponding dysregulated metabolites along with their statistical significance (p-value) and fold-change value (log2FC), in both the ‘glioma versus control’ and ‘LGG versus HGG’ comparisons, are listed in Table 3. Altogether, statistical analysis identified 98 spectral regions with a p-value < 0.05 for glioma versus control groups. Interestingly, except 3.72 and 3.47 ppm regions (the contributing metabolite for both spectral regions is β-glucose), all top 30 ML-predicted spectral regions were found to be statistically significant, adding to the predictive accuracy of ML analyses (Table 3 shaded boxes). For the LGG versus HGG group, however, only 19 ML-predicted spectral regions were regarded as statistically significant (p < 0.05).
Dysregulated metabolites contributing to significant spectral regions for both groups of analysis and common in two comparison sets are summarized in Figure 3. Alanine was the only metabolite detected by all the three ML models for glioma detection. Other important metabolites for glioma detection included methionine, phenylalanine, glutathione, serine, glucose, lactate, aspartate, NAA, and tryptophan. For glioma classification in low and high grades, choline, p-choline, succinate, creatine, taurine, glycine, proline, and scylloinositol were regarded as the key metabolites. The fold-change and mean intensity values for selected dysregulated metabolites, in all three study groups, are presented in Figure 4 in the form of heatmaps, volcano, and box and whisker plots.
Pathway analyses were performed by selecting the 24 metabolites identified for glioma detection and 22 metabolites for glioma classification separately. Major pathways found to be dysregulated in both analysis groups are shown in Table 4. The highest impact generating pathways for glioma detection were found to be associated with alanine/aspartate/glutamate metabolism (impact = 0.7) followed by D-glutamine/D-glutamate metabolism (impact = 0.5) and phenylalanine/tyrosine/tryptophan anabolism (impact = 0.5). Metabolic pathways that created a maximum impact for glioma classification were linked with glutamine/glutamate metabolism (impact = 0.5) and the alanine/aspartate/glutamate metabolism (impact = 0.48), as shown in Figure 5.

3. Discussion

Tumor development in the glial cells (the cells that typically modulate the brain microenvironment, homeostasis, and neurochemical balance in the CNS) is a complex process, often accompanied by reprogramming of key metabolic pathways and the rewiring of central carbon metabolism. Several studies have shown that the survival and growth of tumor cells is dependent on prompt availability of nutrients/metabolic fuels, which in turn is linked with activation and/or deactivation of several pathways, especially those of carbohydrate (e.g., glucose) and amino acids (e.g., glutamine, glutamate) metabolism [10,11,15,21,22]. As metabolic alterations in tumor cells have a pronounced impact on the profile of circulating metabolites, these may serve as an attractive target for non-invasive diagnosis of disease and prognosis of therapeutic response, without essentially going through the conventional MRI, CT scans, and invasive tissue biopsy/histopathology analyses. However, how these metabolic perturbations manifest in the context of different malignancies like gliomas (in particular, GBMs) is a formidable challenge for the oncologists and clinicians.
Previously, a targeted analysis of plasma metabolites using mass spectrometric methods identified 18 statistically significant metabolites that could differentiate LGG from the HGG samples [23,24,25]. Another study reported three plasma metabolites that were found to be associated with survival in GBM patients [26,27]. Yet, to the best of our knowledge, no study has reported liquid biopsy-based detection of glioma-associated metabolic signatures combining the untargeted HRMAS-NMR with ML. ML-based data mining algorithms, often conducted in an unsupervised manner, can reduce the human biasness and minimize analysis time, while processing the large datasets obtained from the metabolomics experiments. ML models have strong potential to be used in biomarker research as they can identify the best features contributing to a particular phenotype, classify diseases, and predict possible outcomes while utilizing the prior experiences obtained from the training datasets [28,29].
In the present study, using untargeted HRMAS-NMR spectroscopy and applying the ML linear logistic regression model, we interestingly identified a single metabolic marker “alanine” that could differentiate glioma from non-glioma samples with the highest predictive accuracy ever reported (precision 1.0; recall 0.96; F1 measure 0.98). Alanine is a glucogenic amino acid, which, following enzymatic conversion to pyruvate, enters the metabolic mainstream to provide energy and replenish the nutrient sink for rapidly proliferating tumor cells [30] Interestingly, alanine was the only top-scorer metabolite that was identified as a potential metabolic marker by all three ML models for glioma detection (Table 3; Figure 3 and Figure 4A,C). Other potential candidate metabolites that we identified for liquid biopsy-based detection and/or classification of gliomas include essential, non-essential, and branched-chain amino acids (e.g., Glu, Gln, Met, Leu, Ile, Val, Arg, Thr); neurotransmitters (NAA, GABA); fatty acids; and D-glucose (Figure 3).
Whereas the aggregate confusion matrix of all the ML algorithms (Table 2) classified the 16 HGG cases with 95–100% accuracy, it misclassified over 50% of LGG cases. In particular, logistic regression accurately predicted and classified all HGG samples, but, out of 09 LGG cases, it misclassified 05 as HGG. Extra tree classifier and random forest misclassified 05 and 07 LGG patients as HGG, respectively, and one HGG patient (the same patient in both models) as an LGG case. This LGG misclassification could primarily be the result of the small number of LGG cases (n = 9) used to train/test the ML algorithms. However, a deeper appreciation of the clinical record of the LGG subjects revealed that all misclassified LGGs were isocitrate dehydrogenase (IDH) wild-type cases. Studies have shown that tumors with IDH wild-type genes (also called IDH-negative) tend to progress far more aggressively to grade IV GBM and demonstrate poorer prognosis than their IDH mutant counterparts [31]. We speculated that, by analyzing the spatial distribution of metabolic changes, within the brain tissue and/or circulation using ML-assisted HRMAS-NMRS, one may predict the probability of LGG transition to HGG; however, further studies are warranted to validate the hypothesis.
Altered glutamate/glutamine metabolism is the hallmark of several cancers and, here, we report a significant decrease in plasma glutamine concentration in glioma samples. Glutamate/glutamine cycle in the brain is highly regulated; glutamate is a central amino acid required for neurotransmission, it acts as synaptic excitatory neurotransmitter, and accounts for more than 80% synapses in the brain [15,32]. After neurotransmission, glutamate in synaptic cleft is taken up by the astrocytes, metabolized to glutamine by an enzyme glutamine synthetase, and transported to neurons where it converts back to glutamate, thus completing the glutamine/glutamate cycle [32,33]. Compared with normal cells, the cancer cells are known to metabolize glutamine at higher levels to meet the ATP demands for biosynthesis of required proteins, lipids, and nucleic acids [15,34,35]. Glutamate homeostasis is found to be dysregulated in brain tumors, especially gliomas, and there are studies that report a higher concentration of glutamine in the tumor tissues [36,37]. In particular, the glial cells with IDH mutations have been identified for taking up high levels of glutamine to produce higher levels of 2-hydroxyglutarate [31,37,38]. This increased uptake of glutamine by glial cells may be responsible for lowering its levels in plasma (Figure 4A–C), hence its potential role as an attractive, non-invasive biomarker for detection and/or classification of gliomas.
Branched-chain amino acids (leucine, isoleucine, and valine) are utilized by the tumor cells for de novo synthesis of glutamate, which is important to fulfil the increased demand for glutamate [39,40]. Arginine, a semi-essential amino acid, was also found to be dysregulated in both comparisons, thus significantly contributing to glioma detection. Methionine is a vital amino acid; many different cancer cells have been found to be dependent on it for their growth. An increased requirement for exogenous methionine during cancer development is analogous to the high demands of cancer cells for glucose [41]. A study has reported it to be essential for colony formation and survival of glioma cells [42]. Methionine levels in the present study were found to be significantly decreased in the plasma of glioma patients, which indicates increased expense of this amino acid in the tumor microenvironment (Figure 4).
Lactate is another important metabolite detectable in brain tumors or areas of ischemic injury because of anaerobic glycolysis. High-grade tumors of adults show more prominent lactate peak as compared with low-grade tumors, while all pediatric brain tumors exhibit high lactate levels. Creatine (Cr) along with phosphocreatine is used normally in energy metabolism. Its decreased levels in tumor cells are likely to be the result of its consumption to replenishing energy under conditions when oxidative phosphorylation and anaerobic glycolysis fail to meet the high energy demands of progressive tumors. Lipids are associated with necrosis and are present in higher amounts in glioblastoma, lymphoma, abscess, or other areas of destruction of myelinated white matter and, subsequently, decrease in circulation [43]. Our results substantiate the previous studies signifying the role of dysregulated plasma metabolic profiling in detection and/or classification of gliomas [23,41,42,43,44].

4. Materials and Methods

4.1. Study Population and Sample Collection

The study population is comprised of 42 subjects that include both low-grade (LGG; n = 9) and high-grade glioma patients (HGG; n = 17) along with 16 healthy controls. The study design was duly approved by the Advanced Studies and Research Board (ASRB; 2402-3/2018), University of the Punjab, Lahore. LGG and HGG patients from Punjab Institute of Neurosciences (PINS) (diagnosed by expert oncologists using magnetic resonance imaging (MRI) or computerized tomography (CT) scan, undergoing surgical resection of the tumor) were recruited during March 2018–2019. Clinical and demographic data of the enrolled patients were obtained from medical record of the hospital. Patients’ recruitment and sample collection followed the principals of the Declaration of Helsinki for research involving human beings. Written informed consent from all the study subjects was obtained prior to drawing their blood sample.

4.2. Ex Vivo HRMAS-NMR

Peripheral blood (3 cc) from each patient (fasting state) was collected in Li-heparin tubes, centrifuged (300× g, 10 min) to prepare plasma within an hour of collection, and preserved in sterile tubes at −80 °C, as 200 µL aliquots, until further analyses.
The HRMAS-NMR analyses were performed on Bruker AVANCE 600 MHz spectrometer (Billerica, MA, USA) equipped with a triple nucleus (1 H, 13 C, 31 P) HRMAS probes. The spectra were acquired by adding 10 µL plasma sample in a 4 mm zirconia rotor with 12 μL Kel-F inserts; 2 µL D2O (Sigma Aldrich, St. Louis, MO, USA) with reference trimethylsilylpropanoic acid (TSP) was added for field locking. For all measurements, 4 °C temperature, 3600 ± 2 Hz spin-rate (to keep the rotation sidebands out of the acquisition window), and a rotor-synchronized Carr-Purcell Meibom-Gill (CPMG) pulse sequence (to function as a T2-filter) were used. To reduce the metabolites’ variation, each spectrum was recorded at 4 °C with and without continuous wave water suppression, with spectrometer frequency centered on water resonance [45].

4.3. Pre-Processing of the Spectral Data

Acorn NMR-Nuts (Livermore, CA, USA) software was used to process the acquired HRMAS-NMR spectra. Free induction decay (FID) file of each spectrum was subjected to a successive chain of commands to perform baseline correction (bc), exponential multiplication (em, 0.5 Hz line broadening), zero filling (zf), rotation control (rc = 68), Fourier transformation (ft), spectra reversing (sr), and automatic phasing (ap). All spectra were processed and aligned against TSP at 0 ppm that placed creatine (Cre) methyl resonance at 3.026 ppm. Peak intensities between the spectral regions 4.66 and 0.5 ppm were curve fitted to the accuracy of 0.01 ppm, and normalized to the relative spectral intensity (Rel_Int) according to the formula below.
( % Re l _ I n t m , s ) = ( E x p _ I n t m , s ) 100 / i = 1 251 ( E x p _ I n t i , s )
where (Exp_Intm,s) represents the experimental intensity value for spectral regions m (m = 1, 2, 3, …251) and samples s (s = 1, 2, 3, ……42), and i = 1 251 ( E x p _ I n t i , s ) represents the sum of intensity values measured for all selected 251 spectral regions, for each of the 42 samples.

4.4. ML-Assisted Data Analysis

Both linear (logistic regression) and non-linear (random forest, extra tree classifier) ML classification methods were used for the accurate prediction of the ‘spectral regions’ essentially linked with the glioma and non-glioma groups. The dataset was normalized by applying Z-score normalization followed by log scaling. Feature selection technique, involving analysis of variance (ANOVA), feature importance, and recursive feature elimination methods, was implemented to reduce the dimensionality of large dataset and to select the best spectral regions associated with the disease. The HRMAS data initially contained 417 spectral regions for analysis. However, after initial selection, only 251 regions were found to have a comparable numeric value; features containing more zero and/or missing values were all excluded from the dataset. Further ML analyses were performed with only those spectral regions that have had the highest impact on the response variable.
To ‘train’ and ‘test’ the dataset for the best possible predictions of glioma versus non-glioma controls, a comprehensive ML approach was followed wherein a wide choice of hyper-parameters was used for each model using GridSearchCV. A fivefold cross-validation was performed at each step to avoid overfitting problem in the dataset. In particular, the data were divided into five equal parts, 80% of which was used to train the ML model, while the remaining 20% was used to test the model function, reducing the chances of overfitting thenceforth. Different evaluation metrics, such as accuracy, recall, precision, and F1-measure (chosen for its best presentations in cases of the multiclass problems of imbalanced data), were computed for each model using the formulas:
Precision = True positive/True positive + False positive
Recall = True positive/True positive + False negative
F1-measure= 2 × Precision × Recall/Precision + Recall
The analyses were repeated for LGG versus HGG cases, and the highest value of F1 measure was recorded by looping through every model, feature, and hyper-parameters. The diagnostic potential of each predictive model was determined by the receiver operating characteristic curve (ROC) plots. A separate test dataset comprising six plasma samples from oligodendroglioma patients was also included to evaluate the tumor versus non-tumor predictive accuracy of ML models.

4.5. Statistical Analysis

Statistical analysis, performed using JMP-pro16 software, were used to further validate the spectral regions identified by ML algorithms. Shapiro–Wilk non-parametric test was applied to check the normality of the given dataset, while the Wilcoxon test to find out the statistically significant spectral regions for glioma versus control and LGG versus HGG groups, separately. A comprehensive platform MetaboAnalyst 5.0 (available at: https://www.metaboanalyst.ca/MetaboAnalyst/home.xhtml) (accessed on 13 July 2021) was used to calculate the spectral fold change (FC; mean value of change) between the two groups.

4.6. Metabolite’s Identification and Pathway Analysis

Literature search and publicly available databases such as Human Metabolome Database (HMDB), Biological Magnetic Resonance Databank (BMRB), and MetaboMiner and MetaboHunter, were used for the identification/assignment of metabolites to the corresponding top 30 spectral regions, predicted by the ML models. Further, the metabolic pathways dysregulated during the tumor progression were identified using MetaboAnalyst 5.0.
Metabolites of interest, quantified by selected distinctive unbiased NMR signals and identified to be important after ML-based analyses, were used as the input matrix. The pathway impact was calculated as the sum of the important measures of the matched metabolites normalized by the sum of the important measures of all metabolites in each pathway.

5. Conclusions

In conclusion, the advanced HRMAS-NMR in conjunction with ML algorithms may serve as a pathway to predict the glioma risk, occurrence, and/or grading easily, reliably, and accurately. The dysregulated circulating metabolite, alanine, may be tested non-invasively using routine spectrophotometric methods in the future, enabling real-time monitoring or objective evaluation of tumor progression, treatment response, and more individualized prognostication. The double-blind validation studies with a larger number of plasma samples from patients at different stages and grades of glioma are, however, necessary to cross-verify the reliability and accuracy of ML models and add support to our findings.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/metabo11080507/s1, Table S1: List of important spectral regions identified by two sets of ML analysis, Table S2: Results of test dataset (oligodedroglioma vs control) analysis using ML models.

Author Contributions

All authors have made substantial contribution in this work. S.F. performed experiments on HRMAS-NMR, raw data analysis, and wrote the initial draft. R.A. assisted in sample collection, processing, tagging, and literature search. The data analysis using ML algorithms was performed by Z.N., while F.B. provided support during ML data compilation process. A.A. facilitated the blood sampling and histopathological examinations. L.L.C. provided lab facilities and extended guidance to S.F. in NMR data handling, and reviewed the draft manuscript. The project conception, study design, overall research supervision, and critical review of the draft manuscript for intellectual inputs were done by S.S. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the partial funding support from the Higher Education Commission (HEC) and University of the Punjab, Lahore, Pakistan. S.F. was a recipient of HEC-IRSIP fellowship from Pakistan at the Massachusetts General Hospital, Harvard Medical School, USA. HRMAS-NMR measurements were made possible by USA NIH grants: S10OD023406 and R21CA243255, and by MGH Martinos Center for Biomedical Imaging. The funding agencies have no role in the conception or study design of the project.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Advanced Studies and Research Board (ASRB), University of the Punjab.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors are thankful to the glioma patients, paramedical staff, and surgeons (General Hospital, Lahore) who volunteered their services and samples for this research.

Conflicts of Interest

The authors of the manuscript declare no potential conflicts.

References

  1. Ohgaki, H.; Kleihues, P. Epidemiology and etiology of gliomas. Acta Neuropathol. 2005, 109, 93–108. [Google Scholar] [CrossRef] [PubMed]
  2. Rasmussen, B.K.; Hansen, S.; Laursen, R.J.; Kosteljanetz, M.; Schultz, H.; Nørgård, B.M.; Guldberg, R.; Gradel, K.O. Epidemiology of glioma: Clinical characteristics, symptoms, and predictors of glioma patients grade I–IV in the the Danish Neuro-Oncology Registry. J. Neurooncol. 2017, 135, 571–579. [Google Scholar] [CrossRef] [PubMed]
  3. Diwanji, T.P.; Engelman, A.; Snider, J.W.; Mohindra, P. Epidemiology, diagnosis, and optimal management of glioma in adolescents and young adults. Adolesc. Health. Med. Ther. 2017, 8, 99. [Google Scholar] [CrossRef] [Green Version]
  4. Davis, M. GBM treatment overview. Clin. J. Oncol. Nurs. 2016, 20, 1–14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Fernandes, C.; Costa, A.; Osório, L.; Lago, R.C.; Linhares, P.; Carvalho, B.; Caeiro, C. Current standards of care in glioblastoma therapy. Exon Publ. 2017, 197–241. [Google Scholar]
  6. Stupp, R.; Taillibert, S.; Kanner, A.A.; Kesari, S.; Steinberg, D.M.; Toms, S.A.; Taylor, L.P.; Lieberman, F.; Silvani, A.; Fink, K.L. Maintenance therapy with tumor-treating fields plus temozolomide vs temozolomide alone for glioblastoma: A randomized clinical trial. JAMA 2015, 314, 2535–2543. [Google Scholar] [CrossRef]
  7. Stupp, R.; Mason, W.P.; van den Bent, M.J.; Weller, M.; Fisher, B.; Taphoorn, M.J.; Belanger, K.; Brandes, A.A.; Marosi, C.; Bogdahn, U. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. N. Engl. J. Med. 2005, 352, 987–996. [Google Scholar] [CrossRef] [PubMed]
  8. Chen, C.; Zuo, W.; Yang, P.; Zhang, Y. Anti-PD-1, anti-VEGF, and temozolomide therapy in a patient with recurrent glioblastoma: A case report. J. Int. Med. Res. 2020, 48. [Google Scholar] [CrossRef]
  9. Wang, L.; Yekula, A.; Muralidharan, K.; Small, J.L.; Rosh, Z.S.; Kang, K.M.; Carter, B.S.; Balaj, L. Novel Gene Fusions in Glioblastoma Tumor Tissue and Matched Patient Plasma. Cancers 2020, 12, 1219. [Google Scholar] [CrossRef]
  10. Bi, J.; Chowdhry, S.; Wu, S.; Zhang, W.; Masui, K.; Mischel, P.S. Altered cellular metabolism in gliomas—an emerging landscape of actionable co-dependency targets. Nat. Rev. Cancer 2020, 20, 57–70. [Google Scholar] [CrossRef] [PubMed]
  11. Lyssiotis, C.A.; Nagrath, D. Metabolic reprogramming and vulnerabilities in cancer. Cancer 2020, 12, 90. [Google Scholar] [CrossRef] [Green Version]
  12. Ward, P.S.; Thompson, C.B. Metabolic reprogramming: A cancer hallmark even warburg did not anticipate. Cancer Cell 2012, 21, 297–308. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Chen, J.Q.; Russo, J. Dysregulation of glucose transport, glycolysis, TCA cycle and glutaminolysis by oncogenes and tumor suppressors in cancer cells. Biochim. Biophys. Acta 2012, 1826, 370–384. [Google Scholar] [CrossRef] [PubMed]
  14. McCann, C.; Kerr, E.M. Metabolic reprogramming: A friend or foe to cancer therapy? Cancers (Basel) 2021, 13, 3351. [Google Scholar] [CrossRef] [PubMed]
  15. Venneti, S.; Thompson, C.B. Metabolic Reprogramming in Brain Tumors. Annu. Rev. Pathol. 2017, 12, 515–545. [Google Scholar] [CrossRef] [PubMed]
  16. Guda, M.R.; Labak, C.M.; Omar, S.I.; Asuthkar, S.; Airala, S.; Tuszynski, J.; Tsung, A.J.; Velpula, K.K. GLUT1 and TUBB4 in glioblastoma could be efficacious targets. Cancers 2019, 11, 1308. [Google Scholar] [CrossRef] [Green Version]
  17. Cheng, L.; Ma, M.; Becerra, L.; Ptak, T.; Tracey, I.; Lackner, A.; Gonzalez, R. Quantitative neuropathology by high resolution magic angle spinning proton magnetic resonance spectroscopy. Proc. Natl. Acad. Sci. USA 1997, 94, 6408–6413. [Google Scholar] [CrossRef] [Green Version]
  18. Cheng, L.L.; Chang, I.-W.; Louis, D.N.; Gonzalez, R.G. Correlation of high-resolution magic angle spinning proton magnetic resonance spectroscopy with histopathology of intact human brain tumor specimens. Cancer Res. 1998, 58, 1825–1832. [Google Scholar]
  19. Dietz, C.; Ehret, F.; Palmas, F.; Vandergrift, L.A.; Jiang, Y.; Schmitt, V.; Dufner, V.; Habbel, P.; Nowak, J.; Cheng, L.L. Applications of high-resolution magic angle spinning MRS in biomedical studies II-Human diseases. NMR Biomed. 2017, 30. [Google Scholar] [CrossRef]
  20. Cheng, L.L.; Anthony, D.C.; Comite, A.R.; Black, P.M.; Tzika, A.A.; Gonzalez, R.G. Quantification of microheterogeneity in glioblastoma multiforme with ex vivo high-resolution magic-angle spinning (HRMAS) proton magnetic resonance spectroscopy. Neuro-oncology 2000, 2, 87–95. [Google Scholar] [CrossRef] [PubMed]
  21. Purohit, V.; Simeone, D.M.; Lyssiotis, C.A. Metabolic regulation of redox balance in cancer. Cancers 2019, 11, 955. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Natarajan, S.K.; Venneti, S. Glutamine metabolism in brain tumors. Cancers 2019, 11, 1628. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Zhao, H.; Heimberger, A.B.; Lu, Z.; Wu, X.; Hodges, T.R.; Song, R.; Shen, J. Metabolomics profiling in plasma samples from glioma patients correlates with tumor phenotypes. Oncotarget 2016, 7, 20486. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Ahmed, K.A.; Chinnaiyan, P. Applying metabolomics to understand the aggressive phenotype and identify novel therapeutic targets in glioblastoma. Metabolites 2014, 4, 740–750. [Google Scholar] [CrossRef] [Green Version]
  25. Jothi, J.; Vanisree, A.J.; Krishnaswamy, R. Metabolic variations among low grade and high grade gliomas–profiling by 1h nmr spectroscopy. J. Proteome Res. 2020. [Google Scholar] [CrossRef]
  26. Shen, J.; Song, R.; Hodges, T.R.; Heimberger, A.B.; Zhao, H. Identification of metabolites in plasma for predicting survival in glioblastoma. Mol. Carcinog. 2018, 57, 1078–1084. [Google Scholar] [CrossRef]
  27. Feng, S.; Liu, Y. Metabolomics of glioma. Adv. Exp. Med. Bio. 2021, 1280, 261–276. [Google Scholar] [CrossRef]
  28. Heinemann, J. Microbial Metabolomics; Springer: Berlin/Heidelberg, Germany, 2019; pp. 287–299. [Google Scholar] [CrossRef]
  29. Alakwaa, F.M.; Chaudhary, K.; Garmire, L.X. Deep learning accurately predicts estrogen receptor status in breast cancer metabolomics data. J. Proteome Res. 2018, 17, 337–347. [Google Scholar] [CrossRef]
  30. Ijare, O.; Baskin, D.; Pichumani, K. Cbmt-01. Alanine Fuels Energy Metabolism of Glioblastoma Cells. Neuro Oncol. 2019, 21 (Suppl. 6), vi32. [Google Scholar] [CrossRef]
  31. Waitkus, M.S.; Diplas, B.H.; Yan, H. Biological role and therapeutic potential of IDH mutations in cancer. Cancer Cell 2018, 34, 186–195. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Jiang, J.; Srivastava, S.; Zhang, J. Starve cancer cells of glutamine: Break the spell or make a hungry monster? Cancers 2019, 11, 804. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Márquez, J.; Alonso, F.J.; Matés, J.M.; Segura, J.A.; Martín-Rufián, M.; Campos-Sandoval, J.A. Glutamine addiction in gliomas. Neurochem. Res. 2017, 42, 1735–1746. [Google Scholar] [CrossRef] [PubMed]
  34. DeBerardinis, R.J.; Mancuso, A.; Daikhin, E.; Nissim, I.; Yudkoff, M.; Wehrli, S.; Thompson, C.B. Beyond aerobic glycolysis: Transformed cells can engage in glutamine metabolism that exceeds the requirement for protein and nucleotide synthesis. Proc. Natl. Acad. Sci. USA 2007, 104, 19345–19350. [Google Scholar] [CrossRef] [Green Version]
  35. Wise, D.R.; Thompson, C.B. Glutamine addiction: A new therapeutic target in cancer. Trends Biochem. Sci. 2010, 35, 427–433. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Robert, S.M.; Sontheimer, H. Glutamate transporters in the biology of malignant gliomas. Cell. Mol. Life. Sci. 2014, 71, 1839–1854. [Google Scholar] [CrossRef] [Green Version]
  37. Daye, D.; Wellen, K.E. Metabolic Reprogramming in Cancer: Unraveling the Role of Glutamine in Tumorigenesis; Elsevier: Amsterdam, The Netherlands, 2012; pp. 362–369. [Google Scholar] [CrossRef]
  38. Kallenberg, K.; Bock, H.C.; Helms, G.; Jung, K.; Wrede, A.; Buhk, J.-H.; Giese, A.; Frahm, J.; Strik, H.; Dechent, P. Untreated glioblastoma multiforme: Increased myo-inositol and glutamine levels in the contralateral cerebral hemisphere at proton MR spectroscopy. Radiology 2009, 253, 805–812. [Google Scholar] [CrossRef]
  39. Zhang, B.; Chen, Y.; Shi, X.; Zhou, M.; Bao, L.; Hatanpaa, K.J.; Patel, T.; DeBerardinis, R.J.; Wang, Y.; Luo, W. Regulation of branched-chain amino acid metabolism by hypoxia-inducible factor in glioblastoma. Cell. Mol. Life. Sci. 2020, 1–12. [Google Scholar] [CrossRef]
  40. Ananieva, E.A.; Wilkinson, A.C. Branched-chain amino acid metabolism in cancer. Curr. Opin. Clin. Nutr. Metab. Care. 2018, 21, 64. [Google Scholar] [CrossRef] [Green Version]
  41. Palanichamy, K.; Thirumoorthy, K.; Kanji, S.; Gordon, N.; Singh, R.; Jacob, J.R.; Sebastian, N.; Litzenberg, K.T.; Patel, D.; Bassett, E. Methionine and kynurenine activate oncogenic kinases in glioblastoma, and methionine deprivation compromises proliferation. Clin. Cancer Res. 2016, 22, 3513–3523. [Google Scholar] [CrossRef] [Green Version]
  42. Phillips, M.M.; Sheaff, M.T.; Szlosarek, P.W. Targeting arginine-dependent cancers with arginine-degrading enzymes: Opportunities and challenges. Cancer Res. Treat. 2013, 45, 251. [Google Scholar] [CrossRef] [PubMed]
  43. Wright, A.J.; Fellows, G.A.; Griffiths, J.R.; Wilson, M.; Bell, B.A.; Howe, F.A. Ex-vivo HRMAS of adult brain tumours: Metabolite quantification and assignment of tumour biomarkers. Mol. Cancer 2010, 9, 66. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Hoffman, R.M.; Stern, P.H.; Coalson, D.W.; Wallace, C.D.; Erbe, R.W. Methionine Dependence of Cancer and Aging; Springer: Berlin/Heidelberg, Germany, 2019; pp. 13–26. [Google Scholar] [CrossRef]
  45. Tilgner, M.; Vater, T.S.; Habbel, P.; Cheng, L.L. High-Resolution Magic Angle Spinning (HRMAS) NMR Methods in Metabolomics. Methods Mol. Biol. 2019, 2037, 49–67. [Google Scholar] [CrossRef]
Figure 1. Comparison of high-resolution magic angle spinning nuclear magnetic resonance spectroscopy (HRMAS-MRS) spectra of healthy (red), low-grade glioma (LGG, blue), and high-grade glioma (HGG, green) study subjects. Important peaks are labelled.
Figure 1. Comparison of high-resolution magic angle spinning nuclear magnetic resonance spectroscopy (HRMAS-MRS) spectra of healthy (red), low-grade glioma (LGG, blue), and high-grade glioma (HGG, green) study subjects. Important peaks are labelled.
Metabolites 11 00507 g001
Figure 2. (A). Mean score plots displaying the number of features (x-axis) and the corresponding F1 measures (y-axis) obtained for each of the three ML algorithms applied (i.e., extra tree classifier, logistic regression & random forest) for glioma vs control (ac) and LGG vs HGG analysis (df), respectively. (B). Receiver operating curve (ROC) plots depicting predictive accuracy of the three ML algorithms for glioma vs control (ac) and LGG vs HGG (df) groups, respectively. X-axis shows false positive rate while Y-axis indicates true positive rate on a scale of 0 to 1.0.
Figure 2. (A). Mean score plots displaying the number of features (x-axis) and the corresponding F1 measures (y-axis) obtained for each of the three ML algorithms applied (i.e., extra tree classifier, logistic regression & random forest) for glioma vs control (ac) and LGG vs HGG analysis (df), respectively. (B). Receiver operating curve (ROC) plots depicting predictive accuracy of the three ML algorithms for glioma vs control (ac) and LGG vs HGG (df) groups, respectively. X-axis shows false positive rate while Y-axis indicates true positive rate on a scale of 0 to 1.0.
Metabolites 11 00507 g002
Figure 3. Summary of dysregulated metabolites identified by the two sets of ML analysis i.e., glioma vs control and LGG vs HGG. Common metabolites can potentially be used for both detection and classification of gliomas.
Figure 3. Summary of dysregulated metabolites identified by the two sets of ML analysis i.e., glioma vs control and LGG vs HGG. Common metabolites can potentially be used for both detection and classification of gliomas.
Metabolites 11 00507 g003
Figure 4. (A) Volcano plot representing fold change in the dysregulated metabolites for glioma vs control group. Left and right panels of the plot respectively indicate the down- and upregulated spectral regions. Alanine (1.47) was identified as most significant metabolite with highest value of –log10(p). (B). Heatmap of top 30 spectral regions identified for glioma vs control group. Class 1 represents the glioma samples while class 2 the control group. Each colored cell corresponds to normalized intensity value of selected feature. The difference(s) between the two groups can be visualized by change in color. Upregulated and downregulated spectral regions can be seen clearly in two classes. (C) Box and Whisker plots of two groups (1=glioma, 2=control) representing mean change in intensity values of the selected metabolites. Notch of the plot indicates the median value while yellow diamond shows mean value of the feature in the group.
Figure 4. (A) Volcano plot representing fold change in the dysregulated metabolites for glioma vs control group. Left and right panels of the plot respectively indicate the down- and upregulated spectral regions. Alanine (1.47) was identified as most significant metabolite with highest value of –log10(p). (B). Heatmap of top 30 spectral regions identified for glioma vs control group. Class 1 represents the glioma samples while class 2 the control group. Each colored cell corresponds to normalized intensity value of selected feature. The difference(s) between the two groups can be visualized by change in color. Upregulated and downregulated spectral regions can be seen clearly in two classes. (C) Box and Whisker plots of two groups (1=glioma, 2=control) representing mean change in intensity values of the selected metabolites. Notch of the plot indicates the median value while yellow diamond shows mean value of the feature in the group.
Metabolites 11 00507 g004
Figure 5. Pathway analysis performed by MetaboAnalyst using metabolites identified for glioma vs control group (A) and the LGG vs HGG group (B).
Figure 5. Pathway analysis performed by MetaboAnalyst using metabolites identified for glioma vs control group (A) and the LGG vs HGG group (B).
Metabolites 11 00507 g005
Table 1. Demographics and clinical characteristics of study subjects.
Table 1. Demographics and clinical characteristics of study subjects.
CharacteristicsLGG
(grade I–II)
HGG
(grade III–IV)
Healthy ControlTotal
No. of Subjects (n)4 + 5 = 91 + 16 = 171642
Mean Age (Years)33 ± 1743 ± 1634 ± 1338 ± 16
Gender
  Male07110725
  Female02060917
Headache
  Yes0513018
  No04041624
Epileptic Seizures
  Yes0307010
  No06101632
Gait/Balance Changes
  Yes0513018
  No04041624
Neurologic Deficits
  Yes0715022
  No02021620
Reduced Vision
  Yes0308011
  No06091631
Cancer History
  Yes0103004
  No08141638
Table 2. Performance and confusion matrices of machine learning (ML) algorithms applied for the detection and classification of glioma in the study subjects.
Table 2. Performance and confusion matrices of machine learning (ML) algorithms applied for the detection and classification of glioma in the study subjects.
Sample TypeAlgorithm
Applied
Confusion Matrices of ML Algorithms 1No. of Features Identified 2GroupPrecisionRecallF1-Measure
Glioma vs. Control
(n = 42)
Extra Tree Classifier[16      0]104Control1.001.001.00
[0      26]Tumor1.001.001.00
Logistic Regression[16      0]01Control0.941.000.97
[1      25]Tumor1.000.960.98
Random
Forest
[16      0]158Control1.001.001.00
[0      26]Tumor1.001.001.00
LGG vs. HGG (n = 25)Extra Tree Classifier[4      5]107LGG0.800.440.57
[1      15]HGG0.750.940.83
Logistic Regression[4      5]92LGG1.000.440.62
[0      16]HGG0.761.000.86
Random
Forest
[2      7]88LGG0.670.220.33
[1      15]HGG0.680.940.79
1 Key used: [true negative false positive]; [false negative true positive]; 2 details of features/spectral regions identified in each case are provided in the Supplementary Material as Table S1.
Table 3. Top 30 spectral regions and their contributing metabolites identified by both sets of ML analysis given in descending order of importance along with their statistical analysis.
Table 3. Top 30 spectral regions and their contributing metabolites identified by both sets of ML analysis given in descending order of importance along with their statistical analysis.
Group 1: Glioma vs Control
ML AnalysisStatistical Validation
FeatureImportanceCorresponding metabolitep-valuelog2(FC)
Model: Logistics Regression
1.471.0Alanine<0.0001–3.7744
Model: Extra Tree Classifier
2.550.0642β-Alanine<0.0001−2.5717
2.120.0524Methionine<0.0001−3.3458
3.10.0491Phenylalanine<0.0001−7.3967
3.070.0415X<0.0001−2.5445
2.690.0405NAA<0.0001−4.7668
1.470.0383Alanine<0.0001−3.7744
0.980.0299Valine<0.0001−2.7959
1.710.028Leucine<0.0001−3.5926
3.130.025Glutathione0.0013−5.7592
2.450.0222Glutamine<0.0001−3.0169
1.880.022GABA0.0028−1.1886
3.950.0204Serine0.0003−2.4757
3.160.0182Alanine<0.0001−5.5148
3.690.0179α-glucose<0.0001−1.3382
1.440.0175Deoxycholic acid<0.000110.114
3.180.0168Taurocholic acid<0.0001−4.2704
4.110.0161Lactate<0.00012.7555
1.720.0157Arginine0.0003−2.6918
3.580.0149Threonine<0.0001−6.6301
2.350.0148Glutamate0.0022−1.4198
3.940.0143Serine<0.0001−2.3249
3.140.014Spermidine0.0119−4.8047
3.880.014Aspartic acid<0.00011.5565
2.090.0133Glutamate<0.0001−1.859
3.210.0126N-Acetylcholine0.0004−4.3221
1.040.0126Valine<0.0001−1.8092
1.010.0124Isoleucine0.00023.8182
1.890.0121GABA0.001−2.2659
3.680.0118α-glucose<0.00014.8319
1.990.0117Isoleucine0.0006−1.2174
Model: Random Forest
2.120.0491Methionine<0.0001−3.3458
0.980.0488Valine<0.0001−2.7959
3.160.0438Alanine<0.0001−5.5148
4.110.0344Lactate<0.00012.7555
1.470.0316Alanine<0.0001−3.7744
3.530.03Myoinositol0.00031.0134
1.450.025Isoleucine0.00022.9475
1.440.0209Deoxycholic acid<0.000110.114
1.990.0195Isoleucine0.0006−1.2174
2.690.019NAA<0.0001−4.7668
2.320.0172Glutamate0.00032.1726
2.130.0172Glutamine<0.0001−3.4879
2.450.0165Glutamine<0.0001−3.0169
3.660.0157Isoleucine0.00504.9598
3.720.015β-glucose0.3716NA
1.30.0146Fatty acids<0.0001−1.3971
3.940.0144Serine<0.0001−2.3249
3.690.0142α-glucose<0.0001−1.3382
1.010.0142Isoleucine0.00023.8182
0.920.0139Isoleucine0.00022.3071
2.090.0133Glutamate<0.0001−1.859
3.880.0117Aspartate<0.00011.5565
3.590.0114L-Valine0.001‘−6.9614
Age0.0107NANANA
1.170.0105X0.006−2.41
1.880.0105GABA0.0028−1.1886
3.460.0097β-glucose0.0178NA
3.220.0093Arginine0.0191NA
3.580.0091Threonine<0.0001−6.6301
3.470.0084β-glucose0.7007NA
Group 2: LGG vs HGG
ML AnalysisStatistical Validation
FeatureImportanceCorresponding metabolitep-valuelog2(FC)
Model: Logistics Regression
3.510.2059Choline0.030NA
2.010.1116NAA0.0411.239
3.20.0719P-Choline0.127NA
1.020.0531Valine0.040NA
3.480.0523β-Glucose0.040NA
2.390.0442Succinate/Malate0.040−1.1293
1.680.0441L-Arginine0.2271.678
1.820.0435X0.0151.4804
2.40.0416Succinate0.092NA
2.310.0386X0.207−1.647
3.50.0324NAA0.133−2.3824
2.30.0311GABA0.133−3.0367
1.840.026X0.0541.6947
3.530.026Myoinositol0.871NA
1.260.0236Isoleucine0.064−2.4499
1.990.0171Isoleucine0.039NA
2.450.0156L-Glutamine0.195NA
0.880.015Fatty acid0.239−2.0748
3.910.0149Creatine0.0541.2722
1.210.0125X0.0543.6568
2.250.0122Fatty acid0.2062.6199
0.90.012Fatty acid0.0411.6497
3.260.0028Taurine0.0091.1339
1.830X0.388−1.791
1.630X0.195NA
1.860GABA0.2061.9942
1.970Isoleucine0.1821.679
1.470Alanine0.6403.2255
1.870GABA0.195NA
1.880GABA0.195NA
Model: Extra Tree Classifier
3.60.0637Valine0.0151.5337
1.820.0352X0.0151.4804
2.50.0265NAA0.0151.3842
1.40.0254X0.0711.5535
2.40.0254Succinate0.0921.1227
1.970.0247Isoleucine0.1821.679
2.040.0244Glutamate0.0301.4984
0.990.0237Isoleucine0.1041.2242
2.530.0232X0.0152.0478
3.260.0232Taurine0.0091.1339
2.080.0211Glutamate0.036NA
3.590.0207Threonine0.053NA
0.910.0186Fatty acids0.0111.6453
0.980.017Valine0.2491.7767
1.990.0167Isoleucine0.0391.429
0.90.0167Fatty acids0.0411.6497
3.330.0164Scyllo inositol0.0151.5207
2.430.0162Glutamine0.222NA
1.220.016X0.103NA
3.910.0159Creatine0.0541.2722
2.720.0154NAA0.1362.2715
2.840.0149X0.0541.2673
3.510.0144Choline0.030
0.850.0143Tauro-cholicacid0.726−1.0208
1.680.0137Leucine0.2271.678
3.540.0131Glycine0.0501.1865
1.360.0127Fatty acids0.519NA
3.420.0124Taurine/Proline0.031NA
1.050.0123Valine0.0971.5722
2.320.0119Glutamate0.026NA
Model: Random Forest
3.60.0637Valine0.0151.5337
1.820.0352X0.0151.4804
2.50.0265NAA0.0151.3842
1.40.0254X0.0711.5535
2.40.0254Succinate0.0921.1227
1.970.0247Isoleucine0.1821.679
2.040.0244Glutamate0.0301.4984
0.990.0237Isoleucine0.1041.2242
2.530.0232X0.0152.0478
3.260.0232Taurine0.0091.1339
2.080.0211Glutamate0.036NA
3.590.0207Threonine0.053NA
0.910.0186Fatty acids0.0111.6453
0.980.017Valine0.2491.7767
1.990.0167Isoleucine0.0391.429
0.90.0167Fatty acids0.041NA
3.330.0164scyllo inositol0.0151.5207
2.430.0162Glutamine0.222NA
1.220.016X0.103NA
3.910.0159Creatine0.0541.2722
2.720.0154NAAG0.1362.2715
2.840.0149X0.0541.2673
3.510.0144Choline0.030NA
0.850.0143Tauro-cholicacid0.726−1.0208
1.680.0137Leucine0.2271.678
3.540.0131Glycine0.0501.1865
1.360.0127Fatty acids0.519NA
3.420.0124Taurine/Proline0.031NA
1.050.0123Valine0.0971.5722
2.320.0119Glutamate0.026NA
Key used: NAA = N-Acetylaspartate; X = Unknown metabolite; NA = Not available.
Table 4. List of metabolic pathways dysregulated for two comparison sets.
Table 4. List of metabolic pathways dysregulated for two comparison sets.
Sr.#PathwayGlioma vs. ControlLGG vs. HGG
HitsRawPFDRImpactHitsRawPFDRImpact
1Alanine, aspartate, and glutamate metabolism06//282.13 × 10−68.34 × 10−50.7075406//284.57 × 10−71.92 × 10−50.48398
2D-Glutamine and D-glutamate metabolism2//60.003320.039840.52//60.002070.019370.5
3Arginine biosynthesis4//144.00 × 10−50.000840.192893//140.000520.007390.19289
4Glutathione metabolism3//280.008290.087080.282812//280.0445310.12939
5Aminoacyl-tRNA biosynthesis13//488.36 × 10−157.02 × 10−130.1666710//482.30 × 10−111.93 × 10−90
6Glycine, serine, and threonine metabolism2//330.090680.400880.217074//330.000530.007390.24577
7Arginine and proline metabolism4//380.00230.03860.201725//386.13 × 10−50.001290.25763
8Phenylalanine, tyrosine, and tryptophan biosynthesis1//40.060570.314970.5NA
9Tryptophan metabolism1//410.4770510.14305NA
10Phenylalanine metabolism1//100.144880.553160.35714NA
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Firdous, S.; Abid, R.; Nawaz, Z.; Bukhari, F.; Anwer, A.; Cheng, L.L.; Sadaf, S. Dysregulated Alanine as a Potential Predictive Marker of Glioma—An Insight from Untargeted HRMAS-NMR and Machine Learning Data. Metabolites 2021, 11, 507. https://doi.org/10.3390/metabo11080507

AMA Style

Firdous S, Abid R, Nawaz Z, Bukhari F, Anwer A, Cheng LL, Sadaf S. Dysregulated Alanine as a Potential Predictive Marker of Glioma—An Insight from Untargeted HRMAS-NMR and Machine Learning Data. Metabolites. 2021; 11(8):507. https://doi.org/10.3390/metabo11080507

Chicago/Turabian Style

Firdous, Safia, Rizwan Abid, Zubair Nawaz, Faisal Bukhari, Ammar Anwer, Leo L. Cheng, and Saima Sadaf. 2021. "Dysregulated Alanine as a Potential Predictive Marker of Glioma—An Insight from Untargeted HRMAS-NMR and Machine Learning Data" Metabolites 11, no. 8: 507. https://doi.org/10.3390/metabo11080507

APA Style

Firdous, S., Abid, R., Nawaz, Z., Bukhari, F., Anwer, A., Cheng, L. L., & Sadaf, S. (2021). Dysregulated Alanine as a Potential Predictive Marker of Glioma—An Insight from Untargeted HRMAS-NMR and Machine Learning Data. Metabolites, 11(8), 507. https://doi.org/10.3390/metabo11080507

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop