Next Article in Journal
The Key Role of Purine Metabolism in the Folate-Dependent Phenotype of Autism Spectrum Disorders: An In Silico Analysis
Previous Article in Journal
Oxidative Stress Modulation by Cameroonian Spice Extracts in HepG2 Cells: Involvement of Nrf2 and Improvement of Glucose Uptake
Previous Article in Special Issue
Profiling of Chlorogenic Acids from Bidens pilosa and Differentiation of Closely Related Positional Isomers with the Aid of UHPLC-QTOF-MS/MS-Based In-Source Collision-Induced Dissociation
Open AccessArticle

Pathway-Activity Likelihood Analysis and Metabolite Annotation for Untargeted Metabolomics Using Probabilistic Modeling

Department of Computer Science, Tufts University, Medford, MA 02155, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Metabolites 2020, 10(5), 183; https://doi.org/10.3390/metabo10050183
Received: 7 March 2020 / Revised: 19 April 2020 / Accepted: 27 April 2020 / Published: 3 May 2020
(This article belongs to the Special Issue Metabolomics Tools to Accelerate Natural Product Discovery)
Motivation: Untargeted metabolomics comprehensively characterizes small molecules and elucidates activities of biochemical pathways within a biological sample. Despite computational advances, interpreting collected measurements and determining their biological role remains a challenge. Results: To interpret measurements, we present an inference-based approach, termed Probabilistic modeling for Untargeted Metabolomics Analysis (PUMA). Our approach captures metabolomics measurements and the biological network for the biological sample under study in a generative model and uses stochastic sampling to compute posterior probability distributions. PUMA predicts the likelihood of pathways being active, and then derives probabilistic annotations, which assign chemical identities to measurements. Unlike prior pathway analysis tools that analyze differentially active pathways, PUMA defines a pathway as active if the likelihood that the path generated the observed measurements is above a particular (user-defined) threshold. Due to the lack of “ground truth” metabolomics datasets, where all measurements are annotated and pathway activities are known, PUMA is validated on synthetic datasets that are designed to mimic cellular processes. PUMA, on average, outperforms pathway enrichment analysis by 8%. PUMA is applied to two case studies. PUMA suggests many biological meaningful pathways as active. Annotation results were in agreement to those obtained using other tools that utilize additional information in the form of spectral signatures. Importantly, PUMA annotates many measurements, suggesting 23 chemical identities for metabolites that were previously only identified as isomers, and a significant number of additional putative annotations over spectral database lookups. For an experimentally validated 50-compound dataset, annotations using PUMA yielded 0.833 precision and 0.676 recall. View Full-Text
Keywords: machine learning; inference; untargeted metabolomics; biological network; metabolic model machine learning; inference; untargeted metabolomics; biological network; metabolic model
Show Figures

Figure 1

MDPI and ACS Style

Hosseini, R.; Hassanpour, N.; Liu, L.-P.; Hassoun, S. Pathway-Activity Likelihood Analysis and Metabolite Annotation for Untargeted Metabolomics Using Probabilistic Modeling. Metabolites 2020, 10, 183.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop