Metabolites 2012, 2(4), 775-795; doi:10.3390/metabo2040775
Review

A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data

1,2,3,* email, 1,3email, 4,5email, 2,4email, 2,4,5email and 1,2,3,* email
1 Metabolomics Platform, Campus Sescelades, Edifici N2, Rovira i Virgili University, Tarragona 43007, Spain 2 Spanish Biomedical Research Center in Diabetes and Associated Metabolic Disorders (CIBERDEM), Passeig Bonanova 69, Barcelona 08017, Spain 3 Institut d’Investigació Biomédica Pere Virgili (IISPV), C/Sant Llorenç, 21, Reus 43201, Spain 4 Institute for Research in Biomedicine (IRB Barcelona), Barcelona 08028, Spain 5 Department of Biochemistry and Molecular Biology, University of Barcelona, Barcelona 08028, Spain
* Authors to whom correspondence should be addressed.
Received: 2 August 2012; in revised form: 2 October 2012 / Accepted: 10 October 2012 / Published: 18 October 2012
(This article belongs to the Special Issue Analytical Techniques in Metabolomics)
PDF Full-text Download PDF Full-Text [420 KB, Updated Version, uploaded 19 October 2012 16:48 CEST]
The original version is still available [420 KB, uploaded 18 October 2012 09:35 CEST]
Abstract: Several metabolomic software programs provide methods for peak picking, retention time alignment and quantification of metabolite features in LC/MS-based metabolomics. Statistical analysis, however, is needed in order to discover those features significantly altered between samples. By comparing the retention time and MS/MS data of a model compound to that from the altered feature of interest in the research sample, metabolites can be then unequivocally identified. This paper reports on a comprehensive overview of a workflow for statistical analysis to rank relevant metabolite features that will be selected for further MS/MS experiments. We focus on univariate data analysis applied in parallel on all detected features. Characteristics and challenges of this analysis are discussed and illustrated using four different real LC/MS untargeted metabolomic datasets. We demonstrate the influence of considering or violating mathematical assumptions on which univariate statistical test rely, using high-dimensional LC/MS datasets. Issues in data analysis such as determination of sample size, analytical variation, assumption of normality and homocedasticity, or correction for multiple testing are discussed and illustrated in the context of our four untargeted LC/MS working examples.
Keywords: univariate; metabolomics; mass spectrometry

Article Statistics

Load and display the download statistics.

Citations to this Article

Cite This Article

MDPI and ACS Style

Vinaixa, M.; Samino, S.; Saez, I.; Duran, J.; Guinovart, J.J.; Yanes, O. A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data. Metabolites 2012, 2, 775-795.

AMA Style

Vinaixa M, Samino S, Saez I, Duran J, Guinovart JJ, Yanes O. A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data. Metabolites. 2012; 2(4):775-795.

Chicago/Turabian Style

Vinaixa, Maria; Samino, Sara; Saez, Isabel; Duran, Jordi; Guinovart, Joan J.; Yanes, Oscar. 2012. "A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data." Metabolites 2, no. 4: 775-795.

Metabolites EISSN 2218-1989 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert