Next Article in Journal
MolNetEnhancer: Enhanced Molecular Networks by Integrating Metabolome Mining and Annotation Tools
Next Article in Special Issue
Metabolomics Analytics Workflow for Epidemiological Research: Perspectives from the Consortium of Metabolomics Studies (COMETS)
Previous Article in Journal
1H-NMR Metabolomics Identifies Significant Changes in Metabolism over Time in a Porcine Model of Severe Burn and Smoke Inhalation
Previous Article in Special Issue
A Single Visualization Technique for Displaying Multiple Metabolite–Phenotype Associations

Statistical Workflow for Feature Selection in Human Metabolomics Data

Department of Statistics, University of Florida, Gainesville, FL 32611, USA
Cardiovascular Division, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
Departments of Medicine & Pharmacology, University of California San Diego, La Jolla, CA 92093, USA
Preventive Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
National Institute for Health and Welfare, FI 00271 Helsinki, Finland
Department of Medicine, Turku University Hospital and Univesity of Turku, FI 20521 Turrku, Finland
Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
Framingham Heart Study, Framingham, MA 01701, USA
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Metabolites 2019, 9(7), 143;
Received: 30 April 2019 / Revised: 3 July 2019 / Accepted: 10 July 2019 / Published: 12 July 2019
(This article belongs to the Special Issue Metabolomics in Epidemiological Studies)
High-throughput metabolomics investigations, when conducted in large human cohorts, represent a potentially powerful tool for elucidating the biochemical diversity underlying human health and disease. Large-scale metabolomics data sources, generated using either targeted or nontargeted platforms, are becoming more common. Appropriate statistical analysis of these complex high-dimensional data will be critical for extracting meaningful results from such large-scale human metabolomics studies. Therefore, we consider the statistical analytical approaches that have been employed in prior human metabolomics studies. Based on the lessons learned and collective experience to date in the field, we offer a step-by-step framework for pursuing statistical analyses of cohort-based human metabolomics data, with a focus on feature selection. We discuss the range of options and approaches that may be employed at each stage of data management, analysis, and interpretation and offer guidance on the analytical decisions that need to be considered over the course of implementing a data analysis workflow. Certain pervasive analytical challenges facing the field warrant ongoing focused research. Addressing these challenges, particularly those related to analyzing human metabolomics data, will allow for more standardization of as well as advances in how research in the field is practiced. In turn, such major analytical advances will lead to substantial improvements in the overall contributions of human metabolomics investigations. View Full-Text
Keywords: statistical methods; large-scale metabolomics; high-dimensional data statistical methods; large-scale metabolomics; high-dimensional data
Show Figures

Figure 1

MDPI and ACS Style

Antonelli, J.; Claggett, B.L.; Henglin, M.; Kim, A.; Ovsak, G.; Kim, N.; Deng, K.; Rao, K.; Tyagi, O.; Watrous, J.D.; Lagerborg, K.A.; Hushcha, P.V.; Demler, O.V.; Mora, S.; Niiranen, T.J.; Pereira, A.C.; Jain, M.; Cheng, S. Statistical Workflow for Feature Selection in Human Metabolomics Data. Metabolites 2019, 9, 143.

AMA Style

Antonelli J, Claggett BL, Henglin M, Kim A, Ovsak G, Kim N, Deng K, Rao K, Tyagi O, Watrous JD, Lagerborg KA, Hushcha PV, Demler OV, Mora S, Niiranen TJ, Pereira AC, Jain M, Cheng S. Statistical Workflow for Feature Selection in Human Metabolomics Data. Metabolites. 2019; 9(7):143.

Chicago/Turabian Style

Antonelli, Joseph, Brian L. Claggett, Mir Henglin, Andy Kim, Gavin Ovsak, Nicole Kim, Katherine Deng, Kevin Rao, Octavia Tyagi, Jeramie D. Watrous, Kim A. Lagerborg, Pavel V. Hushcha, Olga V. Demler, Samia Mora, Teemu J. Niiranen, Alexandre C. Pereira, Mohit Jain, and Susan Cheng. 2019. "Statistical Workflow for Feature Selection in Human Metabolomics Data" Metabolites 9, no. 7: 143.

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Back to TopTop