Next Article in Journal
A Novel Insight into Functional Divergence of the MST Gene Family in Rice Based on Comprehensive Expression Patterns
Previous Article in Journal
Chicken Embryonic-Stem Cells Are Permissive to Poxvirus Recombinant Vaccine Vectors
Previous Article in Special Issue
IPCT: Integrated Pharmacogenomic Platform of Human Cancer Cell Lines and Tissues
Article Menu
Issue 3 (March) cover image

Export Article

Open AccessReview

Challenges in the Integration of Omics and Non-Omics Data

Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), and CIBERONC, Melchor Fernández Almagro 3, 28029 Madrid, Spain
Biosciences Department, University of Vic—Central University of Catalonia, Carrer de la Laura 13, 08570 Vic, Spain
Authors to whom correspondence should be addressed.
Genes 2019, 10(3), 238;
Received: 2 January 2019 / Revised: 5 March 2019 / Accepted: 14 March 2019 / Published: 20 March 2019
(This article belongs to the Special Issue Systems Analytics and Integration of Big Omics Data)
PDF [618 KB, uploaded 27 March 2019]
  |     |  


Omics data integration is already a reality. However, few omics-based algorithms show enough predictive ability to be implemented into clinics or public health domains. Clinical/epidemiological data tend to explain most of the variation of health-related traits, and its joint modeling with omics data is crucial to increase the algorithm’s predictive ability. Only a small number of published studies performed a “real” integration of omics and non-omics (OnO) data, mainly to predict cancer outcomes. Challenges in OnO data integration regard the nature and heterogeneity of non-omics data, the possibility of integrating large-scale non-omics data with high-throughput omics data, the relationship between OnO data (i.e., ascertainment bias), the presence of interactions, the fairness of the models, and the presence of subphenotypes. These challenges demand the development and application of new analysis strategies to integrate OnO data. In this contribution we discuss different attempts of OnO data integration in clinical and epidemiological studies. Most of the reviewed papers considered only one type of omics data set, mainly RNA expression data. All selected papers incorporated non-omics data in a low-dimensionality fashion. The integrative strategies used in the identified papers adopted three modeling methods: Independent, conditional, and joint modeling. This review presents, discusses, and proposes integrative analytical strategies towards OnO data integration. View Full-Text
Keywords: data integration; omics data; genomics; RNA expression; non-omics data; clinical data; epidemiological data; challenges; integrative analytics; joint modeling data integration; omics data; genomics; RNA expression; non-omics data; clinical data; epidemiological data; challenges; integrative analytics; joint modeling

Graphical abstract

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

López de Maturana, E.; Alonso, L.; Alarcón, P.; Martín-Antoniano, I.A.; Pineda, S.; Piorno, L.; Calle, M.L.; Malats, N. Challenges in the Integration of Omics and Non-Omics Data. Genes 2019, 10, 238.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Genes EISSN 2073-4425 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top