Next Article in Journal
Using an Interactive Lattice Boltzmann Solver in Fluid Mechanics Instruction
Previous Article in Journal
A Discrete Approach to Meshless Lagrangian Solid Modeling
Previous Article in Special Issue
Anomalous Diffusion within the Transcriptome as a Bio-Inspired Computing Framework for Resilience
Article Menu

Export Article

Open AccessArticle
Computation 2017, 5(3), 34; doi:10.3390/computation5030034

Tensor-Based Semantically-Aware Topic Clustering of Biomedical Documents

1
Department of Informatics, Ionian University, Tsirigoti Square 7, Kerkyra 49100, Greece
2
Computer Engineering and Informatics Department, University of Patras, Patras 26504, Greece
*
Author to whom correspondence should be addressed.
Received: 28 April 2017 / Revised: 27 June 2017 / Accepted: 14 July 2017 / Published: 18 July 2017
View Full-Text   |   Download PDF [353 KB, uploaded 21 July 2017]   |  

Abstract

Biomedicine is a pillar of the collective, scientific effort of human self-discovery, as well as a major source of humanistic data codified primarily in biomedical documents. Despite their rigid structure, maintaining and updating a considerably-sized collection of such documents is a task of overwhelming complexity mandating efficient information retrieval for the purpose of the integration of clustering schemes. The latter should work natively with inherently multidimensional data and higher order interdependencies. Additionally, past experience indicates that clustering should be semantically enhanced. Tensor algebra is the key to extending the current term-document model to more dimensions. In this article, an alternative keyword-term-document strategy, based on scientometric observations that keywords typically possess more expressive power than ordinary text terms, whose algorithmic cornerstones are third order tensors and MeSH ontological functions, is proposed. This strategy has been compared against a baseline using two different biomedical datasets, the TREC (Text REtrieval Conference) genomics benchmark and a large custom set of cognitive science articles from PubMed. View Full-Text
Keywords: humanistic data; higher order data; medical information retrieval; topic clustering; PubMed; MeSH Ontology; tensor algebra; tucker factorization humanistic data; higher order data; medical information retrieval; topic clustering; PubMed; MeSH Ontology; tensor algebra; tucker factorization
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Drakopoulos, G.; Kanavos, A.; Karydis, I.; Sioutas, S.; G. Vrahatis, A. Tensor-Based Semantically-Aware Topic Clustering of Biomedical Documents. Computation 2017, 5, 34.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Computation EISSN 2079-3197 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top