Next Article in Journal
Creating a Multimodal Translation Tool and Testing Machine Translation Integration Using Touch and Voice
Previous Article in Journal
IGR Token-Raw Material and Ingredient Certification of Recipe Based Foods Using Smart Contracts
Article Menu

Export Article

Open AccessArticle
Informatics 2019, 6(1), 12; https://doi.org/10.3390/informatics6010012

Improvement in the Efficiency of a Distributed Multi-Label Text Classification Algorithm Using Infrastructure and Task-Related Data

Department of Cybernetics and Artificial Intelligence, Technical University Košice, Letná 9/A, 040 01 Košice, Slovakia
*
Author to whom correspondence should be addressed.
Received: 2 January 2019 / Revised: 27 February 2019 / Accepted: 9 March 2019 / Published: 18 March 2019
  |  
PDF [2304 KB, uploaded 19 March 2019]
  |  

Abstract

Distributed computing technologies allow a wide variety of tasks that use large amounts of data to be solved. Various paradigms and technologies are already widely used, but many of them are lacking when it comes to the optimization of resource usage. The aim of this paper is to present the optimization methods used to increase the efficiency of distributed implementations of a text-mining model utilizing information about the text-mining task extracted from the data and information about the current state of the distributed environment obtained from a computational node, and to improve the distribution of the task on the distributed infrastructure. Two optimization solutions are developed and implemented, both based on the prediction of the expected task duration on the existing infrastructure. The solutions are experimentally evaluated in a scenario where a distributed tree-based multi-label classifier is built based on two standard text data collections. View Full-Text
Keywords: text classification; multi-label classification; distributed text-mining; task assignment; resource optimization; grid computing text classification; multi-label classification; distributed text-mining; task assignment; resource optimization; grid computing
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Sarnovsky, M.; Olejnik, M. Improvement in the Efficiency of a Distributed Multi-Label Text Classification Algorithm Using Infrastructure and Task-Related Data. Informatics 2019, 6, 12.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Informatics EISSN 2227-9709 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top