Next Article in Journal
A Soft Body Physics Simulator with Computational Offloading to the Cloud
Next Article in Special Issue
A Mobile Acquisition System and a Method for Hips Sway Fluency Assessment
Previous Article in Journal / Special Issue
Towards Expert-Based Speed–Precision Control in Early Simulator Training for Novice Surgeons
Article Menu

Export Article

Open AccessArticle
Information 2018, 9(12), 317; https://doi.org/10.3390/info9120317

LICIC: Less Important Components for Imbalanced Multiclass Classification

1
Georgia Institute of Technology, Atlanta, GA 30332, USA
2
Dipartimento di Informatica, Università degli studi di Bari, 70121 Bari, Italy
*
Author to whom correspondence should be addressed.
Received: 22 October 2018 / Revised: 19 November 2018 / Accepted: 6 December 2018 / Published: 9 December 2018
(This article belongs to the Special Issue eHealth and Artificial Intelligence)
Full-Text   |   PDF [3647 KB, uploaded 9 December 2018]   |  

Abstract

Multiclass classification in cancer diagnostics, using DNA or Gene Expression Signatures, but also classification of bacteria species fingerprints in MALDI-TOF mass spectrometry data, is challenging because of imbalanced data and the high number of dimensions with respect to the number of instances. In this study, a new oversampling technique called LICIC will be presented as a valuable instrument in countering both class imbalance, and the famous “curse of dimensionality” problem. The method enables preservation of non-linearities within the dataset, while creating new instances without adding noise. The method will be compared with other oversampling methods, such as Random Oversampling, SMOTE, Borderline-SMOTE, and ADASYN. F1 scores show the validity of this new technique when used with imbalanced, multiclass, and high-dimensional datasets. View Full-Text
Keywords: imbalanced learn; smote; SVM; KPCA; kernel; class imbalance imbalanced learn; smote; SVM; KPCA; kernel; class imbalance
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Dentamaro, V.; Impedovo, D.; Pirlo, G. LICIC: Less Important Components for Imbalanced Multiclass Classification. Information 2018, 9, 317.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Information EISSN 2078-2489 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top