Next Article in Journal
Improved LMD, Permutation Entropy and Optimized K-Means to Fault Diagnosis for Roller Bearings
Previous Article in Journal
Correlation between the Increasing Conductivity of Aqueous Solutions of Cation Chlorides with Time and the “Salting-Out” Properties of the Cations
Article Menu

Export Article

Open AccessArticle
Entropy 2016, 18(3), 69; doi:10.3390/e18030069

A Comparison of Four Approaches to Discretization Based on Entropy

1
Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045, USA
2
Department of Expert Systems and Artificial Intelligence, University of Information Technology and Management, Rzeszow 35-225, Poland
This paper is an extended version of our paper published in Pattern Recognition and Machine Intelligence (PReMI). In proceedings of the 6th International Conference on PReMI, Warsaw, Poland, 30 June–3 July 2015.
*
Author to whom correspondence should be addressed.
Academic Editor: Andreas Holzinger
Received: 25 November 2015 / Revised: 7 January 2016 / Accepted: 14 February 2016 / Published: 25 February 2016
View Full-Text   |   Download PDF [235 KB, uploaded 25 February 2016]

Abstract

We compare four discretization methods, all based on entropy: the original C4.5 approach to discretization, two globalized methods, known as equal interval width and equal frequency per interval, and a relatively new method for discretization called multiple scanning using the C4.5 decision tree generation system. The main objective of our research is to compare the quality of these four methods using two criteria: an error rate evaluated by ten-fold cross-validation and the size of the decision tree generated by C4.5. Our results show that multiple scanning is the best discretization method in terms of the error rate and that decision trees generated from datasets discretized by multiple scanning are simpler than decision trees generated directly by C4.5 or generated from datasets discretized by both globalized discretization methods. View Full-Text
Keywords: data mining; discretization; numerical attributes; entropy data mining; discretization; numerical attributes; entropy
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Grzymala-Busse, J.W.; Mroczek, T. A Comparison of Four Approaches to Discretization Based on Entropy. Entropy 2016, 18, 69.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Entropy EISSN 1099-4300 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top