A Comparison of Four Approaches to Discretization Based on Entropy†
AbstractWe compare four discretization methods, all based on entropy: the original C4.5 approach to discretization, two globalized methods, known as equal interval width and equal frequency per interval, and a relatively new method for discretization called multiple scanning using the C4.5 decision tree generation system. The main objective of our research is to compare the quality of these four methods using two criteria: an error rate evaluated by ten-fold cross-validation and the size of the decision tree generated by C4.5. Our results show that multiple scanning is the best discretization method in terms of the error rate and that decision trees generated from datasets discretized by multiple scanning are simpler than decision trees generated directly by C4.5 or generated from datasets discretized by both globalized discretization methods. View Full-Text
Share & Cite This Article
Grzymala-Busse, J.W.; Mroczek, T. A Comparison of Four Approaches to Discretization Based on Entropy. Entropy 2016, 18, 69.
Grzymala-Busse JW, Mroczek T. A Comparison of Four Approaches to Discretization Based on Entropy. Entropy. 2016; 18(3):69.Chicago/Turabian Style
Grzymala-Busse, Jerzy W.; Mroczek, Teresa. 2016. "A Comparison of Four Approaches to Discretization Based on Entropy." Entropy 18, no. 3: 69.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.