Next Article in Journal
Anomalous Advection-Dispersion Equations within General Fractional-Order Derivatives: Models and Series Solutions
Next Article in Special Issue
Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification
Previous Article in Journal
Adaptive Diagnosis for Rotating Machineries Using Information Geometrical Kernel-ELM Based on VMD-SVD
Previous Article in Special Issue
Cross Entropy Method Based Hybridization of Dynamic Group Optimization Algorithm
Open AccessArticle

Using Entropy in Web Usage Data Preprocessing

by Michal Munk 1 and Lubomir Benko 2,*
1
Department of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, 949 74 Nitra, Slovakia
2
Institute of System Engineering and Informatics, University of Pardubice, Studentska 95, 532 10 Pardubice, Czech Republic
*
Author to whom correspondence should be addressed.
Entropy 2018, 20(1), 67; https://doi.org/10.3390/e20010067
Received: 30 November 2017 / Revised: 10 January 2018 / Accepted: 13 January 2018 / Published: 22 January 2018
(This article belongs to the Special Issue Entropy-based Data Mining)
The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages. View Full-Text
Keywords: data preprocessing; information entropy; web usage mining; session identification; Reference Length data preprocessing; information entropy; web usage mining; session identification; Reference Length
Show Figures

Figure 1

MDPI and ACS Style

Munk, M.; Benko, L. Using Entropy in Web Usage Data Preprocessing. Entropy 2018, 20, 67.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop