Next Article in Journal
Anomalous Advection-Dispersion Equations within General Fractional-Order Derivatives: Models and Series Solutions
Next Article in Special Issue
Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification
Previous Article in Journal
Adaptive Diagnosis for Rotating Machineries Using Information Geometrical Kernel-ELM Based on VMD-SVD
Previous Article in Special Issue
Cross Entropy Method Based Hybridization of Dynamic Group Optimization Algorithm
Article Menu
Issue 1 (January) cover image

Export Article

Open AccessArticle
Entropy 2018, 20(1), 67;

Using Entropy in Web Usage Data Preprocessing

Department of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, 949 74 Nitra, Slovakia
Institute of System Engineering and Informatics, University of Pardubice, Studentska 95, 532 10 Pardubice, Czech Republic
Author to whom correspondence should be addressed.
Received: 30 November 2017 / Revised: 10 January 2018 / Accepted: 13 January 2018 / Published: 22 January 2018
(This article belongs to the Special Issue Entropy-based Data Mining)
Full-Text   |   PDF [3113 KB, uploaded 22 January 2018]   |  


The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages. View Full-Text
Keywords: data preprocessing; information entropy; web usage mining; session identification; Reference Length data preprocessing; information entropy; web usage mining; session identification; Reference Length

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Munk, M.; Benko, L. Using Entropy in Web Usage Data Preprocessing. Entropy 2018, 20, 67.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Entropy EISSN 1099-4300 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top