Emerging Pattern-Based Clustering of Web Users Utilizing a Simple Page-Linked Graph
AbstractWeb usage mining is a popular research area in data mining. With the extensive use of the Internet, it is essential to learn about the favorite web pages of its users and to cluster web users in order to understand the structural patterns of their usage behavior. In this paper, we propose an efficient approach to determining favorite web pages by generating large web pages, and emerging patterns of generated simple page-linked graphs. We identify the favorite web pages of each user by eliminating noise due to overall popular pages, and by clustering web users according to the generated emerging patterns. Afterwards, we label the clusters by using Term Frequency-Inverse Document Frequency (TF-IDF). In the experiments, we evaluate the parameters used in our proposed approach, discuss the effect of the parameters on generating emerging patterns, and analyze the results from clustering web users. The results of the experiments prove that the exact patterns generated in the emerging-pattern step eliminate the need to consider noise pages, and consequently, this step can improve the efficiency of subsequent mining tasks. Our proposed approach is capable of clustering web users from web log data. View Full-Text
Share & Cite This Article
Yu, X.; Li, M.; Kim, K.A.; Chung, J.; Ryu, K.H. Emerging Pattern-Based Clustering of Web Users Utilizing a Simple Page-Linked Graph. Sustainability 2016, 8, 239.
Yu X, Li M, Kim KA, Chung J, Ryu KH. Emerging Pattern-Based Clustering of Web Users Utilizing a Simple Page-Linked Graph. Sustainability. 2016; 8(3):239.Chicago/Turabian Style
Yu, Xiuming; Li, Meijing; Kim, Kyung A.; Chung, Jimoon; Ryu, Keun H. 2016. "Emerging Pattern-Based Clustering of Web Users Utilizing a Simple Page-Linked Graph." Sustainability 8, no. 3: 239.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.