Next Article in Journal
Forecasting Electricity Consumption Using an Improved Grey Prediction Model
Previous Article in Journal
Construction of Complex Network with Multiple Time Series Relevance
Previous Article in Special Issue
LOD for Data Warehouses: Managing the Ecosystem Co-Evolution
Article Menu

Export Article

Open AccessArticle
Information 2018, 9(8), 203; https://doi.org/10.3390/info9080203

Chinese Microblog Topic Detection through POS-Based Semantic Expansion

1
School of Information, Beijing Wuzi University, Beijing 101149, China
2
National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
*
Author to whom correspondence should be addressed.
Received: 25 June 2018 / Revised: 25 July 2018 / Accepted: 8 August 2018 / Published: 10 August 2018
(This article belongs to the Special Issue Semantics for Big Data Integration)
Full-Text   |   PDF [2896 KB, uploaded 10 August 2018]   |  

Abstract

A microblog is a new type of social media for information publishing, acquiring, and spreading. Finding the significant topics of a microblog is necessary for popularity tracing and public opinion following. This paper puts forward a method to detect topics from Chinese microblogs. Since traditional methods showed low performance on a short text from a microblog, we put forward a topic detection method based on the semantic description of the microblog post. The semantic expansion of the post supplies more information and clues for topic detection. First, semantic features are extracted from a microblog post. Second, the semantic features are expanded according to a thesaurus. Here TongYiCi CiLin is used as the lexical resource to find words with the same meaning. To overcome the polysemy problem, several semantic expansion strategies based on part-of-speech are introduced and compared. Third, an approach to detect topics based on semantic descriptions and an improved incremental clustering algorithm is introduced. A dataset from Sina Weibo is employed to evaluate our method. Experimental results show that our method can bring about better results both for post clustering and topic detection in Chinese microblogs. We also found that the semantic expansion of nouns is far more efficient than for other parts of speech. The potential mechanism of the phenomenon is also analyzed and discussed. View Full-Text
Keywords: Chinese microblogs; semantic expansion; short text; topic detection Chinese microblogs; semantic expansion; short text; topic detection
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Ding, L.; Sun, B.; Shi, P. Chinese Microblog Topic Detection through POS-Based Semantic Expansion. Information 2018, 9, 203.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Information EISSN 2078-2489 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top