Next Article in Journal
Role-Mining Optimization with Separation-of-Duty Constraints and Security Detections for Authorizations
Previous Article in Journal
Enhancing the 3GPP V2X Architecture with Information-Centric Networking
Open AccessArticle

Incorporating Background Checks with Sentiment Analysis to Identify Violence Risky Chinese Microblogs

1
School of Electronic Engineering and Automation, Civil Aviation University of China, Tianjin 300300, China
2
Honeywell Technology Solutions China, Beijing 100015, China
*
Author to whom correspondence should be addressed.
This paper is an extended version of our paper published in the IEEE 17th International Conference on Information Reuse and Integration, Pittsburgh, PA, USA, 28–30 July 2016; pp. 463–468.
Future Internet 2019, 11(9), 200; https://doi.org/10.3390/fi11090200
Received: 26 July 2019 / Revised: 4 September 2019 / Accepted: 5 September 2019 / Published: 19 September 2019
(This article belongs to the Special Issue Semantic Web Technologies for Sentiment Analysis)
Based on Web 2.0 technology, more and more people tend to express their attitude or opinions on the Internet. Radical ideas, rumors, terrorism, or violent contents are also propagated on the Internet, causing several incidents of social panic every year in China. In fact, most of this content comprises joking or emotional catharsis. To detect this with conventional techniques usually incurs a large false alarm rate. To address this problem, this paper introduces a technique that combines sentiment analysis with background checks. State-of-the-art sentiment analysis usually depends on training datasets in a specific topic area. Unfortunately, for some domains, such as violence risk speech detection, there is no definitive training data. In particular, topic-independent sentiment analysis of short Chinese text has been rarely reported in the literature. In this paper, the violence risk of the Chinese microblogs is calculated from multiple perspectives. First, a lexicon-based method is used to retrieve violence-related microblogs, and then a similarity-based method is used to extract sentiment words. Semantic rules and emoticons are employed to obtain the sentiment polarity and sentiment strength of short texts. Second, the activity risk is calculated based on the characteristics of part of speech (PoS) sequence and by semantic rules, and then a threshold is set to capture the key users. Finally, the risk is confirmed by historical speeches and the opinions of the friend-circle of the key users. The experimental results show that the proposed approach outperforms the support vector machine (SVM) method on a topic-independent corpus and can effectively reduce the false alarm rate. View Full-Text
Keywords: sentiment analysis; violence risk; topic independent; semantic similarity; semantic rules sentiment analysis; violence risk; topic independent; semantic similarity; semantic rules
Show Figures

Figure 1

MDPI and ACS Style

Jia, Y.-F.; Li, S.; Wu, R. Incorporating Background Checks with Sentiment Analysis to Identify Violence Risky Chinese Microblogs. Future Internet 2019, 11, 200.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop