Next Article in Journal
High-Level Smart Decision Making of a Robot Based on Ontology in a Search and Rescue Scenario
Previous Article in Journal
Special Issue “New Perspectives in Intelligent Transportation Systems and Mobile Communications towards a Smart Cities Context”
Open AccessArticle

Predicting Rogue Content and Arabic Spammers on Twitter

by Adel R. Alharbi 1,* and Amer Aljaedi 2,*
1
Department of Computer Engineering, University of Tabuk, Tabuk 71491, Saudi Arabia
2
Department of Information Technology, University of Tabuk, Tabuk 71491, Saudi Arabia
*
Authors to whom correspondence should be addressed.
Future Internet 2019, 11(11), 229; https://doi.org/10.3390/fi11110229
Received: 4 October 2019 / Revised: 26 October 2019 / Accepted: 28 October 2019 / Published: 30 October 2019
(This article belongs to the Section Cybersecurity)
Twitter is one of the most popular online social networks for spreading propaganda and words in the Arab region. Spammers are now creating rogue accounts to distribute adult content through Arabic tweets that Arabic norms and cultures prohibit. Arab governments are facing a huge challenge in the detection of these accounts. Researchers have extensively studied English spam on online social networks, while to date, social network spam in other languages has been completely ignored. In our previous study, we estimated that rogue and spam content accounted for approximately three quarters of all content with Arabic trending hashtags in Saudi Arabia. This alarming rate, supported by autonomous concurrent estimates, highlights the urgent need to develop adaptive spam detection methods. In this work, we collected a pure data set from spam accounts producing Arabic tweets. We applied lightweight feature engineering based on rogue content and user profiles. The 47 generated features were analyzed, and the best features were selected. Our performance results show that the random forest classification algorithm with 16 features performs best, with accuracy rates greater than 90%. View Full-Text
Keywords: arabic text classification; online social network; twitter; machine Learning; spam detection; rogue contents arabic text classification; online social network; twitter; machine Learning; spam detection; rogue contents
Show Figures

Figure 1

MDPI and ACS Style

Alharbi, A.R.; Aljaedi, A. Predicting Rogue Content and Arabic Spammers on Twitter. Future Internet 2019, 11, 229.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop