Next Article in Journal
Electrical Response of the Spinel ZnAl2O4 and Its Application in the Detection of Propane Gas
Next Article in Special Issue
Forecasting Taxi Demands Using Generative Adversarial Networks with Multi-Source Data
Previous Article in Journal
Nonlinear Dynamic Response of Ropeway Roller Batteries via an Asymptotic Approach
Article

An Improved Multiple Features and Machine Learning-Based Approach for Detecting Clickbait News on Social Networks

1
College of Computer Science and Engineering, Taibah University, Medina 42353, Saudi Arabia
2
Institute for Artificial Intelligence and Big Data, Universiti Malaysia Kelantan, City Campus, Pengkalan Chepa, Kota Bharu 16100, Kelantan, Malaysia
3
College of Computer Science and Engineering, University of Ha’il, Ha’il 81481, Saudi Arabia
4
Department of Information Technology, College of Computer, Qassim University, Buraydah 51452, Saudi Arabia
5
Department of Computer Science, College of Applied Sciences, Taiz University, Taiz 6803, Yemen
6
School of Science and Technology, Nottingham Trent University, Nottingham NG11 8NS, UK
*
Authors to whom correspondence should be addressed.
Academic Editors: Giovanni Randazzo, Anselme Muzirafuti and Dimitrios S. Paraforos
Appl. Sci. 2021, 11(20), 9487; https://doi.org/10.3390/app11209487
Received: 10 September 2021 / Revised: 30 September 2021 / Accepted: 6 October 2021 / Published: 13 October 2021
(This article belongs to the Special Issue Future Transportation of People and Goods)
The widespread usage of social media has led to the increasing popularity of online advertisements, which have been accompanied by a disturbing spread of clickbait headlines. Clickbait dissatisfies users because the article content does not match their expectation. Detecting clickbait posts in online social networks is an important task to fight this issue. Clickbait posts use phrases that are mainly posted to attract a user’s attention in order to click onto a specific fake link/website. That means clickbait headlines utilize misleading titles, which could carry hidden important information from the target website. It is very difficult to recognize these clickbait headlines manually. Therefore, there is a need for an intelligent method to detect clickbait and fake advertisements on social networks. Several machine learning methods have been applied for this detection purpose. However, the obtained performance (accuracy) only reached 87% and still needs to be improved. In addition, most of the existing studies were conducted on English headlines and contents. Few studies focused specifically on detecting clickbait headlines in Arabic. Therefore, this study constructed the first Arabic clickbait headline news dataset and presents an improved multiple feature-based approach for detecting clickbait news on social networks in Arabic language. The proposed approach includes three main phases: data collection, data preparation, and machine learning model training and testing phases. The collected dataset included 54,893 Arabic news items from Twitter (after pre-processing). Among these news items, 23,981 were clickbait news (43.69%) and 30,912 were legitimate news (56.31%). This dataset was pre-processed and then the most important features were selected using the ANOVA F-test. Several machine learning (ML) methods were then applied with hyper-parameter tuning methods to ensure finding the optimal settings. Finally, the ML models were evaluated, and the overall performance is reported in this paper. The experimental results show that the Support Vector Machine (SVM) with the top 10% of ANOVA F-test features (user-based features (UFs) and content-based features (CFs)) obtained the best performance and achieved 92.16% of detection accuracy. View Full-Text
Keywords: ANOVA-test; clickbait news; feature selection; social network ANOVA-test; clickbait news; feature selection; social network
Show Figures

Figure 1

MDPI and ACS Style

Al-Sarem, M.; Saeed, F.; Al-Mekhlafi, Z.G.; Mohammed, B.A.; Hadwan, M.; Al-Hadhrami, T.; Alshammari, M.T.; Alreshidi, A.; Alshammari, T.S. An Improved Multiple Features and Machine Learning-Based Approach for Detecting Clickbait News on Social Networks. Appl. Sci. 2021, 11, 9487. https://doi.org/10.3390/app11209487

AMA Style

Al-Sarem M, Saeed F, Al-Mekhlafi ZG, Mohammed BA, Hadwan M, Al-Hadhrami T, Alshammari MT, Alreshidi A, Alshammari TS. An Improved Multiple Features and Machine Learning-Based Approach for Detecting Clickbait News on Social Networks. Applied Sciences. 2021; 11(20):9487. https://doi.org/10.3390/app11209487

Chicago/Turabian Style

Al-Sarem, Mohammed, Faisal Saeed, Zeyad G. Al-Mekhlafi, Badiea A. Mohammed, Mohammed Hadwan, Tawfik Al-Hadhrami, Mohammad T. Alshammari, Abdulrahman Alreshidi, and Talal S. Alshammari 2021. "An Improved Multiple Features and Machine Learning-Based Approach for Detecting Clickbait News on Social Networks" Applied Sciences 11, no. 20: 9487. https://doi.org/10.3390/app11209487

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop