Next Article in Journal
Dimensionality Reduction for Human Activity Recognition Using Google Colab
Next Article in Special Issue
Combating Fake News in “Low-Resource” Languages: Amharic Fake News Detection Accompanied by Resource Crafting
Previous Article in Journal
Raising the Flag: Monitoring User Perceived Disinformation on Reddit
Previous Article in Special Issue
Determining the Age of the Author of the Text Based on Deep Neural Network Models
Open AccessArticle

Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media

School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, UK
*
Author to whom correspondence should be addressed.
Information 2021, 12(1), 5; https://doi.org/10.3390/info12010005
Received: 23 November 2020 / Revised: 16 December 2020 / Accepted: 18 December 2020 / Published: 22 December 2020
(This article belongs to the Special Issue Natural Language Processing for Social Media)
The last two decades have seen an exponential increase in the use of the Internet and social media, which has changed basic human interaction. This has led to many positive outcomes. At the same time, it has brought risks and harms. The volume of harmful content online, such as hate speech, is not manageable by humans. The interest in the academic community to investigate automated means for hate speech detection has increased. In this study, we analyse six publicly available datasets by combining them into a single homogeneous dataset. Having classified them into three classes, abusive, hateful or neither, we create a baseline model and improve model performance scores using various optimisation techniques. After attaining a competitive performance score, we create a tool that identifies and scores a page with an effective metric in near-real-time and uses the same feedback to re-train our model. We prove the competitive performance of our multilingual model in two languages, English and Hindi. This leads to comparable or superior performance to most monolingual models. View Full-Text
Keywords: social media; hate speech; text classification social media; hate speech; text classification
Show Figures

Figure 1

MDPI and ACS Style

Vashistha, N.; Zubiaga, A. Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media. Information 2021, 12, 5. https://doi.org/10.3390/info12010005

AMA Style

Vashistha N, Zubiaga A. Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media. Information. 2021; 12(1):5. https://doi.org/10.3390/info12010005

Chicago/Turabian Style

Vashistha, Neeraj; Zubiaga, Arkaitz. 2021. "Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media" Information 12, no. 1: 5. https://doi.org/10.3390/info12010005

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop