Next Article in Journal
Multi-View Graph Contrastive Neural Networks for Session-Based Recommendation
Previous Article in Journal
United Prediction of Travel Modes and Purposes in Travel Chains Based on Multitask Learning Deep Neural Networks
Previous Article in Special Issue
Multi-Scale TsMixer: A Novel Time-Series Architecture for Predicting A-Share Stock Index Futures
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Multi-Domain Controversial Text Detection Based on a Machine Learning and Deep Learning Stacked Ensemble

1
School of Architecture and Design, Beijing Jiaotong University, Beijing 100044, China
2
School of Economics and Management, Beijing Jiaotong University, Beijing 100044, China
3
Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(9), 1529; https://doi.org/10.3390/math13091529
Submission received: 9 April 2025 / Revised: 30 April 2025 / Accepted: 3 May 2025 / Published: 6 May 2025
(This article belongs to the Special Issue Machine Learning Methods and Mathematical Modeling with Applications)

Abstract

Due to the rapid proliferation of social media and online reviews, the accurate identification and classification of controversial texts has emerged as a significant challenge in the field of natural language processing. However, traditional text-classification methodologies frequently encounter critical limitations, such as feature sensitivity and inadequate generalization capabilities. This results in a notably suboptimal performance when confronted with diverse controversial content. To address these substantial limitations, this paper proposes a novel controversial text-detection framework based on stacked ensemble learning to enhance the accuracy and robustness of text classification. Firstly, considering the multidimensional complexity of textual features, we integrate comprehensive feature engineering, i.e., encompassing word frequency, statistical metrics, sentiment analysis, and comment tree structure features, as well as advanced feature selection methodologies, particularly lassonet, i.e., a neural network with feature sparsity, to effectively address dimensionality challenges while enhancing model interpretability and computational efficiency. Secondly, we design a two-tier stacked ensemble architecture, which not only combines the strengths of multiple machine learning algorithms, e.g., gradient-boosted decision tree (GBDT), random forest (RF), and extreme gradient boosting (XGBoost), with deep learning models, e.g., gated recurrent unit (GRU) and long short-term memory (LSTM), but also implements the support vector machine (SVM) for efficient meta-learning. Furthermore, we systematically compare three hyperparameter optimization algorithms, including the sparrow search algorithm (SSA), particle swarm optimization (PSO), and Bayesian optimization (BO). The experimental results demonstrate that the SSA exhibits a superior performance in exploring high-dimensional parameter spaces. Extensive experimentation across diverse topics and domains also confirms that our proposed methodology significantly outperforms the state-of-the-art approaches.
Keywords: controversial text detection; machine learning; deep learning; ensemble learning; hyperparameter optimization; feature engineering controversial text detection; machine learning; deep learning; ensemble learning; hyperparameter optimization; feature engineering

Share and Cite

MDPI and ACS Style

Liu, J.; Liu, Z.; Li, Q.; Kong, W.; Li, X. Multi-Domain Controversial Text Detection Based on a Machine Learning and Deep Learning Stacked Ensemble. Mathematics 2025, 13, 1529. https://doi.org/10.3390/math13091529

AMA Style

Liu J, Liu Z, Li Q, Kong W, Li X. Multi-Domain Controversial Text Detection Based on a Machine Learning and Deep Learning Stacked Ensemble. Mathematics. 2025; 13(9):1529. https://doi.org/10.3390/math13091529

Chicago/Turabian Style

Liu, Jiadi, Zhuodong Liu, Qiaoqi Li, Weihao Kong, and Xiangyu Li. 2025. "Multi-Domain Controversial Text Detection Based on a Machine Learning and Deep Learning Stacked Ensemble" Mathematics 13, no. 9: 1529. https://doi.org/10.3390/math13091529

APA Style

Liu, J., Liu, Z., Li, Q., Kong, W., & Li, X. (2025). Multi-Domain Controversial Text Detection Based on a Machine Learning and Deep Learning Stacked Ensemble. Mathematics, 13(9), 1529. https://doi.org/10.3390/math13091529

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop