MDPI - Publisher of Open Access Journals

17 pages, 811 KB

Open AccessArticle

A Hybrid Feature-Weighting and Resampling Model for Imbalanced Sentiment Analysis in User Game Reviews

by Thao-Trang Huynh-Cam, Long-Sheng Chen, Hsuan-Jung Huang and Hsiu-Chia Ko

Mathematics 2026, 14(8), 1273; https://doi.org/10.3390/math14081273 - 11 Apr 2026

Viewed by 269

Sentiment analysis of online game reviews has increasingly become important in understanding player experiences and supporting data-driven game development. However, research in this domain has continuously faced two unresolved challenges: (1) the extreme imbalance between positive and negative feedback, and (2) the inefficiency [...] Read more.

Sentiment analysis of online game reviews has increasingly become important in understanding player experiences and supporting data-driven game development. However, research in this domain has continuously faced two unresolved challenges: (1) the extreme imbalance between positive and negative feedback, and (2) the inefficiency of existing feature-weighting schemes in capturing sentiment signals embedded in informal gaming discourses. Prior works demonstrated that negative feedback—though a few in number are highly influential—usually contain richer emotional content and longer textual structures; yet, prevailing classification models often perform poorly for these minorities (i.e., negative feedback). Numerous studies explored multimodal imbalance issues, class imbalance in cross-lingual ABSA (Aspect-Based Sentiment Analysis), reinforcement-learning-based architectures for imbalanced extraction tasks, and oversampling strategies like SMOTE (Synthetic Minority Over-sampling Technique) variants. Few investigations specifically addressed imbalanced sentiment classification in the contexts of online game reviews, where user-generated content exhibits unique lexical, structural, and emotional characteristics. To address these gaps, this study integrated TF-IDF (Term Frequency-Inverse Document Frequency), VADER (Valence Aware Dictionary and Sentiment Reasoner) lexicon features, and IGM (Inverse Gravity Moment) weightings with advanced oversampling methods such as ADASYN (Adaptive Synthetic Sampling Approach for Imbalanced Learning) and Borderline-SMOTE to improve the detection of minority sentiment classes. Ensemble models, including XGBoost (Extreme Gradient Boosting) and LightGBM (Light Gradient-Boosting Machine), were further employed to enhance the robustness of imbalance. Using a large-scale dataset of Steam game reviews, the proposed framework demonstrated substantial improvement in identifying negative sentiments, addressing a critical limitation in the existing computational game-analysis literature, and advancing the modeling for detecting the emotion-rich but imbalance-prone user feedback. Full article

(This article belongs to the Special Issue IIHMSP: Intelligent Information Hiding and Multimedia Signal Processing)

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (64)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI