MDPI - Publisher of Open Access Journals

15 pages, 774 KB

Open AccessArticle

Revisiting the Role of Label Smoothing in Enhanced Text Sentiment Classification

by Shijing Si, Yijie Gao, Haixia Sun, Yugui Zhang and Hua Luo

Electronics 2026, 15(10), 1984; https://doi.org/10.3390/electronics15101984 - 7 May 2026

Viewed by 400

Label smoothing is a widely used technique in various domains, such as text classification, image classification and speech recognition, known for effectively combating model overfitting. However, there is little fine-grained analysis on how label smoothing enhances text sentiment classification. To fill in the [...] Read more.

Label smoothing is a widely used technique in various domains, such as text classification, image classification and speech recognition, known for effectively combating model overfitting. However, there is little fine-grained analysis on how label smoothing enhances text sentiment classification. To fill in the gap, this article performs a set of in-depth analyses on eight datasets for text sentiment classification and three deep learning architectures: TextCNN, BERT, and RoBERTa, under two learning schemes: training from scratch and fine-tuning. By tuning the smoothing parameters, we can achieve improved performance on almost all datasets for each model architecture. Specifically, our experiments demonstrate that label smoothing improves accuracy by 0.5–2.3 percent across different architectures, with the best results achieved using smoothing parameters

λ \in [0.01, 0.1]

for three-class datasets and

λ \in [0.01, 0.15]

for binary-class datasets. We further investigate the benefits of label smoothing, finding that label smoothing can accelerate the convergence of deep models by 15–30 percent and make samples of different labels easily distinguishable. Additionally, we provide comprehensive analysis including macro-F1, precision, and recall metrics to ensure robust evaluation across datasets with varying class distributions. Full article

(This article belongs to the Special Issue New Trends in Machine Learning, System and Digital Twins)

► Show Figures

Figure 1

31 pages, 4372 KB

Open AccessArticle

Text-Anchored Residual Cross-Modal Fusion for Multimodal Sentiment Analysis: A Unified and Protocol-Aware Evaluation on MVSA-Single

by Kosala Natarajan and Nirmalrani Vairaperumal

Appl. Sci. 2026, 16(9), 4514; https://doi.org/10.3390/app16094514 - 4 May 2026

Viewed by 586

Abstract

Multimodal sentiment analysis aims to infer sentiment polarity by jointly modeling textual and visual information. Despite recent advances in pretrained language and vision encoders, sentiment prediction from social media posts remains challenging because textual and visual modalities are often weakly aligned, semantically noisy, [...] Read more.

Multimodal sentiment analysis aims to infer sentiment polarity by jointly modeling textual and visual information. Despite recent advances in pretrained language and vision encoders, sentiment prediction from social media posts remains challenging because textual and visual modalities are often weakly aligned, semantically noisy, and unevenly informative. Recent studies have emphasized the importance of fine-grained cross-modal fusion, stronger pretrained visual representations, and strategies for reducing modality bias in MVSA-style benchmarks. In this work, we present a systematic implementation-driven study of multimodal sentiment classification on MVSA-Single. We first construct a clean three-class sentiment-consistent subset and then implement a wide set of baselines, including text-only DistilBERT, image-only ResNet18, simple multimodal fusion, gated fusion, residual fusion, multi-task contrastive fusion, DINOv2-based fusion, and attention bottleneck fusion. Building on these experiments, we propose a semantic cross-modal fusion architecture that combines a RoBERTa text encoder with a CLIP vision encoder through cross-attention, allowing textual representations to selectively attend to sentiment-relevant visual signals. On the clean 2592-sample subset, the proposed model achieved the best overall performance, reaching 82.63% validation accuracy, 79.62% test accuracy and 79.42 weighted F1, outperforming all other implemented baselines under the same experimental pipeline and dataset setting. To improve comparability with prior MVSA-Single studies, we additionally reconstructed a broader processed setting from the 4511-sample HDF5 version and aligned 4318 text–image pairs with original image files. On this harder protocol-matched setting, the same model achieved 72.69% test accuracy and 70.66 weighted F1, revealing a substantial performance gap caused by dataset construction and residual multimodal noise. These findings show that strong cross-modal semantic alignment contributes more to robust multimodal sentiment prediction than simply increasing architectural complexity and that CLIP-based visual semantics are more beneficial than DINOv2 in our text–image sentiment setting. Full article

► Show Figures

Figure 1

27 pages, 1862 KB

Open AccessArticle

A Fine-Grained Sentiment Classification Metric for Dynamic E-Commerce Content Relationships

by Ahad AlQabasani and Hana Al-Nuaim

Information 2026, 17(5), 419; https://doi.org/10.3390/info17050419 - 27 Apr 2026

Viewed by 551

Abstract

E-commerce web content is dynamic and diverse, necessitating continuous monitoring and adaptation. This presents researchers with the challenge of discovering methods to improve delivered services. Hence, integrating natural language processing (NLP), Machine Learning (ML), Deep Learning (DL), and sentiment analysis (SA) provides businesses [...] Read more.

E-commerce web content is dynamic and diverse, necessitating continuous monitoring and adaptation. This presents researchers with the challenge of discovering methods to improve delivered services. Hence, integrating natural language processing (NLP), Machine Learning (ML), Deep Learning (DL), and sentiment analysis (SA) provides businesses with robust frameworks to utilize customer feedback and enhance decision-making. Therefore, we introduce a novel dataset collection methodology that captures the dynamic relationships between e-commerce web content and consumer sentiment. Additionally, we introduce a novel, real-consumer-based quality metric on product content through FG-CSrP, extending SA into a new Fine-Grained Consumer Sentiment related to the Product. We evaluated our dataset using baseline models: Deep Neural Network (DNN), Long Short-Term Memory (LSTM), DistilBERT, and twelve automatically optimized models created by AutoGluon-Tabular across three scenarios, each with varying feature inputs (numerical, textual, and both). We then applied Explainable Artificial Intelligence (XAI) to the DNN model to explain feature importance in prediction. Our findings showed that LightGBMXT outperformed the other models in two out of three scenarios, and XAI interpretations highlighted the significant role of vendor-provided web content details in consumer sentiment. Overall, our approach provides actionable insights that can help vendors improve e-commerce strategies and strengthen customer engagement. Full article

(This article belongs to the Section Information Applications)

► Show Figures

Figure 1

18 pages, 2038 KB

Open AccessArticle

DCANet: Diffusion-Coded Attention Network for Cross-Domain Semantic Noise Mitigation and Multi-Scale Context Fusion

by Xiao Han, Chunhua Wang, Weijian Fan, Zishuo Niu, Jing Gui and Shijia Yu

Electronics 2026, 15(8), 1667; https://doi.org/10.3390/electronics15081667 - 16 Apr 2026

Viewed by 362

Abstract

Neural language models have achieved remarkable progress in semantic representation learning. However, cross-domain representation learning still suffers from prominent semantic noise propagation issues. Existing methods still face challenges in cross-domain semantic modeling, including limited robustness across different semantic granularities, difficulty in separating transferable [...] Read more.

Neural language models have achieved remarkable progress in semantic representation learning. However, cross-domain representation learning still suffers from prominent semantic noise propagation issues. Existing methods still face challenges in cross-domain semantic modeling, including limited robustness across different semantic granularities, difficulty in separating transferable semantics from task-irrelevant semantic interference, and insufficient adaptability to specialized scenarios. These issues may reduce feature discriminability in fine-grained semantic tasks and complex application settings. To address these problems, we propose the Diffusion-Coded Attention Network (DCANet), a novel cross-domain representation learning architecture with three synergistic core modules: a multi-granular parallel diffusion masking mechanism for cross-scale context fusion via stochastic path activation, an implicit semantic encoder that distills domain-invariant patterns into adaptive bias codes via shared latent manifolds, and a self-correcting attention topology realizing dynamic semantic purification via closed-loop interactions between local features and global bias states. Extensive evaluations are conducted on nine well-recognized benchmark datasets to verify DCANet’s effectiveness and reliability. Experimental results show that DCANet attains state-of-the-art results on the majority of the benchmark datasets, with significant accuracy improvements on text classification and sentiment analysis tasks. Full article

(This article belongs to the Section Artificial Intelligence)

► Show Figures

Figure 1

23 pages, 1461 KB

Open AccessArticle

A Computational Analysis of Emotions and Topics in YouTube Discourse on Sora

by Ayse Ocal

Appl. Sci. 2026, 16(5), 2519; https://doi.org/10.3390/app16052519 - 5 Mar 2026

Viewed by 1021

Abstract

As generative artificial intelligence (AI) technologies become increasingly present in creative and professional domains, examining public discourse surrounding these tools is important for understanding their broader social implications. This study conducts a two-part analysis of the initial public reaction to Sora, the generative [...] Read more.

As generative artificial intelligence (AI) technologies become increasingly present in creative and professional domains, examining public discourse surrounding these tools is important for understanding their broader social implications. This study conducts a two-part analysis of the initial public reaction to Sora, the generative video model developed by OpenAI, by analyzing 23,543 English-language comments posted on YouTube between February and April 2024. Rather than relying on traditional positive–negative sentiment classifications, this study integrates fine-grained emotion detection with topic modeling to examine the relationship between emotions and topics in the discourse. Based on the residual analysis, the overall association between topics and emotions was weak; however, certain topics were associated with specific emotions. For instance, ethical discussions were more likely to be associated with sadness and anger, artistic settings were associated with fear, and benchmark discussions were associated with joy. Methodologically, this study utilizes an emotion–topic coupling through residual deviation with a hierarchical LDA-BERTopic approach, bringing together computational modeling and theories of emotion. This study provides a structured and theory-based way to explore the affective and thematic patterns in the public’s discourse surrounding Sora. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

► Show Figures

Figure 1

28 pages, 1521 KB

Open AccessArticle

Image–Text Sentiment Analysis Based on Dual-Path Interaction Network with Multi-Level Consistency Learning

by Zhi Ji, Chunlei Wu, Qinfu Xu and Yixiang Wu

Electronics 2026, 15(3), 581; https://doi.org/10.3390/electronics15030581 - 29 Jan 2026

Viewed by 683

Abstract

With the continuous evolution of social media, users are increasingly inclined to express their personal emotions on digital platforms by integrating information presented in multiple modalities. Within this context, research on image–text sentiment analysis has garnered significant attention. Prior research efforts have made [...] Read more.

With the continuous evolution of social media, users are increasingly inclined to express their personal emotions on digital platforms by integrating information presented in multiple modalities. Within this context, research on image–text sentiment analysis has garnered significant attention. Prior research efforts have made notable progress by leveraging shared emotional concepts across visual and textual modalities. However, existing cross-modal sentiment analysis methods face two key challenges: Previous approaches often focus excessively on fusion, resulting in learned features that may not achieve emotional alignment; traditional fusion strategies are not optimized for sentiment tasks, leading to insufficient robustness in final sentiment discrimination. To address the aforementioned issues, this paper proposes a Dual-path Interaction Network with Multi-level Consistency Learning (DINMCL). It employs a multi-level feature representation module to decouple the global and local features of both text and image. These decoupled features are then fed into the Global Congruity Learning (GCL) and Local Crossing-Congruity Learning (LCL) modules, respectively. GCL models global semantic associations using Crossing Prompter, while LCL captures local consistency in fine-grained emotional cues across modalities through cross-modal attention mechanisms and adaptive prompt injection. Finally, a CLIP-based adaptive fusion layer integrates the multi-modal representations in a sentiment-oriented manner. Experiments on the MVSA_Single, MVSA_Multiple, and TumEmo datasets with baseline models such as CTMWA and CLMLF demonstrate that DINMCL significantly outperforms mainstream models in sentiment classification accuracy and F1-score and exhibits strong robustness when handling samples containing highly noisy symbols. Full article

(This article belongs to the Special Issue AI-Driven Image Processing: Theory, Methods, and Applications)

► Show Figures

Figure 1

15 pages, 1527 KB

Open AccessArticle

Learning Complementary Representations for Targeted Multimodal Sentiment Analysis

by Binfen Ding, Jieyu An and Yumeng Lei

Computers 2026, 15(1), 52; https://doi.org/10.3390/computers15010052 - 13 Jan 2026

Viewed by 652

Abstract

Targeted multimodal sentiment classification is frequently impeded by the semantic sparsity of social media content, where text is brief and context is implicit. Traditional methods that rely on direct concatenation of textual and visual features often fail to resolve the ambiguity of specific [...] Read more.

Targeted multimodal sentiment classification is frequently impeded by the semantic sparsity of social media content, where text is brief and context is implicit. Traditional methods that rely on direct concatenation of textual and visual features often fail to resolve the ambiguity of specific targets due to a lack of alignment between modalities. In this paper, we propose the Complementary Description Network (CDNet) to bridge this informational gap. CDNet incorporates automatically generated image descriptions as an additional semantic bridge, in contrast to methods that handle text and images as distinct streams. The framework enhances the input representation by directly translating visual content into text, allowing for more accurate interactions between the opinion target and the visual narrative. We further introduce a complementary reconstruction module that functions as a regularizer, forcing the model to retain deep semantic cues during fusion. Empirical results on the Twitter-2015 and Twitter-2017 benchmarks confirm that CDNet outperforms existing baselines. The findings suggest that visual-to-text augmentation is an effective strategy for compensating for the limited context inherent in short texts. Full article

(This article belongs to the Section AI-Driven Innovations)

► Show Figures

Figure 1

30 pages, 1062 KB

Open AccessArticle

Context-Aware Emotion Gating and Modulation for Fine-Grained Sentiment Classification

by Anupama Udayangani Gunathilaka Thennakoon Mudiyanselage, Jinglan Zhang and Yeufeng Li

Mach. Learn. Knowl. Extr. 2026, 8(1), 9; https://doi.org/10.3390/make8010009 - 31 Dec 2025

Cited by 1 | Viewed by 1259

Abstract

Fine-grained sentiment analysis requires a deep understanding of emotional intensity in the text to distinguish subtle shifts in polarity, such as moving from positive to more positive or from negative to more negative, and to clearly separate emotionally neutral statements from polarized expressions, [...] Read more.

Fine-grained sentiment analysis requires a deep understanding of emotional intensity in the text to distinguish subtle shifts in polarity, such as moving from positive to more positive or from negative to more negative, and to clearly separate emotionally neutral statements from polarized expressions, especially in short or contextually sparse texts such as social media posts. While recent advances combine deep semantic encoding with context-aware architectures, such as Bidirectional Long Short-Term Memory (BiLSTM) and Convolutional Neural Networks (CNNs), many models still struggle to detect nuanced emotional cues, particularly in short texts, due to the limited contextual information, subtle polarity shifts, and overlapping affective expressions, which ultimately hinder performance and reduce a model’s ability to make fine-grained sentiment distinctions. To address this challenge, we propose an Emotion- Aware Bidirectional Gating Network (Electra-BiG-Emo) that improves sentiment classification and subtle sentiment differentiation by learning contextual emotion representations and refining them with auxiliary emotional signals. Our model employs an asymmetric gating mechanism within a BiLSTM to dynamically capture both early and late contextual semantics. The gates are temperature-controlled, enabling adaptive modulation of emotion priors, derived from Reddit post datasets to enhance context-aware emotion representation. These soft emotional signals are reweighted based on context, enabling the model to amplify or suppress emotions in the presence of an ambiguous context. This approach advances fine-grained sentiment understanding by embedding emotional awareness directly into the learning process. Ablation studies confirm the complementary roles of semantic encoding, context modeling, and emotion modulation. Further our approach achieves competitive performance on Sem- Val 2017 Task 4c, Twitter US Airline, and SST5 datasets compared with state-of-the-art methods, particularly excelling in detecting subtle emotional variations and classifying short, semantically sparse texts. Gating and modulation analyses reveal that emotion-aware gating enhances interpretability and reinforces the value of explicit emotion modeling in fine-grained sentiment tasks. Full article

(This article belongs to the Section Data)

► Show Figures

Figure 1

29 pages, 2471 KB

Open AccessArticle

MISA-GMC: An Enhanced Multimodal Sentiment Analysis Framework with Gated Fusion and Momentum Contrastive Modality Relationship Modeling

by Zheng Du, Yapeng Wang, Xu Yang, Sio-Kei Im and Zhiwen Wang

Mathematics 2026, 14(1), 115; https://doi.org/10.3390/math14010115 - 28 Dec 2025

Cited by 1 | Viewed by 1775

Abstract

Multimodal sentiment analysis jointly exploits textual, acoustic, and visual signals to recognize human emotions more accurately than unimodal models. However, real-world data often contain noisy or partially missing modalities, and naive fusion may allow unreliable signals to degrade overall performance. To address this, [...] Read more.

Multimodal sentiment analysis jointly exploits textual, acoustic, and visual signals to recognize human emotions more accurately than unimodal models. However, real-world data often contain noisy or partially missing modalities, and naive fusion may allow unreliable signals to degrade overall performance. To address this, we propose an enhanced framework named MISA-GMC, a lightweight extension of the widely used MISA backbone that explicitly accounts for modality reliability. The core idea is to adaptively reweight modalities at the sample level while regularizing cross-modal representations during training. Specifically, a reliability-aware gated fusion module down-weights unreliable modalities, and two auxiliary training-time regularizers (momentum contrastive learning and a lightweight correlation graph) help stabilize and refine multimodal representations without adding inference-time overhead. Experiments on three benchmark datasets—CMU-MOSI, CMU-MOSEI, and CH-SIMS—demonstrate the effectiveness of MISA-GMC. For instance, on CMU-MOSI, the proposed model improves 7-class accuracy from 43.29 to 45.92, reduces the mean absolute error (MAE) from 0.785 to 0.712, and increases the Pearson correlation coefficient (Corr) from 0.764 to 0.795. This indicates more accurate fine-grained sentiment prediction and better sentiment-intensity estimation. On CMU-MOSEI and CH-SIMS, MISA-GMC also achieves consistent gains over MISA and strong baselines such as LMF, ALMT, and MMIM across both classification and regression metrics. Ablation studies and missing-modality experiments further verify the contribution of each component and the robustness of MISA-GMC under partial-modality settings. Full article

(This article belongs to the Special Issue Applications of Machine Learning and Pattern Recognition)

► Show Figures

Figure 1

24 pages, 1197 KB

Open AccessArticle

A Multi-Scale Feature Fusion Linear Attention Model for Movie Review Sentiment Analysis

by Zi Jiang and Chengjun Xu

Big Data Cogn. Comput. 2025, 9(12), 325; https://doi.org/10.3390/bdcc9120325 - 18 Dec 2025

Cited by 1 | Viewed by 1300

Abstract

Sentiment classification is a key technique for analyzing the emotional tendency of user reviews and is of great significance to movie recommendation systems. However, existing methods often face challenges in practical applications due to complex model structures, low computational efficiency, or difficulties in [...] Read more.

Sentiment classification is a key technique for analyzing the emotional tendency of user reviews and is of great significance to movie recommendation systems. However, existing methods often face challenges in practical applications due to complex model structures, low computational efficiency, or difficulties in balancing local details with global contextual features. To address these issues, this paper proposes a Multi-Scale Feature Fusion Linear Attention model (MSFFLA). The model consists of three core modules: the BERT Encoder module for extracting basic semantic features; the Parallel Multi-scale Feature Extraction module (PMFE), which employs multi-branch dilated convolutions to accurately capture local fine-grained features; and the Global Multi-scale Linear Feature Extraction module (MGLFE), which introduces a Multi-Scale Linear Attention mechanism (MSLA) to efficiently model global contextual dependencies with approximately linear computational complexity. Extensive experiments were conducted on three public datasets: SST-2, Amazon Reviews, and MR. The results show that compared to the state-of-the-art BERT-CondConv model, our model achieves improvements in accuracy and F1-Score by 1.8% and 0.4%, respectively, on the SST-2 dataset, and by 1.5% and 0.3% on the Amazon Reviews dataset. This study not only validates the effectiveness of the proposed model but also provides an efficient and lightweight solution for sentiment classification tasks in movie recommendation systems, demonstrating promising practical application prospects. Full article

► Show Figures

Figure 1

34 pages, 11286 KB

Open AccessEditor’s ChoiceArticle

Degradation of Multi-Task Prompting Across Six NLP Tasks and LLM Families

by Federico Di Maio and Manuel Gozzi

Electronics 2025, 14(21), 4349; https://doi.org/10.3390/electronics14214349 - 6 Nov 2025

Cited by 2 | Viewed by 3003

Abstract

This study investigates how increasing prompt complexity affects the performance of Large Language Models (LLMs) across multiple Natural Language Processing (NLP) tasks. We introduce an incremental evaluation framework where six tasks—JSON formatting, English-Italian translation, sentiment analysis, emotion classification, topic extraction, and named entity [...] Read more.

This study investigates how increasing prompt complexity affects the performance of Large Language Models (LLMs) across multiple Natural Language Processing (NLP) tasks. We introduce an incremental evaluation framework where six tasks—JSON formatting, English-Italian translation, sentiment analysis, emotion classification, topic extraction, and named entity recognition—are progressively combined within a single prompt. Six representative open-source LLMs from different families (Llama 3.1 8B, Gemma 3 4B, Mistral 7B, Qwen3 4B, Granite 3.1 3B, and DeepSeek R1 7B) were systematically evaluated using local inference environments to ensure reproducibility. Results show that performance degradation is highly architecture-dependent: while Qwen3 4B maintained stable performance across all tasks, Gemma 3 4B and Granite 3.1 3B exhibited severe collapses in fine-grained semantic tasks. Interestingly, some models (e.g., Llama 3.1 8B and DeepSeek R1 7B) demonstrated positive transfer effects, improving in certain tasks under multitask conditions. Statistical analyses confirmed significant differences across models for structured and semantic tasks, highlighting the absence of a universal degradation rule. These findings suggest that multitask prompting resilience is shaped more by architectural design than by model size alone, and they motivate adaptive, model-specific strategies for prompt composition in complex NLP applications. Full article

(This article belongs to the Special Issue Artificial Intelligence-Driven Emerging Applications)

► Show Figures

Figure 1

24 pages, 2394 KB

Open AccessArticle

Extracting Emotions from Customer Reviews Using Text Mining, Large Language Models and Fine-Tuning Strategies

by Simona-Vasilica Oprea and Adela Bâra

J. Theor. Appl. Electron. Commer. Res. 2025, 20(3), 221; https://doi.org/10.3390/jtaer20030221 - 1 Sep 2025

Cited by 5 | Viewed by 4648

Abstract

User-generated content, such as product and app reviews, offers more than just sentiment. It provides a rich spectrum of emotional expression that reveals users’ experiences, frustrations and expectations. Traditional sentiment analysis, which typically classifies text as positive or negative, lacks the nuance needed [...] Read more.

User-generated content, such as product and app reviews, offers more than just sentiment. It provides a rich spectrum of emotional expression that reveals users’ experiences, frustrations and expectations. Traditional sentiment analysis, which typically classifies text as positive or negative, lacks the nuance needed to fully understand the emotional drivers behind customer feedback. In this research, we focus on fine-grained emotion classification using core emotions. By identifying specific emotions rather than sentiment polarity, we enable more actionable insights for e-commerce and app development, supporting strategies such as feature refinement, marketing personalization and proactive customer engagement. We leverage the Hugging Face Emotions dataset and adopt a two-phase modeling approach. In the first phase, we use a pre-trained DistilBERT model as a feature extractor and evaluate multiple classical classifiers (Logistic Regression, Support Vector Classifier, Random Forest) to establish performance baselines. In the second phase, we fine-tune the DistilBERT model end-to-end using the Hugging Face Trainer API, optimizing classification performance through task-specific adaptation. Training is tracked using the Weights & Biases (wandb) API. Comparative analysis highlights the substantial performance gains from fine-tuning, particularly in capturing informal or noisy language typical in user reviews. The final fine-tuned model is applied to a dataset of customers’ reviews, identifying the dominant emotions expressed. Our results demonstrate the practical value of emotion-aware analytics in uncovering the underlying “why” behind user sentiment, enabling more empathetic decision-making across product design, customer support and user experience (UX) strategy. Full article

► Show Figures

Figure 1

27 pages, 8196 KB

Open AccessArticle

Enhancing Electric Vehicle Charging Infrastructure Planning with Pre-Trained Language Models and Spatial Analysis: Insights from Beijing User Reviews

by Yanxin Hou, Peipei Wang, Zhuozhuang Yao, Xinqi Zheng and Ziying Chen

ISPRS Int. J. Geo-Inf. 2025, 14(9), 325; https://doi.org/10.3390/ijgi14090325 - 24 Aug 2025

Cited by 4 | Viewed by 2346

Abstract

With the growing adoption of electric vehicles, optimizing the user experience of charging infrastructure has become critical. However, extracting actionable insights from the vast number of user reviews remains a significant challenge, impeding demand-driven operational planning for charging stations and degrading the user [...] Read more.

With the growing adoption of electric vehicles, optimizing the user experience of charging infrastructure has become critical. However, extracting actionable insights from the vast number of user reviews remains a significant challenge, impeding demand-driven operational planning for charging stations and degrading the user experience. This study leverages three pre-trained language models to perform sentiment classification and multi-level topic identification on 168,129 user reviews from Beijing, facilitating a comprehensive understanding of user feedback. The experimental results reveal significant task-model specialization: RoBERTa-WWM excels in sentiment analysis (accuracy = 0.917) and fine-grained topic identification (Micro-F1 = 0.844), making it ideal for deep semantic extraction. Conversely, ELECTRA, after sufficient training, demonstrates a strong aptitude for coarse-grained topic summarization, highlighting its strength in high-level semantic generalization. Notably, the models offer capabilities beyond simple classification, including autonomous label normalization and the extraction of valuable information from comments with low information density. Furthermore, integrating textual and spatial analyses revealed striking patterns. We identified an urban–rural emotional gap—suburban users are more satisfied despite fewer facilities—and used geographically weighted regression (GWR) to quantify the spatial differences in the factors affecting user satisfaction in Beijing’s districts. We identified three types of areas requiring differentiated strategies, as follows: the northwestern region is highly sensitive to equipment quality, the central urban area has a complex relationship between supporting facilities and satisfaction, and the emerging adoption area is more sensitive to accessibility and price factors. These findings offer a data-driven framework for charging infrastructure planning, enabling operators to base decisions on real-world user feedback and tailor solutions to specific local contexts. Full article

► Show Figures

Figure 1

37 pages, 5086 KB

Open AccessArticle

Global Embeddings, Local Signals: Zero-Shot Sentiment Analysis of Transport Complaints

by Aliya Nugumanova, Daniyar Rakhimzhanov and Aiganym Mansurova

Informatics 2025, 12(3), 82; https://doi.org/10.3390/informatics12030082 - 14 Aug 2025

Cited by 2 | Viewed by 3668

Abstract

Public transport agencies must triage thousands of multilingual complaints every day, yet the cost of training and serving fine-grained sentiment analysis models limits real-time deployment. The proposed “one encoder, any facet” framework therefore offers a reproducible, resource-efficient alternative to heavy fine-tuning for domain-specific [...] Read more.

Public transport agencies must triage thousands of multilingual complaints every day, yet the cost of training and serving fine-grained sentiment analysis models limits real-time deployment. The proposed “one encoder, any facet” framework therefore offers a reproducible, resource-efficient alternative to heavy fine-tuning for domain-specific sentiment analysis or opinion mining tasks on digital service data. To the best of our knowledge, we are the first to test this paradigm on operational multilingual complaints, where public transport agencies must prioritize thousands of Russian- and Kazakh-language messages each day. A human-labelled corpus of 2400 complaints is embedded with five open-source universal models. Obtained embeddings are matched to semantic “anchor” queries that describe three distinct facets: service aspect (eight classes), implicit frustration, and explicit customer request. In the strict zero-shot setting, the best encoder reaches 77% accuracy for aspect detection, 74% for frustration, and 80% for request; taken together, these signals reproduce human four-level priority in 60% of cases. Attaching a single-layer logistic probe on top of the frozen embeddings boosts performance to 89% for aspect, 83–87% for the binary facets, and 72% for end-to-end triage. Compared with recent fine-tuned sentiment analysis systems, our pipeline cuts memory demands by two orders of magnitude and eliminates task-specific training yet narrows the accuracy gap to under five percentage points. These findings indicate that a single frozen encoder, guided by handcrafted anchors and an ultra-light head, can deliver near-human triage quality across multiple pragmatic dimensions, opening the door to low-cost, language-agnostic monitoring of digital-service feedback. Full article

(This article belongs to the Special Issue Practical Applications of Sentiment Analysis)

► Show Figures

Figure 1

25 pages, 2838 KB

Open AccessArticle

BHE+ALBERT-Mixplus: A Distributed Symmetric Approximate Homomorphic Encryption Model for Secure Short-Text Sentiment Classification in Teaching Evaluations

by Jingren Zhang, Siti Sarah Maidin and Deshinta Arrova Dewi

Symmetry 2025, 17(6), 903; https://doi.org/10.3390/sym17060903 - 7 Jun 2025

Viewed by 1297

Abstract

This study addresses the sentiment classification of short texts in teaching evaluations. To mitigate concerns regarding data security in cloud-based sentiment analysis and to overcome the limited feature extraction capacity of traditional deep-learning methods, we propose a distributed symmetric approximate homomorphic hybrid sentiment [...] Read more.

This study addresses the sentiment classification of short texts in teaching evaluations. To mitigate concerns regarding data security in cloud-based sentiment analysis and to overcome the limited feature extraction capacity of traditional deep-learning methods, we propose a distributed symmetric approximate homomorphic hybrid sentiment classification model, denoted BHE+ALBERT-Mixplus. To enable homomorphic encryption of non-polynomial functions within the ALBERT-Mixplus architecture—a mixing-and-enhancement variant of ALBERT—we introduce the BHE (BERT-based Homomorphic Encryption) algorithm. The BHE establishes a distributed symmetric approximation workflow, constructing a cloud–user symmetric encryption framework. Within this framework, simplified computations and mathematical approximations are applied to handle non-polynomial operations (e.g., GELU, Softmax, and LayerNorm) under the CKKS homomorphic-encryption scheme. Consequently, the ALBERT-Mixplus model can securely perform classification on encrypted data without compromising utility. To improve feature extraction and enhance prediction accuracy in sentiment classification, ALBERT-Mixplus incorporates two core components: 1. A meta-information extraction layer, employing a lightweight pre-trained ALBERT model to capture extensive general semantic knowledge and thereby bolster robustness to noise. 2. A hybrid feature-extraction layer, which fuses a bidirectional gated recurrent unit (BiGRU) with a multi-scale convolutional neural network (MCNN) to capture both global contextual dependencies and fine-grained local semantic features across multiple scales. Together, these layers enrich the model’s deep feature representations. Experimental results on the TAD-2023 and SST-2 datasets demonstrate that BHE+ALBERT-Mixplus achieves competitive improvements in key evaluation metrics compared to mainstream models, despite a slight increase in computational overhead. The proposed framework enables secure analysis of diverse student feedback while preserving data privacy. This allows marginalized student groups to benefit equally from AI-driven insights, thereby embodying the principles of educational equity and inclusive education. Moreover, through its innovative distributed encryption workflow, the model enhances computational efficiency while promoting environmental sustainability by reducing energy consumption and optimizing resource allocation. Full article

(This article belongs to the Section Computer)

► Show Figures

Figure 1

Search Results (39)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (39)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI