MDPI - Publisher of Open Access Journals

14 pages, 238 KiB

Open AccessArticle

Magic at the Crossroads: Moral Dissonance and Repair in the Wizarding World

by Ulugbek Ochilov

Humanities 2025, 14(7), 148; https://doi.org/10.3390/h14070148 - 14 Jul 2025

Viewed by 394

The Harry Potter fandom community around the world prefers a universe of wizards and witches that includes all people, but also has concerns about the author’s perspective regarding gender identity. This disjunction paralyzes the cultural reader with moral confusion, which is a danger [...] Read more.

The Harry Potter fandom community around the world prefers a universe of wizards and witches that includes all people, but also has concerns about the author’s perspective regarding gender identity. This disjunction paralyzes the cultural reader with moral confusion, which is a danger to their emotional investment in the text. Although scholars have analyzed this phenomenon using fragmented prisms, such as social media activism, cognitive engagement, translation, pedagogy, and fan creativity, there is no unifying model that can be used to understand why reading pleasure endures. This article aims to fill this gap by examining these strands of research in a divergent manner, adopting a convergent mixed-methods study approach. Based on neurocognitive (EEG) values, cross-cultural focus groups, social media analysis, and corpus linguistics, we outline the terrain of reader coping mechanisms. We identify separate fan fractions and examine the corresponding practices. The results are summarized by proposing a model called the MDRL (Moral dissonance repair loop) which is a theoretical model that shows how translation smoothing, pedagogical reframing and fan-based re-moralization interact with one another in creating a system that enables the reader to be collectively able to obtain their relations with the text back to a manageable point and continue being engaged. This model makes a theoretical contribution to new areas in the study of fans, moral psychology, and cognitive literature. Full article

(This article belongs to the Special Issue World Mythology and Its Connection to Nature and/or Ecocriticism)

32 pages, 1881 KiB

Open AccessArticle

LLM and Pattern Language Synthesis: A Hybrid Tool for Human-Centered Architectural Design

by Bruno Postle and Nikos A. Salingaros

Buildings 2025, 15(14), 2400; https://doi.org/10.3390/buildings15142400 - 9 Jul 2025

Viewed by 467

Abstract

This paper combines Christopher Alexander’s pattern language with generative AI into a hybrid design framework. The result is a narrative synthesis that can be useful for informed project design. Advanced large language models (LLMs) enable the real-time synthesis of design patterns, making complex [...] Read more.

This paper combines Christopher Alexander’s pattern language with generative AI into a hybrid design framework. The result is a narrative synthesis that can be useful for informed project design. Advanced large language models (LLMs) enable the real-time synthesis of design patterns, making complex architectural choices accessible and comprehensible to stakeholders without specialized architectural knowledge. A lightweight, web-based tool lets project teams rapidly assemble context-specific subsets of Alexander’s 253 patterns, reducing a traditionally unwieldy 1166-page corpus to a concise, shareable list. Demonstrated through a case study of a university department building, this method results in environments that are psychologically welcoming, fostering health, productivity, and emotional well-being. LLMs translate these curated patterns into vivid experiential narratives—complete with neuroscientifically informed ornamentation. LLMs produce representative images from the verbal narrative, revealing a surprisingly traditional design that was never input as a prompt. Two separate LLMs (for cross-checking) then predict the pattern-generated design to catalyze improved productivity as compared to a standard campus building. By bridging abstract design principles and concrete human experience, this approach democratizes architectural planning grounded on Alexander’s human-centered, participatory ethos. Full article

(This article belongs to the Section Architectural Design, Urban Science, and Real Estate)

► Show Figures

Figure 1

24 pages, 1664 KiB

Open AccessReview

A Comprehensive Review of Multimodal Emotion Recognition: Techniques, Challenges, and Future Directions

by You Wu, Qingwei Mi and Tianhan Gao

Biomimetics 2025, 10(7), 418; https://doi.org/10.3390/biomimetics10070418 - 27 Jun 2025

Viewed by 1766

Abstract

This paper presents a comprehensive review of multimodal emotion recognition (MER), a process that integrates multiple data modalities such as speech, visual, and text to identify human emotions. Grounded in biomimetics, the survey frames MER as a bio-inspired sensing paradigm that emulates the [...] Read more.

This paper presents a comprehensive review of multimodal emotion recognition (MER), a process that integrates multiple data modalities such as speech, visual, and text to identify human emotions. Grounded in biomimetics, the survey frames MER as a bio-inspired sensing paradigm that emulates the way humans seamlessly fuse multisensory cues to communicate affect, thereby transferring principles from living systems to engineered solutions. By leveraging various modalities, MER systems offer a richer and more robust analysis of emotional states compared to unimodal approaches. The review covers the general structure of MER systems, feature extraction techniques, and multimodal information fusion strategies, highlighting key advancements and milestones. Additionally, it addresses the research challenges and open issues in MER, including lightweight models, cross-corpus generalizability, and the incorporation of additional modalities. The paper concludes by discussing future directions aimed at improving the accuracy, explainability, and practicality of MER systems for real-world applications. Full article

(This article belongs to the Special Issue Intelligent Human–Robot Interaction: 4th Edition)

► Show Figures

Figure 1

20 pages, 3062 KiB

Open AccessEditor’s ChoiceArticle

Cognitive Networks and Text Analysis Identify Anxiety as a Key Dimension of Distress in Genuine Suicide Notes

by Massimo Stella, Trevor James Swanson, Andreia Sofia Teixeira, Brianne N. Richson, Ying Li, Thomas T. Hills, Kelsie T. Forbush and David Watson

Big Data Cogn. Comput. 2025, 9(7), 171; https://doi.org/10.3390/bdcc9070171 - 27 Jun 2025

Viewed by 611

Abstract

Understanding the mindset of people who die by suicide remains a key research challenge. We map conceptual and emotional word–word co-occurrences in 139 genuine suicide notes and in reference word lists, an Emotional Recall Task, from 200 individuals grouped by high/low depression, anxiety, [...] Read more.

Understanding the mindset of people who die by suicide remains a key research challenge. We map conceptual and emotional word–word co-occurrences in 139 genuine suicide notes and in reference word lists, an Emotional Recall Task, from 200 individuals grouped by high/low depression, anxiety, and stress levels on DASS-21. Positive words cover most of the suicide notes’ vocabulary; however, co-occurrences in suicide notes overlap mostly with those produced by individuals with low anxiety (Jaccard index of 0.42 for valence and 0.38 for arousal). We introduce a “words not said” method: It removes every word that corpus A shares with a comparison corpus B and then checks the emotions of “residual” words in

A - B

. With no leftover emotions, A and B are similar in expressing the same emotions. Simulations indicate this method can classify high/low levels of depression, anxiety and stress with 80% accuracy in a balanced task. After subtracting suicide note words, only the high-anxiety corpus displays no significant residual emotions. Our findings thus pin anxiety as a key latent feature of suicidal psychology and offer an interpretable language-based marker for suicide risk detection. Full article

► Show Figures

Figure 1

18 pages, 4253 KiB

Open AccessArticle

The Emotional Landscape of Technological Innovation: A Data-Driven Case Study of ChatGPT’s Launch

by Lowri Williams and Pete Burnap

Informatics 2025, 12(3), 58; https://doi.org/10.3390/informatics12030058 - 22 Jun 2025

Viewed by 728

Abstract

The rapid development and deployment of artificial intelligence (AI) technologies have sparked intense public interest and debate. While these innovations promise to revolutionise various aspects of human life, it is crucial to understand the complex emotional responses they elicit from potential adopters and [...] Read more.

The rapid development and deployment of artificial intelligence (AI) technologies have sparked intense public interest and debate. While these innovations promise to revolutionise various aspects of human life, it is crucial to understand the complex emotional responses they elicit from potential adopters and users. Such findings can offer crucial guidance for stakeholders involved in the development, implementation, and governance of AI technologies like OpenAI’s ChatGPT, a large language model (LLM) that garnered significant attention upon its release, enabling more informed decision-making regarding potential challenges and opportunities. While previous studies have employed data-driven approaches towards investigating public reactions to emerging technologies, they often relied on sentiment polarity analysis, which categorises responses as positive or negative. However, this binary approach fails to capture the nuanced emotional landscape surrounding technological adoption. This paper overcomes this limitation by presenting a comprehensive analysis for investigating the emotional landscape surrounding technology adoption by using the launch of ChatGPT as a case study. In particular, a large corpus of social media texts containing references to ChatGPT was compiled. Text mining techniques were applied to extract emotions, capturing a more nuanced and multifaceted representation of public reactions. This approach allows the identification of specific emotions such as excitement, fear, surprise, and frustration, providing deeper insights into user acceptance, integration, and potential adoption of the technology. By analysing this emotional landscape, we aim to provide a more comprehensive understanding of the factors influencing ChatGPT’s reception and potential long-term impact. Furthermore, we employ topic modelling to identify and extract the common themes discussed across the dataset. This additional layer of analysis allows us to understand the specific aspects of ChatGPT driving different emotional responses. By linking emotions to particular topics, we gain a more contextual understanding of public reaction, which can inform decision-making processes in the development, deployment, and regulation of AI technologies. Full article

(This article belongs to the Section Big Data Mining and Analytics)

► Show Figures

Figure 1

24 pages, 2269 KiB

Open AccessArticle

This Is the Way People Are Negative Anymore: Mapping Emotionally Negative Affect in Syntactically Positive Anymore Through Sentiment Analysis of Tweets

by Christopher Strelluf and Thomas T. Hills

Languages 2025, 10(6), 136; https://doi.org/10.3390/languages10060136 - 10 Jun 2025

Viewed by 1086

Abstract

The adverb anymore is standardly a negative polarity item (NPI), which must be licensed by triggers of non-positive polarity. Some Englishes also allow anymore in positive-polarity clauses. Linguists have posited that this non-polarity anymore (NPAM) carries a feature of negative affect. However, this [...] Read more.

The adverb anymore is standardly a negative polarity item (NPI), which must be licensed by triggers of non-positive polarity. Some Englishes also allow anymore in positive-polarity clauses. Linguists have posited that this non-polarity anymore (NPAM) carries a feature of negative affect. However, this claim is based on elicited judgments, and linguists have argued that respondents cannot reliably evaluate NPAM via conscious judgment. To solve this problem, we employ sentiment analysis to examine the relationship between NPAM and negative affect in a Twitter corpus. Using two complementary sentiment analytic frameworks, we demonstrate that words occurring with NPAM have lower valence, higher arousal, and lower dominance than words occurring with NPI-anymore. Broadly, this confirms NPAM’s association with negative affect in natural-language productions. We additionally identify inter- and intra-regional differences in affective dimensions, as well as variability across different types of NPI trigger, showing that the relationship between negative affect and NPAM is not monolithic dialectally, syntactically, or semantically. The project demonstrates the utility of sentiment analysis for examining emotional characteristics of low-frequency variables, providing a new tool for dialectology, micro-syntax, and variationist sociolinguistics. Full article

(This article belongs to the Special Issue Linguistics of Social Media)

► Show Figures

Figure 1

26 pages, 433 KiB

Open AccessReview

Hyperarousal, Dissociation, Emotion Dysregulation and Re-Experiencing—Towards Understanding Molecular Aspects of PTSD Symptoms

by Aleksandra Brzozowska and Jakub Grabowski

Int. J. Mol. Sci. 2025, 26(11), 5216; https://doi.org/10.3390/ijms26115216 - 29 May 2025

Viewed by 1354

Abstract

Approximately 70% of people will experience a traumatic event in their lifetime, but post-traumatic stress disorder (PTSD) will only develop in 3.9% and complex post-traumatic stress disorder (CPTSD) in 1–8% of the population worldwide, although in some countries (e.g., Poland and Northern Ireland) [...] Read more.

Approximately 70% of people will experience a traumatic event in their lifetime, but post-traumatic stress disorder (PTSD) will only develop in 3.9% and complex post-traumatic stress disorder (CPTSD) in 1–8% of the population worldwide, although in some countries (e.g., Poland and Northern Ireland) it will develop in a much higher percentage. Stress-related disorders have a complex pathogenesis involving neurophysiological, genetic, epigenetic, neuroendocrine and environmental factors. This article reviews the current state of knowledge on the molecular aspects of selected PTSD symptoms: hypervigilance, re-experiencing, emotion dysregulation and dissociation, i.e., the symptoms with strong neurobiological components. Among analysed susceptibility factors are specific gene polymorphisms (e.g., FKBP5, COMT, CHRNA5, CRHR1, 5-HTTLPR, ADCY8 and DRD2) and their interactions with the environment, changes in the HPA axis, adrenergic hyperactivity and disturbances in the activity of selected anatomical structures (including the amygdala, prefrontal cortex, corpus callosum, anterior cingulate gyrus and hippocampus). It is worth noting that therapeutic methods with proven effectiveness in PTSD (TF-CBT and EMDR) have a substantial neurobiological rationale. Molecular aspects seem crucial when searching for effective screening/diagnostic methods and new potential therapeutic options. Full article

(This article belongs to the Special Issue Molecular Mechanisms Underlying Vulnerability to Stress-Related Disorders)

21 pages, 6196 KiB

Open AccessArticle

Building a Gender-Bias-Resistant Super Corpus as a Deep Learning Baseline for Speech Emotion Recognition

by Babak Abbaschian and Adel Elmaghraby

Sensors 2025, 25(7), 1991; https://doi.org/10.3390/s25071991 - 22 Mar 2025

Viewed by 582

Abstract

The focus on Speech Emotion Recognition has dramatically increased in recent years, driven by the need for automatic speech-recognition-based systems and intelligent assistants to enhance user experience by incorporating emotional content. While deep learning techniques have significantly advanced SER systems, their robustness concerning [...] Read more.

The focus on Speech Emotion Recognition has dramatically increased in recent years, driven by the need for automatic speech-recognition-based systems and intelligent assistants to enhance user experience by incorporating emotional content. While deep learning techniques have significantly advanced SER systems, their robustness concerning speaker gender and out-of-distribution data has not been thoroughly examined. Furthermore, standards for SER remain rooted in landmark papers from the 2000s, even though modern deep learning architectures can achieve comparable or superior results to the state of the art of that era. In this research, we address these challenges by creating a new super corpus from existing databases, providing a larger pool of samples. We benchmark this dataset using various deep learning architectures, setting a new baseline for the task. Additionally, our experiments reveal that models trained on this super corpus demonstrate superior generalization and accuracy and exhibit lower gender bias compared to models trained on individual databases. We further show that traditional preprocessing techniques, such as denoising and normalization, are insufficient to address inherent biases in the data. However, our data augmentation approach effectively shifts these biases, improving model fairness across gender groups and emotions and, in some cases, fully debiasing the models. Full article

(This article belongs to the Special Issue Emotion Recognition and Cognitive Behavior Analysis Based on Sensors)

► Show Figures

Graphical abstract

13 pages, 825 KiB

Open AccessArticle

Religious Passions and Prayer Channeling Divine Disclosure: The Testimony of the Fourth-Century Syrian Fathers

by Miklós Vassányi

Religions 2025, 16(3), 305; https://doi.org/10.3390/rel16030305 - 27 Feb 2025

Viewed by 547

Abstract

In this paper, I investigate what role religious emotions, passions, and prayer may play in what may be called the religion of the heart in early Syrian theology, especially in the several works of ’Aphrahaṭ, Saint Ephrem, and the anonymous collection titled the [...] Read more.

In this paper, I investigate what role religious emotions, passions, and prayer may play in what may be called the religion of the heart in early Syrian theology, especially in the several works of ’Aphrahaṭ, Saint Ephrem, and the anonymous collection titled the Book of Steps, all from the fourth century. In the original Syriac sources, one may see how in this vast corpus, religious emotions and prayer act as a corridor channeling the believer’s striving for the disclosure of the mysteries surrounding God. It will also be shown that while these sources advocate in unison for the heart’s function in connecting the believer with God, their respective interpretive theological contexts are different. These points are substantiated by virtue of a number of citations from the original texts. Full article

(This article belongs to the Special Issue Varieties of Revelation: Scripture, Theology, and Philosophy in the Perspective of Divine Disclosure)

28 pages, 5903 KiB

Open AccessArticle

Anthropological Insights into Emotion Semantics in Intangible Cultural Heritage Museums: A Case Study of Eastern Sichuan, China

by Jiaman Li, Maoen He, Zi Yang and Kin Wai Michael Siu

Electronics 2025, 14(5), 891; https://doi.org/10.3390/electronics14050891 - 24 Feb 2025

Cited by 2 | Viewed by 1328

Abstract

The preservation of intangible cultural heritage (ICH) has transitioned from “static” and “living” approaches to a “digital ecosystem”, becoming a significant topic of anthropological research. This study, adopting an anthropological perspective, integrates sentiment semantic analysis with user identity classification to propose the Identity [...] Read more.

The preservation of intangible cultural heritage (ICH) has transitioned from “static” and “living” approaches to a “digital ecosystem”, becoming a significant topic of anthropological research. This study, adopting an anthropological perspective, integrates sentiment semantic analysis with user identity classification to propose the Identity and Sentiment-Centered Framework for Intangible Cultural Heritage (ISC-ICH). Drawing on four types of ICH museums in Eastern Sichuan, China—Nanchong Langzhong Wang Shadow Puppetry Museum, Bazhong Pingchang Fanshan Jiaozi Base, Guang’an Eastern Sichuan Folk Museum, and Dazhou ICH Exhibition Hall—as case studies, this research analyzes the core factors contributing to the audience’s sense of local identity, including its composition, emotional needs, and cultural interaction. The findings reveal that: (1) “Explorers” and “Experience Seekers” constitute the primary audience groups, with their emotional evaluations closely tied to cultural depth and interactivity. (2) The digital transformation of ICH museums faces challenges such as resource limitations, festival-centric phenomena, the rise of “internet celebrity” trends, and technological homogenization. This paper introduces a culturally tailored corpus and a comprehensive evaluation framework, highlighting the dynamic interaction between ICH and its audience. Additionally, it proposes effective digital strategies to enhance the social and cultural identity of ICH museums in peripheral regions. Full article

(This article belongs to the Special Issue Metaverse and Digital Twins, 2nd Edition)

► Show Figures

Figure 1

20 pages, 1828 KiB

Open AccessArticle

Research on Sentiment Analysis of Online Public Opinion Based on RoBERTa–BiLSTM–Attention Model

by Jiangao Deng and Yue Liu

Appl. Sci. 2025, 15(4), 2148; https://doi.org/10.3390/app15042148 - 18 Feb 2025

Cited by 4 | Viewed by 2272

Abstract

Public opinion comments are important for the public to express their emotions and demands. Accordingly, identifying the public emotions contained in comments and taking corresponding countermeasures according to the changes in the emotions are of great theoretical and practical significance to online public [...] Read more.

Public opinion comments are important for the public to express their emotions and demands. Accordingly, identifying the public emotions contained in comments and taking corresponding countermeasures according to the changes in the emotions are of great theoretical and practical significance to online public opinion management. This study took a public opinion event at a college as an example. Firstly, the microblogs and comment data related to the event were crawled with Python coding, and pre-processing operations such as cleaning, word splitting, and de-noising were carried out; then, the stage of public opinion was divided into phases based on the daily public opinion sound volume, Baidu index, and key time points of the event. Secondly, for sentiment analysis, a supplementary sentiment dictionary of the event was constructed based on the SO-PMI algorithm and merged with the commonly used sentiment dictionary to pre-annotate the sentiment corpus; then, the RoBERTa–BiLSTM–Attention model was constructed to classify the sentiment of microblog comments; after that, four evaluation indexes were selected and ablation experiments were set up to verify the performance of the model. Finally, based on the results of the sentiment classification, we drew public opinion trends and sentiment evolution graphs for analysis. The results showed that the supplementary dictionary constructed based on the SO-PMI algorithm significantly improved the pre-labelling accuracy. The RoBERTa–BiLSTM–Attention model achieved 91.56%, 90.87%, 91.07%, and 91.17% in accuracy, precision, recall, and F1-score, respectively. The situation notification, expert response, regulatory dynamics, and secondary public opinion will trigger significant fluctuations in the volume of public opinion and public sentiment. Full article

► Show Figures

Figure 1

21 pages, 1753 KiB

Open AccessArticle

Explainable Deep Learning for COVID-19 Vaccine Sentiment in Arabic Tweets Using Multi-Self-Attention BiLSTM with XLNet

by Asmaa Hashem Sweidan, Nashwa El-Bendary, Shereen A. Taie, Amira M. Idrees and Esraa Elhariri

Big Data Cogn. Comput. 2025, 9(2), 37; https://doi.org/10.3390/bdcc9020037 - 10 Feb 2025

Viewed by 1117

Abstract

The COVID-19 pandemic has generated a vast corpus of online conversations regarding vaccines, predominantly on social media platforms like X (formerly known as Twitter). However, analyzing sentiment in Arabic text is challenging due to the diverse dialects and lack of readily available sentiment [...] Read more.

The COVID-19 pandemic has generated a vast corpus of online conversations regarding vaccines, predominantly on social media platforms like X (formerly known as Twitter). However, analyzing sentiment in Arabic text is challenging due to the diverse dialects and lack of readily available sentiment analysis resources for the Arabic language. This paper proposes an explainable Deep Learning (DL) approach designed for sentiment analysis of Arabic tweets related to COVID-19 vaccinations. The proposed approach utilizes a Bidirectional Long Short-Term Memory (BiLSTM) network with Multi-Self-Attention (MSA) mechanism for capturing contextual impacts over long spans within the tweets, while having the sequential nature of Arabic text constructively learned by the BiLSTM model. Moreover, the XLNet embeddings are utilized to feed contextual information into the model. Subsequently, two essential Explainable Artificial Intelligence (XAI) methods, namely Local Interpretable Model-Agnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP), have been employed for gaining further insights into the features’ contributions to the overall model performance and accordingly achieving reasonable interpretation of the model’s output. Obtained experimental results indicate that the combined XLNet with BiLSTM model outperforms other implemented state-of-the-art methods, achieving an accuracy of 93.2% and an F-measure of 92% for average sentiment classification. The integration of LIME and SHAP techniques not only enhanced the model’s interpretability, but also provided detailed insights into the factors that influence the classification of emotions. These findings underscore the model’s effectiveness and reliability for sentiment analysis in low-resource languages such as Arabic. Full article

(This article belongs to the Special Issue Application of Deep Learning and Convolution Neural Networks for Social Healthcare)

► Show Figures

Figure 1

31 pages, 1304 KiB

Open AccessArticle

Characterising Student Teachers’ Noticing Habits in Technology-Enhanced Dialogic Reflection

by Yanna Li

Educ. Sci. 2024, 14(12), 1393; https://doi.org/10.3390/educsci14121393 - 19 Dec 2024

Cited by 1 | Viewed by 1154

Abstract

This research responds to the increasing call to hone Student Teachers’ (ST) skill of selective attention or professional noticing as an essential early step toward developing noticing in the classroom environment. Furthering the line of research on teacher noticing via videos, this study [...] Read more.

This research responds to the increasing call to hone Student Teachers’ (ST) skill of selective attention or professional noticing as an essential early step toward developing noticing in the classroom environment. Furthering the line of research on teacher noticing via videos, this study aims to identify the distinguishing features of STs’ noticing in a Corpus Linguistics approach and enhance our understanding of STs’ habitual ways of thinking, doing, and feeling in using videos to reflect collaboratively. Participants were 40 final-year STs majoring in English Language Education and five tutors from the same university. During their 8 weeks of professional practicum, STs recorded their classes and reflected on their practice using the Self-Evaluation of Teacher Talk through Video Enhanced Observation (SETTVEO) tag set and as part of online professional learning communities. A 200,000-word Corpus of Dialogic Reflection (CoDR) was constructed and analysed using the #LancsBox 6.0 tool. Findings highlight novice teachers’ unconsciousness or problematisation of their personal agency, haste in proposing alternative practices, and cognitive or emotional dissonance when they analyse their own videos in group settings. This study has implications for the guidance needed in technology-enhanced dialogic reflection. It suggests how teacher educators could tailor their support to the reflective and professional needs of novice teachers for a more productive, transformative reflection and teacher-learning experience. Full article

(This article belongs to the Special Issue Technology and Language Teacher Education)

► Show Figures

Figure 1

14 pages, 769 KiB

Open AccessArticle

Speech Emotion Recognition Using Multi-Scale Global–Local Representation Learning with Feature Pyramid Network

by Yuhua Wang, Jianxing Huang, Zhengdao Zhao, Haiyan Lan and Xinjia Zhang

Appl. Sci. 2024, 14(24), 11494; https://doi.org/10.3390/app142411494 - 10 Dec 2024

Cited by 1 | Viewed by 1467

Abstract

Speech emotion recognition (SER) is important in facilitating natural human–computer interactions. In speech sequence modeling, a vital challenge is to learn context-aware sentence expression and temporal dynamics of paralinguistic features to achieve unambiguous emotional semantic understanding. In previous studies, the SER method based [...] Read more.

Speech emotion recognition (SER) is important in facilitating natural human–computer interactions. In speech sequence modeling, a vital challenge is to learn context-aware sentence expression and temporal dynamics of paralinguistic features to achieve unambiguous emotional semantic understanding. In previous studies, the SER method based on the single-scale cascade feature extraction module could not effectively preserve the temporal structure of speech signals in the deep layer, downgrading the sequence modeling performance. To address these challenges, this paper proposes a novel multi-scale feature pyramid network. The enhanced multi-scale convolutional neural networks (MSCNNs) significantly improve the ability to extract multi-granular emotional features. Experimental results on the IEMOCAP corpus demonstrate the effectiveness of the proposed approach, achieving a weighted accuracy (WA) of 71.79% and an unweighted accuracy (UA) of 73.39%. Furthermore, on the RAVDESS dataset, the model achieves an unweighted accuracy (UA) of 86.5%. These results validate the system’s performance and highlight its competitive advantage. Full article

(This article belongs to the Special Issue Selected Papers from CCF 39th China Computer Application Conference (CCF NCCA 2024))

► Show Figures

Figure 1

24 pages, 3815 KiB

Open AccessArticle

A Multi-Level Embedding Framework for Decoding Sarcasm Using Context, Emotion, and Sentiment Feature

by Maryam Khanian Najafabadi, Thoon Zar Chi Ko, Saman Shojae Chaeikar and Nasrin Shabani

Electronics 2024, 13(22), 4429; https://doi.org/10.3390/electronics13224429 - 12 Nov 2024

Cited by 3 | Viewed by 1550

Abstract

Sarcasm detection in text poses significant challenges for traditional sentiment analysis, as it often requires an understanding of context, word meanings, and emotional undertones. For example, in the sentence “I totally love working on Christmas holiday”, detecting sarcasm depends on capturing the contrast [...] Read more.

Sarcasm detection in text poses significant challenges for traditional sentiment analysis, as it often requires an understanding of context, word meanings, and emotional undertones. For example, in the sentence “I totally love working on Christmas holiday”, detecting sarcasm depends on capturing the contrast between affective words and their context. Existing methods often focus on single-embedding levels, such as word-level or affective-level, neglecting the importance of multi-level context. In this paper, we propose SAWE (Sentence, Affect, and Word Embeddings), a framework that combines sentence-level, affect-level, and context-dependent word embeddings to improve sarcasm detection. We use pre-trained transformer models SBERT and RoBERTa, enhanced with a bidirectional GRU and self-attention, alongside SenticNet to extract affective words. The combined embeddings are processed through a CNN and classified using a multilayer perceptron (MLP). SAWE is evaluated on two benchmark datasets, Sarcasm Corpus V2 (SV2) and Self-Annotated Reddit Corpus 2.0 (SARC 2.0), outperforming previous methods, particularly on long texts, with a 4.2% improvement on F1-Score for SV2. Our results emphasize the importance of multi-level embeddings and contextual information in detecting sarcasm, demonstrating a new direction for future research. Full article

(This article belongs to the Special Issue Signal and Image Processing Applications in Artificial Intelligence)

► Show Figures

Figure 1

Search Results (97)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (97)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI