MDPI - Publisher of Open Access Journals

18 pages, 4935 KB

Open AccessArticle

Forensic Analysis for Source Camera Identification from EXIF Metadata

by Pengpeng Yang, Chen Zhou, Daniele Baracchi, Dasara Shullani, Yaobin Zou and Alessandro Piva

J. Imaging 2026, 12(3), 110; https://doi.org/10.3390/jimaging12030110 - 4 Mar 2026

Viewed by 257

Source camera identification on smartphones constitutes a fundamental task in multimedia forensics, providing essential support for applications such as image copyright protection, illegal content tracking, and digital evidence verification. Numerous techniques have been developed for this task over the past decades. Among existing [...] Read more.

Source camera identification on smartphones constitutes a fundamental task in multimedia forensics, providing essential support for applications such as image copyright protection, illegal content tracking, and digital evidence verification. Numerous techniques have been developed for this task over the past decades. Among existing approaches, Photo-Response Non-Uniformity (PRNU) has been widely recognized as a reliable device-specific fingerprint and has demonstrated remarkable performance in real-world applications. Nevertheless, the rapid advancement of computational photography technologies has introduced significant challenges: modern devices often exhibit anomalous behaviors under PRNU-based analysis. For instance, images captured by different devices may exhibit unexpected correlations, while images captured by the same device can vary substantially in their PRNU patterns. Current approaches are incapable of automatically exploring the underlying causes of these anomalous behaviors. To address this limitation, we propose a simple yet effective forensic analysis framework leveraging Exchangeable Image File Format (EXIF) metadata. Specifically, we represent EXIF metadata as type-aware word embeddings to preserve contextual information across tags. This design enables visual interpretation of the model’s decision-making process and provides complementary insights for identifying the anomalous behaviors observed in modern devices. Extensive experiments conducted on three public benchmark datasets demonstrate that the proposed method not only achieves state-of-the-art performance for source camera identification but also provides valuable insights into anomalous device behaviors. Full article

(This article belongs to the Section Biometrics, Forensics, and Security)

► Show Figures

Figure 1

27 pages, 1280 KB

Open AccessArticle

Enhancing Causal Text Detection Using Uncertainty-Weighted Machine Learning Ensembles

by Sivachandra K B, Neethu Mohan, Mithun Kumar Kar, Sikha O K and Sachin Kumar S

Informatics 2026, 13(3), 37; https://doi.org/10.3390/informatics13030037 - 2 Mar 2026

Viewed by 382

Abstract

Causal inference in text data has been a demanding objective in the field of natural language processing, mainly due to the intrinsic ambiguity and context sensitivity inherent in data, inducing uncertainty. Diminishing this uncertainty is essential in identifying reliable causal connections and advancing [...] Read more.

Causal inference in text data has been a demanding objective in the field of natural language processing, mainly due to the intrinsic ambiguity and context sensitivity inherent in data, inducing uncertainty. Diminishing this uncertainty is essential in identifying reliable causal connections and advancing predictive consistency. In this research, we introduce an uncertainty-aware ensemble architecture that combines multiple text embedding schemes with both linear and nonlinear classifiers to boost causal text detection. Both sparse and neural-level embeddings were employed, and then combined it with an ensemble weighting approach based on two uncertainty estimation techniques, namely entropy-based and KL divergence-based. Unlike conventional ensemble methods with uniform or fixed voting strategies, our approach assigns weights inversely proportional to classifier uncertainty, ensuring that confident models exert greater influence on the final decisions. Our results show that TF-IDF, through its effective word frequency weighting scheme, consistently outperforms other embedding techniques, achieving better performance across both linear and nonlinear classifiers on both datasets (News Corpus and CausalLM–Adjective group). The experimental results show that our uncertainty-aware ensemble approach enhances both calibration and confidence predictions. Entropy-based weighting improves confidence in the case of linear classifiers with accuracy, F1-score, entropy and prediction confidence values of 94.3%, 94.0%, 0.382 and 0.774, respectively, while in the case of nonlinear classifiers the KL divergence-based weighting acquires a better performance with an accuracy of 97.6%, F1-score of 97.2%, KL Mean value of around 0.055 and LogLoss of 0.221. Full article

(This article belongs to the Section Machine Learning)

► Show Figures

Figure 1

27 pages, 2619 KB

Open AccessArticle

Defamiliarization Attack: Literary Theory Enabled Discussion of LLM Safety

by Bibin Babu, Iana Agafonova, Sebastian Biedermann and Ivan Yamshchikov

Electronics 2026, 15(5), 1047; https://doi.org/10.3390/electronics15051047 - 2 Mar 2026

Viewed by 384

Abstract

This paper introduces a multi-turn large language model (LLM) jailbreaking attack called Defamiliarization, in which malicious queries are embedded within ostensibly harmless narratives. By reframing requests in “unmarked” contexts, LLMs can be coerced into producing undesirable outputs. A range of scenarios is documented, [...] Read more.

This paper introduces a multi-turn large language model (LLM) jailbreaking attack called Defamiliarization, in which malicious queries are embedded within ostensibly harmless narratives. By reframing requests in “unmarked” contexts, LLMs can be coerced into producing undesirable outputs. A range of scenarios is documented, from planning ethically dubious actions to selectively overlooking critical events in literary texts, thereby exposing the limitations of alignment strategies predicated on detecting trigger words or semantic cues. Rather than substituting vocabulary, defamiliarization manipulates context and presentation, highlighting vulnerabilities that cannot be addressed by token-level fixes alone. Beyond demonstrating the effectiveness of defamiliarization as an attack strategy, evidence is presented of a systematic relationship between model scale and susceptibility. Experiments reveal that smaller-parameter models are significantly easier to manipulate using defamiliarized prompts. This finding raises important concerns regarding the growing popularity of lightweight, locally hosted LLMs, which are favored for their lower computational requirements but may lack alignment safeguards. A more holistic approach to LLM safety is advocated—one that incorporates insights from literary theory, ethics, and user experience—treating these models as interpretive agents. By doing so, defenses against covert manipulations can be strengthened and AI systems can remain aligned with human values. Full article

(This article belongs to the Special Issue Multimedia Security in the Era of Artificial Intelligence Generated Content)

► Show Figures

Figure 1

28 pages, 806 KB

Open AccessArticle

Modeling Intelligent Judgment Formation in Public Digital Services: Cognitive and Social Pathways from a Structural Equation Perspective

by Kungwan Laovirojjanakul, Charuay Savithi and Arisaphat Suttidee

Sustainability 2026, 18(5), 2373; https://doi.org/10.3390/su18052373 - 28 Feb 2026

Viewed by 227

Abstract

This study examines intelligent judgment formation in blockchain-based public digital wallet systems within smart city environments. Drawing on an integrated framework that combines cognitive evaluation, social influence, and trust–risk appraisal, this research conceptualizes intelligent decision-making as a socially embedded and contextually enacted evaluative [...] Read more.

This study examines intelligent judgment formation in blockchain-based public digital wallet systems within smart city environments. Drawing on an integrated framework that combines cognitive evaluation, social influence, and trust–risk appraisal, this research conceptualizes intelligent decision-making as a socially embedded and contextually enacted evaluative process rather than a fixed cognitive attribute. A structural equation modeling approach is employed to analyze the interrelationships among perceived usefulness, perceived ease of use, subjective norms, social electronic word of mouth, trust–risk appraisal, attitude, and behavioral intention. The findings indicate that socially distributed information signals play a dominant role in shaping evaluative integration and decision readiness, while cognitive and institutional appraisals operate primarily through mediated pathways. The results suggest that intelligent action in public digital service ecosystems emerges from the coordinated interaction of usability perception, institutional confidence, and socially calibrated information flows. These findings contribute to theoretical extensions of technology acceptance models in public governance contexts and offer implications for the design of socially responsive digital service infrastructures. Full article

► Show Figures

Figure 1

22 pages, 392 KB

Open AccessReview

Word Sense Disambiguation with Wikipedia Entities: A Survey of Entity Linking Approaches

by Michael Angelos Simos and Christos Makris

Entropy 2026, 28(2), 236; https://doi.org/10.3390/e28020236 - 18 Feb 2026

Viewed by 410

Abstract

The inference of unstructured text semantics is a crucial preprocessing task for NLP and AI applications. Word sense disambiguation and entity linking tasks resolve ambiguous terms within unstructured text corpora to senses from a predefined knowledge source. Wikipedia has been one of the [...] Read more.

The inference of unstructured text semantics is a crucial preprocessing task for NLP and AI applications. Word sense disambiguation and entity linking tasks resolve ambiguous terms within unstructured text corpora to senses from a predefined knowledge source. Wikipedia has been one of the most popular sources due to its completeness, high link density, and multi-language support. In the context of chatbot-mediated consumption of information in recent years through implicit disambiguation and semantic representations in LLMs, Wikipedia remains an invaluable source and reference point. This survey covers methodologies for entity linking with Wikipedia, including early systems based on hyperlink statistics and semantic relatedness, methods using graph inference problem formalizations and graph label propagation algorithms, neural and contextual methods based on sense embeddings and transformers, and multimodal, cross-lingual, and cross-domain settings. Moreover, we cover semantic annotation workflows that facilitate the scaled-up use of Wikipedia-centric entity linking. We also provide an overview of the available datasets and evaluation measures. We discuss challenges such as partial coverage, NIL concepts, the level of sense definition, combining WSD and large-scale language models, as well as the complementary use of Wikidata. Full article

(This article belongs to the Special Issue Information Theoretic Learning with Its Applications)

► Show Figures

Figure 1

21 pages, 1728 KB

Open AccessArticle

Cyberbullying Detection Based on Hybrid Neural Networks and Multi-Feature Fusion

by Junkuo Cao, Yunpeng Xiong, Weiquan Wang and Guolian Chen

Information 2026, 17(2), 205; https://doi.org/10.3390/info17020205 - 16 Feb 2026

Viewed by 312

Abstract

Cyberbullying demonstrates notable metaphorical and contextual traits, characterized by a high-dimensional sparse semantic space and dynamic evolution. Pre-trained models utilize extensive textual data for learning and employ transformer-based word vector generation techniques to accurately capture intricate semantics and nuanced syntax in text. However, [...] Read more.

Cyberbullying demonstrates notable metaphorical and contextual traits, characterized by a high-dimensional sparse semantic space and dynamic evolution. Pre-trained models utilize extensive textual data for learning and employ transformer-based word vector generation techniques to accurately capture intricate semantics and nuanced syntax in text. However, although a single pre-trained model demonstrates strong performance in contextual modeling, it still faces challenges including inadequate feature representation and limited generalization capability in classifying cyberbullying texts. This study proposes a cyberbullying detection model employing BERT-BiGRU-CNN (BBGC) to address this issue. The BBGC model initially employs BERT to produce word embeddings, subsequently inputs them into a BiGRU layer to acquire sequence features, and finally utilizes a CNN for the extraction of local features. The features derived from BERT, BiGRU, and CNN are integrated, followed by the application of the softmax function to yield the final outcome of cyberbullying detection. Experimental findings indicate that the BBGC fusion model surpasses individual pre-trained models in the task of detecting cyberbullying text. Furthermore, in comparison to hybrid neural network models utilizing RoBERTa, ALBERT, DistilBERT and other pre-trained models, the BBGC model demonstrates considerable advantages. Full article

(This article belongs to the Special Issue Artificial Intelligence, Generative AI and Large Language Models: Transforming Technology and Society)

► Show Figures

Figure 1

34 pages, 7022 KB

Open AccessArticle

Quantitative Perceptual Analysis of Feature-Space Scenarios in Network Media Evaluation Using Transformer-Based Deep Learning: A Case Study of Fuwen Township Primary School in China

by Yixin Liu, Zhimin Li, Lin Luo, Simin Wang, Ruqin Wang, Ruonan Wu, Dingchang Xia, Sirui Cheng, Zejing Zou, Xuanlin Li, Yujia Liu and Yingtao Qi

Buildings 2026, 16(4), 714; https://doi.org/10.3390/buildings16040714 - 9 Feb 2026

Cited by 1 | Viewed by 402

Abstract

Against the dual backdrop of the rural revitalization strategy and the pursuit of high-quality, balanced urban–rural education, optimizing rural campus spaces has emerged as an important lever for addressing educational resource disparities and improving pedagogical quality. However, conventional evaluation of campus space optimization [...] Read more.

Against the dual backdrop of the rural revitalization strategy and the pursuit of high-quality, balanced urban–rural education, optimizing rural campus spaces has emerged as an important lever for addressing educational resource disparities and improving pedagogical quality. However, conventional evaluation of campus space optimization faces two systemic dilemmas. First, top-down decision-making often neglects the authentic needs of diverse stakeholders and place-based knowledge, resulting in spatial interventions that lose regional distinctiveness. Second, routine public participation is constrained by geographical barriers, time costs, and sample-size limitations, which can amplify professional cognitive bias and impede comprehensive feedback formation. The compounded effect of these challenges contributes to a disconnect between spatial optimization outcomes and perceived needs, thereby constraining the distinctive development of rural educational spaces. To address these constraints, this study proposes a novel method that integrates regional spatial feature recognition with digital media-based public perception assessment. At the data collection and ethical governance level, the study strictly adheres to platform compliance and academic ethics. A total of 12,800 preliminary comments were scraped from major social media platforms (e.g., Douyin, Dianping, and Xiaohongshu) and processed through a three-stage screening workflow—keyword screening–rule-based filtering–manual verification—to yield 8616 valid records covering diverse public groups across China. All user-identifying information was fully anonymized to ensure lawful use and privacy protection. At the analytical modeling level, we develop a Transformer-based deep learning system that leverages multi-head attention mechanisms to capture implicit spatial-sentiment features and metaphorical expressions embedded in review texts. Evaluation on an independent test set indicates a classification accuracy of 89.2%, aligning with balanced and stable scoring performance. Robustness is further strengthened by introducing an equal-weight alternative strategy and conducting stability checks to indicate the consistency of model outputs across weighting assumptions. At the scenario interpretation level, we combine grounded-theory coding with semantic network analysis to establish a three-tier spatial analysis framework—macro (landscape pattern/hydro-topological patterns), meso (architectural interface), and micro (teaching scenes/pedagogical scenarios)—and incorporate an interpretive stakeholder typology (tourists, residents, parents, and professional groups) to systematically identify and quantify key features shaping public spatial perception. Findings show that, at the macro level, naturally integrated scenarios—such as “campus–farmland integration” and “mountain–water embeddedness”—exhibit high affective association, aligning with the “mountain-water-field-village” spatial sequence logic and suggesting broad public endorsement of ecological campus concepts, whereas vernacular settlement-pattern scenarios receive relatively low attention due to cognitive discontinuities. At the meso level, innovative corridor strategies (e.g., framed vistas and expanded corridor spaces) strengthen the building–nature interaction and suggest latent value in stimulating exploratory spatial experience. At the micro level, place-based practice-oriented teaching scenes (e.g., intangible cultural heritage handcraft and creative workshops) achieve higher scores, aligning with the compatibility of vernacular education’s “differential esthetics,” while urban convergence-oriented interdisciplinary curriculum scenes suggest an interpretive gap relative to public expectations. These results indicate an embedded relationship between public perception and regional spatial features, which is further shaped by a multi-actor governance process—characterized by “Government + Influencers + Field Study”—that mediates how rural educational spaces are produced, communicated, and interpreted in digital environments. The study’s innovative value lies in integrating sociological theories (e.g., embeddedness) with deep learning techniques to fill the regional and multi-actor perspective gap in rural campus POE and to promote a methodological shift from “experience-based induction” toward a “data-theory” dual-drive model. The findings provide inferential evidence for rural campus renewal and optimization; the methodological pipeline is transferable to small-scale rural primary schools with media exposure and salient regional ecological characteristics, and it offers a new pathway for incorporating digital media-driven public perception feedback into planning and design practice. The research methodology of this study consists of four sequential stages, which are implemented in a systematic and progressive manner: First, data collection was conducted: Python and the Octopus Collector were used to crawl online comment data related to Fuwen Township Central Primary School, strictly complying with the user agreements of the Douyin, Dianping, and Xiaohongshu platforms. Second, semantic preprocessing was performed: The evaluation content was segmented to generate word frequency statistics and semantic networks; qualitative analysis was conducted using Origin software, and quantitative translation was realized via Sankey diagrams. Third, spatial scene coding was carried out: Combined with a spatial characteristic identification system, a macro–meso–micro three-tier classification system for spatial scene characteristics was constructed to encode and quantitatively express the textual content. Finally, sentiment quantification and correlation analysis was implemented: A deep learning model based on the Transformer framework was employed to perform sentiment quantification scoring for each comment; Sankey diagrams were used to quantitatively correlate spatial scenes with sentiment tendencies, thereby exploring the public’s perceptual associations with the architectural spatial environment of rural campuses. Full article

(This article belongs to the Section Architectural Design, Urban Science, and Real Estate)

► Show Figures

Figure 1

25 pages, 2294 KB

Open AccessArticle

SiAraSent: From Features to Deep Transformers for Large-Scale Arabic Sentiment Analysis

by Omar Almousa, Yahya Tashtoush, Anas AlSobeh, Plamen Zahariev and Omar Darwish

Big Data Cogn. Comput. 2026, 10(2), 49; https://doi.org/10.3390/bdcc10020049 - 3 Feb 2026

Viewed by 452

Abstract

Sentiment analysis of Arabic text, particularly on social media platforms, presents a formidable set of unique challenges that stem from the language’s complex morphology, its numerous dialectal variations, and the frequent and nuanced use of emojis to convey emotional context. This paper presents [...] Read more.

Sentiment analysis of Arabic text, particularly on social media platforms, presents a formidable set of unique challenges that stem from the language’s complex morphology, its numerous dialectal variations, and the frequent and nuanced use of emojis to convey emotional context. This paper presents SiAraSent, a hybrid framework that integrates traditional text representations, emoji-aware features, and deep contextual embeddings based on Arabic transformers. Starting from a strong and fully interpretable baseline built on Term Frequency–Inverse Definition Frequency (TF–IDF)-weighted character and word N-grams combined with emoji embeddings, we progressively incorporate SinaTools for linguistically informed preprocessing and AraBERT for contextualized encodings. The framework is evaluated on a large-scale dataset of 58,751 Arabic tweets labeled for sentiment polarity. Our design works within four experimental configurations: (1) a baseline traditional machine learning architecture that employs TF-IDF, N-grams, and emoji features with an Support Vector Machine (SVM) classifier; (2) an Large-language Model (LLM) feature extraction approach that leverages deep contextual embeddings from the pre-trained AraBERT model; (3) a novel hybrid fusion model that concatenates traditional morphological features, AraBERT embeddings, and emoji-based features into a high-dimensional vector; and (4) a fully fine-tuned AraBERT model specifically adapted for the sentiment classification task. Our experiments demonstrate the remarkable efficacy of our proposed framework, with the fine-tuned AraBERT architecture achieving an accuracy of 93.45%, a significant 10.89% improvement over the best traditional baseline. Full article

(This article belongs to the Special Issue Advances in Natural Language Processing and Text Mining: 2nd Edition)

► Show Figures

Figure 1

31 pages, 1573 KB

Open AccessArticle

Generalised Cross-Dialectal Arabic Question Answering Through Adaptive Code-Mixed Data Augmentation

by Maha Jarallah Althobaiti

Information 2026, 17(2), 139; https://doi.org/10.3390/info17020139 - 1 Feb 2026

Viewed by 341

Abstract

Modern Standard Arabic (MSA) and the many regional dialects differ substantially in vocabulary, morphology, and pragmatic usage. Most available annotated resources are in MSA, and zero-shot transfer from MSA to dialectal tasks suffers a large performance drop. This paper addresses generalised cross-dialectal Arabic [...] Read more.

Modern Standard Arabic (MSA) and the many regional dialects differ substantially in vocabulary, morphology, and pragmatic usage. Most available annotated resources are in MSA, and zero-shot transfer from MSA to dialectal tasks suffers a large performance drop. This paper addresses generalised cross-dialectal Arabic question answering (QA), where the context and the question are written in different Arabic varieties. We propose a training-free augmentation framework that generates code-mixed questions to bridge lexical gaps across Arabic varieties. The method produces semantically faithful, balanced code-mixed questions through the following two-stage procedure: lexicon-based partial substitution with semantic similarity and substitution-rate constraints, followed by fallback neural machine translation with word-level alignment when needed. We also introduce automated multidialectal lexicon construction using machine translation, embedding-based alignment, and semantic checks. We carry out our evaluation in a zero-shot setting, where the model is fine-tuned only on MSA and then tested on dialectal inputs using ArDQA, covering five Arabic varieties and three domains (SQuAD, Vlogs, and Narratives). Experiments show consistent improvements under context-question dialect mismatch as follows: +1.09 F1/+0.87 EM on SQuAD, +1.54/+1.25 on Vlogs, and +2.75/+2.27 on Narratives, with the largest gains for Maghrebi questions in Narratives (+12.13 F1/+8.45 EM). These results show that our method improves zero-shot cross-dialectal transfer without fine-tuning or retraining. Full article

(This article belongs to the Special Issue Natural Language Processing (NLP) with Applications and Natural Language Understanding (NLU), 2nd Edition)

► Show Figures

Graphical abstract

24 pages, 2163 KB

Open AccessArticle

KFF-Transformer: A Human–AI Collaborative Framework for Fine-Grained Argument Element Identification

by Xuxun Cai, Jincai Yang, Meng Zheng and Jianping Zhu

Appl. Sci. 2026, 16(3), 1451; https://doi.org/10.3390/app16031451 - 31 Jan 2026

Viewed by 406

Abstract

With the rapid development of intelligent computing and artificial intelligence, there is an increasing demand for efficient, interpretable, and interactive frameworks for fine-grained text analysis. In the field of argument mining, existing approaches are often constrained by sentence-level processing, limited exploitation of key [...] Read more.

With the rapid development of intelligent computing and artificial intelligence, there is an increasing demand for efficient, interpretable, and interactive frameworks for fine-grained text analysis. In the field of argument mining, existing approaches are often constrained by sentence-level processing, limited exploitation of key linguistic markers, and a lack of human–AI collaborative mechanisms, which restrict both recognition accuracy and computational efficiency. To address these challenges, this paper proposes KFF-Transformer, a computing-oriented human–AI collaborative framework for fine-grained argument element identification based on Toulmin’s model. The framework first employs an automatic key marker mining algorithm to expand a seed set of expert-labeled linguistic cues, significantly enhancing coverage and diversity. It then employs a lightweight deep learning architecture that combines BERT for contextual token encoding with a BiLSTM network enhanced by an attention mechanism to perform word-level classification of the six Toulmin elements. This approach leverages enriched key markers as critical features, enhancing both accuracy and interpretability. It should be noted that while our framework leverages BERT—a Transformer-based encoder—for contextual representation, the core sequence labeling module is based on BiLSTM and does not implement a standard Transformer block. Furthermore, a human-in-the-loop interaction mechanism is embedded to support real-time user correction and adaptive system refinement, improving robustness and practical usability. Experiments conducted on a dataset of 180 English argumentative essays demonstrate that KFF-Transformer identifies key markers in 1145 sentences and achieves an accuracy of 72.2% and an F1-score of 66.7%, outperforming a strong baseline by 3.7% and 2.8%, respectively. Moreover, the framework reduces processing time by 18.9% on CPU and achieves near-real-time performance of approximately 3.3 s on GPU. These results validate that KFF-Transformer effectively integrates linguistically grounded reasoning, efficient deep learning, and interactive design, providing a scalable and trustworthy solution for intelligent argument analysis in real-world educational applications. Full article

(This article belongs to the Special Issue Application of Smart Learning in Education)

► Show Figures

Figure 1

29 pages, 2186 KB

Open AccessArticle

Insights for Curriculum-Oriented Instruction of Programming Paradigms for Non-Computer Science Majors: Survey and Public Q&A Evidence

by Ji-Hye Oh and Hyun-Seok Park

Appl. Sci. 2026, 16(3), 1191; https://doi.org/10.3390/app16031191 - 23 Jan 2026

Viewed by 317

Abstract

This study examines how different programming paradigms are associated with learning experiences and cognitive challenges as encountered by non-computer science novice learners. Using a case-study approach situated within specific instructional contexts, we integrate survey data from undergraduate students with large-scale public question-and-answer data [...] Read more.

This study examines how different programming paradigms are associated with learning experiences and cognitive challenges as encountered by non-computer science novice learners. Using a case-study approach situated within specific instructional contexts, we integrate survey data from undergraduate students with large-scale public question-and-answer data from Stack Overflow to explore paradigm-related difficulty patterns. Four instructional contexts—C, Java, Python, and Prolog—were examined as pedagogical instantiations of imperative, object-oriented, functional-style, and logic-based paradigms using text clustering, word embedding models, and interaction-informed complexity metrics. The analysis identifies distinct patterns of learning challenges across paradigmatic contexts, including difficulties related to low-level memory management in C-based instruction, abstraction and design reasoning in object-oriented contexts, inference-driven reasoning in Prolog-based instruction, and recursion-related challenges in functional-style programming tasks. Survey responses exhibit tendencies that are broadly consistent with patterns observed in public Q&A data, supporting the use of large-scale community-generated content as a complementary source for learner-centered educational analysis. Based on these findings, the study discusses paradigm-aware instructional implications for programming education tailored to non-major learners within comparable educational settings. The results provide empirical support for differentiated instructional approaches and offer evidence-informed insights relevant to curriculum-oriented teaching and future research on adaptive learning systems. Full article

► Show Figures

Figure 1

12 pages, 216 KB

Open AccessBrief Report

Enhancing Interactive Teaching for the Next Generation of Nurses: Generative-AI-Assisted Design of a Full-Day Professional Development Workshop

by Su-I Hou

Informatics 2026, 13(1), 11; https://doi.org/10.3390/informatics13010011 - 15 Jan 2026

Viewed by 537

Abstract

Introduction: Nursing educators and clinical leaders face persistent challenges in engaging the next generation of nurses, often characterized by short attention spans, frequent phone use, and underdeveloped communication skills. This article describes the design and delivery of a full-day interactive teaching workshop for [...] Read more.

Introduction: Nursing educators and clinical leaders face persistent challenges in engaging the next generation of nurses, often characterized by short attention spans, frequent phone use, and underdeveloped communication skills. This article describes the design and delivery of a full-day interactive teaching workshop for nursing faculty, senior clinical nurses, and nurse leaders, developed using a design-thinking approach supported by generative AI. Methods: The workshop comprised four thematic sessions: (1) Learning styles across generations, (2) Interactive teaching methods, (3) Application of interactive teaching strategies, and (4) Lesson planning and transfer. Generative AI was used during planning to create icebreakers, discussion prompts, clinical teaching scenarios, and application templates. Design decisions emphasized low-tech, low-prep strategies suitable for spontaneous clinical teaching, thereby reducing barriers to adoption. Activities included emoji-card introductions, quick generational polls, colored-paper reflections, portable whiteboard brainstorming, role plays, fishbowl discussions, gallery walks, and movement-based group exercises. Participants (N = 37) were predominantly female (95%) and represented multiple generations of X, Y, and Z. Mid- and end-of-workshop reflection prompts were embedded within Sessions 2 and 4, with participants recording their responses on colored papers, which were then compiled into a single Word document for thematic analysis. Results: Thematic analysis of 59 mid- and end-workshop reflections revealed six interconnected themes, grouped into three categories: (1) engagement and experiential learning, (2) practical applicability and generational awareness, and (3) facilitation, environment, and motivation. Participants emphasized the workshop’s lively pace and hands-on design. Experiencing strategies firsthand built confidence for application, while generational awareness encouraged reflection on adapting methods for younger learners. The facilitator’s passion, personable approach, and structured use of peer learning created a psychologically safe and motivating climate, leaving participants recharged and inspired to integrate interactive methods. Discussion: The workshop illustrates how AI-assisted, design-thinking-driven professional development can model effective strategies for next-generation learners. When paired with skilled facilitation, AI-supported planning enhances engagement, fosters reflective practice, and promotes immediate transfer of interactive strategies into diverse teaching settings. Full article

(This article belongs to the Special Issue Generative AI in Higher Education: Applications, Implications, and Future Directions)

30 pages, 1553 KB

Open AccessArticle

Combining User and Venue Personality Proxies with Customers’ Preferences and Opinions to Enhance Restaurant Recommendation Performance

by Andreas Gregoriades, Herodotos Herodotou, Maria Pampaka and Evripides Christodoulou

AI 2026, 7(1), 19; https://doi.org/10.3390/ai7010019 - 9 Jan 2026

Viewed by 564

Abstract

Recommendation systems are popular information systems that help consumers manage information overload. Whilst personality has been recognised as an important factor influencing consumers’ choice, it has not yet been fully exploited in recommendation systems. This study proposes a restaurant recommendation approach that integrates [...] Read more.

Recommendation systems are popular information systems that help consumers manage information overload. Whilst personality has been recognised as an important factor influencing consumers’ choice, it has not yet been fully exploited in recommendation systems. This study proposes a restaurant recommendation approach that integrates customer personality traits, opinions and preferences, extracted either directly from online review platforms or derived from electronic word of mouth (eWOM) text using information extraction techniques. The proposed method leverages the concept of venue personality grounded in personality–brand congruence theory, which posits that customers are more satisfied with brands whose personalities align with their own. A novel model is introduced that combines fine-tuned BERT embeddings with linguistic features to infer users’ personality traits from the text of their reviews. Customers’ preferences are identified using a custom named-entity recogniser, while their opinions are extracted through structural topic modelling. The overall framework integrates neural collaborative filtering (NCF) features with both directly observed and derived information from eWOM to train an extreme gradient boosting (XGBoost) regression model. The proposed approach is compared to baseline collaborative filtering methods and state-of-the-art neural network techniques commonly used in industry. Results across multiple performance metrics demonstrate that incorporating personality, preferences and opinions significantly improves recommendation performance. Full article

► Show Figures

Figure 1

12 pages, 1115 KB

Open AccessCommunication

Linguistic Influence on Multidimensional Word Embeddings: Analysis of Ten Languages

by Anna V. Aleshina, Andrey L. Bulgakov, Yanliang Xin and Larisa S. Skrebkova

Computation 2026, 14(1), 16; https://doi.org/10.3390/computation14010016 - 9 Jan 2026

Viewed by 403

Abstract

Understanding how linguistic typology shapes multilingual embeddings is important for cross-lingual NLP. We examine static MUSE word embedding for ten diverse languages (English, Russian, Chinese, Arabic, Indonesian, German, Lithuanian, Hindi, Tajik and Persian). Using pairwise cosine distances, Random Forest classification, and UMAP visualization, [...] Read more.

Understanding how linguistic typology shapes multilingual embeddings is important for cross-lingual NLP. We examine static MUSE word embedding for ten diverse languages (English, Russian, Chinese, Arabic, Indonesian, German, Lithuanian, Hindi, Tajik and Persian). Using pairwise cosine distances, Random Forest classification, and UMAP visualization, we find that language identity and script type largely determine embedding clusters, with morphological complexity affecting cluster compactness and lexical overlap connecting clusters. The Random Forest model predicts language labels with high accuracy (≈98%), indicating strong language-specific patterns in embedding space. These results highlight script, morphology, and lexicon as key factors influencing multilingual embedding structures, informing linguistically aware design of cross-lingual models. Full article

(This article belongs to the Special Issue Recent Advances on Computational Linguistics and Natural Language Processing)

► Show Figures

Figure 1

22 pages, 957 KB

Open AccessArticle

A Hybrid Deep Learning Model Based on Local and Global Features for Amazon Product Reviews: An Optimal ALBERT-Cascade CNN Approach

by Israa Mustafa Abbas, İsmail Atacak, Sinan Toklu, Necaattin Barışçı and İbrahim Alper Doğru

Appl. Sci. 2026, 16(1), 25; https://doi.org/10.3390/app16010025 - 19 Dec 2025

Viewed by 812

Abstract

Natural Language Processing (NLP) is a valuable technology and business topic as it helps turn data into useful information with the spread of digital information. Nevertheless, there are some difficulties in its use, including the language’s complexity and the data quality. To address [...] Read more.

Natural Language Processing (NLP) is a valuable technology and business topic as it helps turn data into useful information with the spread of digital information. Nevertheless, there are some difficulties in its use, including the language’s complexity and the data quality. To address these challenges, in this study, the researchers first performed a series of ablation experiments on 14 models derived from various variations in Deep Learning (DL) methods, including A Lite BERT (ALBERT) together with Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM), Bidirectional LSTM (BiLSTM), Max Pooling layer, and attention mechanism. Subsequently, they proposed an ALBERT-cascaded CNN hybrid model as an effective method to overcome the related challenges by evaluating the performance results obtained from these models. In the proposed model, a transformer architecture with parallel processing capability for both word and subword tokenization is used in addition to creating contextualized word embeddings. Local and global feature extraction was also performed using two 1-D CNN blocks before classification to improve the model performance. The model was optimized using an advanced hyperparameter optimization tool called OPTUNA. The findings of the experiment conducted with the proposed model were obtained based on Amazon Fashion 2023 data under 5-fold cross-validation conditions. The experimental results demonstrate that the proposed hybrid model exhibits good performance with average scores of 0.9308 (accuracy), 0.9296 (F1 score), 0.9412 (precision), 0.9182 (recall), and 0.9797 (AUC) in the validation dataset, and scores of 0.9313, 0.9305, 0.9414, 0.9199, and 0.9800 in the test dataset. In addition, comparisons of the model with models in studies using similar datasets support the experimental results and reveal that it can be used as a competitive approach for solving the problems encountered in the NLP field. Full article

(This article belongs to the Special Issue Applied Artificial Intelligence and Data Science)

► Show Figures

Figure 1

Search Results (644)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (644)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI