Next Article in Journal
Asynchronous Quantum-Resistant Blockchain for Secure Intelligence Sharing
Previous Article in Journal
Comparative Analysis of Mineralogical Composition and Radioactivity Levels in Stone Samples from the Archaeological Site of Halaesa (Sicily, Italy)
Previous Article in Special Issue
A Variational-Mode-Decomposition-Cascaded Long Short-Term Memory with Attention Model for VIX Prediction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Predicting the Event Types in the Human Brain: A Modeling Study Based on Embedding Vectors and Large-Scale Situation Type Datasets in Mandarin Chinese

by
Xiaorui Ma
and
Hongchao Liu
*
School of Literature, Shandong University, Jinan 250100, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(11), 5916; https://doi.org/10.3390/app15115916 (registering DOI)
Submission received: 17 April 2025 / Revised: 19 May 2025 / Accepted: 22 May 2025 / Published: 24 May 2025
(This article belongs to the Special Issue Application of Artificial Intelligence and Semantic Mining Technology)

Abstract

Event types classify Chinese verbs based on the internal temporal structure of events. The categorization of verb event types is the most fundamental classification of concept types represented by verbs in the human brain. Meanwhile, event types exhibit strong predictive capabilities for exploring collocational patterns between words, making them crucial for Chinese teaching. This work focuses on constructing a statistically validated gold-standard dataset, forming the foundation for achieving high accuracy in recognizing verb event types. Utilizing a manually annotated dataset of verbs and aspectual markers’ co-occurrence features, the research conducts hierarchical clustering of Chinese verbs. The resulting dendrogram indicates that verbs can be categorized into three event types—state, activity and transition—based on semantic distance. Two approaches are employed to construct vector matrices: a supervised method that derives word vectors based on linguistic features, and an unsupervised method that uses four models to extract embedding vectors, including Word2Vec, FastText, BERT and ChatGPT. The classification of verb event types is performed using three classifiers: multinomial logistic regression, support vector machines and artificial neural networks. Experimental results demonstrate the superior performance of embedding vectors. Employing the pre-trained FastText model in conjunction with an artificial neural network classifier, the model achieves an accuracy of 98.37% in predicting 3133 verbs, thereby enabling the automatic identification of event types at the level of Chinese verbs and validating the high accuracy and practical value of embedding vectors in addressing complex semantic relationships and classification tasks. This work constructs datasets of considerable semantic complexity, comprising a substantial volume of verbs along with their feature vectors and situation type labels, which can be used for evaluating large language models in the future.
Keywords: situation type; vector space model; embedding vector; training dataset situation type; vector space model; embedding vector; training dataset

Share and Cite

MDPI and ACS Style

Ma, X.; Liu, H. Predicting the Event Types in the Human Brain: A Modeling Study Based on Embedding Vectors and Large-Scale Situation Type Datasets in Mandarin Chinese. Appl. Sci. 2025, 15, 5916. https://doi.org/10.3390/app15115916

AMA Style

Ma X, Liu H. Predicting the Event Types in the Human Brain: A Modeling Study Based on Embedding Vectors and Large-Scale Situation Type Datasets in Mandarin Chinese. Applied Sciences. 2025; 15(11):5916. https://doi.org/10.3390/app15115916

Chicago/Turabian Style

Ma, Xiaorui, and Hongchao Liu. 2025. "Predicting the Event Types in the Human Brain: A Modeling Study Based on Embedding Vectors and Large-Scale Situation Type Datasets in Mandarin Chinese" Applied Sciences 15, no. 11: 5916. https://doi.org/10.3390/app15115916

APA Style

Ma, X., & Liu, H. (2025). Predicting the Event Types in the Human Brain: A Modeling Study Based on Embedding Vectors and Large-Scale Situation Type Datasets in Mandarin Chinese. Applied Sciences, 15(11), 5916. https://doi.org/10.3390/app15115916

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop