Next Article in Journal
Joint Beam Switching and Beam Design for RIS-Assisted Multi-Base Station IoV
Previous Article in Journal
Reconstruction of 2D Microstructures of Rock Using the Improved Simulated Annealing Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Research on Named Entity Recognition of Ancient Chinese Text by Fusing Explicit Features and Implicit Features

School of Information Science, Beijing Language and Culture University, Beijing 100083, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(11), 5398; https://doi.org/10.3390/app16115398
Submission received: 23 April 2026 / Revised: 18 May 2026 / Accepted: 18 May 2026 / Published: 28 May 2026

Abstract

Named entity recognition (NER) of ancient Chinese texts is the foundation for their development and utilization. Previous studies have focused on the data-driven methodology which tries to utilize the semantic features of ancient Chinese text. With the continuous accumulation of ancient Chinese linguistic resources and textual data, how to fully utilize the data resource and lexical knowledge related to ancient Chinese text with the help of new-generation information technology, so as to improve the ability of semantic comprehension and achieve good performance of NER, has become a great challenge to be solved. In view of this, this paper proposes a named entity recognition model for ancient Chinese text by fusing explicit feature and implicit feature (NERM), on the basis of extracting the explicit features and implicit features of ancient Chinese texts using a pre-trained model and a multi-head attention mechanism. In this model, the GuwenBERT model is introduced to extract the semantic features of ancient Chinese texts, namely the explicit features. The implicit features include relative positional relations, part-of-speech, and character radicals. The experimental results on the corpus GuNER 2023 show that the proposed model NERM achieves an F1 value of 90.67%, outperforming the existing models. The ablation experimental results show that implicit features provide a modest but meaningful improvement over explicit features, and implicit features can be arranged in order of importance as follows: character radicals, part-of-speech, and relative positional relations.
Keywords: ancient Chinese text; named entity recognition; explicit feature; implicit feature; GuwenBERT; multi-head attention mechanism ancient Chinese text; named entity recognition; explicit feature; implicit feature; GuwenBERT; multi-head attention mechanism

Share and Cite

MDPI and ACS Style

Liu, Z.; Zhao, W. Research on Named Entity Recognition of Ancient Chinese Text by Fusing Explicit Features and Implicit Features. Appl. Sci. 2026, 16, 5398. https://doi.org/10.3390/app16115398

AMA Style

Liu Z, Zhao W. Research on Named Entity Recognition of Ancient Chinese Text by Fusing Explicit Features and Implicit Features. Applied Sciences. 2026; 16(11):5398. https://doi.org/10.3390/app16115398

Chicago/Turabian Style

Liu, Zhongbao, and Wenjuan Zhao. 2026. "Research on Named Entity Recognition of Ancient Chinese Text by Fusing Explicit Features and Implicit Features" Applied Sciences 16, no. 11: 5398. https://doi.org/10.3390/app16115398

APA Style

Liu, Z., & Zhao, W. (2026). Research on Named Entity Recognition of Ancient Chinese Text by Fusing Explicit Features and Implicit Features. Applied Sciences, 16(11), 5398. https://doi.org/10.3390/app16115398

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop