Next Article in Journal
The Impact of Space-Based AIS Antenna Orientation on In-Orbit AIS Detection Performance
Previous Article in Journal
Estimating Non-Sustainable Soil Erosion Rates in the Tierra de Barros Vineyards (Extremadura, Spain) Using an ISUM Update
Open AccessArticle

Constructing Uyghur Commonsense Knowledge Base by Knowledge Projection

1,2,3, 1,2,3,*, 1,2,3 and 1,2,3,4
1
Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China
2
Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
3
University of Chinese Academy of Sciences, Beijing 100049, China
4
Department of Information Security Engineering, Xinjiang Police College, Urumqi 830011, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2019, 9(16), 3318; https://doi.org/10.3390/app9163318
Received: 24 July 2019 / Revised: 8 August 2019 / Accepted: 9 August 2019 / Published: 13 August 2019
  |  
PDF [2177 KB, uploaded 13 August 2019]
  |  

Abstract

Although considerable effort has been devoted to building commonsense knowledge bases (CKB), it is still not available for many low-resource languages such as Uyghur because of expensive construction cost. Focusing on this issue, we proposed a cross-lingual knowledge-projection method to construct an Uyghur CKB by projecting ConceptNet’s Chinese facts into Uyghur. We used a Chinese–Uyghur bilingual dictionary to get high-quality entity translation in facts and employed a back-translation method to eliminate the entity-translation ambiguity. Moreover, to tackle the inner relation ambiguity in translated facts, we made a hand-crafted rule to convert the structured facts into natural-language phrases and built the Chinese–Uyghur lingual phrases based on the similarity of phrases that corresponded to the bilingual semantic similarity scoring model. Experimental results show that the accuracy of our semantic similarity scoring model reached 94.75% for our task, and they successfully project 55,872 Chinese facts into Uyghur as well as obtain 67,375 Uyghur facts within a very short period. View Full-Text
Keywords: commonsense knowledge bases; Uyghur; knowledge projection; low-resource languages commonsense knowledge bases; Uyghur; knowledge projection; low-resource languages
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Anwar, A.; Li, X.; Yang, Y.; Wang, Y. Constructing Uyghur Commonsense Knowledge Base by Knowledge Projection. Appl. Sci. 2019, 9, 3318.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Appl. Sci. EISSN 2076-3417 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top