Next Article in Journal
Encrypting and Preserving Sensitive Attributes in Customer Churn Data Using Novel Dragonfly Based Pseudonymizer Approach
Next Article in Special Issue
Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs
Previous Article in Journal
Correlations and How to Interpret Them
Previous Article in Special Issue
The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages

Terminology Translation in Low-Resource Scenarios

School of Computing, Dublin City University, Dublin 9 Glasnevin, Ireland
Department of Computer Science, Cork Institute of Technology, T12 P928 Cork, Ireland
Author to whom correspondence should be addressed.
Information 2019, 10(9), 273;
Received: 30 June 2019 / Revised: 17 August 2019 / Accepted: 22 August 2019 / Published: 30 August 2019
(This article belongs to the Special Issue Computational Linguistics for Low-Resource Languages)
Term translation quality in machine translation (MT), which is usually measured by domain experts, is a time-consuming and expensive task. In fact, this is unimaginable in an industrial setting where customised MT systems often need to be updated for many reasons (e.g., availability of new training data, leading MT techniques). To the best of our knowledge, as of yet, there is no publicly-available solution to evaluate terminology translation in MT automatically. Hence, there is a genuine need to have a faster and less-expensive solution to this problem, which could help end-users to identify term translation problems in MT instantly. This study presents a faster and less expensive strategy for evaluating terminology translation in MT. High correlations of our evaluation results with human judgements demonstrate the effectiveness of the proposed solution. The paper also introduces a classification framework, TermCat, that can automatically classify term translation-related errors and expose specific problems in relation to terminology translation in MT. We carried out our experiments with a low resource language pair, English–Hindi, and found that our classifier, whose accuracy varies across the translation directions, error classes, the morphological nature of the languages, and MT models, generally performs competently in the terminology translation classification task. View Full-Text
Keywords: machine translation; terminology translation; phrase-based statistical machine translation; neural machine translation; terminology translation evaluation machine translation; terminology translation; phrase-based statistical machine translation; neural machine translation; terminology translation evaluation
Show Figures

Figure 1

MDPI and ACS Style

Haque, R.; Hasanuzzaman, M.; Way, A. Terminology Translation in Low-Resource Scenarios. Information 2019, 10, 273.

AMA Style

Haque R, Hasanuzzaman M, Way A. Terminology Translation in Low-Resource Scenarios. Information. 2019; 10(9):273.

Chicago/Turabian Style

Haque, Rejwanul, Mohammed Hasanuzzaman, and Andy Way. 2019. "Terminology Translation in Low-Resource Scenarios" Information 10, no. 9: 273.

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Back to TopTop