Next Article in Journal
The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages
Previous Article in Journal
Enhanced Grid-Based Visual Analysis of Retinal Layer Thickness with Optical Coherence Tomography
Open AccessArticle

Study on Unknown Term Translation Mining from Google Snippets

by 1,* and 2,*
School of Information Engineering, Anhui Open University, Hefei 230041, China
Provincial Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou 215006, China
Authors to whom correspondence should be addressed.
Information 2019, 10(9), 267;
Received: 16 May 2019 / Revised: 5 August 2019 / Accepted: 12 August 2019 / Published: 28 August 2019
Bilingual web pages are widely used to mine translations of unknown terms. This study focused on an effective solution for obtaining relevant web pages, extracting translations with correct lexical boundaries, and ranking the translation candidates. This research adopted co-occurrence information to obtain the subject terms and then expanded the source query with the translation of the subject terms to collect effective bilingual search engine snippets. Afterwards, valid candidates were extracted from small-sized, noisy bilingual corpora using an improved frequency change measurement that combines adjacent information. This research developed a method that considers surface patterns, frequency–distance, and phonetic features to elect an appropriate translation. The experimental results revealed that the proposed method performed remarkably well for mining translations of unknown terms. View Full-Text
Keywords: unknown term; translation mining; web mining; google snippets unknown term; translation mining; web mining; google snippets
Show Figures

Figure 1

MDPI and ACS Style

Li, B.; Yao, J. Study on Unknown Term Translation Mining from Google Snippets. Information 2019, 10, 267.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map

Back to TopTop