Next Article in Journal
Effectiveness of Interest Rate Policy of the Fed in Management of Subprime Mortgage Crisis
Next Article in Special Issue
Ensemble Learning or Deep Learning? Application to Default Risk Analysis
Previous Article in Journal
Does the Assumption on Innovation Process Play an Important Role for Filtered Historical Simulation Model?
Article Menu

Export Article

Open AccessFeature PaperArticle
J. Risk Financial Manag. 2018, 11(1), 8; doi:10.3390/jrfm11010008

Estimation of Cross-Lingual News Similarities Using Text-Mining Methods

Izumi lab, Department of System Innovation, Graduate School of Engineering, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan
Yahoo! Japan Research, Kioicho 1-3, Chiyoda-ku, Tokyo 102-8282, Japan
Authors to whom correspondence should be addressed.
Received: 31 December 2017 / Revised: 23 January 2018 / Accepted: 25 January 2018 / Published: 31 January 2018
(This article belongs to the Special Issue Empirical Finance)
View Full-Text   |   Download PDF [1016 KB, uploaded 31 January 2018]   |  


In this research, two estimation algorithms for extracting cross-lingual news pairs based on machine learning from financial news articles have been proposed. Every second, innumerable text data, including all kinds news, reports, messages, reviews, comments, and tweets are generated on the Internet, and these are written not only in English but also in other languages such as Chinese, Japanese, French, etc. By taking advantage of multi-lingual text resources provided by Thomson Reuters News, we developed two estimation algorithms for extracting cross-lingual news pairs from multilingual text resources. In our first method, we propose a novel structure that uses the word information and the machine learning method effectively in this task. Simultaneously, we developed a bidirectional Long Short-Term Memory (LSTM) based method to calculate cross-lingual semantic text similarity for long text and short text, respectively. Thus, when an important news article is published, users can read similar news articles that are written in their native language using our method. View Full-Text
Keywords: text similarity; text mining; machine learning; SVM; neural network; LSTM text similarity; text mining; machine learning; SVM; neural network; LSTM

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Supplementary material

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Wang, Z.; Liu, E.; Sakaji, H.; Ito, T.; Izumi, K.; Tsubouchi, K.; Yamashita, T. Estimation of Cross-Lingual News Similarities Using Text-Mining Methods. J. Risk Financial Manag. 2018, 11, 8.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
J. Risk Financial Manag. EISSN 1911-8074 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top