Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents
AbstractRecent deep learning approaches have shown promising results for named entity recognition (NER). A reasonable assumption for training robust deep learning models is that a sufficient amount of high-quality annotated training data is available. However, in many real-world scenarios, labeled training data is scarcely present. In this paper we consider two use cases: generic entity extraction from financial and from biomedical documents. First, we have developed a character based model for NER in financial documents and a word and character based model with attention for NER in biomedical documents. Further, we have analyzed how transfer learning addresses the problem of limited training data in a target domain. We demonstrate through experiments that NER models trained on labeled data from a source domain can be used as base models and then be fine-tuned with few labeled data for recognition of different named entity classes in a target domain. We also witness an interest in language models to improve NER as a way of coping with limited labeled data. The current most successful language model is BERT. Because of its success in state-of-the-art models we integrate representations based on BERT in our biomedical NER model along with word and character information. The results are compared with a state-of-the-art model applied on a benchmarking biomedical corpus. View Full-Text
Share & Cite This Article
Francis, S.; Van Landeghem, J.; Moens, M.-F. Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents. Information 2019, 10, 248.
Francis S, Van Landeghem J, Moens M-F. Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents. Information. 2019; 10(8):248.Chicago/Turabian Style
Francis, Sumam; Van Landeghem, Jordy; Moens, Marie-Francine. 2019. "Transfer Learning for Named Entity Recognition in Financial and Biomedical Documents." Information 10, no. 8: 248.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.