Next Article in Journal
Recursive Matrix Calculation Paradigm by the Example of Structured Matrix
Previous Article in Journal
Comparing Web Accessibility Evaluation Tools and Evaluating the Accessibility of Webpages: Proposed Frameworks
Previous Article in Special Issue
Punctuation and Parallel Corpus Based Word Embedding Model for Low-Resource Languages
Open AccessArticle

Viability of Neural Networks for Core Technologies for Resource-Scarce Languages

Centre for Text Technology, North-West University, Potchefstroom 2520, South Africa
*
Author to whom correspondence should be addressed.
Information 2020, 11(1), 41; https://doi.org/10.3390/info11010041
Received: 15 November 2019 / Revised: 22 December 2019 / Accepted: 25 December 2019 / Published: 12 January 2020
(This article belongs to the Special Issue Computational Linguistics for Low-Resource Languages)
In this paper, the viability of neural network implementations of core technologies (the focus of this paper is on text technologies) for 10 resource-scarce South African languages is evaluated. Neural networks are increasingly being used in place of other machine learning methods for many natural language processing tasks with good results. However, in the South African context, where most languages are resource-scarce, very little research has been done on neural network implementations of core language technologies. In this paper, we address this gap by evaluating neural network implementations of four core technologies for ten South African languages. The technologies we address are part of speech tagging, named entity recognition, compound analysis and lemmatization. Neural architectures that performed well on similar tasks in other settings were implemented for each task and the performance was assessed in comparison with currently used machine learning implementations of each technology. The neural network models evaluated perform better than the baselines for compound analysis, are viable and comparable to the baseline on most languages for POS tagging and NER, and are viable, but not on par with the baseline, for Afrikaans lemmatization. View Full-Text
Keywords: resource-scarce languages; core technologies; South African languages; neural networks; machine learning resource-scarce languages; core technologies; South African languages; neural networks; machine learning
MDPI and ACS Style

Loubser, M.; Puttkammer, M.J. Viability of Neural Networks for Core Technologies for Resource-Scarce Languages. Information 2020, 11, 41.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop