Next Article in Journal / Special Issue
Ontology-Mediated Historical Data Modeling: Theoretical and Practical Tools for an Integrated Construction of the Past
Previous Article in Journal
Multilingual Transformer-Based Personality Traits Estimation

Measuring Language Distance of Isolated European Languages

Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS), Universidade de Santiago de Compostela, 15782 Galiza, Spain
Imaxin|software, Santiago de Compostela, 15702 Galiza, Spain
IXA NLP Group, University of Basque Country, 48940 Bilbao, Spain
Author to whom correspondence should be addressed.
Information 2020, 11(4), 181;
Received: 24 February 2020 / Revised: 23 March 2020 / Accepted: 25 March 2020 / Published: 27 March 2020
(This article belongs to the Special Issue Digital Humanities)
Phylogenetics is a sub-field of historical linguistics whose aim is to classify a group of languages by considering their distances within a rooted tree that stands for their historical evolution. A few European languages do not belong to the Indo-European family or are otherwise isolated in the European rooted tree. Although it is not possible to establish phylogenetic links using basic strategies, it is possible to calculate the distances between these isolated languages and the rest using simple corpus-based techniques and natural language processing methods. The objective of this article is to select some isolated languages and measure the distance between them and from the other European languages, so as to shed light on the linguistic distances and proximities of these controversial languages without considering phylogenetic issues. The experiments were carried out with 40 European languages including six languages that are isolated in their corresponding families: Albanian, Armenian, Basque, Georgian, Greek, and Hungarian. View Full-Text
Keywords: language distance; phylogenetics; perplexity; clustering; kullback leibler divergence language distance; phylogenetics; perplexity; clustering; kullback leibler divergence
Show Figures

Figure 1

MDPI and ACS Style

Gamallo, P.; Pichel, J.R.; Alegria, I. Measuring Language Distance of Isolated European Languages. Information 2020, 11, 181.

AMA Style

Gamallo P, Pichel JR, Alegria I. Measuring Language Distance of Isolated European Languages. Information. 2020; 11(4):181.

Chicago/Turabian Style

Gamallo, Pablo, José R. Pichel, and Iñaki Alegria. 2020. "Measuring Language Distance of Isolated European Languages" Information 11, no. 4: 181.

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Back to TopTop