Next Article in Journal
Understanding Humans: The Extensions of Digital Media
Next Article in Special Issue
MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language
Previous Article in Journal
Multivariate Maps—A Glyph-Placement Algorithm to Support Multivariate Geospatial Visualization
Previous Article in Special Issue
Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs
Open AccessArticle

A Sustainable and Open Access Knowledge Organization Model to Preserve Cultural Heritage and Language Diversity

1
Groupe d’Études et de Recherche Interdisciplinaire en Information et Communication (GERiiCO), Université de Lille, 59000 Lille, France
2
Laboratoire d’Informatique pour la Mécanique et les Sciences de l’Ingénieur-Centre National de la Recherche Scientifique (LIMSI-CNRS), Université Paris-Saclay, 91400 Orsay, France
3
Centre d’Etudes en Civilisations, Langues et Littératures Etrangères (CECILLE), Université de Lille, 59000 Lille, France
4
Department of English, Stanford University, 94305 California, CA, USA
*
Author to whom correspondence should be addressed.
Information 2019, 10(10), 303; https://doi.org/10.3390/info10100303
Received: 31 July 2019 / Revised: 16 September 2019 / Accepted: 24 September 2019 / Published: 28 September 2019
(This article belongs to the Special Issue Computational Linguistics for Low-Resource Languages)
This paper proposes a new collaborative and inclusive model for Knowledge Organization Systems (KOS) for sustaining cultural heritage and language diversity. It is based on contributions of end-users as well as scientific and scholarly communities from across borders, languages, nations, continents, and disciplines. It consists in collecting knowledge about all worldwide translations of one original work and sharing that data through a digital and interactive global knowledge map. Collected translations are processed in order to build multilingual parallel corpora for a large number of under-resourced languages as well as to highlight the transnational circulation of knowledge. Building such corpora is vital in preserving and expanding linguistic and traditional diversity. Our first experiment was conducted on the world-famous and well-traveled American novel Adventures of Huckleberry Finn by the American author Mark Twain. This paper reports on 10 parallel corpora that are now sentence-aligned pairs of English with Basque (an European under-resourced language), Bulgarian, Dutch, Finnish, German, Hungarian, Polish, Portuguese, Russian, and Ukrainian, processed out of 30 collected translations. View Full-Text
Keywords: library and information science; knowledge organization systems; information flow; knowledge diversity; under-resourced languages; parallel corpora; translated texts library and information science; knowledge organization systems; information flow; knowledge diversity; under-resourced languages; parallel corpora; translated texts
Show Figures

Figure 1

MDPI and ACS Style

Fraisse, A.; Zhang, Z.; Zhai, A.; Jenn, R.; Fisher Fishkin, S.; Zweigenbaum, P.; Favier, L.; Mustafa El Hadi, W. A Sustainable and Open Access Knowledge Organization Model to Preserve Cultural Heritage and Language Diversity. Information 2019, 10, 303.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop