Next Article in Journal
A Ranking Method for User Recommendation Based on Fuzzy Preference Relations in the Nature Reserve of Dangshan Pear Germplasm Resources
Next Article in Special Issue
Semantic Modelling and Publishing of Traditional Data Collection Questionnaires and Answers
Previous Article in Journal
Measuring Bikeshare Access/Egress Transferring Distance and Catchment Area around Metro Stations from Smartcard Data
Previous Article in Special Issue
Conversion of the English-Xhosa Dictionary for Nurses to a Linguistic Linked Data Framework
Article Menu

Export Article

Open AccessFeature PaperArticle
Information 2018, 9(11), 290; https://doi.org/10.3390/info9110290

Annotating a Low-Resource Language with LLOD Technology: Sumerian Morphology and Syntax

1
Department of Informatik und Mathematik, Goethe University Frankfurt, D-60325 Frankfurt, Germany
2
Department of Near Eastern Languages and Cultures, University of California Los Angeles, Los Angeles, CA 90095, USA
3
Department of Near and Middle Eastern Civilizations, University of Toronto, Toronto, ON M5S 1C1, Canada
This paper is an extended version of our paper published in Christian Chiarcos, Ilya Khait, Émilie Pagé-Perron, Niko Schenk, Jayanth, and Lucas Reckling. “Annotating Sumerian: A LLOD-enhanced Workflow for Cuneiform Corpora” Proceedings of the 6th Workshop on Linked Data in Linguistics (LDL-2018): Towards Linguistic Data Science, Miyazaki, Japan, May 2018.
*
Author to whom correspondence should be addressed.
Received: 17 September 2018 / Revised: 24 October 2018 / Accepted: 25 October 2018 / Published: 19 November 2018
(This article belongs to the Special Issue Towards the Multilingual Web of Data)
Full-Text   |   PDF [591 KB, uploaded 19 November 2018]   |  

Abstract

This paper describes work on the morphological and syntactic annotation of Sumerian cuneiform as a model for low resource languages in general. Cuneiform texts are invaluable sources for the study of history, languages, economy, and cultures of Ancient Mesopotamia and its surrounding regions. Assyriology, the discipline dedicated to their study, has vast research potential, but lacks the modern means for computational processing and analysis. Our project, Machine Translation and Automated Analysis of Cuneiform Languages, aims to fill this gap by bringing together corpus data, lexical data, linguistic annotations and object metadata. The project’s main goal is to build a pipeline for machine translation and annotation of Sumerian Ur III administrative texts. The rich and structured data is then to be made accessible in the form of (Linguistic) Linked Open Data (LLOD), which should open them to a larger research community. Our contribution is two-fold: in terms of language technology, our work represents the first attempt to develop an integrative infrastructure for the annotation of morphology and syntax on the basis of RDF technologies and LLOD resources. With respect to Assyriology, we work towards producing the first syntactically annotated corpus of Sumerian. View Full-Text
Keywords: linked open data; linguistic linked open data; morphology; syntax; parsing; RDF; SPARQL; low-resource languages; Sumerian; Cuneiform linked open data; linguistic linked open data; morphology; syntax; parsing; RDF; SPARQL; low-resource languages; Sumerian; Cuneiform
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Chiarcos, C.; Khait, I.; Pagé-Perron, É.; Schenk, N.; Jayanth; Fäth, C.; Steuer, J.; Mcgrath, W.; Wang, J. Annotating a Low-Resource Language with LLOD Technology: Sumerian Morphology and Syntax. Information 2018, 9, 290.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Information EISSN 2078-2489 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top