Next Article in Journal
Link Prediction Based on Deep Convolutional Neural Network
Previous Article in Journal
Research on the Quantitative Method of Cognitive Loading in a Virtual Reality System
Article Menu
Issue 5 (May) cover image

Export Article

Open AccessArticle

Ontological Semantic Annotation of an English Corpus Through Condition Random Fields

Departamento de Informática—Centro de Ciências Exatas e Tecnológicas, Universidade Federal de Vicosa, Viçosa MG 36570-900, Brazil
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Information 2019, 10(5), 171; https://doi.org/10.3390/info10050171
Received: 27 February 2019 / Revised: 28 April 2019 / Accepted: 30 April 2019 / Published: 9 May 2019
(This article belongs to the Special Issue Text Mining: Classification, Clustering, and Summarization)
  |  
PDF [450 KB, uploaded 10 May 2019]
  |  

Abstract

One way to increase the understanding of texts by machines is through adding semantic information to lexical items by including metadata tags, a process also called semantic annotation. There are several semantic aspects that can be added to the words, among them the information about the nature of the concept denoted through the association with a category of an ontology. The application of ontologies in the annotation task can span multiple domains. However, this particular research focused its approach on top-level ontologies due to its generalizing characteristic. Considering that annotation is an arduous task that demands time and specialized personnel to perform it, much is done on ways to implement the semantic annotation automatically. The use of machine learning techniques are the most effective approaches in the annotation process. Another factor of great importance for the success of the training process of the supervised learning algorithms is the use of a sufficiently large corpus and able to condense the linguistic variance of the natural language. In this sense, this article aims to present an automatic approach to enrich documents from the American English corpus through a CRF model for semantic annotation of ontologies from Schema.org top-level. The research uses two approaches of the model obtaining promising results for the development of semantic annotation based on top-level ontologies. Although it is a new line of research, the use of top-level ontologies for automatic semantic enrichment of texts can contribute significantly to the improvement of text interpretation by machines. View Full-Text
Keywords: information extraction; semantic annotation; ontology; condition random fields information extraction; semantic annotation; ontology; condition random fields
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

de Andrade, G.C.; de Paiva Oliveira, A.; Moreira, A. Ontological Semantic Annotation of an English Corpus Through Condition Random Fields. Information 2019, 10, 171.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Information EISSN 2078-2489 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top