Ontological Semantic Annotation of an English Corpus Through Condition Random Fields
AbstractOne way to increase the understanding of texts by machines is through adding semantic information to lexical items by including metadata tags, a process also called semantic annotation. There are several semantic aspects that can be added to the words, among them the information about the nature of the concept denoted through the association with a category of an ontology. The application of ontologies in the annotation task can span multiple domains. However, this particular research focused its approach on top-level ontologies due to its generalizing characteristic. Considering that annotation is an arduous task that demands time and specialized personnel to perform it, much is done on ways to implement the semantic annotation automatically. The use of machine learning techniques are the most effective approaches in the annotation process. Another factor of great importance for the success of the training process of the supervised learning algorithms is the use of a sufficiently large corpus and able to condense the linguistic variance of the natural language. In this sense, this article aims to present an automatic approach to enrich documents from the American English corpus through a CRF model for semantic annotation of ontologies from Schema.org top-level. The research uses two approaches of the model obtaining promising results for the development of semantic annotation based on top-level ontologies. Although it is a new line of research, the use of top-level ontologies for automatic semantic enrichment of texts can contribute significantly to the improvement of text interpretation by machines. View Full-Text
Share & Cite This Article
de Andrade, G.C.; de Paiva Oliveira, A.; Moreira, A. Ontological Semantic Annotation of an English Corpus Through Condition Random Fields. Information 2019, 10, 171.
de Andrade GC, de Paiva Oliveira A, Moreira A. Ontological Semantic Annotation of an English Corpus Through Condition Random Fields. Information. 2019; 10(5):171.Chicago/Turabian Style
de Andrade, Guidson C.; de Paiva Oliveira, Alcione; Moreira, Alexandra. 2019. "Ontological Semantic Annotation of an English Corpus Through Condition Random Fields." Information 10, no. 5: 171.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.