Event Extraction and Representation: A Case Study for the Portuguese Language
AbstractText information extraction is an important natural language processing (NLP) task, which aims to automatically identify, extract, and represent information from text. In this context, event extraction plays a relevant role, allowing actions, agents, objects, places, and time periods to be identified and represented. The extracted information can be represented by specialized ontologies, supporting knowledge-based reasoning and inference processes. In this work, we will describe, in detail, our proposal for event extraction from Portuguese documents. The proposed approach is based on a pipeline of specialized natural language processing tools; namely, a part-of-speech tagger, a named entities recognizer, a dependency parser, semantic role labeling, and a knowledge extraction module. The architecture is language-independent, but its modules are language-dependent and can be built using adequate AI (i.e., rule-based or machine learning) methodologies. The developed system was evaluated with a corpus of Portuguese texts and the obtained results are presented and analysed. The current limitations and future work are discussed in detail.
Share & Cite This Article
Quaresma, P.; Nogueira, V.B.; Raiyani, K.; Bayot, R. Event Extraction and Representation: A Case Study for the Portuguese Language. Information 2019, 10, 205.
Quaresma P, Nogueira VB, Raiyani K, Bayot R. Event Extraction and Representation: A Case Study for the Portuguese Language. Information. 2019; 10(6):205.Chicago/Turabian Style
Quaresma, Paulo; Nogueira, Vítor B.; Raiyani, Kashyap; Bayot, Roy. 2019. "Event Extraction and Representation: A Case Study for the Portuguese Language." Information 10, no. 6: 205.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.