Next Article in Journal
The Optimization of the Interior Permanent Magnetic Motor Case Study
Previous Article in Journal
A Survey of Non-Autoregressive Neural Machine Translation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Effective Event Extraction Method via Enhanced Graph Convolutional Network Indication with Hierarchical Argument Selection Strategy

1
School of Chinese Ethnic Minority Languages and Literatures, Minzu University of China, Beijing 100081, China
2
Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance, Ministry of Education, Minzu University of China, Beijing 100081, China
3
School of Information Engineering, Minzu University of China, Beijing 100081, China
*
Authors to whom correspondence should be addressed.
Electronics 2023, 12(13), 2981; https://doi.org/10.3390/electronics12132981
Submission received: 29 May 2023 / Revised: 24 June 2023 / Accepted: 29 June 2023 / Published: 6 July 2023
(This article belongs to the Topic Recent Advances in Data Mining)

Abstract

:
As one of foundation technologies for massive data processing for AI, event mining is attracting more and more attention, mainly including event detection (event trigger identification and event classification) and argument extraction. At present, EE-GCN is one of the most effective methods for event detection. However, since EE-GCN only focuses on event detection, complete event multi-tuple extraction needs to be improved. Inspired by the EE-GCN event detection method, this paper proposes an effective event extraction method via graph convolutional network indication with a hierarchical argument selection strategy. The method mainly includes the following steps. (1) Based on the ACE2005 argument extraction template, a new argument extraction template is established for the Baidu event extraction dataset. (2) The trigger events and event classification detected by EE-GCN are used as indicators to determine the argument extraction template, and the alternative arguments are extracted via named entity recognition based on the determined template. (3) Making full use of the side information of EE-GCN graph to solve the local and global correlation degree, and based on the local and global correlation degrees, the final argument multi-tuple is determined. (4) Finally, several experiments are conducted on the Baidu event extraction dataset to compare the proposed method with other methods. The experimental results show that the proposed method has improved the accuracy and completeness of the event extraction compared to other existing methods.

1. Introduction

Due to the rapid growth of information on the Internet, event extraction has become a crucial task in data processing. This task involves automatically extracting structured information from massive and diverse unstructured texts. It is the foundation for mining the value of big data by using AI technology, such as automatic summarization, intelligent writing, machine translation, etc. Furthermore, it has been successfully applied in many important fields such as network public opinion monitoring, emergency alarms, intelligence collection, business and economy, automatic response, human–computer interaction, and biomedicine [1,2,3,4,5,6,7,8].
Generally, the study of event extraction can be divided into two sub-tasks. The first sub-task is event detection (event trigger identification and event classification). The second sub-task is to extract the elements involved in the event, such as person, time, place and so on, which are called arguments. In recent years, due to the great success of deep learning (especially deep convolutional neural networks and graph convolutional neural networks) and its powerful representation and learning ability in many fields [9,10], most emerging research focuses on a deep learning-based event extraction approach. According to the order of completion of the two sub-tasks, event extraction methods based on deep learning can be divided into pipeline methods and joint learning methods. Theoretically, the joint learning method is superior to the pipeline method because it considers the interaction between trigger words and arguments. However, trigger recognition and argument recognition are coupled and restricted to each other, so the learning algorithm and training mode are complex, which makes it difficult to implement. Because of this, people tend to use the pipeline method to study event extraction. The pipeline method focuses on trigger identification and takes the syntactic dependency between arguments as a supplement. Event detection is performed first, and then argument extraction is performed. Based on this idea, Cui et al. [11] proposed a new event detection model, Edge-Enhanced Graph Convolution Networks for event detection with Syntactic Relation (EE-GCN). The adjacency matrix describing the presence or presence of syntactic dependencies is transformed into a tensor describing the presence or presence of syntactic dependencies and the category of syntactic dependencies. The recognition of event trigger words and event categories is realized by using a graph convolutional neural network and softmax, which is verified on the ACE2005 dataset. Experiments show that this method is one of the most effective methods for event detection.
However, EE-GCN only completes the first sub-task of event extraction, namely event detection. For complete event extraction, it is necessary to further extract the event arguments. Inspired by this idea, on the basis of the existing research on EE-GCN, this paper proposes an event tuple extraction method. This method extends beyond existing research on EE-GCN by proposing a novel approach that effectively extracts complete event multi-tuples using a combination of graph convolutional network indication and hierarchical argument selection strategy. The hierarchical argument selection strategy can effectively improve the accuracy and completeness of multi-tuple extraction. The key contributions of this paper are as follows:
(1)
We establish a more complete argument extraction template for the Baidu event extraction dataset based on the ACE2005.
(2)
We propose an effective event extraction method using trigger events and event classifications detected by EE-GCN as indicators. Moreover, we employ a hierarchical argument selection strategy, which helps in determining the final argument multi-tuple more accurately.
(3)
A series of experiments are conducted on the Baidu event extraction dataset to compare the proposed method with other methods, demonstrating that it improves both accuracy and completeness of event extraction compared to existing methods.

2. Related Works

From the development process of event extraction, the methods can be divided into three categories, which are pattern matching-based methods, machine learning-based methods and deep learning methods.
At present, machine learning methods and deep learning methods have gradually become the mainstream technology of event extraction. Compared to the pattern matching method used by early researchers such as Grishman et al. [12], it is easier to implement, more scalable, and portable to various fields. Following the rise of GCN (Kipf and Welling [13]), Orr et al. [14] proposed a directed acyclic graph GRU model that introduces syntactic structure into the sequence structure. Nguyen and Grishman [15] et al. proposed to transform the syntactic dependency tree into a graph, and used GCN to carry out event detection through information propagation on the graph. They used the graph to represent the syntactic dependency relationship of fields in a sentence, which captured deeper semantic features.
Event extraction methods based on deep learning can be divided into pipeline methods and joint learning methods. Ahn [16] clearly divides event extraction into four steps: trigger detection, argument detection, event alignment and event relationship detection, and adopts the pipeline extraction idea to extract the four tasks relatively independently. With the development of information technology, researchers can enhance the effect of event extraction by combining additional information. Ji and Grishman [17] collect event extraction results from a set of related documents and use rule-based methods to infer and constrain, thus improving the performance of event extraction. Differing from the above cross-document research perspective, Liao and Grishman [18] focused on cross-event information within the same document, and believed that the previous research may be limited to the extraction of a single event in the document, ignoring the logical and temporal relationship between all different events in the same document. Liu et al. [19] designed a model based on Probabilistic Soft Logic (PSL), proposed three types of local information and two kinds of global information that play a positive role in event extraction, and combined local information and global information for event extraction. This makes the research on event extraction based on deep learning even further advanced. Due to the phased training, the pipeline method will face the problem of error propagation. A misclassified trigger will affect the results of the argument extraction, and this negative influence is irreversible. Therefore, another group of researchers prefer joint learning models that can slow down the impact of error propagation. Nguyen et al. [20] applied a recurrent neural network (RNN) to the study of event extraction and introduced the global text features proposed by Li et al. [21] to achieve the joint extraction of trigger words and arguments by combining global text features and deep learning. Liu [22] designed a joint learning event extraction framework to extract events through graph convolutional networks. Although the joint learning method has certain advantages in theory, its model design is complex, and more parameters and multi-thread output need to be considered, which causes some difficulties in practical operation.
Considering the limitations of existing methods in effectively addressing tuple extraction challenges, this study draws inspiration from the EE-GCN approach, and proposes a novel method that seeks to enhance both the accuracy and completeness of extraction. This is achieved by utilizing an improved EE-GCN model in conjunction with a hierarchical argument selection strategy, thereby offering a more robust solution for tuple extraction tasks.

3. Methods

Figure 1 shows the proposed event multi-tuple extraction model based on graph convolution indication and event correlation degree. First, a new argument role table based on ACE2005 was constructed for the dataset extracted from Baidu events. Second, the event trigger words and event categories obtained using EE-GCN were used as argument extraction indicators. Third, based on the new argument role table and indicator, named entity recognition was used to obtain all the alternative arguments of the event. Fourth, based on the side information of EE-GCN graph, the local correlation degree between the trigger word and all the alternative arguments is calculated, and the alternative arguments are divided into confirmed argument 1 and a to-be-confirmed argument based on the local correlation degree. Fifth, the global correlation degree between confirmed argument 1 and the non-argument words in the sentence was calculated, and the to-be-confirmed arguments were further divided into confirmed argument 2 and other arguments according to the global correlation degree. Last, confirmed argument 1 and confirmed argument 2 are merged into the final event argument, and then the final event argument, event classification, and event trigger words are combined to form an event tuple.

3.1. Argument Role Table Construction for Baidu Dataset

The Baidu event extraction dataset [19] defines 65 event types and provides a table of event types and corresponding argument roles, but does not give the entity types corresponding to argument roles. The ACE 2005 Role Table [22] is a table used in the ACE2005 task, which is a language-based information extraction task that aims to identify named entities, relations, and events from natural language texts. The table describes the roles involved in events, including the names, meanings, and occurrences of the roles in the corpus for different event types. Referring to the correspondence between argument roles and entity types in ACE 2005, we construct the correspondence between argument roles and entity types as follows. (1) For argument roles in the Baidu event extraction dataset that are similar to the argument roles in ACE 2005, the definitions in ACE2005 are directly adopted. For example, “award winners” and “person to thank” in the Baidu event extraction data-set are equivalent to “PERSON” in ACE 2005. (2) If there is no similar argument role, its corresponding entity type, such as “translation:sale price” and other argument roles in the Baidu event extraction dataset should be defined, and “MONEY” defined as its corresponding named entity type.
Using the 65 event types defined by the Baidu event extraction dataset, and using the above ideas, the corresponding relationship between argument roles and entity types is established based on 121 event argument roles, and a new argument role table is constructed. The correspondence between partial argument roles and entity types is shown in Table 1.
According to Table 1, the set of event types can be defined as E T = e t 1 , e t 2 , , e t 65 , where the set of argument roles of event type e t i (i = 1, 2, …, 65) is a r g u m e n t i = A r i 1 , A r i 2 , , A r i a i , and a i is the number of argument roles of event e t i . Then the current event M I i can be represented as M I i = 0,0 , , e t i , , 0 and the type number of the current event M I i is:
T N i = N u m M I i
where N u m represents the position sequence number of “ e t i ” in M I i .

3.2. Event Multi-Argument Extraction

3.2.1. Generation of Argument Indicators Based on EE-GCN

Suppose the input is a sentence sequence S = w 1 , w 2 , , w M , where M represents the number of words in the sentence, w i i 1 , 2 , , M is the word vector w i R d w of the ith word in the sentence, and d w is the dimension of the word vector. The recognition is performed in the input sentence using STANZA [23] to obtain the named entity sequence N ^ = N 1 , N 2 , , N M , where N i i 1 , 2 , , M represents the named entity type for each word. Make the embedding of each element in sequence N ^ , and transform it into sequence N = n 1 , n 2 , , n M , where n i R d n is the embedding corresponding to N i , and d n is the dimension of the embedding of a named entity type.
Concatenate w i with n i to form a vector embedding x i = [ w i ; n i ] R d w + d n for each word. Finally, the sequence of embedding vectors of the input sentence is X = [ x 1 , x 2 , , x M ] . By inputting X into BiLSTM, the hidden state sequence H 0 = [ h 1 0 , h 2 0 , , h M 0 ] of X = [ x 1 , x 2 , , x M ] is obtained, where H 0 R M × d , d are the dimensions of h i 0 i 1 , 2 , , M and h 1 0 , h 2 0 , . . . , h M 0 corresponds to x 1 , x 2 , , x M in the original sentence sequence one-to-one. H 0 is taken as the initial node of the EE-GCN model. Bidirectional Long Short-Term Memory (BiLSTM) is a type of artificial neural network incorporating information from both past and future time steps in a sequence. The main advantage of BiLSTM is its ability to capture context information from both forward and backward directions, resulting in a more accurate prediction. This makes it particularly useful for tasks that involve analyzing sequences of data. BiLSTM models have proven successful in tasks, including genomic sequence analysis, life prediction, and video analysis [24,25].
Then, we use STANZA [23] for a syntactic dependency analysis to obtain syntactic dependency relations between words, and to produce dependency embedding. The initial adjacency matrix E 0 = { E : , : , : 0 } R M × M × K and K are the dimensions of the dependency embedding. Consider E 0 the initial edge of the EE-GCN model.
The update iteration formula for the nodes and edges of EE-GCN, composed of initial nodes and edges, is shown in Formulas (2)–(4) [11].
H l = r e l u A v e r a g e _ p o o l i n g H 1 l , H 2 l , , H K l
H 1 l = E : , : , k l 1 H l 1 W H
E i , j , : l = W E E i , j , : l 1 h i l h j l , i , j 1 , M
where E : , : , k l 1 is the kth channel component of E l 1 and E i , j , : l R K represents the i , j th element of E l 1 . W E R ( 2 × d + K ) × K and W H R d × d , d = d w + d n are two learnable weight matrices. h i l and h j l are the representations of the i and j nodes of the l t h layer, respectively.
At the last layer, each node h i l is fed into a fully connected network ( F C N i ), followed by a Softmax ( s o f t m a x i . ) used to calculate the probabilities of h i l for all event types, expressed as Equation (5) [14].
p e t j h i l = s o f t m a x j w j h i l + b j , i , j 1 , M
where w j maps the node representation h i l to the feature score for each event type, and b j is a bias term.
So, the type of event and position of t r i g g e r   w o r d are calculated with Equation (6), and it is taken as the indicator of event detection, expressed with I .
I = Arc max i 1 , M , j 1 , q p e t j h i l
Figure 2 illustrates the process of generating argument indicators using EE-GCN.

3.2.2. Extraction of Alternative Arguments

According to the event trigger word and event type determined by Formula (6), suppose e t i , combined with the event indicator determined by Formula (1), the corresponding argument extraction template is found based on the event argument role table of the new Baidu dataset, and the argument role list a r g u m e n t i = A r i 1 , A r i 2 , , A r i a l i corresponding to the event type is obtained. The set of arguments is A r i j = R O i , j 1 , R O i , j 2 , , R O i , j a r j , j 1 , a l i , where R O i , j t = r o l e i , j 1 , r o l e i , j 2 , , r o l e i , j r t , t 1 , a r j , r o l e i , j p , p 1 , r t denotes the pth argument in the jth argument set of the ith type event, and r t is the number of roles. Then, each word in the sentence is BIO-tagged via named entity recognition. Finally, the candidate arguments are selected according to the template determined and the entity type of the word.
After determining the argument roles and arguments using named entity recognition [26], the arguments indicated in the template are extracted from the sentence, and they are regarded as alternative arguments. The formula is described as follows:
E v e n t m u l t i t u p l e = e t i , t r i g g e r   w o r d , R O i
where t r i g g e r   w o r d is the event trigger word, e t i is the event type corresponding to the event trigger word, and R O i , j t represents the argument list of the event.
Obviously, if there is only one argument for each argument role, then it can be directly included in the final argument list. If there is no argument corresponding to an argument role, then the argument corresponding to that role is empty in the final argument list. If there are multiple arguments corresponding to the same argument role in the sentence, these multiple arguments need to be further filtered. Algorithm 1 shows the extraction method of alternative arguments.
Algorithm 1 ERAR based on NER
Input: Event type E, Event Argument Dataset S, the initial empty argument dictionary D, Input text T.
Output: the final argument dictionary D.
1
  roles ← S.get(E)
2
  ner_results ← NER(T)
3
  for e ← roles do
4
  role_name ← e [0]
5
  role_entity_type ← e [1…len(e)]
6
  temp_words ← []
7
  for v ← ner_results.values do
8
  if v in role_entity_type then
9
  add the key corresponding to the value to the temp_words
10
end if
11
end for
12
D.put(role_name,temp_words)
13
end for
14
return D

3.3. Hierarchical Argument Selection Strategy

3.3.1. The Selection of Arguments Based on the Degree of Local Correlation

Based on the output of the EE-GCN model, not only can the event trigger words be determined, but the arguments can also be filtered by analyzing the weight of the graph network edges. The edge tensor information in the output of the graph convolutional layer of the EE-GCN model is denoted as E o u t R M × M × K , where M represents the number of nodes in the graph and K is the dimension of the weight vector between two nodes. The weight of the edge between the ith node and the jth node in the graph is denoted as the dimension vector E o u t ( i , j ) = e i , j 1 , e i , j 2 , , e i , j K . In the convolution process, nodes with a closely related degree will interact more and produce more information. In other words, information about the degree of association between nodes is contained in the edge tensor message E o u t R M × M × K . Therefore, in this section, the association relevance of edges between two nodes is first defined to represent the closeness between nodes, as shown in Formula (8):
r e l e v a n c e i , j = q = 1 K e i . j q 2 , i , j 1 , M
If the degree of association between the alternative argument and the event trigger is closer, the probability that the alternative argument will be identified as the final argument is higher. Therefore, this paper uses the local relevance degree between alternative arguments and event trigger words to filter alternative arguments in the first round.
Denote the trigger word as T r i g g e r i in the sentence sequence S , the arguments in the t-th alternative argument set R O i , j t of the jth argument role A r i j , and the edges corresponding to r o l e i , j p , p 1 , r t as E o u t ( i , p ) = e i , j p , 1 , e i , j p , 2 , , e i , j p , K . Define the mean correlation degree of the edges connected to the event trigger word and the alternative arguments as the local correlation degree threshold t h r e s h o l d l o c a l . Based on Formula (8), the local correlation degree threshold t h r e s h o l d l o c a l can be calculated from Formulas (9) and (10):
r e l e v a n c e i , p = q = 1 K e i . j p , q 2 , i , j 1 , M
t h r e s h o l d l o c a l = 1 r t p = 1 r t r e l e v a n c e i , p
Use Formula (9) to calculate the correlation degree r e l e v a n c e i , p between each argument r o l e i , j p , p 1 , r t and the trigger word in R O i , j t . If r e l e v a n c e i , p t h r e s h o l d l o c a l , r o l e i , j p , p 1 , r t is regarded as the confirmed argument role and stored in the confirmed argument set R O i , j t 1 , that is, R O i , j t 1 = r o l e i , j 1 , r o l e i , j 2 , , r o l e i , j r t 1 and r t 1 are the number of confirmed arguments. Otherwise, r o l e i , j p , p 1 , r t is regarded as an unconfirmed argument role stored in the unconfirmed argument set R O i , j t 1 ^ , that is, R O i , j t 1 ^ = r o l e ^ i , j 1 , r o l e ^ i , j 2 , , r o l e ^ i , j r t 1 ^ , r t 1 ^ is the number of unconfirmed arguments.

3.3.2. Argument Selection Based on the Global Correlation Degree

For R O i , j t 1 ^ = r o l e ^ i , j 1 , r o l e ^ i , j 2 , , r o l e ^ i , j r t 1 ^ , if the closeness between the unconfirmed argument role in R O i , j t 1 ^ and the confirmed argument role in R O i , j t 1 = r o l e i , j 1 , r o l e i , j 2 , , r o l e i , j r t 1 is higher, the probability that the unconfirmed argument will be determined as the final argument is higher. Remember that all the words excluding the alternative argument set in the sentence sequence are S e = S / R O i , j t 1 = w e ^ i , j t , 1 , w e ^ i , j t , 2 , , w e ^ i , j t , M r t 1 (including the unknown argument role), and the dimension vector of the edge weight between any word w e ^ i , j t , u in S e and any argument r o l e i , j v in R O i , j t 1 = r o l e i , j 1 , r o l e i , j 2 , , r o l e i , j r t 1 is E o u t ( w e ^ i , j t , u , r o l e i , j v ) = e i , j u , v 1 , e i , j u , v 2 , , e i , j u , v K . Then, according to Formula (8), the correlation degree between the two can be calculated as follows:
r e l e v a n c e u , v = q = 1 K e i , j u , v q 2 , u 1 , M r t 1 , v 1 , r t 1
define the global correlation threshold as t h r e s h o l d g l o b a l :
t h r e s h o l d g l o b a l = 1 r t 1 M r t 1 v = 1 r t 1 u = 1 M r t 1 r e l e v a n c e u , v
using Formula (11), calculate the degree of correlation degree r e l e v a n c e u , v between each argument r o l e ^ i , j u in R O i , j t 1 ^ = r o l e ^ i , j 1 , r o l e ^ i , j 2 , , r o l e ^ i , j r t 1 ^ and any argument r o l e i , j v in R O i , j t 1 = r o l e i , j 1 , r o l e i , j 2 , , r o l e i , j r t 1 . When r e l e v a n c e u , v t h r e s h o l d g l o b a l , r o l e ^ i , j u , u 1 , r ^ t 1 is regarded as a confirmed argument role and stored into the confirmed argument set R O i , j t 2 , that is, R O i , j t 2 = r o l e i , j 1 , r o l e i , j 2 , , r o l e i , j r t 2 and r t 2 are the number of confirmed arguments. Otherwise, it can be discarded.

3.3.3. Event Multi-Tuple Generation

The confirmation argument set R O i , j t 1 is merged with the confirmation argument set R O i , j t 2 to generate the jth argument role A r j final argument list.
R O i , j t F = R O i , j t 1 R O i , j t 2
Finally, based on Formulas (5)–(7) and (13), the event type, event trigger word, and the final argument list are merged to generate a multi-tuple of events.
E v e n t   m u l t i t u p l e = e t i , t r i g g e r w o r d , R O i , j F
To better illustrate the above process, we provide a brief description of the argument selection procedure using an example of multi-tuple extraction from a single sentence, as shown in Figure 3.

4. Experiment

4.1. Experimental Environment

Several experiments were conducted to compare the effectiveness of the algorithm proposed in this paper with existing methods. The experiments mainly consist of three parts, as follows. (1) The sentence-level event multi-tuple extraction is implemented using our method. (2) The HIT language technology platform LTP is used to extract the sentence-level event multi-tuple. (3) The Baidu ERNIE model is used to achieve sentence-level event tuple extraction. The experimental results of the proposed method are compared with the results of LTP and Baidu to illustrate the effectiveness of the proposed method.
These experiments are composed of a hardware and software environment. The hardware mainly includes 128 GB memory, an Intel(R) Xeon(R) E5-2620 v4 CPU and a NVIDIA GeForce GTX 1080 Ti GPU. The software framework employed for the experiments includes Ubuntu 9.4.0 as the operating system and python3.7 as the programming language. The Baidu event extraction dataset [27] was utilized as the primary dataset for the experiments.

4.2. Extracting the Event Multi-Tuple

4.2.1. Extraction of Alternative Arguments

Four sentences (id 1–4, as shown in Table 2) were randomly selected from the Baidu event extraction dataset as the input sentence text. First, the trigger words and event types were generated using Formulas (1)–(6), and the argument was determined using the trigger words and event types. Then, based on the argument template, Formula (7) is used to determine the alternative arguments, and the output results are shown in Table 2, Table 3, Table 4 and Table 5, where the second column is the content of the input sentence, the third column is the generated indicator, and the fourth column is the alternative argument.
Table 2, Table 3, Table 4 and Table 5 shows that for sentences 1 and 2, each role of the argument corresponds to only one argument, and the event tuple can be obtained by matching by the type of the named entity. For sentences 2–4, in the event tuple extracted after matching by the type of named entity, there is a situation in which one argument role corresponds to more than one argument. For example, the argument role “organization” in sentence 2 corresponds to two arguments: “the county management over-speed station” and “county’s discipline inspection commission”, one of which needs to be further selected.

4.2.2. Selecting of Alternative Arguments

For the problem that one argument role corresponds to more than one argument, the selection method proposed in this paper in Section 3.3 is used. For a specific alternative argument, (a) Formula (10) is used to calculate the local correlation degree of the edges connecting the event trigger word and the alternative argument; then, Formula (9) is used to calculate the correlation degree between each alternative argument and the trigger word, and the two are compared. According to the comparison results, the alternative arguments are classified into confirmed argument 1 and the to-be-confirmed argument. (b) Formula (12) is used to calculate the global correlation degree between the words in the text, except the alternative argument and the confirmation argument 1; secondly, Formula (11) is used to calculate the correlation degree between the argument to be confirmed and the confirmed argument 1. The two arguments were compared, and arguments to be confirmed were classified as confirmed argument 2, or deleted according to the comparison results. (c) The final argument is the union of confirmation argument 1 and confirmation argument 2. The results are shown in Table 6, where the second column is the alternative arguments, the third column is the result based on the selection, and the fourth column is the result of the second round of screening.
Table 6 shows that by calculating the correlation degree of edges between each alternative argument and the trigger word and comparing it with the local correlation degree threshold, the alternative arguments are divided into confirmed argument 1 and to-be-confirmed arguments, and confirmed argument 1 can be directly included in the final event multi-tuple. Then, by calculating the degree of correlation of edges between the to-be-confirmed argument and the confirmed argument 1 and comparing it with the global correlation threshold, the to-be-confirmed argument is divided into confirmed argument 2 or deleted. In sentence 2, the to-be-confirmed argument “county’s discipline inspection commission” is deleted in the second round of selecting, which determines the argument composition in the final event multi-tuple. However, in sentence 3, the to-be-confirmed argument “1.4-T engine” is deleted, while “CS35PLUS” is confirmed. Therefore, in the multi-tuple event in sentence 5, there are two arguments, “Yidong” and “CS35PLUS”, in the argument role of “publish products”, and a similar phenomenon also appears in sentence 4.
Finally, the resulting final arguments are combined with event triggers and event types to form the final event multi-tuple, as shown in Table 7.
Obviously, the event tuples extracted using the proposed method are the same as the event tuples refined after human reading.

4.3. Case Study of the Comparison Experiment

To prove the effectiveness of the method proposed in this paper, the HIT language technology platform (LTP, a Chinese natural language tool processing package) and Baidu ERNIE model were selected as baselines and compared with our model for the same input sentences; the results are shown in Table 8 and Table 9.
In order to prove the effectiveness of the proposed method, we used LTP, the Baidu model, and the proposed method to extract arguments for the same set of sentences, and compared the arguments extracted by the three methods.
As can be seen in Table 8, the arguments extracted by the proposed method have better completeness and a higher accuracy compared with LTP. For example, in sentence 3, the arguments extracted by this paper’s method are “publish products” are “Yidong” and “CS35PLUS”, while the arguments extracted by LTP are “equipped with a 1.4 T engine”. According to the meaning of the sentence, the arguments extracted by this paper’s method are correct. In the sixth sentence, the argument roles extracted by LTP are only “product” and “time”, while the argument roles extracted by the method in this paper are “publish products”, “publish time”, and “publisher”.
As can be seen in Table 9, for the Baidu ERINE model, the proposed method outperforms it in terms of completeness, and has higher precision. For example, in sentence 2, ERINE incorrectly extracted the argument “suspension person: negative” whose argument role is “suspended person: [‘Ma’]”. In sentence 4, the proposed method extracted the argument “Dazhou City” with the argument role as “location”, while ERINE erroneously extracted “the main rivers of Dazhou city”.

4.4. Statistical Analysis of the Comparison Experiment

Traditional overlapping matching evaluation methods frequently produce sub-optimal outcomes due to their reliance on scoring metrics that depend on the ratio between the length of extracted arguments and reference values. As a result, shorter yet more precise arguments often receive lower scores, compromising the overall efficacy of these techniques. For instance, the extracted result “employee representative supervisor Xing Jun” obtains a higher matching score than “Xing Jun” in sentence 1. Upon examining the Baidu dataset and the extracted results, numerous instances resembling Sentence 1 were identified. Employing traditional overlapping matching for evaluation purposes could lead to significant discrepancies, as demonstrated by the aforementioned case study.
To ensure the reliability of evaluation outcomes, this study adopted a combination of sampling and manual assessment methodologies. Five native language evaluators were assigned to randomly select and review 100 data samples from the experimental results. These evaluators examined each sample using three key metrics (coverage (C), precision (P), and recall (R)), thereby establishing a rigorous and systematic evaluation process. Besides, the harmonic mean of C, P, and R—namely, the F1-value—was calculated to provide a comprehensive measure of performance. This approach allowed for a more logical and academically sound analysis of the dataset under investigation.
N t o t a l e x represents the total number of predicted arguments. N r i g h t e x denotes the number of correctly predicted arguments. N r e signifies the total number of annotated arguments. These key metrics can be calculated as follows:
C o v e r a g e = N t o t a l e x N r e
P r e c i s i o n = N r i g h t e x N t o t a l e x
R e c a l l = N r i g h t e x N r e
F 1 = C P R 3 C P + P R + C R
The final results are shown as Table 10 and ranked based on the F1 score, which provides a comprehensive measure of performance by considering all key metrics. This approach allows for a more logical and academically sound analysis of the results. Upon examining data samples from various systems in comparison with our method, it was observed that our method effectively extracts not only the correct arguments, but also those beyond the reference set, with high accuracy rates. Additionally, our approach demonstrates greater granularity and precision compared to other systems. Although the recall (R) value is marginally lower than that of ERINE, our method’s overall performance surpasses that of both ERINE and LTP. With an F1 value 1.6 points higher than ERINE and significantly stronger than LTP, it is evident from Table 10 that our method possesses superior extraction capabilities and can extract more accurate arguments.

5. Concluding Remarks

In this paper, we propose an effective event extraction method via graph convolutional network indication with a hierarchical argument selection strategy. The retrieval and classification of event trigger words based on EE-GCN is used as the indicator of an argument extraction template, and the argument extraction template is selected for the Baidu event extraction dataset. On the basis of this template, the candidate arguments are obtained using named entity recognition. Then, the local and global correlation degree based on EE-GCN solution is used to further filter the alternative arguments to improve the accuracy and completeness of the extracted arguments. The experimental results compared with the LTP and Baidu model verify the effectiveness of the proposed method. The results show that the proposed method is an effective event multi-tuple extraction method, which provides an effective support technology for text mining applications. At present, this method is limited to sentence-level event multi-tuple extraction. The extension of this method to document-level event multi-tuple extraction is an important objective of future research works.

Author Contributions

Z.L. proposed the initial ideas and the overarching research aims. In addition, he proofread and revised the manuscript. Y.L. and Y.Z. carried out the experiment. Y.L. and K.Y. wrote the main manuscript text. C. and Y.W. proposed the research. All authors have read and agreed to the published version of the manuscript.

Funding

We would like to thank Guosheng Yang for providing helpful constructive comments that improved the manuscript substantially, and for providing the financial support for the project leading to this publication. This work was supported in part by the National Key Research and Development Program of China under Grant 2020YFB1406702-3, and in part by the National Natural Science Foundation of China under Grant 62006257.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The dataset used for this work is the DuEE 1.0 dataset published by Li X [27]. It can be found at https://aistudio.baidu.com/aistudio/competition/detail/65/0/introduction (accessed on 28 June 2023).

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. Zhan, L.; Jiang, X. Survey on event extraction technology in information extraction research area. In Proceedings of the IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China, 15–17 March 2019; pp. 2121–2126. [Google Scholar]
  2. Xiang, W.; Wang, B.; Bang, W. A survey of event extraction from text. IEEE Access 2019, 7, 173111–173137. [Google Scholar] [CrossRef]
  3. Conlon, S.J.; Abrahams, A.S.; Simmons, L.L. Terrorism information extraction from online reports. J. Comput. Inf. Syst. 2015, 55, 20–28. [Google Scholar] [CrossRef]
  4. Tanev, H.; Piskorski, J.; Atkinson, M. Real-time news event extraction for global crisis monitoring. In Proceedings of the 13th International Conference on Applications of Natural Language to Information Systems (NLDB 2008), London, UK, 24–27 June 2008; pp. 207–218. [Google Scholar]
  5. Atkinson, M.; Du, M.; Piskorski, J.; Tanev, H.; Yangarber, R.; Zavarella, V. Techniques for multilingual security-related event extraction from online news. In Computational Linguistics; Springer: Berlin, Germany, 2013; pp. 163–186. [Google Scholar]
  6. Nuij, W.; Hogenboom, F.; Frasincar, F.; Kaymak, U. An automated framework for incorporating news into stock trading strategies. IEEE Trans. Knowl. Data Eng. 2014, 26, 823–835. [Google Scholar] [CrossRef]
  7. Capet, P.; Delavallade, T.; Nakamura, T.; Sandor, A.; Tarsitano, C.; Voyatzi, S. A risk assessment system with automatic extraction of event types. In Proceedings of the Intelligent Information Processing IV: 5th IFIP International Conference on Intelligent Information Processing, Beijing, China, 19–22 October 2008; pp. 220–229. [Google Scholar]
  8. Vanegas, J.A.; Matos, S.; González, F.; Oliveira, J.L. An overview of biomolecular event extraction from scientic documents. Comput. Math. Methods Med. 2015, 2015, 571381. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Zhang, J.; Li, X.; Tian, J.; Jiang, Y.; Luo, H.; Yin, S. A variational local weighted deep sub-domain adaptation network for remaining useful life prediction facing cross-domain condition. Reliab. Eng. Syst. Saf. 2023, 231, 108986. [Google Scholar] [CrossRef]
  10. Zhang, J.; Li, X.; Tian, J.; Luo, H.; Yin, S. An integrated multi-head dual sparse self-attention network for remaining useful life prediction. Reliab. Eng. Syst. Saf. 2023, 233, 109096. [Google Scholar] [CrossRef]
  11. Cui, S.; Yu, B.; Liu, T.; Zhang, Z.; Wang, X.; Shi, J. Edge-enhanced graph convolution networks for event detection with syntactic relation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), Online, 16–20 November 2020; p. 10757. [Google Scholar]
  12. Grishman, R. Information extraction: Techniques and challenges. In Lecture Notes in Computer Science; Springer: Cham, Switzerland, 1997; pp. 10–27. [Google Scholar]
  13. Kipf, T.N.; Welling, M. Semi-supervised classification with graph convolutional networks. arXiv 2016, arXiv:1609.02907. [Google Scholar]
  14. Orr, J.W.; Tadepalli, P.; Fern, X. Event detection with neural networks:A rigorous empirical evaluation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October–4 November 2018; pp. 999–1004. [Google Scholar]
  15. Nguyen, T.H.; Grishman, R. Graph convolutional networks with argumant-aware pooling for event detection. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence, New Orleans, LO, USA, 2–7 February 2018; pp. 5900–5907. [Google Scholar]
  16. Ahn, D. The stages of event extraction. In Proceedings of the Workshop on Annotating and Rcasoning about Time and Events, Sydney, Australia, 23 July 2006; pp. 1–8. [Google Scholar]
  17. Ji, H.; Grishman, R. Refining event extraction through cross-document inference. In Proceedings of the Meeting of the Association for Computational Linguistics, Columbus, OH, USA, 15–20 June 2008; pp. 254–262. [Google Scholar]
  18. Liao, S.; Grishman, R. Using document level cross-event inference to improve eventextraction. In Proceedings of the Meeting of the Association for Computational Linguistics, Uppsala, Sweden, 11–16 July 2010; pp. 789–797. [Google Scholar]
  19. Liu, S.; Liu, K.; He, S.Z.; Zhao, J. A Probabilistic Soft Logic based approach to exploiting latent and global information in event classification. In Proceedings of the 30th AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016; pp. 2993–2999. [Google Scholar]
  20. Nguyen, T.H.; Cho, K.; Grishman, R. Joint event extraction via recurent neural networks. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, San Diego, CA, USA, 12–17 June 2016; pp. 300–309. [Google Scholar]
  21. Li, Q.; Wan, H.; Huang, L. Joint event extraction via structured prediction with global features. In Proceedings of the Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, 4–9 August 2013; pp. 73–82. [Google Scholar]
  22. Walker, C.; Strassel, S.; Medero, J.; Maeda, K. ACE 2005 Multilingual Training Corpus LDC2006T06; Web Download; Linguistic Data Consortium: Philadelphia, PA, USA, 2006. [Google Scholar]
  23. Qi, P.; Zhang, Y.; Zhang, Y.; Bolton, J.; Manning, C.D. Stanza: A Python natural language processing toolkit for many human languages. In Proceedings of the Association for Computational Linguistics (ACL) System Demonstrations, Online, 5–10 July 2020. [Google Scholar]
  24. Zhang, J.; Tian, J.; Li, M.; Leon, J.I.; Franquelo, L.G.; Luo, H.; Yin, S. A parallel hybrid neural network with integration of spatial and temporal features for remaining useful life prediction in prognostics. IEEE Trans. Instrum. Meas. 2022, 72, 1–12. [Google Scholar] [CrossRef]
  25. Zhang, J.; Huang, C.; Chow, M.Y.; Li, X.; Tian, J.; Luo, H.; Yin, S. A Data-model Interactive Remaining Useful Life Prediction Approach of Lithium-ion Batteries Based on PF-BiGRU-TSAM. IEEE Trans. Ind. Inform. 2023. [Google Scholar] [CrossRef]
  26. Akbik, A.; Blythe, D.; Vollgraf, R. Contextual String Embeddings for Sequence Labeling. In Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, NM, USA, 20–26 August 2018; pp. 1638–1649. [Google Scholar]
  27. Li, X.; Li, F.; Pan, L.; Chen, Y.; Peng, W.; Wang, Q.; Lyu, Y.; Zhu, Y. DuEE: A Large-scale dataset for chinese event extraction in real-world scenarios. In Proceedings of the Natural Language Processing and Chinese Computing: 9th CCF International Conference (NLPCC 2020), Zhengzhou, China, 14–18 October 2020; pp. 534–545. [Google Scholar]
Figure 1. The overall architecture.
Figure 1. The overall architecture.
Electronics 12 02981 g001
Figure 2. The generation of argument indicators based on EE-GCN.
Figure 2. The generation of argument indicators based on EE-GCN.
Electronics 12 02981 g002
Figure 3. The extraction of arguments.
Figure 3. The extraction of arguments.
Electronics 12 02981 g003
Table 1. Role of event argument in Baidu dataset.
Table 1. Role of event argument in Baidu dataset.
Event TypeEvent Argument Role [Entity Type]
finance/trading—listinglocation [GPE LOC FAC]
listed enterprises [ORG]
amount of financing [MONEY]
product behavior—publishtime [DATE TIME]
publish products [PRODUCT WORK_OF_ART]
publisher [PERSON ORG GPE NORP]
product behavior—releasetime [DATE TIME]
release party [ORG]
release film and television [WORK_OF_AR]
competitive behavior—winningtime [DATE TIME]
winner [PERSON ORG]
championship event [EVENT]
life—marriagetime [DATE TIME]
married parties [PERSON]
Table 2. Extraction of alternative arguments (1).
Table 2. Extraction of alternative arguments (1).
Sentence ID1
SentenceTsingtao Brewery shares (00168) employee representative supervisor Xing Jun resigned.
Indicatortrigger: resign
Event Type: organizational relationship—resignation
Alternative argumentstime: []
resigned person: [‘Xing Jun’]
original organization: [‘Tsingtao Brewery Shares’]
Table 3. Extraction of alternative arguments (2).
Table 3. Extraction of alternative arguments (2).
Sentence ID2
SentenceA man named Ma has been suspended and is being investigated by the county’s discipline inspection commission, according to a person in charge of the county management over-speed station on June 4.
Indicatortrigger: suspension
Event Type: organizational relations—suspension
Alternative argumentstime: [‘June 4’]
organization: [‘the county management over-speed station’,
‘county’s discipline inspection commission’]
suspended person: [‘Ma’]
Table 4. Extraction of alternative arguments (3).
Table 4. Extraction of alternative arguments (3).
Sentence ID3
SentenceAt the 2019 Chongqing International Auto Show, Changan officially published Yidong and CS35 PLUS with a 1.4 T engine.
Indicatortrigger: publish
Event Type : product behavior—publish
Alternative argumentstime: [‘2019’]
publish products: [‘Yidong’, ‘CS35PLUS’, ‘1.4 T engine’]
publisher: [‘Chang’ an’]
Table 5. Extraction of alternative arguments (4).
Table 5. Extraction of alternative arguments (4).
Sentence ID4
SentenceOn July 18, the reporter learned from the provincial flood control and disaster reduction work dispatch meeting that since July 16, Dazhou City has experienced heavy rainfall. Affected by rainfall, Dazhou City’s main rivers have suffered varying degrees of flooding.
Indicatortrigger: flood
Event Type: disaster/accident—flood
Alternative argumentstime: [‘16 July’, ‘18 July’]
location: [‘Dazhou City’]
Table 6. Results of the selection of alternative arguments.
Table 6. Results of the selection of alternative arguments.
Sentence IDAlternative ArgumentsResults of the Selection Based on Local AssociationResults of the Selection Based on Global Association
2time: [‘June 4’]
organization:
[‘the county management over-speed station’,
‘county’s discipline inspection commission’]
suspended person: [‘Ma’]
organization:{confirmed argument 1: [‘the county management over-speed station’]
to-be-confirmed argument: [‘county’s discipline inspection commission’]})
organization:
{confirmed argument 2: []}
3time: [‘2019’]
publish products: [‘Yidong’, ‘CS35PLUS’, ‘1.4 T engine’]
publisher:
[‘Chang’ an’]
publish products: {confirmed argument 1:
[‘Yidong’]
to-be-confirmed arguments: [‘CS35PLUS’,’1.4 T engine’]}
publish product:
{confirmed argument 2:
[‘CS35PLUS’]}
4time:
[‘16 July’, ‘18 July’]
location:
[‘Dazhou City’]
time:{confirmed argument 1: [‘16 July’]
to-be-confirmed arguments: [‘18 July’]}
time:
{confirmed argument 2: []}
Table 7. The final result of the event multi-tuple.
Table 7. The final result of the event multi-tuple.
SentenceEvent Multi-Tuple
2A man named Ma, has been suspended and is being investigated by the county’s discipline inspection commission, according to a person in charge of the county management over-speed station on June 4.organizational relations-suspension
Trigger: suspension
time: [‘June 4’]
Argument: organization:
[‘the county management over-speed station’]
Argument:
suspended person: [‘Ma ‘]
3At the 2019 Chongqing International Auto Show, Changan officially published Yidong and CS35 PLUS with a 1.4 T engine.Event Type: product behavior—publish
Trigger: publish
Argument: time: [‘2019’]
Argument: publish product:
[‘Yidong’, ’CS35PLUS’]
Argument: publisher: [‘Chang’ an ‘]
4On July 18, the reporter learned from the provincial flood control and disaster reduction work dispatch meeting that since July 16, Dazhou City has experienced heavy rainfall. Affected by rainfall, Dazhou City’s main rivers have suffered varying degrees of flooding.Event Type: disaster/accident—flood
Trigger: flood
Argument: time: [‘July 16th’]
Argument: location:
[‘Dazhou City’]
Table 8. Experimental results compared with LTP.
Table 8. Experimental results compared with LTP.
Sentence IDSentenceOurLTP
1Tsingtao Brewery shares (00168) employee representative supervisor Xing Jun resigned.Event Type: organizational relationship-resignation
Trigger: resignation
Argument: resigned person: Xing Jun
Argument: original organization:
Tsingtao Brewery shares
[]
2A man named Ma has been suspended and is being investigated by the county’s discipline inspection commission, according to a person in charge of the county management over-speed station on June 4.Event Type: organizational relations-suspension
Trigger: suspension
Argument: time: [‘June 4th’]
Argument: organization:
[‘the county management over-speed station’]
Argument: suspended person: [‘Ma’]
Trigger: intervene
Argument: entity: county’s discipline inspection commission
Argument: time:for now, June 4th
3At the 2019 Chongqing International Auto Show, Changan officially published Yidong and CS35 PLUS with a 1.4 T engine.Event Type:
product behavior-publish
Trigger: publish
Argument: time: [‘2019’]
Argument: publish product: [‘Yidong’, ‘CS35PLUS’]
Argument: publisher: [‘Chang’ an ‘]
Trigger: publish
Argument: publisher: Chang’ an
Argument: publish product: equipped with a 1.4 T engine)
Argument: time: 2019
Table 9. Experimental results compared with ERINE.
Table 9. Experimental results compared with ERINE.
Sentence IDSentenceOurERINE
1Tsingtao Brewery shares (00168) employee representative supervisor Xing Jun resigned.Event Type: organizational relationship-resignation
Trigger: resignation
Argument: resigned person: Xing Jun
Argument: original organization:
Tsingtao Brewery shares
Event Type: organizational relationship-resignation
Trigger: resignation
Argument: resigned person: employee representative supervisor Xing Jun
Argument: original organization:
Tsingtao Brewery shares
2A man named Ma has been suspended and is being investigated by the county’s discipline inspection commission, according to a person in charge of the county management over-speed station on June 4.Event Type: organizational relations-suspension
Trigger: suspension
Argument: time: [‘June 4th’]
Argument: organization:
[‘the county management over-speed station’]
Argument: suspended person: [‘Ma’]
Event Type: organizational relations-suspension
Trigger: suspension
Argument: suspension person:negative)
Argument: suspended person: [‘The involved man, Ma’]
4On July 18, the reporter learned from the provincial flood control and disaster reduction work dispatch meeting that since July 16, Dazhou City has experienced heavy rainfall. Affected by rainfall, Dazhou City’s main rivers have suffered varying degrees of flooding.Event Type: disaster/accident—flood
Trigger: flood
Argument: time: [‘July 16th’]
Argument:
location: [‘Dazhou City’]
Event Type: disaster/accident—flood
Trigger: flood
Argument: time: [‘July 16th’]
Argument: location: Dazhou City
Argument: location: the main rivers of Dazhou city
Table 10. Statistical analysis results of the comparison experiment.
Table 10. Statistical analysis results of the comparison experiment.
SystemsCoveragePrecisionRecallF1
LTP69.879.776.575.1
ERINE94.883.986.888.3
OUR97.587.485.689.9
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, Z.; Li, Y.; Zhang, Y.; Weng, Y.; Yang, K.; Chaomurilige. Effective Event Extraction Method via Enhanced Graph Convolutional Network Indication with Hierarchical Argument Selection Strategy. Electronics 2023, 12, 2981. https://doi.org/10.3390/electronics12132981

AMA Style

Liu Z, Li Y, Zhang Y, Weng Y, Yang K, Chaomurilige. Effective Event Extraction Method via Enhanced Graph Convolutional Network Indication with Hierarchical Argument Selection Strategy. Electronics. 2023; 12(13):2981. https://doi.org/10.3390/electronics12132981

Chicago/Turabian Style

Liu, Zheng, Yimeng Li, Yu Zhang, Yu Weng, Kunyu Yang, and Chaomurilige. 2023. "Effective Event Extraction Method via Enhanced Graph Convolutional Network Indication with Hierarchical Argument Selection Strategy" Electronics 12, no. 13: 2981. https://doi.org/10.3390/electronics12132981

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop