Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction

Aldawsari, Mohammed; Kolhar, Manjur; Dawood Omer, Omer Salih

doi:10.3390/app131911004

Open AccessArticle

Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction

by

Mohammed Aldawsari

^*

,

Manjur Kolhar

and

Omer Salih Dawood Omer

Department Computer Science, College of Arts and Science, Wadi Ad Dwaser, Prince Sattam Bin Abdulaziz University, Al-Kharj 16273, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(19), 11004; https://doi.org/10.3390/app131911004

Submission received: 7 September 2023 / Revised: 29 September 2023 / Accepted: 3 October 2023 / Published: 6 October 2023

(This article belongs to the Special Issue AI for Computational Vision, Natural Language Processing, and Geoinformatics)

Download Versions Notes

Abstract

:

Event coreference resolution is a crucial component in Natural Language Processing (NLP) applications as it directly affects text summarization, machine translation, classification, and textual entailment. However, the research on this task for Arabic language is limited, compared to other languages such as English, Chinese and Spanish. This paper aims to review the state-of-the-art approaches in event coreference (EC) within the context of coreference resolution tasks, emphasizing the significance of EC in NLP. The focus is placed on the latest developments in Arabic language processing related to event coreference. To fill this gap, a comprehensive study of existing work is conducted, and new approaches are suggested. The paper highlights the challenges specific to Arabic event coreference resolution, such as the variability of verb forms, pronoun ambiguity, ellipsis and null arguments, lexical and morphological variation, lack of annotated resources, discourse and pragmatic context, and cultural and contextual sensitivity. Addressing these challenges requires a deep understanding of Arabic linguistics, advanced NLP techniques, and the availability of annotated resources. Furthermore, this paper examines the existing datasets and methods for Arabic event coreference and proposes an annotation scheme. By leveraging existing NLP algorithms and developing event coreference resolution systems tailored for Arabic, the accuracy and performance of NLP tasks can be significantly improved.

Keywords:

natural language processing; coreference resolution; event coreference; Arabic language processing; linguistic representations

1. Introduction

In the context of the ACE [1] and TimeML [2] schemas, an event is a significant occurrence or activity that happens at a specific point in time. However, the definitions of an event in these two schemas are slightly different. In the ACE2005 schema, an event is defined as a type of entity that represents a happening or occurrence that takes place at a particular time, such as a meeting, an attack, or an election. An event in ACE2005 is often characterized by attributes such as its type, subtype, and participants. In contrast, the TimeML schema defines an event as a concept that represents a single point or interval in time where something happens. TimeML events can include specific occurrences, such as an earthquake or a meeting, as well as more abstract events, such as the start or end of a period of time. Overall, both ACE2005 and TimeML define an event as a significant occurrence that takes place at a specific time, but the distinction between the two lies in how they represent and categorize events. ACE2005 is more focused on the type and attributes of events, while TimeML is more concerned with the temporal relationships between events and other temporal expressions in a text. In Table 1, bolded words are annotated as events in both corpora.

Event coreference resolution is a task in Natural Language Processing (NLP) that involves identifying expressions (i.e., words) in a text that refer to the same real-world event. In other words, event coreference aims to find all the different ways that an event is mentioned in a text and link them together, so that NLP systems can better understand the relationship between events and the entities that participate in them. In Table 2, all bolded words are coreferencing events.

Event coreference is important for natural language processing because it affects how well we can do things like summarizing, translating, and understanding texts. For instance, if we want to summarize a text, we need to know which events are the most important and how they are related to each other. Event coreference is also difficult because it requires knowing how humans make sense of and communicate events before using words. Humans can process the text directly without focusing on which event refers to another because they have a rich mental representation of events that is based on their perception, memory, and knowledge. They can use cues such as tense, aspect, modality, and discourse markers to infer the relations between events. However, these cues are not always explicit or consistent in natural language, and they may vary across languages and genres. This paper concentrates on event coreference for Arabic, which has more challenges and less research than event coreference for other languages. Arabic is a morphologically rich and syntactically complex language that has many variations and dialects. It also has different writing systems and conventions that affect how events are expressed in texts. This paper surveys the existing works on Arabic event coreference and challenges as well as available datasets. Furthermore, the paper proposes a schema for annotating Arabic event coreference based on the ACE (Automatic Content Extraction) Arabic Annotation Guidelines for Events [3].

Challenges in Arabic Event Coreference

The Arabic language presents several challenges when it comes to event coreference resolution due to its unique linguistic characteristics. Some of these challenges include then following:

Lexical and Morphological Variation: As shown in Table 3 Arabic has a complex system of inflections and morphological variations that can lead to a variety of surface forms for a given event. As a result of this morphological richness, it may be difficult to identify and link event mentions expressed in different inflected forms.

Consider two sentences: “أحمد زار المكتبة” (Ahmed visited the library) and “زرت المكتبة” (I visited the library). In the second sentence, the verb form “زرت” is different due to the speaker being first-person singular, making it challenging to link this event to the first sentence.

Dropped Pronouns: Arabic frequently drops subject pronouns due to its tendency to drop them, making it more difficult to determine the agents of events, which are crucial for resolving coreferences [4]. For instance, in this sentence “ذهبت إلى المدرسة (Went to school)” the subject pronoun “أنا” (I) is dropped, which makes it more difficult to determine the agent of the event.

Pronouns Ambiguity: Due to the extensive use of pronouns with multiple forms and genders, Arabic pronouns are highly ambiguous, as are other Arabic words. For example, in this sentence: “قالت لها أنها تحبه (She told her that she loves him)” both pronouns “ها” and “ها” refer to different entities. Based on context, it is necessary to disambiguate pronoun referents in order to resolve coreferring events accurately.

Events Ellipses: These refer to situations where an event is referred to or implied without being explicitly mentioned. Arabic uses events ellipses extensively. In this sentence, “والبنت برتقالة أكل الولد تفاحة (The boy ate an apple and the girl ate an orange)”, the event of eating is referred to twice, but the second time it is implied without being explicitly mentioned in a sentence. By recognizing event ellipsis, the coherence of discourse is enhanced as it aids in resolving coreferring events accurately.

Cultural and Contextual Sensitivity: The interpretation and resolution of Arabic coreferences may involve a consideration of cultural and contextual factors specific to the Arabic language and its diverse dialects.

In order to address these challenges, a deep understanding of Arabic linguistics, advanced natural language processing techniques, and the availability of annotated resources specific to Arabic event coreference resolution are required.

When transferring NLP models from English to Arabic or any other language, it is imperative to understand the linguistic and domain-specific characteristics of both languages and datasets. The key to achieving good performance in cross-lingual NLP tasks is careful analysis, adaptation, and evaluation. In addition, Arabic has several distinctive features, including its root-based morphology, its right-to-left script, and its complex verb conjugation. These linguistic differences could challenge models that are trained on Indo-European languages such as English.

A key difference between within-document event coreference resolution and cross-document event coreference resolution lies in the scope and objective of each task. As opposed to within-document resolution, which emphasizes coherence within a single text by connecting mentions of events, cross-document resolution focuses on connecting and consolidating information about events across multiple texts to improve understanding of events that occur in various contexts. Among the NLP tasks, both event-centered analysis and information retrieval have their own challenges and applications. It is more challenging to resolve cross-document event coreferences than to resolve within-document events because it often involves reasoning about the same event in various contexts, dealing with variations in event descriptions, and resolving ambiguity caused by various descriptions of the same event.

2. Various Approaches and Data

2.1. Data

When it comes to EC resolution, the English language domain faces several major challenges. Additionally, there are a variety of annotation schemas, topologies, and conceptual definitions of what an event is available in the corpora. Therefore, event coreference methodologies cannot be used to compare datasets for which they were not designed. A benchmark for entity and event coreference resolution systems in the English language domain has long been the OntoNotes corpus [5]. The large-scale multi-domain text collection contains annotations at the entity and event levels. Additionally, OntoNotes does not distinguish between entity and event labels: both are simply referred to as mentions. In spite of the fact that event coreference and entity coreference share many aspects, researchers currently face vastly different challenges when it comes to the two tasks. Additionally, verbal events cannot be classified as a single-word mention in the OntoNotes corpus unless there is an equivalent noun phrase. The data become less consistent as a result, especially since coreference resolution algorithms are intended for use in practical, real-world applications.

Table 4 shows the available English corpora annotated with event reference relations and Table 5 presents the state-of-the-art systems for these corpora. The ACE 2005 [1] is used for evaluating the ACE corpora. A set of predefined type actions in English and Chinese are annotated as events and coreferences within the ACE 2005 corpus. In spite of the fact that ACE’s event schema is relatively limited, its general approach has evolved over time. ACE methodology, for example, is incorporated into the TAC-KBP corpora [6], which combine ACE’s typology with more complex and informative annotation styles. There are documents included in this dataset in English, Arabic, Chinese, and Spanish, as well as co-referential links within each document. These datasets all annotate co-referential links within documents, which makes cross-document event correlative resolution even more challenging. A monolingual English ECB+ corpus [7] is the standard for cross-document research, extending the more limited ECB corpus [8]. This corpus contains a substantial number of newspaper documents that contain multi-word event spans that have been annotated according to Rich ERE guidelines [9]. In addition to the use of a semi-automated data leveraging method, the WEC-Eng project developed the WEC-Eng dataset [10], the second and final cross-document English corpus. As a result of this method, event mentions and references are not limited to predefined categories. English, Dutch, Spanish, and Italian annotations are provided at the cross-document level. In terms of coverage, the dataset is largely unrestricted. Authors [10], proposed an efficient method for acquiring a large-scale dataset for cross-document event correlation is using Wikipedia’s event conference.

Several datasets are available in different languages, including Chinese [1,11], Greek [12], and Spanish [1,13]. Based on the available information at present, it has been ascertained that there are no existing corpora specifically annotated for Arabic event coreference. Therefore, there is no system yet to detect Arabic event coreference.

Table 4. Statistics of the commonly used English corpora annotated with event coreference relation [14].

Dataset	Docs	Events Mentions	Chains
ACE 2005	599	5349	4090
ECB+ [7]	982	14,884	9875
TAC KBP [11]	1075	29,471	19,257
MAVEN-ERE [14]	4480	112,276	103,193

Table 5. English state-of-the-art systems in event coreference. The AVG is the average F-score of four metrics: MUC, B3, CEAFe and BLANC. The CoNLL score is the average of the first three metrics [15].

Model	Dataset	CoNLL	AVG
End-to-End within-document [16]	ACE2005	64.56	62.11
Gold triggers within-document [16]	ACE2005	86.78	86.63
EPASE-within-document [17]	ECB+	88.3	-
EPASE-cross-document [17]	ECB+	84.3	-
End-to-End within-document [18]	KBP 2016	43.55	40.61

2.2. Approaches

According to Table 5, in their model [16], multiple representations are learned and integrated from both event alone and event pair data. In order to create more discriminatory representations of events, they introduced multiple linguistically motivated event alone features. To capture the distinctions between event pairs, they considered multiple similarity measures. They demonstrate the effectiveness of their proposed model by achieving a state-of-the-art on the ACE2005 benchmark. The model is also compared with ground truth triggers and predicted event-alone features in order to ensure a thorough comparison with the CDGM model.

A new model for event coreference resolution, called EPASE, was proposed in [17]. It can cover event paraphrases in a broader range of situations, improving generalization, by identifying deep paraphrase relationships within an event-specific context of sentences. In addition, argument roles are embedded in event embedding without relying on a fixed number or type of arguments, leading to greater EPASE scalability. There is consistent and significant superiority of this method over existing methods, both within- and cross-document correlations.

The resolution of event coreferences is an important research problem with many applications. Although pre-trained language models have achieved remarkable success in recent times, we argue that using symbolic features is still highly beneficial. The automatic extraction of symbolic features is subject to noise and errors, since reference resolution typically comes from upstream components in the information extraction pipeline. Furthermore, certain features may be more informative than others depending on the context. In response to these observations, the authors in [18] proposed a novel context-dependent gated module for adaptively controlling the information flow from the input symbolic features. With the help of a simple noisy training method, their proposed models achieve state-of-the-art results on two datasets: ACE 2005 and KBP 2016.

According to reference [15], event coreference resolution typically follows the same paradigms as entity coreference resolution. It should be noted, however, that the methods described in the following discussion are limited to the English and Chinese languages. Coreference resolution is currently characterized by three predominant paradigms. Typically, the mention–pair approach is used, which involves transforming the process of forming clusters of co-referential mentions into a binary classification process. Based on this approach, pairs of event mentions are generated and classified using a binary classification algorithm. In order to reconstruct the event coreference chain from the binary output, a clustering algorithm is used. Recently, mention–pair systems have evolved in tandem with developments in machine learning methods frequently used in natural language processing.

Prior to deep neural networks [19], support vector machines [20] and decision trees [4] were used to analyze feature-based data. In studies, outward lexical similarity has been demonstrated to be the most powerful indicator of coreference among these approaches, and features based on string comparisons have also been demonstrated to be the most powerful. A further feature that models the document’s structure was also successful in resolving coreference in its context [15,21]. However, feature-based methods for coreference resolution have encountered competition from transformer-based approaches. These newer techniques employ large language models to produce powerful contextual representations of mentions, forming the foundation for their classification algorithms [22].

Transformer-based approaches that are span-based [23] have demonstrated state-of-the-art performance in the English language. It is important to note that these pre-trained language models are optimized to encode longer word sequences, which results in more robust contextual representations of events (including multi-word events). Even though mention–pair models generally perform better in coreference resolution tasks, one of their main limitations is their inability to account for event coreference chains involving more than two events. Rather than considering the entire discourse, the algorithm reduces to making pairwise decisions. The second paradigm, mention-ranking, addresses some of the limitations of the mention–pair approach. Based on the feature representation of the mention and its antecedents, the possible antecedents of a mention are ranked.

The algorithm calculates the probability of all co-referential relationships [24] based on a partitioning of co-referential chains. A third method of resolving event correlations is known as the easy-first modeling approach. An event coreference approach is applied using rule-based multi-pass sieve algorithms that have been found to be successful in entity coreference research [25]. A system in which mentions which are relatively “easier” are resolved first is determined by a combination of a series of classification rules or sieves arranged in decreasing order of precision. However, it is possible to include global coreference cluster information even though rule systems are primarily based on pairwise comparisons. As a result, mention–pair approaches are addressed, albeit to a minor extent. In addition, within-chain event argument propagation [26] and agglomerative clustering [27] can further improve the performance of simple first methods. Event coreference are resolved using gold-standard event mentions in the methods and algorithms discussed thus far. End-to-end systems must, however, first extract mentions from raw text. To resolve coreferences end-to-end, a pipeline or a joint approach can be used.

It is possible to detect and resolve event mentions in a pipeline configuration using several different methods, and any self-contained detection method may be coupled with any of the above methods for resolving events. In spite of the fact that such systems can be relatively easy to implement and highly customizable, they are prone to error propagation, since errors in one component can pass without being corrected to the next. Alternatively, joint event coreference resolution aims to model both event detection and coreference resolution simultaneously. Integer linear programming and Markov logic networks can be used to perform joint inference [28,29]. Each component can be enhanced by incorporating background knowledge. By utilizing segment-based decoding, a joint coreference resolution algorithm can be generated as part of a full-blown joint-learning approach, which combines the two tasks into one structured prediction task. It has been demonstrated that joint methods, particularly joint inference methods, perform best in this field [30], particularly when combined with high-performance entity coreference resolution systems [15] and transformer-based architectures [23].

3. Proposed Arabic Event Coreference Annotation

3.1. Event Trigger Annotation

For annotating event triggers, we plan to follow the ACE2005 schema for Arabic event annotation. That is, in order to tag events triggers, annotators must adhere to the ACE2005 Arabic event guidelines [3]. Table 6 shows examples of the main event types and subtypes extracted from the ACE2005 Arabic event guidelines.

3.2. Event Coreference Annotation

For annotating Arabic event coreference, annotators will follow the less strict schema Rich ERE schema [9] in annotating event coreference. That is, event mentions that refer to the same event occurrence will be grouped into Event Hoppers. Event Hopper is a more inclusive, less strict notion of event coreference as compared strict event coreference in ACE2005 and Light ERE. Event hoppers contain mentions of events that “feel” coreferential to the annotator even if they do not meet the strict event identity requirement in ACE2005. More specifically, event mentions that have the following features go into the same hopper: The bolded text in the following text has been annotated

When events mentions refer to the same real-world event and have the same event type.

1. قتل ثلاثة وأربعون شخصا في الهجوم على بغداد , توفي ثلاثة وأربعون شخصا في هجوم بغداد

2. الهجوم وقع بعاصمة سوريا .…. قتل في الهجوم على دمشق اربعة مسلحين

Events that have the same temporal and location scope, though not necessarily the same temporal expression or specifically the same date.
1. هجوم في بغداد الخميس ….. قصف في المنطقة الخضراء الاسبوع الماضي
Event arguments may be non-coreferential or conflicting.
1. قتل 18 شخصا ….. عشرات القتلى

Furthermore, In order to develop an effective annotation schema for Arabic event coreference resolution, it is necessary to consider the specific linguistic challenges associated with Arabic. In addition to the high-level descriptions previously provided, let us examine more detailed strategies for addressing these challenges within the annotation schema.

Morphological Variation:
○
A strategy should be developed to train annotators in identifying morphological variations in verbs and how they are related to the same event. Within an event chain, guidelines should provide examples and rules for handling different verb forms.
○
Example: Provide annotators with examples of verb conjugations and instruct them to connect verbs with the same root and semantic event, even if they have different morphological forms. For instance, “زار” (visited) and “زرت” (I visited) share the same root and should be linked if they refer to the same event.
Pronoun Ambiguity:
○
In order to disambiguate pronoun references, guidelines should provide explicit instructions. The annotator should be guided to consider the context, antecedents, and gender/number agreement when making annotations.
○
Example: Instruct annotators to look for the closest noun or entity that agrees in gender and number with the ambiguous pronoun. For “ساره رأت محمد وقالت له أنها ستأتي,” annotators should link “له” (him) to “محمد” since they agree in gender and number.
Dialectal Variations:
○
Strategy: Annotators should be trained to recognize different expressions for the same event in order to be aware of dialectal variations. A section on common dialectal variations can be included in the guidelines.
○
Example: If annotators encounter a dialectal phrase that refers to an event, they should be instructed to link it to the standard Arabic expression that represents the same event.
Verb Ellipsis:
○
Strategy: Guidelines should specify how verb ellipses should be handled, emphasizing that omitted verbs should be interpreted in light of the context.
Example: For “أحمد أكل التفاحة ومحمد أيضًا,” annotators should understand that the omitted verb “ate” applies to both Ahmed and Mohammad.
○
Providing annotators with clear guidelines, training, and regular feedback sessions can also assist in addressing linguistic challenges effectively. When faced with ambiguous cases, the schema should include mechanisms for annotator discussion and consensus building. In order to improve the quality of event coreference annotations for Arabic text, constant communication between annotators and project supervisors is essential.

4. Evaluation Metrics

Following standard practice for event coreference systems evaluation, the most common evaluation metrics for event coreference resolution can be used to evaluate Arabic event coreference systems, such as MUC [31], B-Cubed [32], CEAF [33], and BLANC [34], all of which report results in terms of recall (R), precision (P), and F-score (F). Additionally, the CoNLL score [35] can be used for Arabic event coreference evaluation, which is the unweighted average of the MUC, B3, and CEAF F-scores.

5. Conclusions

The successful resolution of event coreference in Arabic holds the potential to significantly benefit a range of applications, including information extraction, sentiment analysis, document summarization, and machine translation. Nevertheless, this task encounters substantial challenges within the Arabic linguistic landscape.

The intricate lexical and morphological variations in Arabic, coupled with the frequent omission of subject pronouns, contribute to the complexity of event coreference resolution. Moreover, the extensive utilization of ambiguous pronouns and events ellipses further amplifies this complexity. Additionally, the scarcity of annotated corpora tailored to Arabic event coreference presents a major hindrance to the development of specialized systems.

Arabic event coreference remains an underexplored area compared to languages with more established resources. The lack of specialized systems and comprehensive datasets highlights the need for concerted efforts in constructing suitable corpora. Addressing this gap, we have put forth a schema for annotating Arabic event coreference. This schema is designed to effectively capture the nuanced relationships between events and thereby provide crucial support for the development of advanced coreference resolution systems.

In the future, work will be conducted on the resolution of event conflicts, which often involves entities such as individuals and organizations as participants. The integration of entity coreference resolution with event coreference can result in knowledge graphs or databases that are more comprehensive and coherent. By leveraging shared context, joint entity and event coreference resolution can also improve accuracy.

Author Contributions

Conceptualization, M.A. and M.K.; methodology, M.A.; software, M.A. and O.S.D.O.; validation, M.A., M.K. and O.S.D.O.; formal Analysis, M.A., M.K. and O.S.D.O.; investigation, M.A., M.K. and O.S.D.O.; resources, M.A., M.K. and O.S.D.O.; data curation, M.A., M.K. and O.S.D.O.; writing—original draft preparation, M.A. and M.K.; writing—review and editing, M.A.; supervision, M.A.; project administration, M.A.; funding acquisition, M.A. All authors have read and agreed to the published version of the manuscript.

Funding

The authors extend their appreciation to the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia for funding this research work through the project number (IF2/PSAU/2022/01/21848).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Doddington, G.R.; Mitchell, A.; Przybocki, M.A.; Ramshaw, L.A.; Strassel, S.M.; Weischedel, R.M. The automatic content extraction (ace) program—Tasks, data, and evaluation. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04), Lisbon, Portugal, 26–28 May 2004; Volume 2, pp. 837–840. [Google Scholar]
Verhagen, M.; Gaizauskas, R.J.; Schilder, F.; Hepple, M.; Katz, G.; Pustejovsky, J. TimeML: Robust Specification of Event and Temporal Expressions in Text. J. Semant. 2007, 24, 37–75. [Google Scholar]
Arabic Events Guidelines Version 5.4.4. Available online: https://www.ldc.upenn.edu/sites/www.ldc.upenn.edu/files/arabic-events-guidelines-v5.4.4.pdf (accessed on 13 June 2023).
Cybulska, A.; Vossen, P. Translating Granularity of Event Slots into Features for Event Coreference Resolution. In Proceedings of the 3rd Workshop on EVENTS: Definition, Detection, Coreference, and Representation, Denver, CO, USA, 4 June 2015; pp. 1–10. [Google Scholar]
OntoNotes Release 5.0. LDC2013T19. Web Download. Philadelphia: Linguistic Data Consortium. 2013. Available online: https://catalog.ldc.upenn.edu/LDC2013T19 (accessed on 9 May 2023).
National Institute of Standards and Technology. TAC Knowledge Base Population (KBP) 2017. Available online: https://tac.nist.gov/2017/KBP/index.html (accessed on 1 May 2023).
Cybulska, A.; Vossen, P. Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, 26–31 May 2014; pp. 4545–4552. [Google Scholar]
Adrian, B.C.; Sanda, H. Unsupervised event coreference resolution with rich linguistic features. Uppsala. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL), Uppsala, Sweden, 11–16 July 2010. [Google Scholar]
Song, Z.; Bies, A.; Strassel, S.; Riese, T.; Mott, J.; Ellis, J.; Wright, J.; Kulick, S.; Ryant, N.; Ma, X. From light to rich ere: Annotation of entities, relations, and events. In Proceedings of the 3rd Workshop on EVENTS: Definition, Detection, Coreference, and Representation, Denver, CO, USA, 4 June 2015; pp. 89–98. [Google Scholar]
Eirew, A.; Cattan, A.; Dagan, I. WEC: Deriving a large-scale cross-document event coreference dataset from Wikipedia. arXiv 2021, arXiv:2104.05022. [Google Scholar]
Getman, J.; Ellis, J.; Strassel, S.; Song, Z.; Tracey, J. Laying the groundwork for knowledge base population: Nine years of linguistic resources for tac kbp. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, 7–12 May 2018. [Google Scholar]
Hürriyetoğlu, A.; Zavarella, V.; Tanev, H.; Yörük, E.; Safaya, A.; Mutlu, O. Automated extraction of socio-political events from news (AESPEN): Workshop and shared task report. arXiv 2020, arXiv:2005.06070. [Google Scholar]
Hürriyetoğlu, A.; Mutlu, O.; Yörük, E.; Liza, F.F.; Kumar, R.; Ratan, S. Multilingual protest news detection-shared task 1, case 2021. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text (CASE 2021), Online, 5–6 August 2021; pp. 79–91. [Google Scholar]
Wang, X.; Chen, Y.; Ding, N.; Peng, H.; Wang, Z.; Lin, Y.; Han, X.; Hou, L.; Li, J.; Liu, Z.; et al. MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Abu Dhabi, United Arab Emirates, 7–11 December 2022; pp. 926–941. [Google Scholar]
Lu, J.; Ng, V. Event Coreference Resolution: A Survey of Two Decades of Research. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence Survey Track, Stockholm, Sweden, 13–19 July 2018; pp. 5479–5486. [Google Scholar]
Yao, Y.; Li, Z.; Zhao, H. Learning Event-aware Measures for Event Coreference Resolution. In Proceedings of the Association for Computational Linguistics: ACL 2023, Toronto, ON, Canada, 9–14 July 2023; pp. 13542–13556. [Google Scholar]
Zeng, Y.; Jin, X.; Guan, S.; Guo, J.; Cheng, X. Event coreference resolution with their paraphrases and argument-aware embeddings. In Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, Spain (Online), 8–13 December 2020; pp. 3084–3094. [Google Scholar]
Lai, T.; Ji, H.; Bui, T.; Tran, Q.H.; Dernoncourt, F.; Chang, W. A context-dependent gated module for incorporating symbolic semantics into event coreference resolution. arXiv 2021, arXiv:2104.01697. [Google Scholar]
Nguyen, T.H.; Meyers, A.; Grishman, R. New York University 2016 System for KBP Event Nugget: A Deep Learning Approach; TAC: Tokyo, Japan, 2016. [Google Scholar]
Chen, C.; Ng, V.S. An end-to-end Chinese event coreference resolver. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, 26–31 May 2014; Volume 2, pp. 4532–4538. [Google Scholar]
De Langhe, L.; De Clercq, O.; Hoste, V. Investigating Cross-Document Event Coreference for Dutch. In Proceedings of the Fourth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 16–17 October 2022; pp. 88–98. [Google Scholar]
Joshi, M.; Levy, O.; Weld, D.S.; Zettlemoyer, L. BERT for coreference resolution: Baselines and analysis. arXiv 2019, arXiv:1908.09091. [Google Scholar]
Joshi, M.; Chen, D.; Liu, Y.; Weld, D.S.; Zettlemoyer, L.; Levy, O. Spanbert: Improving pre-training by representing and predicting spans. Trans. Assoc. Comput. Linguist. 2020, 8, 64–77. [Google Scholar] [CrossRef]
Lu, J.; Ng, V. Learning antecedent structures for event coreference resolution. In Proceedings of the 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), Cancun, Mexico, 18–21 December 2017; pp. 113–118. [Google Scholar]
Raghunathan, K.; Lee, H.; Rangarajan, S.; Chambers, N.; Surdeanu, M.; Jurafsky, D.; Manning, C.D. A multi-pass sieve for coreference resolution. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, USA, 9–11 October 2010; pp. 492–501. [Google Scholar]
Liu, Z.; Araki, J.; Hovy, E.H.; Mitamura, T. Supervised Within-Document Event Coreference using Information Propagation. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, 26–31 May 2014; pp. 4539–4544. [Google Scholar]
Choubey, P.K.; Huang, R. Event coreference resolution by iteratively unfolding inter-dependencies among events. arXiv 2017, arXiv:1707.07344. [Google Scholar]
Chen, C.; Ng, V. Joint inference over a lightly supervised information extraction pipeline: Towards event coreference resolution for resource-scarce languages. In Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016; Volume 30. [Google Scholar]
Araki, J.; Mitamura, T. Joint event trigger identification and event coreference resolution with structured perceptron. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015; pp. 2074–2080. [Google Scholar]
Lu, J.; Ng, V. Conundrums in event coreference resolution: Making sense of the state of the art. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Online and Punta Cana, Dominican Republic, 7–11 November 2021; pp. 1368–1380. [Google Scholar]
Vilain, M.; Burger, J.D.; Aberdeen, J.; Connolly, D.; Hirschman, L. A model-theoretic coreference scoring scheme. In Proceedings of the Sixth Message Understanding Conference (MUC-6), Columbia, MD, USA, 6–8 November 1995. [Google Scholar]
Bagga, A.; Baldwin, B. Algorithms for scoring coreference chains. In Proceedings of the First International Conference on Language Resources and Evaluation Workshop on Linguistics Coreference, Granada, Spain, 28–30 May 1998; Volume 1, pp. 563–566. [Google Scholar]
Luo, X. On coreference resolution performance metrics. In Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, BC, Canada, 6–8 October 2005; pp. 25–32. [Google Scholar]
Recasens, M.; Hovy, E. BLANC: Implementing the Rand index for coreference evaluation. Nat. Lang. Eng. 2011, 17, 485–510. [Google Scholar] [CrossRef]
Pradhan, S.; Moschitti, A.; Xue, N.; Uryupina, O.; Zhang, Y. CoNLL-2012 shared task: Modeling multilingual unrestricted coreference in OntoNotes. In Proceedings of the Joint Conference on EMNLP and CoNLL-Shared Task, Jeju, Republic of Korea, 13 July 2012; pp. 1–40. [Google Scholar]

Table 1. An Example of annotated events in ACE2005 and TimeML.

Arabic	English	Transliteration
أصيب جنديان في الهجوم	Two soldiers were injured in the attack	Usyeb jundyan fi alhujum

Table 2. An example of coreferencing events.

Arabic	English	Transliteration
توالت الادانات العربية للجريمة البشعة التي اقترفها تنظيم داعش بإقدامه على حرق الطيار الاردني حيا... فأدانت السعودية تلك الجريمة.	Arab condemnations continued for the heinous crime committed by ISIS, as they burned the Jordanian pilot alive. Saudi Arabia condemned that crime	Tawalat al-adanat al-arabiyya lil-jarimah al-bashiyah allati iqtarafaha tanzeem Da’esh bi-iqdamih ‘ala harq al-tayar al-urdoni hayyan... fa-adanat al-Su’udiyyah tilka al-jarimah

Table 3. General overview of the inflectional forms for the word “attack—هجم (hujm)” in Arabic and English language.

Lemma.	English Inflectional Forms	Arabic Inflectional Forms
attack— هجم (hujm)	attack, attacks, attacked, attacking	هجم - هاجم - تهاجم - يهاجم - نهاجم - تهاجمونَ - يهاجمونَ – سأهاجم – ستهاجم - سيهاجم - سنهاجم - ستهاجمونَ - سيهاجمونَ - هاجموا- هاجمتُ - هاجمتَ - هاجمتما - أهاجم

Table 6. Event Types and Sub Types are extracted from ACE2005 event annotation guidelines.

No	Event Type	Event Sub Types	Annotated	Not Annotated	Note
1	Life	Be-Born	Mohamed was born in England on 18 June 1963 1963 محمد ولد في لندن في 13 يونيو Being born without my hand, I have never experienced any other way. أنا ولدت بدون أيدي, لا أحس أن هنالك أي فرق	University was born in August 1990. الجامعة ولدت في أغسطس 1990.	The birth of other things or ideas is not encompassed.
		Marry	Amna and ahmed were married on 9 June 1998. امنة و أحمد تزوجوا عام 1998 amna and ahmed are married. (resultative) امنة و أحمد متزوجان
		Divorce	A two-year marriage ended in divorce for the couple. انتهى زواج لمدة عامين بالطلاق للزوجين.
		Injure	The attack resulted in two soldiers being injured. اصيب جنديان في الهجوم A soldier who has been injured. (resultative) الجندي المصاب		LIFE events
		Die	Ronald Reagan was the target of an assassination attempt by John Hinckley. جون هينكلي قام بمحاولة اغتيال رونالد ريجان Foreign hostages have been threatened with death by terrorist groups. المجموعات الارهابية هددت بقتل الرهائن الاجانب An automobile accident resulted in her death. توفيت في حادث
2	Movement	Transport	Fred went to New York on Friday to visit Harry. ذهب فريد إلى نيويورك يوم الجمعة لزيارة هاري. The leaders of Palestine cautioned that Israel should withdraw its troops from the surrounding areas of Palestinian cities. القادة الفلسطينيين حذروا بان علي الاسرائيليين ان يخلوا جنودهم من المدن الفلسطينية
3	Transaction	Transfer-Ownership	A total of two nuclear submarines have been purchased by China from Russia اشترت الصين ما مجموعه غواصتين نوويتين من روسيا This report pertains to the submarines that China has recently obtained. (attributive) اقتنت الصين غواصتين حديثا
3	Transaction	Transfer-Money	There were suspicions that the charity provided funds to an organization. الجمعيات الخيرية متهمة بتمويل منظمة القاعدة	I paid $9 for the movie ticket. دفعت تسعة دولارات ثمنا لتذكرة السينما	TRANSFER-MONEY event.
4	Business	Start-Org	Joseph Conrad Parkhurst, the founder of Cycle World motorcycle magazine in 1962, has passed away. .... جوزيف آونارد الذي انشا مجلة السيارات عام 1962		The event of establishing independence of a geopolitical entity (GPE) or spinning off a subsidiary of an organization (ORG) will not be marked as a STARTORG event in the annotation.
		Merge-Org	It was announced in September that the long-planned merger with KLM Royal Dutch Airlines was not going to take place أُعلن في سبتمبر أن الاندماج المخطط له منذ فترة طويلة مع الخطوط الجوية الملكية KLM لن الهولندية
		Declare-Bankruptcy	In 1995, Orange County declared bankruptcy. اشهرت شركة كوكي افلاسها عام 1995
		End-Org	FOO Corp folded in 2002. تم طي فو كروب في العام 2002
5	Conflict	Attack	The bombing of Fallujah by U.S. forces persisted. القوات الامريكية استمرت في قصف الفالوجا
5	Conflict	Demonstrate	On Monday, the strike of the union started. وبدأ إضراب النقابة يوم الاثنين. Demonstrators gathered in protest. المعارضون تظاهروا امام البيت الابيض
6	Contact	Meet	General Motors (GM) is currently in negotiations with Chrysler for the acquisition of Jeep. شركة جي ام تجري مباحثات مع آريزلر لشراء السيارة جيب
6	Contact	Phone-Write	An e-mail was sent by John to Jane. جون ارسل رسالة اليكترونية الي جين		The event of a very common PERSON talking to reporters or issuing a statement is not taggable.
7	Personnel	Start-Position	In June 1998, Mary Smith became the CEO of Foo Corp. ماري سميث التحقت بالشركة كرئيس مجلس ادارة في يونية عام 1998		A job creation or other large-scale economic trends in employment will not be annotated in general.
		End-Position	Mary Smith departed from Foo Corp. in July 2000 ماري سميث تركت شركة كوكي في عام 2000م
		Nominate	We expect the party to nominate him for president. نتوقع أن يرشحه الحزب لمنصب
		Elect	In 1993, Greg Lashutka won the election and became the mayor of Columbus. جورج انتخب عمده لكولومبيا عام 1993
8	Justice	Arrest-Jail	Scott Peterson was taken into custody for the killing of his spouse تم القبض على سكوت بيترسون بتهمة قتل زوجته. Since May, more than 20 individuals suspected of terrorism have been imprisoned in Russia without trial. منذ مايو الماضي اعتقلت روسيا أكثر من عشرين من الارهابيين المشتبه فيهم بدون اي محاكمة		Only events that can be linked to the legal system of a GPE entity that can be tagged will be annotated as JUSTICE events.
		Release-Parole	In accordance with his parole, Fred has been released. وفقًا للإفراج المشروط عنه ، تم إطلاق سراح فريد المسجون
		Trial-Hearing	Jenna Raleigh is facing trial in a military court. جينا رالي ستحاكم في محكمة عسكرية. This week, the trial resumed المحاكمة المنعقدة هذا الاسبوع
		Charge-Indict	The grand jury indicted Joy Fenter on eleven counts of mail fraud. وجهت هيئة محلفين كبرى اتهامات إلى جوي فينتر في إحدى عشرة تهمة بالاحتيال عبر البريد.
		Sue	She threatened to sue me هددت بمقاضاتي
		Convict	The court will convict the suspect المحكمة ستقضي بإدانة المشتبه به بالجريمة
		Sentence	The court sentenced him to 20 years’ hard labor. وحكمت عليه المحكمة بالأشغال الشاقة 20 سنة
		Fine	She was acquitted on all counts تمت تبرئتها من جميع التهم الموجهة إليه
		Execute	The execution of David Goran by lethal injection took place in March 1987. تم إعدام ديفيد جوران بالحقنة المميتة في مارس 1987.
		Extradite	The ex-leader was sent to Burkina Faso after extradition. تم إرسال الزعيم السابق إلى بوركينا فاسو بعد ترحيله
		Acquit	Chase was acquitted after a trial in the Senate. تمت تبرئة تشيس بعد محاكمة في مجلس الشيوخ.
		Appeal	Ahmed submitted an appeal against the court ruling قدم أحمد طلب إستئناف الحكم الصادر من المحكمة
		Pardon	The prisoner was granted a pardon after serving the sentence. تم منح العفو للمسجون بعد قضاء فترة العقوبة.

Annotated words are bolded.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Aldawsari, M.; Kolhar, M.; Dawood Omer, O.S. Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction. Appl. Sci. 2023, 13, 11004. https://doi.org/10.3390/app131911004

AMA Style

Aldawsari M, Kolhar M, Dawood Omer OS. Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction. Applied Sciences. 2023; 13(19):11004. https://doi.org/10.3390/app131911004

Chicago/Turabian Style

Aldawsari, Mohammed, Manjur Kolhar, and Omer Salih Dawood Omer. 2023. "Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction" Applied Sciences 13, no. 19: 11004. https://doi.org/10.3390/app131911004

APA Style

Aldawsari, M., Kolhar, M., & Dawood Omer, O. S. (2023). Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction. Applied Sciences, 13(19), 11004. https://doi.org/10.3390/app131911004

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Within-Document Arabic Event Coreference: Challenges, Datasets, Approaches and Future Direction

Abstract

1. Introduction

Challenges in Arabic Event Coreference

2. Various Approaches and Data

2.1. Data

2.2. Approaches

3. Proposed Arabic Event Coreference Annotation

3.1. Event Trigger Annotation

3.2. Event Coreference Annotation

4. Evaluation Metrics

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI