Event Argument Extraction for Rainstorm Disasters Based on Social Media: A Case Study of the 2021 Heavy Rains in Henan
Abstract
:1. Introduction
- In view of the importance of rainstorm disaster event information extraction for subsequent natural language processing tasks, this paper generates a rainstorm event extraction (RainEE) dataset based on rainstorm disaster texts on Weibo.
- In order to enhance the perception of trigger words in event text and improve the accuracy and completeness of event argument extraction, a BERT-based question and answer network (ETEN_BERT_QA) is proposed with an event text enhancement network in the coding section.
- To verify the superiority of our proposed network, we conduct experiments on the RainEE dataset, the DuEE dataset, and the 20 July 2021 Henan rainstorm case. The results show that our proposed network outperforms the baseline model in both accuracy and number of event extractions.
2. Materials and Methods
2.1. Materials
2.1.1. Production of RainEE
2.1.2. Existing Event Extraction Datasets
2.2. Methods
2.2.1. Optimizing the BERT_QA Model by Adding Event Text Augmentation Block
2.2.2. Event Text Enhancement Network
- Event text construction: For the original event text , the text is vectorized by the event content in the form of the following:Among them, represents the length after vectorization.
- Trigger word representation: The trigger word form obtained from the event detection subtask is as follows:
- Enhanced representation construction: In order to improve the model’s ability to understand event text, we adopted the strategy of adding special separators ‘[SEP]’ before and after the trigger word . Specifically, we construct the enhanced text , which is in the following form:
2.2.3. Event Extraction Evaluation Metrics
- Precision measures the proportion of correctly identified positive samples out of all samples predicted as positive. It is calculated as follows:
- Recall, also known as sensitivity, indicates the proportion of actual positive samples that are correctly predicted by the model. It is calculated as follows:
- The F1 score is the harmonic mean of precision and recall, providing a single metric to evaluate the model’s performance. It is calculated as follows:
3. Experiment and Result
3.1. Experiment
3.2. Result
3.2.1. Accuracy Results for Event Argument Extraction
3.2.2. Quantitative Results and Specific Examples of Event Argument Extraction
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Lei, H. Climate Change and Extreme Weather Events in China in 2007. In The China Environment Yearbook; Brill: Leiden, The Netherlands, 2009. [Google Scholar]
- Khan, S.M.; Shafi, I.; Butt, W.H.; Diez, I.D.L.T.; Flores, M.A.L.; Galán, J.C.; Ashraf, I. A Systematic Review of Disaster Management Systems: Approaches, Challenges, and Future Directions. Land 2023, 12, 1514. [Google Scholar] [CrossRef]
- Mavrodieva, A.V.; Shaw, R. Social Media in Disaster Management. In Media and Disaster Risk Reduction: Advances, Challenges and Potentials; Shaw, R., Kakuchi, S., Yamaji, M., Eds.; Springer: Singapore, 2021; pp. 55–73. ISBN 9789811602856. [Google Scholar]
- Wu, K.; Wu, J.; Ding, W.; Tang, R. Extracting Disaster Information Based on Sina Weibo in China: A Case Study of the 2019 Typhoon Lekima. Int. J. Disaster Risk Reduct. 2021, 60, 102304. [Google Scholar] [CrossRef]
- Dong, Z.S.; Meng, L.; Christenson, L.; Fulton, L. Social Media Information Sharing for Natural Disaster Response. Nat. Hazards 2021, 107, 2077–2104. [Google Scholar] [CrossRef]
- Liu, J.; Min, L.; Huang, X. An Overview of Event Extraction and Its Applications. arXiv 2021, arXiv:2111.03212. [Google Scholar] [CrossRef]
- Panem, S.; Gupta, M.; Varma, V. Structured Information Extraction from Natural Disaster Events on Twitter. In Proceedings of the 5th International Workshop on Web-Scale Knowledge Representation Retrieval & Reasoning, Shanghai, China, 3 November 2014; ACM: New York, NY, USA, 2014; pp. 1–8. [Google Scholar]
- Edouard, A.; Cabrio, E.; Tonelli, S.; Le Thanh, N. Graph-Based Event Extraction from Twitter. In Proceedings of the RANLP17-Recent Advances in Natural Language Processing, Varna, Bulgaria, 2–8 September 2017. [Google Scholar]
- Liu, K.; Chen, Y.; Liu, J.; Zuo, X.; Zhao, J. Extracting Events and Their Relations from Texts: A Survey on Recent Research Progress and Challenges. AI Open 2020, 1, 22–39. [Google Scholar] [CrossRef]
- Sardianos, C.; Katakis, I.M.; Petasis, G.; Karkaletsis, V. Argument Extraction from News. In Proceedings of the 2nd Workshop on Argumentation Mining, Denver, CO, USA, 4 June 2015; Cardie, C., Ed.; Association for Computational Linguistics: Denver, CO, USA, 2015; pp. 56–66. [Google Scholar]
- Chen, Y.; Xu, L.; Liu, K.; Zeng, D.; Zhao, J. Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, 26–31 July 2015; Zong, C., Strube, M., Eds.; Association for Computational Linguistics: Beijing, China, 2015; pp. 167–176. [Google Scholar]
- Nguyen, T.H.; Cho, K.; Grishman, R. Joint Event Extraction via Recurrent Neural Networks. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA, 12–17 June 2016; Knight, K., Nenkova, A., Rambow, O., Eds.; Association for Computational Linguistics: San Diego, CA, USA, 2016; pp. 300–309. [Google Scholar]
- Li, Q.; Ji, H.; Huang, L. Joint Event Extraction via Structured Prediction with Global Features. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Sofia, Bulgaria, 4–9 August 2013; Schuetze, H., Fung, P., Poesio, M., Eds.; Association for Computational Linguistics: Sofia, Bulgaria, 2013; pp. 73–82. [Google Scholar]
- Ahn, D. The Stages of Event Extraction. In Proceedings of the Workshop on Annotating and Reasoning about Time and Events, Sydney, Australia, 23 July 2006; Boguraev, B., Muñoz, R., Pustejovsky, J., Eds.; Association for Computational Linguistics: Sydney, Australia, 2006; pp. 1–8. [Google Scholar]
- Chen, Y.; Liu, S.; Zhang, X.; Liu, K.; Zhao, J. Automatically Labeled Data Generation for Large Scale Event Extraction. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vancouver, BC, Canada, 30 July–4 August 2017; Barzilay, R., Kan, M.-Y., Eds.; Association for Computational Linguistics: Vancouver, BC, Canada, 2017; pp. 409–419. [Google Scholar]
- Liu, J.; Chen, Y.; Liu, K.; Zhao, J. Event Detection via Gated Multilingual Attention Mechanism. Proc. AAAI Conf. Artif. Intell. 2018, 32. [Google Scholar] [CrossRef]
- Huang, L.; Ji, H.; Cho, K.; Dagan, I.; Riedel, S.; Voss, C. Zero-Shot Transfer Learning for Event Extraction. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, Australia, 15–20 July 2018; Gurevych, I., Miyao, Y., Eds.; Association for Computational Linguistics: Melbourne, Australia, 2018; pp. 2160–2170. [Google Scholar]
- Chen, D.; Bolton, J.; Manning, C.D. A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Berlin, Germany, 7–12 August 2016; Erk, K., Smith, N.A., Eds.; Association for Computational Linguistics: Berlin, Germany, 2016; pp. 2358–2367. [Google Scholar]
- Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA, 2–7 June 2019; Burstein, J., Doran, C., Solorio, T., Eds.; Association for Computational Linguistics: Minneapolis, MN, USA, 2019; pp. 4171–4186. [Google Scholar]
- Feng, S.Y.; Gangal, V.; Wei, J.; Chandar, S.; Vosoughi, S.; Mitamura, T.; Hovy, E. A Survey of Data Augmentation Approaches for NLP. arXiv 2021, arXiv:2105.03075. [Google Scholar]
- Yuan, X.; Wang, T.; Gulcehre, C.; Sordoni, A.; Bachman, P.; Zhang, S.; Subramanian, S.; Trischler, A. Machine Comprehension by Text-to-Text Neural Question Generation. In Proceedings of the 2nd Workshop on Representation Learning for NLP, Vancouver, BC, Canada, 3 August 2017; Blunsom, P., Bordes, A., Cho, K., Cohen, S., Dyer, C., Grefenstette, E., Hermann, K.M., Rimell, L., Weston, J., Yih, S., Eds.; Association for Computational Linguistics: Vancouver, BC, Canada, 2017; pp. 15–25. [Google Scholar]
- Duan, N.; Tang, D.; Chen, P.; Zhou, M. Question Generation for Question Answering. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 9–11 September 2017; Palmer, M., Hwa, R., Riedel, S., Eds.; Association for Computational Linguistics: Copenhagen, Denmark, 2017; pp. 866–874. [Google Scholar]
- Elsahar, H.; Gravier, C.; Laforest, F. Zero-Shot Question Generation from Knowledge Graphs for Unseen Predicates and Entity Types. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), New Orleans, LA, USA, 1–6 June 2018; Walker, M., Ji, H., Stent, A., Eds.; Association for Computational Linguistics: New Orleans, LA, USA, 2018; pp. 218–228. [Google Scholar]
- Gao, S.; Sethi, A.; Agarwal, S.; Chung, T.; Hakkani-Tur, D. Dialog State Tracking: A Neural Reading Comprehension Approach. In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, Stockholm, Sweden, 11–13 September 2019; Nakamura, S., Gasic, M., Zukerman, I., Skantze, G., Nakano, M., Papangelis, A., Ultes, S., Yoshino, K., Eds.; Association for Computational Linguistics: Stockholm, Sweden, 2019; pp. 264–273. [Google Scholar]
- Li, X.; Yin, F.; Sun, Z.; Li, X.; Yuan, A.; Chai, D.; Zhou, M.; Li, J. Entity-Relation Extraction as Multi-Turn Question Answering. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July–2 August 2019; Korhonen, A., Traum, D., Màrquez, L., Eds.; Association for Computational Linguistics: Florence, Italy, 2019; pp. 1340–1350. [Google Scholar]
- Levy, O.; Seo, M.; Choi, E.; Zettlemoyer, L. Zero-Shot Relation Extraction via Reading Comprehension. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), Vancouver, BC, Canada, 3–4 August 2017; Levy, R., Specia, L., Eds.; Association for Computational Linguistics: Vancouver, BC, Canada, 2017; pp. 333–342. [Google Scholar]
- Li, X.; Li, F.; Pan, L.; Chen, Y.; Peng, W.; Wang, Q.; Lyu, Y.; Zhu, Y. DuEE: A Large-Scale Dataset for Chinese Event Extraction in Real-World Scenarios. In Proceedings of the Natural Language Processing and Chinese Computing, Hangzhou, China, 1–3 November 2024; Zhu, X., Zhang, M., Hong, Y., He, R., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 534–545. [Google Scholar]
- Water Industry Standard of the People’s Republic of China (SL 579-2012): Flood Disaster Evaluation Criteria (Chinese Edition) by Zhong Hua Ren Min Gong He Guo Shui Li Bu: New Paperback|Liu Xing. Available online: https://www.abebooks.com/Water-industry-standard-Peoples-Republic-China/14934923043/bd (accessed on 11 October 2024).
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. arXiv 2023, arXiv:1706.03762. [Google Scholar]
- Liu, Y.; Ott, M.; Goyal, N.; Du, J.; Joshi, M.; Chen, D.; Levy, O.; Lewis, M.; Zettlemoyer, L.; Stoyanov, V. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv 2019, arXiv:1907.11692. [Google Scholar]
- Rajpurkar, P.; Jia, R.; Liang, P. Know What You Don’t Know: Unanswerable Questions for SQuAD. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Melbourne, Australia, 15–20 July 2018; Gurevych, I., Miyao, Y., Eds.; Association for Computational Linguistics: Melbourne, Australia, 2018; pp. 784–789. [Google Scholar]
- Yang, Z.; Dai, Z.; Yang, Y.; Carbonell, J.; Salakhutdinov, R.; Le, Q.V. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arXiv 2020, arXiv:1906.08237. [Google Scholar]
- Sun, Y.; Wang, S.; Li, Y.; Feng, S.; Chen, X.; Zhang, H.; Tian, X.; Zhu, D.; Tian, H.; Wu, H. ERNIE: Enhanced Representation through Knowledge Integration. arXiv 2019, arXiv:1904.09223. [Google Scholar] [CrossRef]
- Du, X.; Cardie, C. Event Extraction by Answering (Almost) Natural Questions. arXiv 2021, arXiv:2004.13625. Available online: https://arxiv.org/pdf/2004.13625 (accessed on 17 October 2024).
Event Types | Argument Roles |
---|---|
Waterlogging | Time, location, depth of water accumulation |
Vegetation damage | Time, location, vegetation name |
Transportation | Time, location, transportation name |
Rescue operations | Time, location, rescuer, rescued, supplies/equipment |
Personal safety | Time, location, victims, number of missing persons, number of dead, number of injured, number of trapped persons, number of disaster victims |
Landslides | Time, location, landslide subject |
Building collapse/damage | Time, location, subject of collapse/damage |
Dataset | Training Set | Test Set | Validation Set |
---|---|---|---|
DuEE | 11,958 | 35,000 | 1498 |
RainEE | 1021 | / | 411 |
Network | Recall (%) | Precision (%) | F1 (%) |
---|---|---|---|
BERT_QA | 72.7 | 73.9 | 73.3 |
ETEN_BERT_QA | 76.9 | 75.7 | 76.3 |
Network | Recall (%) | Precision (%) | F1 (%) |
---|---|---|---|
BERT_QA | 82.7 | 69.0 | 75.2 |
ETEN_BERT_QA | 79.3 | 72.4 | 75.7 |
Network | Total Number of Items | Number of Extractions |
---|---|---|
BERT_QA | 30,475 | 25,369 |
ETEN_BERT_QA | 28,261 |
Network | Text | Event Argument Extraction |
---|---|---|
BERT_QA | There were multiple demonstrations in Paris on the 21st, local time, with preliminary statistics showing more than 10,000 people participating. | Organizational Behavior_Parade_Location: Paris |
ETEN_BERT_QA | Organizational Behavior_Parade_Time: the 21st local time Organizational Behavior_Parade_Location: Paris Organizational behavior_Parade_number of marchers: more than 10,000 people |
Network | Total Number of Items | Number of Extractions |
---|---|---|
BERT_QA | 6000 | 4713 |
ETEN_BERT_QA | 5461 |
Network | Text | Event Argument Extraction |
---|---|---|
BERT_QA | More than one hundred people were trapped at the entrance of the Zhengzhou Confucian Temple Station. | Personal Safety_Location: the entrance of Zhengzhou Confucian Temple Station |
ETEN_BERT_QA | Personal Safety_number of trapped persons: More than one hundred people Personal Safety_Location: the entrance of Zhengzhou Confucian Temple Station |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
He, Y.; Yang, B.; He, H.; Fei, X.; Fan, X.; Liu, J. Event Argument Extraction for Rainstorm Disasters Based on Social Media: A Case Study of the 2021 Heavy Rains in Henan. Water 2024, 16, 3535. https://doi.org/10.3390/w16233535
He Y, Yang B, He H, Fei X, Fan X, Liu J. Event Argument Extraction for Rainstorm Disasters Based on Social Media: A Case Study of the 2021 Heavy Rains in Henan. Water. 2024; 16(23):3535. https://doi.org/10.3390/w16233535
Chicago/Turabian StyleHe, Yun, Banghui Yang, Haixia He, Xianyun Fei, Xiangtao Fan, and Jian Liu. 2024. "Event Argument Extraction for Rainstorm Disasters Based on Social Media: A Case Study of the 2021 Heavy Rains in Henan" Water 16, no. 23: 3535. https://doi.org/10.3390/w16233535
APA StyleHe, Y., Yang, B., He, H., Fei, X., Fan, X., & Liu, J. (2024). Event Argument Extraction for Rainstorm Disasters Based on Social Media: A Case Study of the 2021 Heavy Rains in Henan. Water, 16(23), 3535. https://doi.org/10.3390/w16233535