The Graph Reasoning Approach Based on the Dynamic Knowledge Auxiliary for Complex Fact Verification
Abstract
:1. Introduction
- According to the given claim, the retrieval module will retrieve documents and sentences relative to the claim.
- The auxiliary knowledge module predicts a key span in the retrieved evidence.
- Retrieve knowledge related to the claim and evidence from the external knowledge resources, such as the ConceptNet [26] and large-scale text corpora [27]. For our paper, we adopt four external knowledge sources: ConceptNet [26], WordNet subset (used in [10]), OMCS (Open Mind Common Sense subset), and ARC (AI2 Reasoning Challenge dataset) [27], where both structured and unstructured knowledge resources are included.
- Based on the above steps, predict knowledge gaps and fill the gap with external knowledge.
- Construct the collaborative graph with external knowledge and context information.
- Reason on the collaborative graph and label the given claim as “supported”, “refused”, or “not enough information”.
- This paper is the first attempt to introduce a joint knowledge-driven and data-driven mechanism into fact verification, and verify the effectiveness of the approach in a small sample and heterogeneous web text source, which will provide an important reference for further research.
2. Related Work
2.1. Pre-Training Language Processing and Background Knowledge for FV
2.2. Graph Neural Network for FV
3. Methodology
3.1. External Knowledge for Retrieved Evidence
3.1.1. Identifying the Knowledge Gaps
3.1.2. Evidence Relevance
3.1.3. Filling the Gap
- The module predicts the relation between the fact span and the 4 evidential sentences information representations for each evidential sentence , as shown in Equations (3) and (4):
- Combine the predicted relation with evidence to score the additional knowledge by the feedforward neural network
3.2. Constructing the Entity Graph
3.3. Claim Verification with the GNN Reasoning Method
4. Experiments
4.1. Experimental Setting
4.1.1. Dataset
4.1.2. Baselines
4.1.3. Evaluation Metrics
4.1.4. Data Processing
4.2. Performance
4.3. Further Expeeriments of FV System on Diverse Web Information
- Accuracy: Accuracy is simply a ratio of correct predictions to the total number of predictions.
- Precision: Precision is the ratio of true positives to the total predicted positive observations.
- Recall: Recall is the ratio of true positives to actual positives.
- F1: F1 is the harmonic mean of precision and recall.
5. Limitation of the Study
6. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Hanselowski, A.; Zhang, H.; Li, Z.; Sorokin, D.; Schiller, B.; Schulz, C.; Gurevych, I. Ukp-athene: Multi-sentence textual entailment for claim verification. arXiv 2018, arXiv:1809.01479. [Google Scholar]
- Yoneda, T.; Mitchell, J.; Welbl, J.; Stenetorp, P.; Riedel, S. Ucl machine reading group: Four factor framework for fact finding (hexaf). In Proceedings of the First Workshop on Fact Extraction and VERification (FEVER), Brussels, Belgium, 1 November 2018; pp. 97–102. [Google Scholar]
- Nie, Y.; Chen, H.; Bansal, M. Combining fact extraction and verification with neural semantic matching networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; pp. 6859–6866. [Google Scholar]
- Zhou, J.; Han, X.; Yang, C.; Liu, Z.; Wang, L.; Li, C.; Sun, M. GEAR: Graph-based evidence aggregating and reasoning for fact verification. arXiv 2019, arXiv:1908.01843. [Google Scholar]
- Liu, Z.; Xiong, C.; Sun, M.; Liu, Z. Fine-grained Fact Verification with Kernel Graph Attention Network. arXiv 2019, arXiv:1910.09796. [Google Scholar]
- Thorne, J.; Vlachos, A.; Christodoulopoulos, C.; Mittal, A. Fever: A large-scale dataset for fact extraction and verification. arXiv 2018, arXiv:1803.05355. [Google Scholar]
- Yang, Z.; Qi, P.; Zhang, S.; Bengio, Y.; Cohen, W.W.; Salakhutdinov, R.; Manning, C.D. Hotpotqa: A dataset for diverse, explainable multi-hop question answering. arXiv 2018, arXiv:1809.09600. [Google Scholar]
- Khashabi, D.; Chaturvedi, S.; Roth, M.; Upadhyay, S.; Roth, D. Looking beyond the surface: A challenge set for reading comprehension over multiple sentences. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, New Orleans, LA, USA, 1–6 June 2018; Volume 1 (Long Papers), pp. 252–262. [Google Scholar]
- Welbl, J.; Stenetorp, P.; Riedel, S. Constructing datasets for multi-hop reading comprehension across documents. Trans. Assoc. Comput. Linguist. 2018, 6, 287–302. [Google Scholar] [CrossRef] [Green Version]
- Mihaylov, T.; Clark, P.; Khot, T.; Sabharwal, A. Can a suit of armor conduct electricity? a new dataset for open book question answering. arXiv 2018, arXiv:1809.02789. [Google Scholar]
- Pîrtoacă, G.-S.; Rebedea, T.; Ruseti, S. Answering questions by learning to rank--Learning to rank by answering questions. arXiv 2019, arXiv:1909.00596. [Google Scholar]
- Banerjee, P.; Pal, K.K.; Mitra, A.; Baral, C. Careful selection of knowledge to solve open book question answering. arXiv 2019, arXiv:1907.10738. [Google Scholar]
- Mitra, A.; Banerjee, P.; Pal, K.K.; Mishra, S.; Baral, C. Exploring ways to incorporate additional knowledge to improve natural language commonsense question answering. arXiv 2019, arXiv:1909.08855. [Google Scholar]
- Khot, T.; Sabharwal, A.; Clark, P. What’s Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering. In Proceedings of the International Joint Conference on Natural Language Processing, Hong Kong, China, 3–7 November 2019; pp. 2814–2828. [Google Scholar]
- Mihaylov, T.; Frank, A. Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge. In Proceedings of the Meeting of the Association for Computational Linguistics, Melbourne, Australia, 15–20 July 2018; pp. 821–832. [Google Scholar]
- Chen, Q.; Zhu, X.; Ling, Z.; Inkpen, D.; Wei, S. Neural Natural Language Inference Models Enhanced with External Knowledge. In Proceedings of the Meeting of the Association for Computational Linguistics, Melbourne, Australia, 15–20 July 2018; pp. 2406–2417. [Google Scholar]
- Yang, A.; Wang, Q.; Liu, J.; Liu, K.; Lyu, Y.; Wu, H.; She, Q.; Li, S. Enhancing pre-trained language representations with rich knowledge for machine reading comprehension. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July–2 August 2019; pp. 2346–2357. [Google Scholar]
- Wang, C.; Jiang, H. Explicit utilization of general knowledge in machine reading comprehension. arXiv 2018, arXiv:1809.03449. [Google Scholar]
- Jansen, P.; Ustalov, D. TextGraphs 2019 shared task on multi-hop inference for explanation regeneration. In Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-13), Hong Kong, China, 4 November 2019; pp. 63–77. [Google Scholar]
- Banerjee, P. ASU at TextGraphs 2019 shared task: Explanation ReGeneration using language models and iterative re-ranking. arXiv 2019, arXiv:1909.08863. [Google Scholar]
- Kadlec, R.; Schmid, M.; Bajgar, O.; Kleindienst, J. Text Understanding with the Attention Sum Reader Network. arXiv 2016, arXiv:1603.01547. [Google Scholar]
- Weissenborn, D.; Kocisky, T.; Dyer, C. Dynamic Integration of Background Knowledge in Neural NLU Systems. arXiv 2018, arXiv:1706.02596. [Google Scholar]
- Bollacker, K.; Evans, C.; Paritosh, P.; Sturge, T.; Taylor, J. Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of the International Conference on Management of Data, Vancouver, BC, Canada, 10–12 June 2008; pp. 1247–1250. [Google Scholar]
- Krishnamurthy, J.; Dasigi, P.; Gardner, M. Neural Semantic Parsing with Type Constraints for Semi-Structured Tables. In Proceedings of the Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 9–11 September 2017; pp. 1516–1526. [Google Scholar]
- Petrochuk, M.; Zettlemoyer, L. SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach. In Proceedings of the Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October–4 November 2018; pp. 554–558. [Google Scholar]
- Speer, R.; Chin, J.; Havasi, C. ConceptNet 5.5: An Open Multilingual Graph of General Knowledge. In Proceedings of the National Conference on Artificial Intelligence, Copenhagen, Denmark, 9–11 September 2017; pp. 4444–4451. [Google Scholar]
- Clark, P.; Cowhey, I.; Etzioni, O.; Khot, T.; Sabharwal, A.; Schoenick, C.; Tafjord, O. Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge. arXiv 2018, arXiv:1803.05457. [Google Scholar]
- Tu, M.; Huang, J.; He, X.; Zhou, B. Graph sequential network for reasoning over sequences. arXiv 2020, arXiv:2004.02001. [Google Scholar]
- Rajpurkar, P.; Zhang, J.; Lopyrev, K.; Liang, P. Squad: 100,000+ questions for machine comprehension of text. arXiv 2016, arXiv:1606.05250. [Google Scholar]
- Rajpurkar, P.; Jia, R.; Liang, P. Know what you don’t know: Unanswerable questions for SQuAD. arXiv 2018, arXiv:1806.03822. [Google Scholar]
- Joshi, M.; Choi, E.; Weld, D.S.; Zettlemoyer, L. Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension. arXiv 2018, arXiv:1705.03551. [Google Scholar]
- Nguyen, T.; Rosenberg, M.; Song, X.; Gao, J.; Tiwary, S.; Majumder, R.; Deng, L. Ms marco: A human-generated machine reading comprehension dataset. arXiv 2016, arXiv:1611.09268. [Google Scholar]
- Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 5998–6008. [Google Scholar]
- Goldberg, Y. Assessing BERT’s syntactic abilities. arXiv 2019, arXiv:1901.05287. [Google Scholar]
- Peters, M.E.; Neumann, M.; Zettlemoyer, L.; Yih, W.-T. Dissecting contextual word embeddings: Architecture and representation. arXiv 2018, arXiv:1808.08949. [Google Scholar]
- Chen, D.; Bolton, J.; Manning, C.D. A thorough examination of the cnn/daily mail reading comprehension task. arXiv 2016, arXiv:1606.02858. [Google Scholar]
- Bauer, L.; Wang, Y.; Bansal, M. Commonsense for generative multi-hop question answering tasks. arXiv 2018, arXiv:1809.06309. [Google Scholar]
- Zhong, W.; Tang, D.; Duan, N.; Zhou, M.; Wang, J.; Yin, J. Improving question answering by commonsense-based pre-training. In Proceedings of the CCF International Conference on Natural Language Processing and Chinese Computing, Dunhuang, China, 9–14 October 2019; pp. 16–28. [Google Scholar]
- Seo, M.; Kembhavi, A.; Farhadi, A.; Hajishirzi, H. Bidirectional attention flow for machine comprehension. arXiv 2016, arXiv:1611.01603. [Google Scholar]
- Manning, C.D.; Surdeanu, M.; Bauer, J.; Finkel, J.R.; Bethard, S.; Mcclosky, D. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the Meeting of the Association for Computational Linguistics, Baltimore, MD, USA, 22–27 June 2014; pp. 55–60. [Google Scholar]
- Qiu, L.; Xiao, Y.; Qu, Y.; Zhou, H.; Li, L.; Zhang, W.; Yu, Y. Dynamically fused graph network for multi-hop reasoning. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July–2 August 2019; pp. 6140–6150. [Google Scholar]
- Yang, Z.; Yang, D.; Dyer, C.; He, X.; Smola, A.; Hovy, E. Hierarchical attention networks for document classification. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human language Technologies, San Diego, CA, USA, 12–17 June 2016; pp. 1480–1489. [Google Scholar]
- Qian, F.; Gong, C.; Sharma, K.; Liu, Y. Neural User Response Generator: Fake News Detection with Collective User Intelligence. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence Main track (IJCAI), Stockholm, Sweden, 13–19 July 2018; pp. 3834–3840. [Google Scholar]
- Guo, H.; Cao, J.; Zhang, Y.; Guo, J.; Li, J. Rumor detection with hierarchical social attention network. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, Turin, Italy, 22–26 October 2018; pp. 943–951. [Google Scholar]
- Ruchansky, N.; Seo, S.; Liu, Y. Csi: A hybrid deep model for fake news detection. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Singapore, 6–10 November 2017; pp. 797–806. [Google Scholar]
- Shu, K.; Cui, L.; Wang, S.; Lee, D.; Liu, H. defend: Explainable fake news detection. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 395–405. [Google Scholar]
- Han, Y.; Karunasekera, S.; Leckie, C. Graph Neural Networks with Continual Learning for Fake News Detection from Social Media. arXiv 2020, arXiv:2007.03316. [Google Scholar]
Model | LA (%) | FEVER Score (%) |
---|---|---|
Athene [1] | 68.49 | 64.74 |
UCL MRG [2] | 69.66 | 65.41 |
UNC NLP [3] | 69.72 | 66.49 |
GEAR [4] | 74.84 | 70.69 |
KGAT [5] | 78.02 | 75.86 |
DKAR | 79.64 | 77.12 |
Supported (G) | Refused (G) | Not Enough Information (G) | Total | |
---|---|---|---|---|
Supported (P) | 5942 (↑8%) | 625 | 398 | 6965 |
Refused (P) | 957 | 4397 (↑4.6%) | 1031 | 6385 |
Not enough information (P) | 561 (↓ 9.7%) | 499 (↓ 3.6%) | 5588 (↑17%) | 6648 |
total | 7460 | 5521 | 7017 | 19,998 |
Dataset | Metrics | 100 | 500 | 1000 | 1500 |
---|---|---|---|---|---|
PolitiFact | Accuracy | 0.879 | 0.892 | 0.903 | 0.908 |
Precision | 0.897 | 0.913 | 0.921 | 0.925 | |
Recall | 0.891 | 0.909 | 0.910 | 0.913 | |
F1 | 0.895 | 0.903 | 0.915 | 0.919 | |
GossipCop | Accuracy | 0.890 | 0.896 | 0.900 | 0.903 |
Precision | 0.906 | 0.910 | 0.915 | 0.916 | |
Recall | 0.898 | 0.901 | 0.905 | 0.907 | |
F1 | 0.903 | 0.906 | 0.913 | 0.914 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Y.; Xia, C.; Si, C.; Zhang, C.; Wang, T. The Graph Reasoning Approach Based on the Dynamic Knowledge Auxiliary for Complex Fact Verification. Electronics 2020, 9, 1472. https://doi.org/10.3390/electronics9091472
Wang Y, Xia C, Si C, Zhang C, Wang T. The Graph Reasoning Approach Based on the Dynamic Knowledge Auxiliary for Complex Fact Verification. Electronics. 2020; 9(9):1472. https://doi.org/10.3390/electronics9091472
Chicago/Turabian StyleWang, Yongyue, Chunhe Xia, Chengxiang Si, Chongyu Zhang, and Tianbo Wang. 2020. "The Graph Reasoning Approach Based on the Dynamic Knowledge Auxiliary for Complex Fact Verification" Electronics 9, no. 9: 1472. https://doi.org/10.3390/electronics9091472
APA StyleWang, Y., Xia, C., Si, C., Zhang, C., & Wang, T. (2020). The Graph Reasoning Approach Based on the Dynamic Knowledge Auxiliary for Complex Fact Verification. Electronics, 9(9), 1472. https://doi.org/10.3390/electronics9091472