Toward Knowledge-Enhanced Geohazard Intelligence: A Review of Knowledge Graphs and Large Language Models
Abstract
1. Introduction
- To trace the historical evolution of knowledge-guided geohazard research from early expert systems to contemporary KG- and LLM-based frameworks;
- To synthesize how ontologies and knowledge graphs structure, represent, and operationalize geohazard knowledge, supporting semantic interoperability and causal reasoning;
- To analyze how machine learning and large language models integrate with domain knowledge to enhance interpretability, adaptability, and decision-making in geohazard modeling.
2. Bibliometric Analysis of Knowledge-Guided Approaches to Geohazards
2.1. Search Strategy and Literature Selection

2.2. Bibliometric Analysis Results
2.2.1. Publication Trends and Evolution
2.2.2. Thematic Evolution and Research Focus
3. Knowledge Representation in Geohazard
3.1. Evolution of Knowledge Representation in Geohazard Research
3.2. Comparative Review: Ontologies vs. Knowledge Graphs
3.3. Advances in Knowledge Reasoning and Fusion
4. Knowledge-Guided and Hybrid Modeling Approaches in Geohazards
4.1. Evolution from Data-Driven to Knowledge-Guided Modeling
4.2. Constraint-Based Knowledge Injection: Physics- and Rule-Constrained Learning
4.3. Structure- and Reasoning-Based Hybrid Frameworks
5. Cognitive and Generative Knowledge Systems in Geohazard
5.1. Cognitive Transformation of Geohazard Knowledge
5.2. Large Language Models and Generative Cognition in Geological Reasoning
5.3. Toward Adaptive Knowledge Evolution and Integrated Cognitive Systems
6. Conclusions and Future Perspectives
6.1. Current Gaps
6.2. Near-Term Opportunities
6.3. Long-Term Vision
6.4. Limitations and Ethical Considerations
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Froude, M.J.; Petley, D.N. Global fatal landslide occurrence from 2004 to 2016. Nat. Hazards Earth Syst. Sci. 2018, 18, 2161–2181. [Google Scholar] [CrossRef]
- Zhou, Y.; Zhang, L.; Zhang, A.; Wang, J. Geoscience Big Data Mining and Machine Learning; Sun Yat-Sen University Press: Guangzhou, China, 2018. [Google Scholar]
- Zhou, Y.; Zuo, R.; Liu, G.; Yuan, F.; Mao, X.; Guo, Y.; Xiao, F.; Liao, J.; Liu, Y. A decade of advances in mathematical geoscience: Big data and artificial intelligence are reshaping geology. Bull. Mineral. Petrol. Geochem. 2021, 40, 556–573. [Google Scholar]
- Kirschbaum, D.; Stanley, T.; Zhou, Y. Spatial and temporal analysis of a global landslide catalog. Geomorphology 2020, 249, 4–15. [Google Scholar] [CrossRef]
- Gariano, S.L.; Guzzetti, F. Landslides in a changing climate. Earth-Sci. Rev. 2016, 162, 227–252. [Google Scholar] [CrossRef]
- Emberson, R.; Kirschbaum, D.B.; Amatya, P.; Tanyas, H.; Marc, O. Insights from the topographic characteristics of a large global catalog of rainfall-induced landslide event inventories. Nat. Hazards Earth Syst. Sci. Discuss. 2022, 22, 1129–1149. [Google Scholar] [CrossRef]
- Zhou, Y.; Zuo, R. Application of Big Data Mining, Machine Learning and Artificial Intelligence in Ore Deposits; MDPI: Basel, Switzerland, 2025; 222p. [Google Scholar] [CrossRef]
- Gill, J.C.; Malamud, B.D. Anthropogenic processes, natural hazards, and interactions in a multi-hazard framework. Earth-Sci. Rev. 2017, 166, 246–269. [Google Scholar] [CrossRef]
- Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
- Merghadi, A.; Yunus, A.P.; Dou, J.; Whiteley, J.; ThaiPham, B.; Bui, D.T.; Avtar, R.; Abderrahmane, B. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Sci. Rev. 2020, 207, 103225. [Google Scholar] [CrossRef]
- Karpatne, A.; Atluri, G.; Faghmous, J.H.; Steinbach, M.; Banerjee, A.; Ganguly, A.; Shekhar, S.; Samatova, N.; Kumar, V. Theory-guided data science: A new paradigm for scientific discovery from data. IEEE Trans. Knowl. Data Eng. 2017, 29, 2318–2331. [Google Scholar] [CrossRef]
- Bergen, K.J.; Johnson, P.A.; de Hoop, M.V.; Beroza, G.C. Machine learning for data-driven discovery in solid Earth geoscience. Science 2019, 363, eaau0323. [Google Scholar] [CrossRef]
- Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat, F. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef] [PubMed]
- Gil, Y.; Pierce, S.A.; Babaie, H.; Banerjee, A.; Borne, K.; Bust, G.; Cheatham, M.; Ebert-Uphoff, I.; Gomes, C.; Hill, M.; et al. Intelligent systems for geosciences: An essential research agenda. Commun. ACM 2019, 62, 76–84. [Google Scholar] [CrossRef]
- Kuhn, W. Geospatial semantics: Why, of what, and how? In Journal on Data Semantics III; Springer: Berlin/Heidelberg, Germany, 2005; pp. 1–24. [Google Scholar] [CrossRef]
- Janowicz, K.; Hitzler, P.; Adams, B.; Kolas, D.; Vardeman, C. Five stars of linked data vocabulary use. Semant. Web 2012, 5, 173–176. [Google Scholar] [CrossRef]
- Willard, J.; Jia, X.; Xu, S.; Steinbach, M.; Kumar, V. Integrating scientific knowledge with machine learning for engineering and environmental systems. ACM Comput. Surv. 2022, 55, 1–37. [Google Scholar] [CrossRef]
- Jia, X.; Willard, J.; Karpatne, A.; Read, J.S.; Zwart, J.A.; Steinbach, M.; Kumar, V. Physics-guided machine learning for scientific discovery: An application in simulating lake temperature profiles. ACM/IMS Trans. Data Sci. 2021, 2, 1–26. [Google Scholar] [CrossRef]
- Richard, S.M.; Model, C.D.; Testbed Working Group. GeoSciML-A GML Application for Geoscience Information Interchange. In Digital Mapping Techniques ’06; USGS Open File Report 2007-1285; U.S. Geological Survey: Reston, VA, USA, 2007. [Google Scholar]
- Qiu, Q.J.; Wu, L.; Ma, K.; Xie, Z.; Tao, L. A knowledge graph construction method for geohazard chain for disaster emergency response. Earth Sci. 2023, 48, 1875–1891. [Google Scholar]
- Zhang, Y.; Chen, Y.; Wang, J.; Pan, Z. Unsupervised deep anomaly detection for multi-sensor time-series signals. IEEE Trans. Knowl. Data Eng. 2022, 35, 2118–2132. [Google Scholar] [CrossRef]
- Bommasani, R.; Hudson, D.A.; Adeli, E.; Altman, R.; Arora, S.; von Arx, S.; Bernstein, M.S.; Bohg, J.; Bosselut, A.; Brunskill, E.; et al. On the opportunities and risks of foundation models. arXiv 2022, arXiv:2108.07258. [Google Scholar]
- Zhao, W.X.; Zhou, K.; Li, J.; Tang, T.; Wang, X.; Hou, Y.; Min, Y.; Zhang, B.; Zhang, J.; Dong, Z.; et al. A survey of large language models. arXiv 2023, arXiv:2303.18223. [Google Scholar]
- Raeissi, M.M.; Knapen, R. Applications of Generative Large Language Models in Environmental Science: A Systematic Review. Adv. Environ. Eng. Res. 2025, 6, 028. [Google Scholar] [CrossRef]
- Yang, L.; Chen, H.; Li, Z.; Ding, X.; Wu, X. Give us the facts: Enhancing large language models with knowledge graphs for fact-aware language modeling. arXiv 2024. [Google Scholar] [CrossRef]
- Ma, X. Knowledge Graph Construction and Application in Geosciences: A Review. Comput. Geosci. 2022, 161, 105082. [Google Scholar] [CrossRef]
- Zhao, T.; Wang, S.; Ouyang, C.; Chen, M.; Liu, C.; Zhang, J.; Yu, L.; Wang, F.; Xie, Y.; Li, J.; et al. Artificial intelligence for geoscience: Progress, challenges, and perspectives. Innovation 2024, 5, 100691. [Google Scholar] [CrossRef] [PubMed]
- Pranckutė, R. Web of Science (WoS) and Scopus: The Titans of Bibliographic Information in Today’s Academic World. Publications 2021, 9, 12. [Google Scholar] [CrossRef]
- Martín-Martín, A.; Orduna-Malea, E.; Thelwall, M.; Delgado López-Cózar, E. Google Scholar, Web of Science, and Scopus: A systematic comparison of citations in 252 subject categories. J. Informetr. 2018, 12, 1160–1177. [Google Scholar] [CrossRef]
- Carrara, A.; Cardinali, M.; Guzzetti, F.; Reichenbach, P. GIS techniques in mapping landslide hazard. In Geographical Information Systems in Assessing Natural Hazards; Carrara, A., Guzzetti, F., Eds.; Springer: Berlin/Heidelberg, Germany, 1991; pp. 135–175. [Google Scholar] [CrossRef]
- Zhou, C.H.; Lee, C.F.; Li, J.; Xu, Z.W. On the spatial relationship between landslides and causative factors on Lantau Island, Hong Kong. Geomorphology 2002, 43, 197–207. [Google Scholar] [CrossRef]
- Gruber, T.R. A translation approach to portable ontology specifications. Knowl. Acquis. 1993, 5, 199–220. [Google Scholar] [CrossRef]
- Guarino, N. Formal ontology and information systems. In Proceedings of the FOIS’98; IOS Press: Amsterdam, The Netherlands, 1998; pp. 3–15. [Google Scholar]
- Raskin, R.; Pan, M. Knowledge representation in the semantic web for Earth and environmental terminology (SWEET). Comput. Geosci. 2005, 31, 1119–1125. [Google Scholar] [CrossRef]
- SWEET Team/ESIPFed. SWEET: Semantic Web for Earth and Environment Technology. Available online: http://sweetontology.net/sweetAll (accessed on 1 June 2025).
- Phengsuwan, J.; Shah, T.; James, P.; Thakker, D.; Barr, S.; Ranjan, R. Ontology-based discovery of time-series data sources for landslide early warning system. Computing 2020, 102, 745–763. [Google Scholar] [CrossRef]
- Wen, M.; Qiu, Q.; Zheng, S.; Ma, K.; Zheng, S.; Xie, Z.; Tao, L. Construction and application of a multilevel geohazard domain ontology: A case study of landslide geohazards. Appl. Comput. Geosci. 2023, 20, 100134. [Google Scholar] [CrossRef]
- Hogan, A.; Blomqvist, E.; Cochez, M.; D’amato, C.; De Melo, G.; Gutierrez, C.; Kirrane, S.; Gayo, J.E.L.; Navigli, R.; Neumaier, S.; et al. Knowledge graphs. ACM Comput. Surv. 2021, 54, 1–37. [Google Scholar] [CrossRef]
- Sen, M.; Tim, D. GeoSciML: Development of a generic geoscience markup language. Comput. Geosci. 2005, 31, 1095–1103. [Google Scholar] [CrossRef]
- Qiu, Q.; Xie, Z.; Ma, K.; Tao, L.; Zheng, S. NeuroSPE: A neuro-net spatial relation extractor for natural language text fusing gazetteers and pre-trained models. Trans. GIS 2023, 27, 1485–1510. [Google Scholar] [CrossRef]
- Zhang, Q.; Zhou, Y.; Guo, L.; Yuan, Q.; Yu, P.; Wang, H.; Zhu, B.; Han, F.; Long, S. Intelligent applications of knowledge graphs in mineral exploration: A case study of the Qin–Hang metallogenic belt porphyry copper deposit. Earth Sci. Front. 2024, 31, 7–15. [Google Scholar] [CrossRef]
- Han, F.; Deng, Y.; Liu, Q.; Zhou, Y.; Wang, J.; Huang, Y.; Zhang, Q.; Bian, J. Construction and application of the knowledge graph method in management of soil pollution in contaminated sites: A case study in South China. J. Environ. Manag. 2022, 3019, 115685. [Google Scholar] [CrossRef]
- Han, Y.; Semnani, S.J. Integration of Physics-Based and Data-Driven Approaches for Landslide Susceptibility Assessment. Int. J. Numer. Anal. Methods Geomech. 2025, 49, 3060–3097. [Google Scholar] [CrossRef]
- Ji, J.; Zhou, Y.; Cheng, Q.; Jiang, S.; Liu, S. Landslide susceptibility mapping based on deep learning algorithms using. Land 2023, 12, 1125. [Google Scholar] [CrossRef]
- Qiu, Q.; Xie, Z.; Zhang, D.; Ma, K.; Tao, L.; Tan, Y.; Zhang, Z.; Jiang, B. Knowledge graph for identifying geological disasters by integrating computer vision with ontology. J. Earth Sci. 2023, 34, 1418–1432. [Google Scholar] [CrossRef]
- Ge, X.; Yang, Y.; Chen, J.; Li, W.; Huang, Z.; Zhang, W.; Peng, L. Disaster prediction knowledge graph based on multi-source spatio-temporal information. Remote Sens. 2022, 14, 1214. [Google Scholar] [CrossRef]
- Wu, Q.; Xie, Z.; Tian, M.; Qiu, Q.; Chen, J.; Tao, L.; Zhao, Y. Integrating Knowledge Graph and Machine Learning Methods for Landslide Susceptibility Assessment. Remote Sens. 2024, 16, 2399. [Google Scholar] [CrossRef]
- Sajjadian, M.; Scheider, S. Geodata source retrieval by multilingual/semantic query expansion: The case of Google Translate and WordNet. Agil. GISci. Ser. 2022, 3, 60. [Google Scholar] [CrossRef]
- Ji, S.; Pan, S.; Cambria, E.; Marttinen, P.; Yu, P.S. A survey on knowledge graphs: Representation, acquisition, and applications. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 494–514. [Google Scholar] [CrossRef] [PubMed]
- Du, W.; Liu, C.; Xia, Q.; Wen, M.; Hu, Y.; Chen, Z.; Xu, L.; Zhang, X.; Terfa, B.K.; Chen, N. OFPO & KGFPO: Ontology and knowledge graph for flood process observation. Environ. Model. Softw. 2025, 185, 106317. [Google Scholar] [CrossRef]
- Ma, K.; Tian, M.; Tan, Y.; Qiu, Q.; Xie, Z.; Huang, R. Ontology-Based BERT Model for Automated Information Extraction from Geological Hazard Reports. J. Earth Sci. 2023, 34, 1390–1405. [Google Scholar] [CrossRef]
- Chen, L.; Ge, X.; Yang, L.; Li, W.; Peng, L. An improved multi-source data-driven landslide prediction method based on spatio-temporal knowledge graph. Remote Sens. 2023, 15, 2126. [Google Scholar] [CrossRef]
- Chen, L.; Peng, L. Improving landslide prediction: Innovative modeling and evaluation of landslide scenario with knowledge graph embedding. Remote Sens. 2024, 16, 145. [Google Scholar] [CrossRef]
- Sun, Q.; Ding, Y.; Hou, J.; Zhu, Q.; Wu, Y.; Wu, T.; Wang, X.; Zhao, X.; Shao, S. LHAKG: A knowledge graph construction framework for landslide hazard assessment by using XLNet-BiLSTM-CRF from geoscience literature. Int. J. Digit. Earth 2025, 18, 2577292. [Google Scholar] [CrossRef]
- Li, J.; Qin, J.; Kang, K.; Liang, M.; Liu, K.; Ding, X. Enhanced Spatiotemporal Landslide Displacement Prediction Using Dynamic Graph-Optimized GNSS Monitoring. Sensors 2025, 25, 4754. [Google Scholar] [CrossRef]
- Chen, X.; Hu, D.; Zhang, L.; Wu, Y.; Dai, K.; Feng, Y.; Xu, Q. TPE: Time-parameterized edge for sequential link prediction. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management; ACM: Singapore, 2020; pp. 1963–1972. [Google Scholar] [CrossRef]
- Gottschalk, S.; Demidova, E. EventKG: A multilingual event-centric temporal knowledge graph. In European Semantic Web Conference; Springer: Berlin/Heidelberg, Germany, 2018; pp. 272–287. [Google Scholar] [CrossRef]
- Bordogna, G.; Frigerio, L.; Kliment, T.; Brivio, P.A.; Hossard, L.; Manfron, G.; Sterlacchini, S. “Contextualized VGI” creation and management to cope with uncertainty and imprecision. ISPRS Int. J. Geo-Inf. 2016, 5, 234. [Google Scholar] [CrossRef]
- Laskey, K.B. MEBN: A language for first-order Bayesian knowledge bases. Artif. Intell. 2008, 172, 140–178. [Google Scholar] [CrossRef]
- Berg, R.; Kipf, T.N.; Welling, M. GCMC: Graph convolutional matrix completion. arXiv 2017. [Google Scholar] [CrossRef]
- Gu, Y.; Wang, C.; Liu, Y.; Zhou, R. An ontology-based multi-hazard coupling accidents simulation and deduction system for underground utility tunnel: A case study of earthquake-induced disaster chain. Reliab. Eng. Syst. Saf. 2025, 253, 110559. [Google Scholar] [CrossRef]
- Guzzetti, F.; Reichenbach, P.; Cardinali, M.; Galli, M.; Ardizzone, F. Probabilistic landslide hazard assessment at the basin scale. Geomorphology 2006, 72, 272–299. [Google Scholar] [CrossRef]
- Dou, J.; Yunus, A.P.; Bui, D.T.; Merghadi, A.; Sahana, M.; Zhu, Z.; Chen, C.W.; Khosravi, K.; Yang, Y.; Pham, B.T. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 2019, 662, 332–346. [Google Scholar] [CrossRef]
- Hong, H.; Liu, J.; Zhu, A.X. Modeling landslide susceptibility using LogitBoost alternating decision trees and forest by penalizing attributes with the bagging ensemble. Sci. Total Environ. 2018, 644, 1108–1119. [Google Scholar] [CrossRef]
- Li, J.; Zhang, J.; Wang, L.; Zhao, A. A hierarchical spatiotemporal data model based on knowledge graphs for representation and modeling of geohazards. Sustainability 2024, 16, 10271. [Google Scholar] [CrossRef]
- Wang, Y.; Fang, Z.; Hong, H.; Peng, L. Flood susceptibility mapping using convolutional neural network frameworks. J. Hydrol. 2020, 582, 124482. [Google Scholar] [CrossRef]
- Wang, Z.; Li, W.; Tang, C. Ontology-based semantic reasoning for multi-hazard knowledge integration and risk assessment. Int. J. Appl. Earth Obs. Geoinf. 2023, 118, 103285. [Google Scholar] [CrossRef]
- Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
- Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
- Pei, T.; Qiu, T.; Shen, C. Landslide susceptibility mapping using physics-guided machine learning: A case study of a debris flow event in the Colorado Front Range. Acta Geotech. 2024, 19, 6617–6641. [Google Scholar] [CrossRef]
- Pei, T.; Maroufi, M.; Shen, C.; Tian, Y. Physics-Informed Machine Learning Framework for Predicting Rainfall-Induced Shallow Landslides in the Colorado Front Range. In Proceedings of the Geo-Extreme 2025, Long Beach, CA, USA, 2–5 November 2025; pp. 97–106. [Google Scholar] [CrossRef]
- Monaco, S.; Apiletti, D.; Malnati, G. Theory-Guided Deep Learning Algorithms: An Experimental Evaluation. Electronics 2022, 11, 2850. [Google Scholar] [CrossRef]
- Wu, R.; Huang, M.; Ma, H.; Huang, J.; Li, Z.; Mei, H.; Wang, C. A Multi-Temporal Knowledge Graph Framework for Landslide Monitoring and Hazard Assessment. GeoHazards 2025, 6, 39. [Google Scholar] [CrossRef]
- Belvederesi, G.; Tanyas, H.; Lipani, A.; Dahal, A.; Lombardo, L. Distribution-agnostic landslide hazard modelling via Graph Transformers. Environ. Model. Softw. 2025, 183, 106231. [Google Scholar] [CrossRef]
- Yang, C.; Yin, Y.; Zhang, J.; Ding, P.; Liu, J. A graph deep learning method for landslide displacement prediction based on global navigation satellite system positioning. Geosci. Front. 2024, 15, 101690. [Google Scholar] [CrossRef]
- Ge, Y.; Cao, S.; Tang, H.; Zhang, Y. Graph neural network for spatiotemporal landslide prediction: A case study of the Three Gorges Reservoir area, China. Geomorphology 2023, 441, 108891. [Google Scholar] [CrossRef]
- Hungr, O.; Leroueil, S.; Picarelli, L. The Varnes classification of landslide types, an update. Landslides 2014, 11, 167–194. [Google Scholar] [CrossRef]
- Lu, N.; Godt, J.W. Hillslope Hydrology and Stability; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar] [CrossRef]
- Cui, H.-Z.; Tong, B.; Wang, T.; Dou, J.; Ji, J. A hybrid data-driven approach for rainfall-induced landslide susceptibility mapping: Physically-based probabilistic model with convolutional neural network. J. Rock Mech. Geotech. Eng. 2025, 17, 4933–4951. [Google Scholar] [CrossRef]
- Dahal, A.; Lombardo, L. Towards physics-informed neural networks for landslide prediction. Eng. Geol. 2025, 344, 107852. [Google Scholar] [CrossRef]
- Zhu, X.; Xu, Q.; Tang, M.; Li, H.; Liu, F. A hybrid machine learning and computing model for forecasting displacement of multifactor-induced landslides. Neural Comput. Appl. 2018, 30, 3825–3835. [Google Scholar] [CrossRef]
- Zhu, J.; Dang, P.; Cao, Y.; Lai, J.; Guo, Y.; Wang, P. A flood knowledge-constrained large language model interactable with GIS: Enhancing public risk perception of floods. Int. J. Geogr. Inf. Sci. 2024, 38, 456–481. [Google Scholar] [CrossRef]
- Li, W.; Wu, L.; Xu, X.; Xie, Z.; Qiu, Q.; Liu, H.; Huang, Z.; Chen, J. Deep learning and network analysis: Classifying and visualizing geologic hazard reports. J. Earth Sci. 2024, 35, 1289–1303. [Google Scholar] [CrossRef]
- Xue, D.; Qian, S.; Xu, C. Integrating Neural-Symbolic Reasoning with Variational Causal Inference Network for Explanatory Visual Question Answering. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 7893–7908. [Google Scholar] [CrossRef] [PubMed]
- Hadid, A.; Chakraborty, T.; Busby, D. When geoscience meets generative AI and large language models: Foundations, trends, and future challenges. Expert Syst. 2024, 41, e13654. [Google Scholar] [CrossRef]
- Hofmeister, M.; Bai, J.; Brownbridge, G.; Mosbach, S.; Lee, K.F.; Farazi, F.; Hillman, M.; Agarwal, M.; Ganguly, S.; Akroyd, J.; et al. Semantic agent framework for automated flood assessment using dynamic knowledge graphs. Data-Centric Eng. 2024, 5, e14. [Google Scholar] [CrossRef]
- Li, S.; Erickson, C.; Zajac, M.; Guo, X.; Duan, Q.; Gong, J. A Semi-Automated Framework for Flood Ontology Construction with an Application in Risk Communication. Water 2025, 17, 2801. [Google Scholar] [CrossRef]
- Aivalis, T.; Klampanos, I.A.; Troumpoukis, A. LLM-Driven Knowledge Graph Construction from Earth Observation Data for Extreme Events. In Workshop on AI-Driven Data Engineering and Reusability for Earth and Space Sciences (DARES’25), Co-Located with ECAI 2025, Bologna, Italy; CEUR Workshop Proceedings: Aachen, Germany, 2025; Volume 4128. [Google Scholar]
- Areerob, K.; Nguyen, V.-Q.; Li, X.; Inadomi, S.; Shimada, T.; Kanasaki, H.; Wang, Z.; Suganuma, M.; Nagatani, K.; Chun, P.-J.; et al. Multimodal artificial intelligence approaches using large language models for expert-level landslide image analysis. Comput.-Aided Civ. Infrastruct. Eng. 2025, 40, 2900–2921. [Google Scholar] [CrossRef]
- Shimizu, C.; Stephe, S.; Barua, A.; Cai, L.; Christou, A.; Currier, K.; Dalal, A.; Fisher, C.K.; Hitzler, P.; Janowicz, K.; et al. The KnowWhereGraph ontology: Enabling spatially explicit knowledge graphs for disasters and environment. Data Intell. 2023, 5, 304–328. [Google Scholar] [CrossRef]
- Zajac, M.; Kulawiak, C.; Li, S.; Erickson, C.; Hubbell, N.; Gong, J. Unifying flood-risk communication: Empowering community leaders through AI-enhanced, contextualized storytelling. Hydrology 2025, 12, 204. [Google Scholar] [CrossRef]
- Zhou, Y.; Matyas, C.J.; Liu, P.; Li, H. Identification of tropical cyclone–related flash floods from hazard narratives using a large language model–based approach. npj Nat. Hazards 2025, 2, 104. [Google Scholar] [CrossRef]
- Wang, C.; Engler, D.; Li, X.; Hou, J.; Wald, D.J.; Jaiswal, K.; Xu, S. Near-real-time earthquake-induced fatality estimation using crowdsourced data and large-language models. Int. J. Disaster Risk Reduct. 2024, 108, 104680. [Google Scholar] [CrossRef]
- Lin, Z.; Deng, C.; Zhou, L.; Zhang, T.; Xu, Y.; Xu, Y.; He, Z.; Shi, Y.; Dai, B.; Song, Y.; et al. Geogalactica: A scientific large language model in geoscience. arXiv 2023, arXiv:2401.00434. [Google Scholar]
- Wang, S.; Hu, T.; Xiao, H.; Li, Y.; Zhang, C.; Ning, H.; Zhu, R.; Li, Z.; Ye, X. GPT, large language models (LLMs) and generative artificial intelligence (GAI) models in geospatial science: A systematic review. Int. J. Digit. Earth 2024, 17, 2353122. [Google Scholar] [CrossRef]
- Zhang, W.; Zhang, J. Hallucination Mitigation for Retrieval-Augmented Large Language Models: A Review. Mathematics 2025, 13, 856. [Google Scholar] [CrossRef]
- Xu, F.; Ma, J.; Li, N.; Cheng, J.C.P. Large language model applications in disaster management: An interdisciplinary review. Int. J. Disaster Risk Reduct. 2025, 127, 105642. [Google Scholar] [CrossRef]
- Zhou, B.; Li, K. Fusing Geoscience Large Language Models and Lightweight RAG for Enhanced Geological Question Answering. Geosciences 2025, 15, 382. [Google Scholar] [CrossRef]
- Karimanzira, D.; Rauschenbach, T.; Hellmund, T.; Ritzau, L. Improved Flood Management and Risk Communication Through Large Language Models. Algorithms 2025, 18, 713. [Google Scholar] [CrossRef]
- Wang, R.; Gao, Z.; Zhang, L.; Yue, S.; Gao, Z. Empowering large language models to edge intelligence: A survey of edge efficient LLMs and techniques. Comput. Sci. Rev. 2025, 57, 100755. [Google Scholar] [CrossRef]
- Nguyen, V.; Dhopate, V.; Huynh, H.; Bouhlal, H.; Annengala, A.; Scoccia, G.L.; Martinez, M.; Stoico, V.; Malavolta, I. On-Device or Remote? On the Energy Efficiency of Fetching LLM-Generated Content. In Proceedings of the 4th International Conference on AI Engineering—Software Engineering for AI (CAIN 2025), Ottawa, ON, Canada, 27–28 April 2025; pp. 72–82. [Google Scholar]
- Cruden, D.M.; Varnes, D.J. Landslide types and processes. In Landslides: Investigation and Mitigation; Turner, A.K., Schuster, R.L., Eds.; Special Report 247; Transportation Research Board, National Research Council: Washington, DC, USA, 1996; pp. 36–75. [Google Scholar]
- Samek, W.; Montavon, G.; Lapuschkin, S.; Anders, C.J.; Müller, K.R. Explaining deep neural networks and beyond: A review of methods and applications. Proc. IEEE 2021, 109, 247–278. [Google Scholar] [CrossRef]
- Guo, Z.; Zhang, X.; Karniadakis, G.E. Fourier Neural Operators for surrogate modeling in geophysical simulations. Water Resour. Res. 2024, 60, e2023WR034939. [Google Scholar] [CrossRef]






| Ontology-Based Approaches | Knowledge Graph-Based Approaches | |
|---|---|---|
| Conceptual focus | Formal domain semantics and taxonomy | Entity-relation networks with data instances |
| Strengths | Logical consistency, interpretability, semantic interoperability | Scalability, data integration, graph reasoning |
| Limitations | Static, difficult to update, limited instance coverage | Weaker formal semantics, inconsistent schemas |
| Representative works | [15,19,34,36,39,51] | [20,41,46,47,52,53] |
| Model Class | Main Knowledge Source | Injection Mechanism | Representative Geohazard Tasks | Main Strengths | Main Limitations | Representative References |
|---|---|---|---|---|---|---|
| Physics-guided ML | Physical laws and governing equations, including Darcy’s law, Mohr-Colomb criteria, PDEs, and geomechanical constraints | Physics-based loss functions, residual penalties, constrained optimization, hybrid PINN frameworks, knowledge-guided model initialization | Rainfall-induced landslide prediction, slope stability assessment, debris-flow forecasting, geomechanically constrained susceptibility mapping | Mechanistic interpretability; physically consistent predictions under sparse data; stronger numerical consistency; improved generalization under process-informed constraints | Difficult to encode qualitative or symbolic knowledge; computationally expensive for PDE-based settings; performance may depend on simplified process assumptions | [43,68,69,70,71] |
| Theory-guided ML | Expert rules, geotechnical heuristics, threshold logic, symbolic rules, and process-based prior knowledge | Rule-based feature construction, knowledge-guided loss terms, logic constraints, semantic consistency regularization, knowledge-based initialization | Slope-stability prediction, warning classification, hazard screening, semantically constrained susceptibility assessment | Useful when knowledge is qualitative or heuristic rather than fully equation-based; improves interpretability; preserves causal transparency; easier to implement than full physics-informed models | Domain rules may be incomplete or context-specific; difficult to formalize consistently; may lack numerical grounding | [11,17,67,72] |
| KG-regularized ML | Ontologies, knowledge graphs, semantic relations, graph embeddings, hazard-event-factor networks | KG embeddings, semantic regularization, relation constraints, ontology-guided feature fusion, rule/KG-supported inference | KG-supported risk assessment, hazard classification, semantic fusion of remote-sensing and geological data, multi-source geohazard monitoring | Maintains semantic alignment across heterogeneous data; supports explainable reasoning, causal transparency, and interoperability; useful for multi-hazard contexts | Depends on KG quality and schema consistency; knowledge coverage may be uneven; many systems remain project-specific and only semi-automated | [45,46,47,52,54] |
| Graph-based spatiotemporal models | Spatial adjacency, environmental similarity, monitoring-network topology, temporal dependencies, instance-level relational graphs | Graph neural networks, graph convolutions, attention/transformer modules, spatiotemporal graph learning, knowledge-informed graph priors | Landslide displacement prediction, landslide susceptibility mapping, dynamic hazard monitoring, multi-factor-induced landslide forecasting | Captures nonlocal dependencies and structured spatial–temporal interactions; supports trend and fluctuation modeling; effective for dynamic hazard processes | Performance depends on graph construction quality; may remain weakly interpretable without explicit semantic or physical constraints | [55,73,74,75,76] |
| Data Type | Typical Source | Structured Knowledge Representation | Role in Grounded Cognitive Reasoning | Representative References |
|---|---|---|---|---|
| Remote sensing imagery | Sentinel-1/2, Landsat, UAV/aerial imagery, LiDAR-derived slope units | Geomorphology/land-cover ontology; image entity/event graph linking scar, runout, water extent, damaged assets, time, and location | Ontology-aligned labeling and KG grounding transform visual observations into interpretable hazard entities; support cross-source alignment with reports and geospatial metadata, and improve spatially coherent explanation | [45,88,89] |
| Topographic/terrain data | SRTM, ASTER, ALOS PALSAR, national DEMs, slope-unit maps | Geomorphologic ontology and terrain rule schema; geospatial KG linking slope units, adjacency, elevation, drainage, and exposure | Terrain semantics and expert rules act as interpretable priors for susceptibility screening, spatial retrieval, and rule-constrained reasoning; they also contextualize remote-sensing and monitoring evidence within physically meaningful terrain units | [46,78,90] |
| Monitoring/time-series data | InSAR, GNSS, rainfall gauges, hydrological stations, IoT sensors | Temporal ontology or trigger–event–impact KG; provenance-linked observation graph | Temporal relations support event tracking, threshold reasoning, alert propagation, and evidence-linked updating instead of isolated point prediction | [46,55,73,86] |
| Geological/soil/lithology maps | Field surveys, OneGeology, USGS databases, soil maps, fault inventories | Lithology/fault/soil ontology; geospatial KG linking material units, properties, hydrology, and hazard history | Semantic harmonization across heterogeneous geological vocabularies and map layers supports explainable zoning, concept alignment, and crosswalks between geological descriptions and model inputs | [19,34,37,90] |
| Textual and literature data | Scientific articles, hazard bulletins, disaster reports, social media, agency documents | Hazard ontology, event schema, provenance-linked concept graph/document graph | LLMs and NLP extract entities, relations, and event arguments; KG/RAG grounding supports evidence attribution, summarization, causal QA, consistency checking, and expert validation | [51,54,82,87,91,92] |
| Multi-source/multimodal evidence | Combined imagery, GIS layers, sensors, text reports, prior simulations/models | Multimodal KG/ontology + neural embeddings + provenance graph | Cross-modal grounding aligns visual, textual, and numerical evidence; supports hypothesis generation, scenario comparison, and validated knowledge updating across reasoning cycles | [46,86,88,90,93] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Li, W.; Zhou, Y. Toward Knowledge-Enhanced Geohazard Intelligence: A Review of Knowledge Graphs and Large Language Models. GeoHazards 2026, 7, 40. https://doi.org/10.3390/geohazards7020040
Li W, Zhou Y. Toward Knowledge-Enhanced Geohazard Intelligence: A Review of Knowledge Graphs and Large Language Models. GeoHazards. 2026; 7(2):40. https://doi.org/10.3390/geohazards7020040
Chicago/Turabian StyleLi, Wenjia, and Yongzhang Zhou. 2026. "Toward Knowledge-Enhanced Geohazard Intelligence: A Review of Knowledge Graphs and Large Language Models" GeoHazards 7, no. 2: 40. https://doi.org/10.3390/geohazards7020040
APA StyleLi, W., & Zhou, Y. (2026). Toward Knowledge-Enhanced Geohazard Intelligence: A Review of Knowledge Graphs and Large Language Models. GeoHazards, 7(2), 40. https://doi.org/10.3390/geohazards7020040

