Intelligent Prediction and Prevention of Coal Mine Water Inrush: Integrating Hybrid Data Augmentation, HO-SVR, and RAG-LLM Technologies

Ke He; Changfeng Wang; Qiushuang Zheng

doi:10.3390/w17243534

,

and

School of Economics and Management, Beijing University of Posts and Telecommunications, Beijing 100876, China

^*

Author to whom correspondence should be addressed.

Water2025, 17(24), 3534;https://doi.org/10.3390/w17243534
(registering DOI)

This article belongs to the Section New Sensors, New Technologies and Machine Learning in Water Sciences

Version Notes

Order Reprints

Abstract

This study proposes a novel integrated framework that combines a Hippopotamus-Optimized Support Vector Regression (HO-SVR) prediction model with a Retrieval-Augmented Generation-enhanced Large Language Model (RAG-LLM)-based intelligent decision module, addressing the core challenge of bridging prediction and prevention in coal mine water inrush disasters. It represents the first application of the combined HO-SVR and RAG-LLM approach in this field. Methodologically, a hybrid data augmentation technique (SMOTE–GN–Bootstrap) alleviates data scarcity and imbalance, while feature selection and dimensionality reduction optimize the input features. The developed HO-SVR model demonstrates superior prediction accuracy over benchmark models. The key innovation lies in the RAG-LLM module which automatically generates interpretable reports and actionable prevention strategies based on the prediction results and key influencing factors, thereby establishing a closed-loop intelligent system from accurate prediction to informed prevention. Practically, this framework enables proactive risk management through data-driven predictions, significantly reduces water inrush incidents, and provides intelligent decision support for field operations, substantially enhancing mine safety. Furthermore, the study discusses the model’s potential and challenges across different geological settings, charting a course for developing more generalized models

Keywords:

coal mine water inrush; prediction model; support vector regression; hippopotamus optimization algorithm; large language models; retrieval-augmented generation

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Article metric data becomes available approximately 24 hours after publication online.