Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling

Wei, Zhonglin; Ji, Yuan; Fang, Huiming; Yu, Lujia; Dong, Donglin

doi:10.3390/w17060790

Open AccessArticle

Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling

by

Zhonglin Wei

^1,2,

Yuan Ji

^3,*

,

Huiming Fang

^1,2,

Lujia Yu

³ and

Donglin Dong

³

¹

General Prospecting Institute, China National Administration of Coal Geology, Beijing 100039, China

²

Key Laboratory of Transparent Mine Geology and Digital Twin Technology, National Mine Safety Administration, Beijing 100039, China

³

Department of Geological Engineering and Environment, China University of Mining and Technology-Beijing (CUMTB), Beijing 100083, China

^*

Author to whom correspondence should be addressed.

Water 2025, 17(6), 790; https://doi.org/10.3390/w17060790

Submission received: 14 February 2025 / Revised: 28 February 2025 / Accepted: 6 March 2025 / Published: 10 March 2025

(This article belongs to the Section Hydrogeology)

Download

Browse Figures

Versions Notes

Abstract

Coal mine safety is vital not only for maintaining production operations but also for ensuring the industry’s sustainable development. The threat posed by mine water hazards is especially severe, growing more critical as mining activities become more intense and reach greater depths. Currently, common methods for identifying water sources mainly depend on hydrochemical data, supplemented by analyses of water level and temperature changes. However, due to constraints in cost, time, and the complexity of mining conditions, there is still significant potential for enhancing water source identification techniques. To advance water source identification, this study introduces a novel approach that uses a spectrophotometer to gather spectral data from water sources. These data are then integrated with a bat algorithm (BA)-optimized radial basis function (RBF) neural network to develop a model for identifying water inrush sources. At Baode Coal Mine in China, 105 water samples from four different sources were collected and analyzed using spectral data. The baseline was corrected using the second derivative technique to ensure the data’s integrity. Additionally, 54 sets of historical hydrochemical data were collected for comparison with the spectral data-based model. Theoretical analysis and experimental results show that both hydrochemical and spectral data are effective for identifying water inrush sources. The hydrochemical data model achieved an accuracy of about 90%, whereas the model based on spectral data reached an average accuracy of 95%. Among the tested models: RBF, GA-RBF, PSO-RBF, BA-RBF, and the BA-RBF model demonstrated superior performance, providing the most rapid and accurate identification of water inrush.

Keywords:

spectral data; BA-RBF; mine water inrush; water source identification

1. Introduction

Coal plays a crucial role in China’s primary energy production and consumption. The mining process often involves complex geological formations, including faults and fissure systems. The activity of faults may affect groundwater flow paths and the risk of water surges [1,2]. As mining operations intensify and reach greater depths, the hydrogeological conditions become increasingly complex. Water inrush incidents from mines and tunnels pose significant risks to operational safety and can lead to substantial economic losses [3,4,5]. Consequently, numerous researchers have focused on developing methods for the rapid and accurate identification of mine water inrush sources. Their studies reveal that water from different aquifers exhibits distinct hydrochemical and physical properties [6]. Currently, several methods are commonly used for identifying water inrush sources: (1) Analytical methods that rely on isotope or trace element data are not only costly but also involve complex and cumbersome data testing processes [7,8,9,10]; (2) the method that relies on groundwater level and water temperature data is primarily suitable for mines with straightforward hydrogeological conditions [10]; (3) the method that uses conventional hydrogeochemical data combined with identification models may face challenges due to potential chemical reactions during the sampling and testing of hydrochemical ions. These reactions can impact the accuracy of water source identification [11,12,13,14]; (4) the approach that uses fluorescence spectral data combined with recognition models often suffers from data redundancy, making the processing procedure cumbersome [15,16,17]. Consequently, the approach for classifying coal mine water source predictions necessitates additional investigation.

Spectrophotometry has seen extensive use in the coal industry. One scholar used an atomic absorption spectrophotometer to measure iron content in mine water [18]. Another scholars applied the same technique to determine concentrations of trace metals, such as cadmium, chromium, copper, cobalt, iron, manganese, nickel, lead, mercury, and zinc in river water [19]. And yet another scholars employed an ultraviolet-visible spectrophotometer to measure polyacrylamide polymers in wastewater [20]. Compared to traditional chemical analysis methods, spectrophotometry offers several advantages, including simplicity, enhanced sensitivity, and rapid results. Building on these benefits, this paper proposes the use of ultraviolet-visible spectrophotometry to obtain spectral data from water samples, integrating this data with an identification model to analyze water inrush sources.

When the differences in water quality characteristics between aquifers are minimal or when multiple water sources are involved, mathematical methods are often necessary to construct a discriminant model for identifying the source of water inrush [21]. Research has highlighted the use of discriminant analysis techniques based on multivariate statistical theory for this purpose. These techniques include distance discriminant analysis, Bayesian discriminant methods, fuzzy evaluation, and cluster analysis [22,23,24,25,26]. Additionally, non-linear analytical methods such as Geographic Information Systems (GIS) and Extensible Recognition Techniques have also been employed [27,28,29,30]. While these models can aid in water source identification, their accuracy varies. Moreover, artificial neural networks and support vector machines are increasingly used to identify water inrush sources, with machine learning algorithms generally enhancing prediction accuracy, though they require extensive training samples [31,32,33,34,35].

In recent years, most research has mainly relied on hydrochemical data for water source identification, with relatively few studies using spectral data of water samples for water source identification. Research using spectral data of water samples combined with machine learning methods for water source identification is even rarer.

Based on this, the study proposes a water source identification model optimized with the Bat Algorithm (BA) and Radial Basis Function (RBF) Neural Network. First, spectral data of water samples were obtained through experimental methods. Next, we constructed the BA-RBF model, alongside Genetic Algorithm (GA)-optimized RBF (GA-RBF) and Particle Swarm Optimization Backpropagation (PSO-BP) network classification models. The performance of these models was compared and analyzed to evaluate the usability and accuracy of the spectral data. Finally, the model was validated at Baode Coal Mine to confirm its effectiveness.

2. Data Acquisition

2.1. Geological and Hydrogeological Conditions

Baode Coal Mine (Figure 1) is situated in Baode County, Shanxi Province, characterized by typical North China-type Carboniferous–Permian coalfield features. The mine has an actual production capacity of 4.2 million tons per annum. The geological structure is predominantly monoclinic, with the terrain sloping from north to south towards the center, resulting in a maximum elevation difference of 335.9 m. The mine extracts several key coal seams: #8, #10, #11, and #13. The primary coal seam, #8, is found in the Permian Shanxi Formation, while seams #10, #11, and #13 are located in the Carboniferous Taiyuan Formation.

Baode Coal Mine has four primary sources of mine filling water: (1) Atmospheric Precipitation and Surface Water: This water type is classified as HCO₃-Ca-Mg, with a mineralization of 0.4 g/L. It has a limited impact on the mine’s water inflow. (2) Coal System Sandstone Fissure Water: Characterized by the HCO₃-SO₄-Na-Ca-Mg type, with mineralization ranging from 0.063 to 1.01 g/L. This water is weakly enriched in sandstone fissures both above and below the coal seams and directly contributes to the mine’s water inflow. The Carboniferous and Permian systems, consisting of mudstone, sandstone, thinly bedded marl, and coal seams, have developed inter- and intra-layer joints, creating several water-bearing layers with some hydraulic connections. (3) Ordovician Tuff Karst Water: This water is of the HCO₃-Na-Mg type and has high local chloride ion content. It is a major water-filled aquifer in the area, especially affecting the coal seams. The Ordovician tuff aquifer below the #8 and #11 coal seams is a significant water source for mining operations. Hydrogeological studies show that this aquifer is weakly to moderately water-rich, with both plane and vertical hydraulic connections. (4) Goaf water: Since there are no small kilns in the mine field, goaf water originates from the currently mined-out areas, primarily the #8 coal seam. As the goaf expands, gob water poses a potential threat to the mining of lower coal seams.

2.2. Spectral Data Acquisition

To analyze the primary water filling sources at Baode Coal Mine, researchers collected a total of 105 water samples between November 2019 and November 2020. The samples were categorized as follows: 25 samples of Permian sandstone water (marked as A), 25 samples of Permian goaf water (marked as B), 25 samples of Carboniferous sandstone water (marked as C), and 30 samples of Ordovician limestone water (marked as D). The spectral data for the water samples were obtained using the UV-1700PC UV-visible spectrophotometer system, manufactured by Shanghai Meixi Instrument Co., Ltd (Shanghai, China). This instrument covers a wavelength range of 190 to 1100 nm and has a transmittance accuracy of ±0.3% τ.

2.3. Hydrochemical Data Acquisition

In this study, we collected hydrochemical data from 54 water samples at Baode Coal Mine. The dataset includes 12 samples of Permian sandstone water, 11 samples of Permian goaf water, 16 samples of Carboniferous sandstone water, and 15 samples of Ordovician limestone water. The data consist of measurements for cations (Na⁺, Ca²⁺, Mg²⁺) and anions (HCO₃⁻, SO₄²⁻, Cl⁻).

3. Methods

The research framework for this study is illustrated in Figure 2. The process began with data enhancement and baseline correction of the spectral data. Subsequently, the water chemistry data were analyzed using a Piper trilinear plot and Pearson correlation analysis to verify their reliability. Following this, the BA-RBF machine learning model was developed and compared with three other models: GA-RBF, PSO-RBF, and RBF. Finally, the processed spectral and water chemistry data were assessed using various discriminant models to determine the most effective one. All programming and computations were conducted using Python 3.10.

3.1. Spectral Data Preprocessing

One of the primary goals of spectral data preprocessing is to select the most relevant spectral information. Unstable factors such as instrument condition, acquisition background, and detection settings can impact the consistency of spectral data. Therefore, it is crucial to apply processing techniques that are both theoretically sound and practically effective. In line with the characteristics of the spectral data collected in this study, the second-order differentiation method was employed to enhance and optimize the dataset. This method effectively addresses issues like baseline drift and smoothing background interference, improves the resolution of overlapping peaks, and increases the sensitivity of spectral lines [36]. The conversion formula is as follows:

\frac{{d^{2}}_{y}}{d_{λ^{2}}} = \frac{y_{i + 1} - {2 y_{i} + y}_{i - 1}}{{(Δ λ)}^{2}}

(1)

In the formula, y is the spectral absorbance, i = 1, 2, 3, … represents the wavelength data points, and

λ

signifies the wavelength sampling point spacing.

3.2. RBF

Powell (1987) introduced the radial basis function (RBF) to address multivariate interpolation problems [37]. By the late 1980s, Broomhead and others incorporated the concept of neural network computation into interpolation processes, leading to the development of radial basis functions within the framework of artificial neural network design, thereby establishing the RBF neural network. The RBF neural network is a feed-forward network known for its local approximation capabilities, and its optimization process can be viewed as a surface fitting problem in high-dimensional space [38,39,40]. The structure of the RBF neural network is illustrated in Figure 3:

According to the topological architecture of the neural network, the input consists of the sample data matrix

X

, which includes m training samples

X_{i} (i = 1,2 \dots m)

, with each sample characterized by n attributes. The result of the output layer is Y. In this experiment, the network is fed with spectral and hydrochemical data from various types of mine water sources as input values, while the output values correspond to the types of mine water sources.

The Radial Basis Function (RBF) hidden layer employs the Gaussian function

G (‖x, c_{i}‖)

as the radial basis function, and the radial distance is defined as the distance between any point x in the space and a certain center c, using Euclidean distance, denoted as

‖x - c‖

.

G (‖x, c_{i}‖) = G (‖x - c_{i}‖) = e x p (- \frac{1}{2 σ^{2}} {‖x - c_{i}‖}^{2})

(2)

In the formula,

σ

is the width of basis function,

G ‖x - c_{i}‖

denotes the activation function of the i-th hidden layer node, x denotes the vector of the input layer, and

c_{i}

denotes the i-th center of the hidden layer. In this experiment, the values of

σ

and

c_{i}

are optimized by the BA algorithm. The objective function of the output layer is

y (x) = \sum_{i = 1}^{k} w_{i} G ‖x - c_{i}‖

(3)

In the formula,

y (x)

is the result of the output layer, k is the number of hidden layer nodes, and

w_{i}

is the connection weight between the i-th hidden layer node and the output node.

3.3. BA

The Bat Algorithm (BA), is inspired by the echolocation behavior of bats during hunting. This algorithm effectively merges key features of genetic algorithms and particle swarm optimization, resulting in enhanced search and optimization capabilities. Initially, the virtual bat’s flight parameters: speed (

v_{i}

), position (

x_{i}

), pulse frequency (

f_{i}

), pulse loudness (

A_{i}

), and pulse rate (

γ

) are randomly assigned. As the bat detects prey, it adjusts its speed and position by varying the frequency, decreasing the loudness, and increasing the pulse emission rate. A fitness function then assesses the current position’s effectiveness, guiding the selection of the optimal solution. This forms the core concept of the BA algorithm [41].

Assuming that the size of the bat population is

m

and the search space is

d

dimensional, the update process of the position

x_{i}^{t}

and the velocity

v_{i}^{t}

of bat

i

at time

t

is as follows:

f_{i} = f_{m i n} + (f_{m a x} - f_{m i n}) β

(4)

v_{i}^{t} = v_{i}^{t - 1} + (x_{i}^{t} - x^{*}) f_{i}

(5)

x_{i}^{t} = x_{i}^{t - 1} + v_{i}^{t}

(6)

Among them,

f_{i}

represents the acoustic frequency of bat

i

at the current moment, while

f_{m a x}

and

f_{m i n}

are the maximum and minimum values of the acoustic frequency, respectively. In the whole process of bat speed and position update, the frequency of sound wave

f_{i}

controls the search range of bats and plays a role in adjusting the step size.

β \in [0, 1]

is a random number that obeys uniform distribution, and

x^{*}

represents the current global optimal solution of the bat population.

In the local search phase, when a bat selects one of the current best solutions, it generates a new solution in its vicinity through a random walk. The updated formula for this process is

X_{n e w} = X_{o l d} + ε A^{t}, ε \in [- 1, 1]

(7)

Among them,

ε

is a random number, and

A^{t} = < A_{i}^{t} >

is the average loudness of all bats in this generation.

Then, the loudness

A_{i}

and the pulse rate

γ_{i}

are also updated with the iteration process. The update formula is as follows:

A_{i}^{t} = α A_{i}^{t - 1}

(8)

γ_{i}^{t} = γ_{i}^{0} [1 - e^{- γ t}]

(9)

Here,

α

and

γ

are constants. When initialized, the loudness and pulse rate emitted by each bat are randomly given. In general, the initial loudness

A_{i}

is usually defined between [1, 2], and the initial pulse rate

γ^{0}

is generally close to 0. In the process of flying to the optimal solution,

A_{i}

and

γ_{i}

are constantly updated.

3.4. BA-RBF Model

The RBF neural network prediction model, optimized by the Bat Algorithm (BA), enables faster training and prediction of input data. The steps of the algorithm are outlined as follows (Figure 4):

Step 1: Determine the RBF neural network structure. Set the i-th center

c_{i}

of the hidden layer and the basis function width

σ

to a random uniformly distributed decimal at (0, 1). Input spectral data matrix

X

;

Step 2: Determine the BA structure and initialization parameters. Parameter initialization: bat population size

m =

50, number of iterations

N =

100; search spaced dimension = dimension of RBF input layer + 1; pulse volume attenuation coefficient

α

= 0.8, pulse rate

γ

= 0.5; maximum pulse frequency

f_{m a x} = 10

, the minimum value

f_{m i n} = - 10

; the impulse loudness

A_{i}

is a random number uniformly distributed between [1, 2] produced randomly; the values of

c_{i}

and

σ

are called solutions, the lower limit of the solution is

l o w e r = - 10

, and the upper limit of the solution is

u p p e r = 10

; the random initialization speed

v_{i}

and position

x_{i}

are also within the value range of the solution.

Step 3: Generate a new solution by frequency adjustment, and update the velocity and position according to Equations (4)–(6).

Step 4: Generate a random number,

{r a n d}_{1}

.

{r a n d}_{1}

is a random number on [0, 1]. If

{r a n d}_{1}

>

γ_{i}

, select an optimal individual among the optimal bats, and then generate a local solution near the optimal individual selected by Formula (7); otherwise, update the bat position according to Formula (6).

Step 5: Pass the obtained solution to RBF, and assign it to the hidden layer center

c_{i}

, the basis function width

σ

, and the weight w. According to the classification and recognition results, the accuracy value is calculated, and the accuracy rate is used as the fitness function. The higher the accuracy rate, the better the fitness. The fitness value is calculated and returned to BA.

Step 6: Regenerate a random number,

{r a n d}_{2}

, where

{r a n d}_{2}

is a random number on [0, 1]. If

{r a n d}_{2}

<

A_{i}

, and the fitness of the objective function is better than the new solution in step 3, then accept the position and adjust

A_{i}

(reduce) and

γ_{i}

(increase) by Formulas (8) and (9). Find the current best

x^{*}

, which, when updated, retains the optimal solution value for this generation.

Step 7: Determine whether the maximum number of iterations is satisfied, and repeat steps 3–6. Then assess output classification results and accuracy.

3.5. Model Evaluation

In this experiment, Accuracy (A), Precision (P), Recall (R), and F1 score are selected to evaluate and compare the classification performance of the RBF model, GA-RBF model (Table 1), PSO-RBF model, and BA-RBF model. Accuracy is defined as the ratio of correctly classified samples to the total number of samples in the dataset. The other three metrics are defined as follows: typically, the class of interest is considered positive, while the other class is negative. The classifier’s prediction on the dataset can be categorized into four possible outcomes: true positive, false positive, true negative, and false negative.

P = \frac{T P}{T P + F P}

(10)

R = \frac{T P}{T P + F N}

(11)

F 1 = \frac{2 P * R}{P + R}

(12)

4. Results

4.1. Spectral Data Analysis

This study obtained a total of 105 sets of spectral data from a full scan using a UV-Vis spectrophotometer, covering the wavelength range of 190 to 1100 nm, with ultrapure water as the baseline reference. Each water sample was measured three times, and the results were averaged, with no anomalies detected. The spectral data from these tests are presented in Figure 5.

As shown in Figure 5, the maximum absorbance of Permian sandstone fissure water from Baode Mine is approximately 2.44 at 193 nm, after which the absorbance decreases. For Permian goaf water, the maximum absorbance is around 2.23 between 193 and 198 nm, with an initial increase followed by a decrease between 190 and 210 nm. In the case of Carboniferous sandstone water, the maximum absorbance reaches 3.25 between 200 and 210 nm, with a gradual change between 190 and 220 nm before decreasing. The maximum absorbance of Ordovician limestone water is about 1.99 at 193 nm, followed by a downward trend.

It is noteworthy that the absorbance peaks of the water samples are primarily concentrated in the range of 190 to 240 nm, while the absorbance values from 240 to 1100 nm tend to be less stable. Therefore, it is crucial to focus on the spectral data within the 190 to 240 nm range for subsequent spectral pre-processing and modeling.

In Figure 6, panels A, B, C, and D display the spectral data lines for Permian sandstone water, Permian goaf water, Carboniferous sandstone water, and Ordovician water, respectively, after processing the spectral data from Baode Coal Mine. Panel (a) shows the result of data enhancement, while panel (b) illustrates the result of data baseline correction. After spectral preprocessing, the overlapping peaks for each water sample have been improved, the differences between water samples have been enhanced, and the sensitivity of the spectral lines has been increased.

As shown in Figure 6, the spectrogram provides an intuitive visualization of the absorbance differences among water samples from various aquifers, facilitating the identification of different water sources. Additionally, the second derivative plot allows for a clear view of the position and magnitude of the characteristic points for each water sample.

To enhance the accuracy and speed of identifying water sources, we developed a Bat Algorithm-optimized neural network recognition model (BA-RBF). In this model, 70% of the data from the four water samples were used for training (72 samples), while the remaining 30% was reserved for testing (33 samples).

4.2. Hydrochemical Data Analysis

Figure 7 shows the Piper trilinear diagram, while Figure 8 presents the Pearson correlation analysis diagram for the water samples from Baode Mine.

As shown in Figure 7, the Piper trilinear diagram reveals distinct differences in water quality among the various aquifers at Baode Mine. The anion composition of the water samples from the Permian mining area is generally ordered as HCO₃⁻ > Cl⁻ > SO₄²⁻, with relatively uniform cation concentrations. In the Permian sandstone water samples, the anion content follows HCO₃⁻ > Cl⁻ > SO₄²⁻, with a notably high concentration of Na⁺, making sodium bicarbonate the dominant cation. This indicates that the water quality in this aquifer is primarily a sodium–potassium bicarbonate type. For Carboniferous sandstone water samples, the anion content also follows HCO₃⁻ > Cl⁻ > SO₄²⁻, and the water quality is predominantly a sodium–potassium bicarbonate or calcium bicarbonate type. The Ordovician chert water exhibits an anion composition of HCO₃⁻ > SO₄²⁻ > Cl⁻, with higher local Cl⁻ concentrations, consistent with geological records. This water is primarily classified as sodium–potassium bicarbonate, calcium bicarbonate, or sodium bicarbonate type. In summary, the water chemistry of samples from various aquifers is mainly classified within the bicarbonate group, with fewer samples categorized under the chloride group.

As shown in Figure 8, from the Pearson correlation analysis graph, it is evident that there are significant correlations among various ions in the water samples from Baode Mine. Specifically: Mg²⁺ and Ca²⁺ exhibit a strong positive correlation, with a correlation coefficient of 0.71. Na⁺ and HCO₃⁻ also show a strong positive correlation, with a correlation coefficient of 0.71. SO₄²⁻ and Ca²⁺ have a moderate positive correlation, with a correlation coefficient of 0.51. On the other hand, there are notable negative correlations: Ca²⁺ has a moderate negative correlation with both Na⁺ and HCO₃⁻, with correlation coefficients of −0.57 and −0.68, respectively. Mg²⁺ also shows a negative correlation with Na⁺; and HCO₃⁻, with correlation coefficients of −0.52 and −0.48, respectively. These negative correlations suggest the potential dissolution of carbonate rocks, such as calcite and dolomite, and alternating adsorption of cations. Additionally, SO₄²⁻ exhibits a moderate negative correlation with HCO₃⁻, with a correlation coefficient of −0.45. This could indicate the oxidation of sulfurous iron ore commonly found in coal seams and the alternating adsorption of cations.

Additionally, the concentrations of six ions were used as input variables, while the type of sudden water source served as the output variable. Out of the 54 data sets, 43 were selected for the training set, and 11 were used as the test set to enable the classification of sudden water sources.

4.3. Analysis of Water Inrush Source Identification Model Results

In this experiment, Accuracy (A), Precision (P), Recall (R), and F1 score were selected to evaluate and compare the classification performance of the RBF model, GA-RBF model, PSO-RBF model, and BA-RBF model. Water source discrimination was performed using three different data sets: the original spectral data (a), the second-order processed spectral data (b), and the water chemistry data (c). The computational results are presented in Figure 9.

As shown in Figure 9, the BA-RBF model demonstrates superior overall performance, with higher classification accuracy compared to the other models. The recognition accuracy of the second-order processed spectral data is notably higher. Although the original spectral data reveal differences in absorbance among water samples from various aquifers, its recognition rate is lower in RBF discrimination. Conversely, the processed spectral data yield better recognition results. Additionally, the recognition accuracy for water chemistry data is lower for the GA-RBF, PSO-RBF, and BA-RBF models compared to the spectral data. The identification model based on hydrochemical data exhibits a high misjudgment rate for Permian sandstone water (Type l), incorrectly classifying a significant number of Permian sandstone water samples as Carboniferous sandstone water (Type 3). This may be attributed to the similar hydrochemical composition and closely related water quality types of the water samples from these two aquifers.

The original spectral data, second-order processed spectral data, and water chemistry data were each used with the BA-RBF model for water source discrimination. The error plots for the training set and test set are shown in Figure 10. In these plots, yellow triangles represent the actual categories of water samples, while blue pentagrams indicate the results of the discrimination. When the actual categories and the discriminated categories match, the yellow triangles and blue pentagrams overlap in the same position on the figure. Overlap is expected if the training set contains a larger amount of data, which is a normal occurrence.

From Figure 10, it is evident that the BA-RBF model achieves a discrimination rate of approximately 96.97% using both the original spectral data and the second-order processed spectral data. However, the training accuracy does not reach 100%. This may be related to data noise and the complexity of hydrogeological conditions.

In the discrimination result graph shown in Figure 9, the BA-RBF, PSO-RBF, and GA-RBF models were each subjected to 100 iterations of optimization search. The corresponding fitness curves are illustrated in Figure 11.

According to Figure 11, the fitness value of the BA-RBF model throughout the iteration process outperforms the other two classification models. It reaches the maximum accuracy value and stabilizes when the number of iterations is fewer than 10, demonstrating the best convergence effect. The GA-RBF model also converges around 10 iterations but exhibits slight fluctuations until it stabilizes at approximately 55 generations. The PSO-RBF model stabilizes around 80 generations, but its classification performance is relatively poor. Thus, the optimal value achieved by the BA-RBF algorithm is closest to the actual optimal value of the function.

In summary, the comparison of classification values with true values and the analysis of error results show that the PSO-RBF model, without any preprocessing, has the worst discrimination ability. The BA-RBF model, with its closer alignment to the true values and superior discrimination performance, outperforms both the GA-RBF and PSO-RBF models, which are themselves better than the RBF model. The recognition rate of the BA-RBF recognition model based on processed spectral data reached 96.97%, which was significantly better than that of the hydrochemical recognition model that has been widely studied and applied in recent years.

5. Conclusions

To ensure reliable prediction of mine water emergencies, this paper establishes a rapid discrimination model for mine water sources using the BA-RBF algorithm. This model integrates spectral data of water samples with hydrochemical data, while accounting for geographic and environmental factors. The BA-RBF water source identification model was then applied to the Shanxi Baode Coal Mine. Based on the analysis of water samples from four different aquifers in the mine, the following key conclusions were drawn:

(1) Measuring the absorbance of water samples using a spectrophotometer allows for the rapid creation of sample datasets. With sufficient water sample data, this method can effectively build a spectral database that enhances source identification, particularly in mining areas with complex geological conditions;

(2) The BA-RBF model developed for identifying water inrush sources in this research offers advantages such as rapid processing speed, high accuracy, and minimal sample volume requirements. It achieves predictive accuracy of up to 96.67% for both raw spectral data from different aquifer samples and spectral data that have undergone baseline correction and noise reduction;

(3) A comparison of the prediction accuracy for four different models—RBF, GA-RBF, PSO-RBF, and BA-RBF—was conducted using spectral data from various aquifers. The analysis revealed that the BA-RBF model had the highest recognition accuracy. This model significantly aids in the swift identification of water surge sources and helps mitigate disaster losses in similar mining areas;

(4) Compared to traditional water source identification methods that rely solely on hydrochemical data, the effectiveness of the water source identification approach based on spectral data has been demonstrated.

This study introduces a novel method for identifying sudden water sources. However, due to the constraints of the current research, only one water source identification method was explored. Future research should focus on developing and applying more comprehensive methods for identifying water sources in mixed water conditions. In addition, in future research, endeavors can be made to integrate spectral data with a broader array of technical approaches, such as travel-time tomography technology [42,43]. It is imperative to amass more data from diverse coal mines and enhance long-term monitoring of water samples. This will enable a more in-depth exploration of the model’s robustness, resilience, and scalability in a dynamic environment.

Author Contributions

All authors contributed to the study conception and design. Z.W.: Writing—original draft, Writing—review and editing, Methodology. Y.J.: Writing—original draft, Formal analysis, Methodology, Data curation. H.F.: Visualization. L.Y.: Data curation. D.D.: Funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the National Key Research and Development Program of the Ministry of Science and Technology of China (2023YFC3012101).

Data Availability Statement

The data used in this paper can be accessed by contacting the corresponding author directly.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Wang, J.; Shu, W. A brief review on the role of aseismic slip on experimental faults: Implications for fault-involved geological engineering safety. J. Saf. Sustain. 2025, in press. [Google Scholar] [CrossRef]
Hu, C.; Li, X.; Wu, Y.; Meng, B.; Jia, B. Experimental and numerical studies on mechanical characteristics and fracture behaviours of jointed rock with different roughness. J. Saf. Sustain. 2024, in press. [Google Scholar] [CrossRef]
Dong, S.; Wang, H.; Guo, X.; Zhou, Z. Characteristics of Water Hazards in China’s Coal Mines: A Review. Mine Water Environ. 2021, 40, 325–333. [Google Scholar] [CrossRef]
Meng, Z.P.; Li, G.Q.; Xie, X.T. A geological assessment method of floor water inrush risk and its application. Eng. Geol. 2012, 143, 51–60. [Google Scholar] [CrossRef]
Yin, S.X.; Zhang, J.C.; Liu, D.M. A study of mine water inrushes by measurements of in situ stress and rock failures. Nat. Hazards 2015, 79, 1961–1979. [Google Scholar] [CrossRef]
Wu, Q. Progress, problems and prospects of prevention and control technology of mine water and reutilization in China. J. China Coal Soc. 2014, 39, 795–805. [Google Scholar] [CrossRef]
Dou, H.; Ma, Z.; Cao, H.; Liu, F.; Hu, W.; Li, T. Application of isotopic and hydro-geochemical methods in identifying sources of mine inrushing water. Min. Sci. Technol. 2011, 21, 319–323. [Google Scholar] [CrossRef]
Tichomirowa, M.; Heidel, C.; Junghans, M.; Haubrich, F.; Matschullat, J. Sulfate and strontium water source identification by, O.; S and Sr isotopes and their temporal changes (1997–2008) in the region of Freiberg, central-eastern Germany. Chem. Geol. 2010, 276, 104–118. [Google Scholar] [CrossRef]
Qu, S.; Wang, G.; Shi, Z.; Xu, Q.; Guo, Y.; Ma, L.; Sheng, Y. Using stable isotopes (δD, δ¹⁸O, δ³⁴S and ⁸⁷Sr/⁸⁶Sr) to identify sources of water in abandoned mines in the Fengfeng coal mining district, northern China. Hydrogeol. J. 2018, 26, 1443–1453. [Google Scholar] [CrossRef]
Wang, X.; Xu, Z.; Sun, Y.; Zheng, J.; Zhang, C.; Duan, Z. Construction of multi-factor identification model for real-time monitoring and early warning of mine water inrush. Int. J. Min. Sci. Technol. 2021, 31, 853–866. [Google Scholar] [CrossRef]
Hou, E.; Wen, Q.; Che, X.; Chen, W.; Wei, J.; Ye, Z. Study on recognition of mine water sources based on statistical analysis. Arab. J. Geosci. 2019, 13, 5. [Google Scholar] [CrossRef]
Sun, L.; Gui, H. Establishment of water source discrimination model in coal mine by using hydrogeochemistry and statistical analysis: A case study from Renlou Coal Mine in northern Anhui Province, China. J. Coal Sci. Eng. 2012, 18, 385–389. [Google Scholar] [CrossRef]
Wang, Q.; Dong, S.; Wang, H.; Yang, J.; Huang, H.; Dong, X.; Yu, B. Hydrogeochemical processes and groundwater quality assessment for different aquifers in the Caojiatan coal mine of Ordos Basin, northwestern China. Environ. Earth Sci. 2020, 79, 199. [Google Scholar] [CrossRef]
Wang, Y.; Shi, L.; Wang, M.; Liu, T. Hydrochemical analysis and discrimination of mine water source of the Jiaojia gold mine area, China. Environ. Earth Sci. 2020, 79, 123. [Google Scholar] [CrossRef]
Hu, F.; Zhou, M.; Yan, P.; Li, D.; Lai, W.; Zhu, S.; Wang, Y. Selection of characteristic wavelengths using SPA for laser induced fluorescence spectroscopy of mine water inrush. Spectrochim. Acta A 2019, 219, 367–374. [Google Scholar] [CrossRef]
Yan, P.; Shang, S.; Zhang, C.; Yin, N.; Zhang, X.; Yang, G.; Zhang, Z.; Sun, Q. Research on the Processing of Coal Mine Water Source Data by Optimizing BP Neural Network Algorithm with Sparrow Search Algorithm. IEEE Access 2021, 9, 108718–108730. [Google Scholar] [CrossRef]
Yang, Y.; Yue, J.; Li, J.; Yang, Z. Mine water inrush sources online discrimination model using fluorescence spectrum and CNN. IEEE Access 2018, 6, 47828–47835. [Google Scholar] [CrossRef]
Singh, G. Impact of coal mining on mine water quality. Int. J. Mine Water 1988, 7, 49–59. [Google Scholar] [CrossRef]
Reza, R.; Singh, G. Heavy metal contamination and its indexing approach for river water. Int. J. Environ. Sci. Technol. 2010, 7, 785–792. [Google Scholar] [CrossRef]
Al Momani, F.A.; Örmeci, B. Measurement of polyacrylamide polymers in water and wastewater using an in-line UV–vis spectrophotometer. J. Environ. Chem. Eng. 2014, 2, 765–772. [Google Scholar] [CrossRef]
Dong, S.; Zheng, L.; Tang, S.; Shi, P. A Scientometric Analysis of Trends in Coal Mine Water Inrush Prevention and Control for the Period 2000–2019. Mine Water Environ. 2020, 39, 3–12. [Google Scholar] [CrossRef]
Xu, Z.; Sun, Y.; Gao, S.; Zhao, X.; Duan, R.; Yao, M.; Liu, Q. Groundwater Source Discrimination and Proportion Determination of Mine Inflow Using Ion Analyses: A Case Study from the Longmen Coal Mine, Henan Province, China. Mine Water Environ. 2018, 37, 385–392. [Google Scholar] [CrossRef]
Qian, J.; Zhang, C.; Fang, L.; Gao, Z.; Wang, L. Application of fuzzy excessive criterion model to determine source of inrush water in Xieqiao mine. In Proceedings of the 2010 Sixth International Conference on Natural Computation, Yantai, China, 10–12 August 2010; pp. 159–162. [Google Scholar] [CrossRef]
Huang, P.; Yang, Z.; Wang, X.; Ding, F. Research on Piper-PCA-Bayes-LOOCV discrimination model of water inrush source in mines. Arab. J. Geosci. 2019, 12, 334. [Google Scholar] [CrossRef]
Ju, Q.; Hu, Y. Source identification of mine water inrush based on principal component analysis and grey situation decision. Environ. Earth Sci. 2021, 80, 157. [Google Scholar] [CrossRef]
Ji, Y.; Dong, D.; Gao, J.; Wei, Z.; Ding, J.; Hu, Z. Source Discrimination of Mine Water Inrush Based on Spectral Data and EGA–PNN Model: A Case Study of Huangyuchuan Mine. Mine Water Environ. 2022, 41, 583–593. [Google Scholar] [CrossRef]
Donglin, D.; Wenjie, S.; Sha, X. Water-inrush assessment using a GIS-based Bayesian network for the 12-2 coal seam of the Kailuan Donghuantuo coal mine in China. Mine Water Environ. 2012, 31, 138–146. [Google Scholar] [CrossRef]
Sun, F.; Wei, J.; Wan, Y.; Liu, C. Recognition method of mine water source based on Fisher’s discriminant analysis and centroid distance evaluation. Coal Geol. Explor. 2017, 45, 80–84. [Google Scholar] [CrossRef]
Chen, L.; Feng, X.; Xu, D.; Zeng, W.; Zheng, Z. Prediction of Water Inrush Areas Under an Unconsolidated, Confined Aquifer: The Application of Multi-information Superposition Based on GIS and AHP in the Qidong Coal Mine, China. Mine Water Environ. 2018, 37, 786–795. [Google Scholar] [CrossRef]
Yang, B.; Sui, W.; Duan, L. Risk Assessment of Water Inrush in an Underground Coal Mine Based on GIS and Fuzzy Set Theory. Mine Water Environ. 2017, 36, 617–627. [Google Scholar] [CrossRef]
Tutu, H.; Cukrowska, E.M.; Dohnal, V.; Havel, J. Application of artificial neural networks for classification of uranium distribution in the Central Rand goldfield, South Africa. Environ. Model. Assess. 2005, 10, 143–152. [Google Scholar] [CrossRef]
Wei, W.; Shi, L.; Lu, X.; Feng, Z. Prediction of Mine Water Inflow Based on Support Vector Machine. In Proceedings of the IEEE 2011 Workshop on Digital Media and Digital Content Management, Hangzhou, China, 15–16 May 2011; pp. 326–329. [Google Scholar] [CrossRef]
Wei, J.; Li, G.; Xie, D.; Yu, G.; Man, X.; Wang, J. Discrimination of mine water-inflow sources based on the multivariate mixed model and fuzzy comprehensive evaluation. Arab. J. Geosci. 2020, 13, 873. [Google Scholar] [CrossRef]
Ji, Y.; Yu, L.; Wei, Z.; Ding, J.; Dong, D. Research progress on identification of mine water inrush sources: A visual analysis perspective. Mine Water Environ. 2025. [Google Scholar] [CrossRef]
Ji, Y.; Dong, D.; Mei, A.; Wei, Z. Study on Key Technology of Identification of Mine Water Inrush Source by PSO-LightGBM. Water Supply 2022, 22, 7416–7429. [Google Scholar] [CrossRef]
Roy, I.G. On computing first and second order derivative spectra. J. Comput. Phys. 2015, 295, 307–321. [Google Scholar] [CrossRef]
Powell, M.J.D. Radial Basis Functions for Multivariable Interpolation: A Review; Clarendon Press: Oxford, UK, 1987; pp. 143–167. [Google Scholar]
Er, M.J.; Wu, S.; Lu, J.; Toh, H.L. Face recognition with radial basis function (RBF) neural networks. IEEE Trans. Neural Netw. 2002, 13, 697–710. [Google Scholar] [CrossRef]
Zhang, R.; Huang, G.B.; Sundararajan, N.; Saratchandran, P. Improved GAP-RBF network for classification problems. Neurocomputing 2007, 70, 3011–3018. [Google Scholar] [CrossRef]
Jiang, Q.; Zhu, L.; Shu, C.; Sekar, V. An efficient multilayer RBF neural network and its application to regression problems. Neural Comput. Appl. 2021, 34, 4133–4150. [Google Scholar] [CrossRef]
Kumar, Y.; Kaur, A. Variants of bat algorithm for solving partitional clustering problems. Eng. Comput. 2021, 38, 1973–1999. [Google Scholar] [CrossRef]
Dong, L.; Pei, Z.; Xie, X. Early Identification of Abnormal Regions in Rock-Mass Using Traveltime Tomography. Engineering 2023, 22, 191–200. [Google Scholar] [CrossRef]
Zhao, Z.; Yao, X.; Xu, K.; Yang, L. Simulation study on response characteristics of water-bearing zone in metal mine monitored by resistivity method. J. Saf. Sustain. 2024, 1, 212–222. [Google Scholar] [CrossRef]

Figure 1. Geographical location of the study area and the sampling sites. Note: The blue dots in the image are the sampling points in the nearby area.

Figure 2. Research framework of this study.

Figure 3. Sketch map of the RBF.

Figure 4. Sketch map of the BA-RBF.

Figure 5. Original spectral curve of the water sample from Baode Mine.

Figure 6. Pretreatment images showing two different preprocessing methods.

Figure 7. Piper three-line diagram.

Figure 8. Pearson correlation analysis diagram.

Figure 9. Comparison of prediction performance among the four models. (a) The original spectral data; (b) The second-order processed spectral data; (c) The water chemistry data.

Figure 10. Comparison of true values and predicted values of the BA-RBF model across different data sets.

Figure 11. The fitness curve.

Table 1. Parameter definition of index.

	Positive	Negative
Positive	True Positive (TP)	False Negative (FN)
Negative	False Positive (FP)	True Negative (TN)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wei, Z.; Ji, Y.; Fang, H.; Yu, L.; Dong, D. Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling. Water 2025, 17, 790. https://doi.org/10.3390/w17060790

AMA Style

Wei Z, Ji Y, Fang H, Yu L, Dong D. Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling. Water. 2025; 17(6):790. https://doi.org/10.3390/w17060790

Chicago/Turabian Style

Wei, Zhonglin, Yuan Ji, Huiming Fang, Lujia Yu, and Donglin Dong. 2025. "Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling" Water 17, no. 6: 790. https://doi.org/10.3390/w17060790

APA Style

Wei, Z., Ji, Y., Fang, H., Yu, L., & Dong, D. (2025). Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling. Water, 17(6), 790. https://doi.org/10.3390/w17060790

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Rapid Source Identification of Mine Water Inrush Using Spectral Data Combined with BA-RBF Modeling

Abstract

1. Introduction

2. Data Acquisition

2.1. Geological and Hydrogeological Conditions

2.2. Spectral Data Acquisition

2.3. Hydrochemical Data Acquisition

3. Methods

3.1. Spectral Data Preprocessing

3.2. RBF

3.3. BA

3.4. BA-RBF Model

3.5. Model Evaluation

4. Results

4.1. Spectral Data Analysis

4.2. Hydrochemical Data Analysis

4.3. Analysis of Water Inrush Source Identification Model Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI