A Novel Approach for Predicting the Height of the Water-Flow Fracture Zone in Undersea Safety Mining

: The height of the water-ﬂow fracture zone (WFZ) is an important reference for designing the size of a waterproof crown pillar. Once the WFZ is connected with the sea, there will be catastrophic consequences, especially for undersea mining. This study suggests using a rotating forest (RoF) model to predict the height of the WFZ for the evaluation of the size of a waterproof crown pillar. To train and test the RoF model, ﬁve indicators with major inﬂuencing factors on undersea safety mining were determined, 107 ﬁeld-measured mining datasets were collected, 75 (70%) datasets were used for training, and 32 (30%) datasets were used for model testing. At the same time, the random forest ensemble algorithm (RFR) and support vector machine (SVM) models were introduced for comparison and veriﬁcation; in the end, the tested results were evaluated by RMSE (root-mean-square error) and R 2 . The comparison shows that the predicted results from the RoF model are signiﬁcantly better than those from the RFR and SVM models. An importance analysis of the impact indicators shows that the mining height and depth have signiﬁcant impacts on the prediction results. The development height of the WFZ in undersea safety mining was predicted via the RoF model. The predicted results via the RoF model were veriﬁed by ﬁeld observations using panoramic borehole televiewers. The RoF prediction results are consistent with the observation results at all depths. Compared with the other two models, the RoF model has the smallest average absolute error at 2.87%. The results show that the RoF model can be applied to predict the height of the WFZ in undersea mining, which could be an e ﬀ ective way of minimizing the mineral resource waste in the study area and in other similar areas in the world under the premise of mine safety.


Introduction
With the rapid growth of the economy, the demand for mineral resources has continued to increase [1]. However, as easy-to-mine orebodies are gradually exhausted, orebodies with complicated mining conditions, such as broken orebodies, high-cold orebodies, deep orebodies, and underwater or seabed orebodies, are becoming important mining targets [2][3][4]. With the development of technology and equipment upgrades, the exploitation of underwater mines is gradually increasing. After mining an underwater orebody, a goaf can be produced, which will cause the overburden to move, deform, and even break down [5], making it easy to form an overburden mining fracture zone, which may lead to water pouring into the mine, causing water inrush accidents [6]. According to the statistics of the government accident investigation system of the State Administration of Work Safety of the People's Republic of China [7], from 2003 to 2017 there were 561 mine water-inrush accidents in China, which led to 2875 deaths, as shown in Figure 1. The occurrence of water-inrush accidents in mines not only causes casualties and resource losses but also damages the original water and land Many approaches have been adopted to estimate the height of the water-flow fracture zone (WFZ). These approaches can be grouped as empirical equations, theoretical calculations, field measurements, physical and numerical simulations, and artificial intelligence methods.
The Chinese, British, and Japanese governments have issued empirical prediction equations based on mining practices [8]. Most of these empirical equations are derived from the accumulation of field experience. For safety reasons, these empirical prediction equations exaggerate the thickness of the crown pillar and lead to the waste of resources.
Beyond these equations, researchers have proposed theoretical calculations of the WFZ based on mining subsidence theory [9], the law of overlying stratum movement [10], and permeability changes in the rock strata [11]. Due to the complexity of the assumption conditions in theoretical calculations and the mining environment in the field, most of these results do not have a wide applicability.
Physical and numerical simulations are methods used to predict the heights of WFZs [12][13][14][15]. Many numerical simulation methods have been proposed to determine the height of the WFZ, such as PFC (Particle Flow Code) [16], UDEC (Universal Distinct Element Code) [17], 3DEC (3 Dimension Distinct Element Code) [18], FLAC (Fast Lagrangian Analysis of Continua) [19], and ANSYS [20]. However, these numerical simulations require many input parameters that may require approximations or assumptions, and physical simulations are time consuming and expensive.
Field measurements, such as the downhole water injection leak detection method, acoustic computed tomography (CT), the borehole TV method, the high-density resistivity method, the drilling mud testing system, and transient electromagnetic methods [21][22][23][24][25], have been used individually or in combination to measure the height of the WFZ. Due to the complexity of underground and underwater mining environments and equipment operations, field measurements are time consuming and expensive, similar to physical simulations.
In recent years, numerous artificial intelligence technologies have been utilized to predict the height of the WFZ, such as artificial neural networks (ANNs) [26], support vector machines (SVMs) [27], random forest regression (RFR) [28], and the gray relational analysis analytic hierarchy process (GRA-AHP) [29]. These technologies can synthetically consider various influencing factors and Many approaches have been adopted to estimate the height of the water-flow fracture zone (WFZ). These approaches can be grouped as empirical equations, theoretical calculations, field measurements, physical and numerical simulations, and artificial intelligence methods.
The Chinese, British, and Japanese governments have issued empirical prediction equations based on mining practices [8]. Most of these empirical equations are derived from the accumulation of field experience. For safety reasons, these empirical prediction equations exaggerate the thickness of the crown pillar and lead to the waste of resources.
Beyond these equations, researchers have proposed theoretical calculations of the WFZ based on mining subsidence theory [9], the law of overlying stratum movement [10], and permeability changes in the rock strata [11]. Due to the complexity of the assumption conditions in theoretical calculations and the mining environment in the field, most of these results do not have a wide applicability.
Physical and numerical simulations are methods used to predict the heights of WFZs [12][13][14][15]. Many numerical simulation methods have been proposed to determine the height of the WFZ, such as PFC (Particle Flow Code) [16], UDEC (Universal Distinct Element Code) [17], 3DEC (3 Dimension Distinct Element Code) [18], FLAC (Fast Lagrangian Analysis of Continua) [19], and ANSYS [20]. However, these numerical simulations require many input parameters that may require approximations or assumptions, and physical simulations are time consuming and expensive.
Field measurements, such as the downhole water injection leak detection method, acoustic computed tomography (CT), the borehole TV method, the high-density resistivity method, the drilling mud testing system, and transient electromagnetic methods [21][22][23][24][25], have been used individually or in combination to measure the height of the WFZ. Due to the complexity of underground and underwater mining environments and equipment operations, field measurements are time consuming and expensive, similar to physical simulations.
In recent years, numerous artificial intelligence technologies have been utilized to predict the height of the WFZ, such as artificial neural networks (ANNs) [26], support vector machines (SVMs) [27], random forest regression (RFR) [28], and the gray relational analysis analytic hierarchy process (GRA-AHP) [29]. These technologies can synthetically consider various influencing factors and predict results quickly and accurately in an economic way. Moreover, their results have a wide Remote Sens. 2020, 12, 358 3 of 18 applicability [30][31][32]. However, field operations show that these artificial intelligence technologies have some limitations. For example, some scholars have noted issues such as the excessive studying effect, as well as the requirement of a large number of computations and a lower speed of convergence than for the ANN model [1].
In undersea mining, continuous excavations will lead to unceasing fractures in the overlying rock strata. Once the WFZ connects with the sea water, an irreversible disaster will occur. To avoid this, it is necessary to retain a certain level of thickness in the roof pillar. However, a pillar that is too thick may lead to a waste of resources. If the pillar is too thin, the WFZ will easily connect with the sea water and cannot guarantee safety. Therefore, accurate analyses and predictions must be made for the height of the WFZ.
Unlike the aforementioned studies, a novel approach based on the RoF algorithm, which can generate base classifiers with large differences and a high precision, can improve the accuracy of prediction models for the height of the WFZ in undersea safety mining. The rotating forest (RoF) algorithm, proposed by Rodríguezin [33], is a new integrated learning algorithm. Compared with the random forest algorithm and other classical integrated learning algorithms, the RoF algorithm can generate base classifiers with large differences and a high precision, giving it a better generalization ability. RoF has been widely used in biomedicine [34], pattern recognition [35], geotechnical engineering [36], and environmental science [37]. In this study, the RoF algorithm is applied to estimate the development height of the WFZ in underwater mines.

Study Areas
Due to the complexity of mining conditions and hydrogeological conditions, many mines in China are threatened by water, especially mines in the north of China. As shown in Figure 2, the provinces of Henan, Anhui, Shandong, and Shaanxi in China form a concentrated area in which mines are threatened by water. In order to ensure the economic development and survival of large populations, mines below rivers and lakes are being extracted. For example, the Beizao mine (No. 3 in Table 1), located in Shandong, is an undersea (Bohai Sea) mine. The No. H2106 working face in this mine is about 330 m deep, the working face length is 150 m, the angle of the ore is 7 • , and the height of the WFZ is 39 m. This mine is threatened by seawater; once the WFZ is connected to the seawater, the results will be devastating to society. The Chenjiagou mine (No. 5 in Table 1), located in Liupan Mountain (the junction of Shaanxi, Gansu and Ningxia), is under the Beirui River. The No. H2106 working face in this mine is about 540 m deep, the working face length is 100 m, the angle of the ore is 21 • , and the height of the WFZ is 134 m. To study the height of the WFZ with the RoF algorithm, large amounts of data on the WFZ must be gathered in these areas.   Table 1).

Data Collection
In this study, 107 cases were collected from several mines in China [38][39][40][41][42][43]. Each data group contains field-measured data on the height of the WFZ and the aforementioned five main controlling factors, which are shown in Table 1. According to the literature, the lithological types of the rocks in these mines can possess different grades, subject to the uniaxial compressive strength of the rock. A value of 1 means that the rock is soft, a value of 2 means that the rock is medium soft, a value of 3 means that the rock is medium hard, and a value of 4 means that the rock is hard.

Rotating Forest Algorithm
The RoF ensemble algorithm can process the original sample features and use a certain feature transformation method to obtain the different sample sets required by the training base classifier. After that, the differences between the individual classifiers can be obtained. Then, the purpose of improving the classification accuracy rate can be achieved. This procedure is shown in Figure 3.
In this study, 107 cases were collected from several mines in China [38][39][40][41][42][43]. Each data group contains field-measured data on the height of the WFZ and the aforementioned five main controlling factors, which are shown in Table 1. According to the literature, the lithological types of the rocks in these mines can possess different grades, subject to the uniaxial compressive strength of the rock. A value of 1 means that the rock is soft, a value of 2 means that the rock is medium soft, a value of 3 means that the rock is medium hard, and a value of 4 means that the rock is hard.

Rotating Forest Algorithm
The RoF ensemble algorithm can process the original sample features and use a certain feature transformation method to obtain the different sample sets required by the training base classifier. After that, the differences between the individual classifiers can be obtained. Then, the purpose of improving the classification accuracy rate can be achieved. This procedure is shown in Figure 3.
First, the training dataset can be defined as follows: Second, the parameters in the RoF algorithm need to be defined in advance-namely s (the feature number of subsets) and L (the scale of integration). Among the parameters, s represents the number of characteristic indexes included in each feature subset, and L represents the number of base classifiers included in the integrated system. The RoF calculation process can be divided into two stages: the generation and synthesis of the base classifier.
Stage 1: Generation of the base classifier (1) The original feature set can be randomly segmented, and each subset contains a number of characteristic indexes. Among these subsets, any two subsets cannot intersect. If the original First, the training dataset can be defined as follows: Second, the parameters in the RoF algorithm need to be defined in advance-namely s (the feature number of subsets) and L (the scale of integration). Among the parameters, s represents the number of characteristic indexes included in each feature subset, and L represents the number of base classifiers included in the integrated system. The RoF calculation process can be divided into two stages: the generation and synthesis of the base classifier.
Stage 1: Generation of the base classifier (1) The original feature set can be randomly segmented, and each subset contains a number of characteristic indexes. Among these subsets, any two subsets cannot intersect. If the original feature set is indivisible, then the remainder forms a subset. In the end, a number of subsets can be obtained. The j-th feature subset of the i-th base classifier (F ij ) can then be expressed. (2) All F ij samples are extracted from the dataset X, and 75% of the samples are randomly extracted with the bootstrap method to generate the sample subset X ij . The subset X ij can be transformed into W ij according to the feature transformation algorithm. The transformed matrix W i1 , W i2 , · · · , W ik can be arranged to generate a sparse matrix W i , which is defined as follows: (3) The matrix W i can be adjusted to obtain the new matrix W i * , and each column is consistent with the original feature set sequence. (6) Use the base classifier G i to predict x i . (7) Repeat steps (5) and (6), and the classification results of all base classifiers can be obtained. Then, the results can be integrated as per the following equation: (8) Place sample x into the category with the highest probability; then, the final integrated classification result can be obtained.

Feature Transformation Algorithm
The decision tree is a sensitive classifier, and any small change in the data may result in a completely different structure of the decision tree in training [44]. Therefore, the feature transformation algorithm plays a crucial role in constructing discrepant data and directly affects the final classification results. Based on high-order statistics, the multifeature data can be processed with Independent Component Analysis (ICA). ICA can transform the features of a dataset into several statistically independent feature combinations to ensure the high-order statistical properties of the dataset. ICA is regarded as an important extension of PCA and is also suitable as a feature change method for RoF. ICA has the following advantages: This approach can provide a more realistic statistical model, which can precisely determine the data at positions in a high-dimensional space. This method also provides a nonorthogonal basis space, which can better reconstruct the data information when the original dataset has outliers. Moreover, ICA is sensitive to high-order statistical information.
We assume that X (the s-dimensional random variable) is a linear combination of random variables S, whose s-dimension components are independent of each other and can be expressed as follows: X = SA. Then, the s-dimension matrix W can be obtained, i.e., U = XW, where U represents the s-dimension matrix and is the estimated value of S, which satisfies the independence requirements of the s-dimensional components. Currently, the estimation algorithms of the matrix include fast ICA, Info Max, JADE, and radical ICA.

Model Evaluation Metrics
Before the model is used for prediction, the accuracy of the model must be evaluated. The root mean square error (RMSE) and coefficient of determination (R 2 ) can be selected as the evaluation indicators. These two indexes are defined as follows: where n -the total number of test datas y i -the observed output value of the test sample y i -the predicted value by the model y i -the average output value of the test samples.

Controlling Factors of WFZ
The WFZ is a complex phenomenon influenced by many factors [39]. In this study, according to the studies in [45][46][47], the indicators with major influencing factors on undersea safety mining were determined. Detailed descriptions of these five factors are provided below.
(1) Mining Depth Geostress increases by increasing the mining depth. After the orebody is mined out, the lateral and vertical movements of the overlying strata lead to the formation of overburden mining fractures in the rock. Therefore, the deeper the mining depth of the underwater mining becomes, the higher the pressure of the overlying strata will be, the more intense the lateral and vertical movements of the rock strata will be, and the more fractures will occur in the rock.
(2) Mining Height The mining height plays a key role in the stability of the overburden and the height of the broken rock. In a way, the mining height restricts the mine's size. At a certain recovery rate, the larger the mining height is, the larger the goaf will be. When the exposed space increases, the height of the overburden strata will also increase. If the mining height shows a growth trend, then the height of the overburden collapse will also exhibit a growth trend.

(3) Lithology Type
Under normal circumstances, due to the differences in plasticity, stiffness, brittleness, and other characteristics of the rock formation, the lithology will have correspondingly different impacts on the extent of the rock fracturing. If the lithology hardness is high, a more serious overburden damage development will occur than under a low lithology hardness, which will yield an only slight overburden damage development. For example, the fracture zone usually has fractures along the vertical or near-vertical plane. The question of whether the rock layer can be fractured, along with the degree of fracture development, are both directly related to the lithology.

(4) Working-Face Length
As the length of the face increases, a greater elastic potential will be released when the surrounding rock is damaged. Naturally, this will also cause a wider range of surrounding rock damage, and the extent of this damage is likely to increase with an increase in the working-face length. If an ore body with a constant thickness is mined, the size of the stope will also have different effects on the height of the overburden. The smaller the stope required for the mining method used, the lower the disturbance to the surrounding rock; and the lower the elastic strain energy released when the surrounding rock is destroyed, the smaller the generated damage height. Conversely, if a different mining method is applied, the larger the mine size required for the method is, the more damage it will cause.

(5) Orebody Dip Angle
In general, the inclination of an inclined orebody is inversely proportional to the height of the caving zone and the fracture zone, as shown in Figure 4. The inclination of the orebody will have a crucial influence on the height of the fractures in the overburden during mining. The upper and lower arch angles of the roof damage are small, but the upper arch angle is the largest. Due to the increase in this angle, the funnel of the roof damage will be reduced. At the same time, the corner of the arch will slowly disappear.
As the length of the face increases, a greater elastic potential will be released when the surrounding rock is damaged. Naturally, this will also cause a wider range of surrounding rock damage, and the extent of this damage is likely to increase with an increase in the working-face length. If an ore body with a constant thickness is mined, the size of the stope will also have different effects on the height of the overburden. The smaller the stope required for the mining method used, the lower the disturbance to the surrounding rock; and the lower the elastic strain energy released when the surrounding rock is destroyed, the smaller the generated damage height. Conversely, if a different mining method is applied, the larger the mine size required for the method is, the more damage it will cause.

(5) Orebody Dip Angle
In general, the inclination of an inclined orebody is inversely proportional to the height of the caving zone and the fracture zone, as shown in Figure 4. The inclination of the orebody will have a crucial influence on the height of the fractures in the overburden during mining. The upper and lower arch angles of the roof damage are small, but the upper arch angle is the largest. Due to the increase in this angle, the funnel of the roof damage will be reduced. At the same time, the corner of the arch will slowly disappear.

Shortcomings and Uncertainty
Compared with the random forest algorithm and other classical integrated learning algorithms, the RoF algorithm can generate base classifiers with large differences and a high precision, thus giving it a better generalization ability. However, mining is a complicated industry affected by hydrogeological conditions. The uncertainty in this research lies in the absence of data on the fault, initial fissure, and pore water pressure in the rockmass. Unlike the five factors in Section 2.6, these data are difficult to obtain, analyze, and quantify. If these indicators experience a large change in the same mine, a more obvious difference could appear in the height of the WFZ, which could threaten the safety of the mine. In order to apply this method accurately, the hydrogeological conditions must be precisely detected.
In order to prevent the movement and fracture of the rock strata, progressively more mines are adopting the cut and fill mining method. The shortcomings in this research lie in the absence of influence from filling. The effects from filling on the fractures in rockmasses are very complex, and the filling materials' strength, filling rate, and compression rate may generate only a slight effect on the height of the WFZ. Future work will consider the effect of filling on the height of the WFZ.

Shortcomings and Uncertainty
Compared with the random forest algorithm and other classical integrated learning algorithms, the RoF algorithm can generate base classifiers with large differences and a high precision, thus giving it a better generalization ability. However, mining is a complicated industry affected by hydrogeological conditions. The uncertainty in this research lies in the absence of data on the fault, initial fissure, and pore water pressure in the rockmass. Unlike the five factors in Section 2.6, these data are difficult to obtain, analyze, and quantify. If these indicators experience a large change in the same mine, a more obvious difference could appear in the height of the WFZ, which could threaten the safety of the mine. In order to apply this method accurately, the hydrogeological conditions must be precisely detected.
In order to prevent the movement and fracture of the rock strata, progressively more mines are adopting the cut and fill mining method. The shortcomings in this research lie in the absence of influence from filling. The effects from filling on the fractures in rockmasses are very complex, and the filling materials' strength, filling rate, and compression rate may generate only a slight effect on the height of the WFZ. Future work will consider the effect of filling on the height of the WFZ.

Validity Verification of RoF Model: Comparison and Evaluation
Among the collected cases, we selected 75 (70%) cases as a training dataset randomly and selected RMSE and R 2 were selected to evaluate the three models. Based on Equations (4) and (5), the RMSE and R 2 of each model was calculated, as indicated in Table 2. As shown, the RoF model has the lowest R 2 value with 0.02 (compared to 0.08 for the RFR model and 0.13 for the SVM model) and a medium RMSE value of 39.81. Therefore, it can be concluded that the three models are reasonable, and that the RoF model has a better performance than the other two models.  Figure 5 shows the comparison of the observed and predicted heights determined using the three models. Compared with the RFR and SVM models, the data predicted by the RoF model are almost the same as those predicted with the measured data, while the results predicted by the other two models are slightly different from the measured data.

Validity Verification of RoF Model: Comparison and Evaluation
Among the collected cases, we selected 75 (70%) cases as a training dataset randomly and selected 32 (30%) cases for testing the dataset.
RMSE and R 2 were selected to evaluate the three models. Based on Equations (4) and (5), the RMSE and R 2 of each model was calculated, as indicated in Table 2. As shown, the RoF model has the lowest R 2 value with 0.02 (compared to 0.08 for the RFR model and 0.13 for the SVM model) and a medium RMSE value of 39.81. Therefore, it can be concluded that the three models are reasonable, and that the RoF model has a better performance than the other two models.  Figure 5 shows the comparison of the observed and predicted heights determined using the three models. Compared with the RFR and SVM models, the data predicted by the RoF model are almost the same as those predicted with the measured data, while the results predicted by the other two models are slightly different from the measured data. In the RoF model, the mining height and mining depth have the highest importance, followed by the lithology type and working-face length. The dip angle has the lowest importance in the Gini index. Regarding the mean decrease in accuracy, the order of the importance degree is consistent with the results obtained for the Gini index method.
Compared with the SVM and RFR models, the mining height and mining depth have the highest importance in the RoF model, followed by the lithology type and working-face length. The dip angle has the lowest importance in the Gini index. Regarding the mean decrease in accuracy, the order of the importance degree is consistent with the results obtained for the Gini index method.     In the RoF model, the mining height and mining depth have the highest importance, followed by the lithology type and working-face length. The dip angle has the lowest importance in the Gini index. Regarding the mean decrease in accuracy, the order of the importance degree is consistent with the results obtained for the Gini index method.
Compared with the SVM and RFR models, the mining height and mining depth have the highest importance in the RoF model, followed by the lithology type and working-face length. The dip angle has the lowest importance in the Gini index. Regarding the mean decrease in accuracy, the order of the importance degree is consistent with the results obtained for the Gini index method.

Project Profile
The Xinli mining area is located to the southwest of the Sanshandao gold mine in Shandong Province, China. The designed production capacity of the Xinli mining district is 1500 t/day. It is the largest undersea metal mine in the world thus far.
Geographically, the Xinli mining district is located at the intersection of Bohai Bay and the Wanghe River. Bohai Bay is located on the west and north sides of the mining area, as shown in Figure 9. The entire coastline is nearly 5000 km long. The south side is the coastal plain, and the east side is the Wanghe River. There is also a surface industrial plant on the coastal plain. Because it is located at the intersection of Bohai Bay and the Wanghe River, the surface water in the area is derived from these water bodies. The orebody is basically under the Wanghe River and Bohai Bay, and the main ore body is at an angle of 60 • with the coastline, while the ore dip angle is 46 • . From the perspective of the relationship between the water bodies, the safety of the Xinli mining district is largely affected by seawater. The test results of the hydrogeological conditions and lithological parameters in the mining area show that the hard rock composed of fricrock and siltstone is an aquifer with a cumulative thickness of about 30 m. The geological well off the coast of Xinli can illustrate the stratum's structure, as shown in Figure 10.
To prevent water inrush disasters, it is important to accurately determine the height of the WFZ in the mining-disturbed overburden.

Prediction the Height of the Water-Flow Fracture Zone
In the Xinli district, the classification of the lithology type is 2, the orebody dip angle is 46°, the mining heights at each depth are 35 m, 40 m, and 40 m, and the working-face lengths at each depth are 49 m, 56 m, and 56 m. By applying the SVM, RFR, and RoF models generated above the stratum fractures during excavation at −165 m, −200 m, and −240 m depths, the height of the WFZ can be predicted, as listed in Table 3.

Prediction the Height of the Water-Flow Fracture Zone
In the Xinli district, the classification of the lithology type is 2, the orebody dip angle is 46°, the mining heights at each depth are 35 m, 40 m, and 40 m, and the working-face lengths at each depth are 49 m, 56 m, and 56 m. By applying the SVM, RFR, and RoF models generated above the stratum fractures during excavation at −165 m, −200 m, and −240 m depths, the height of the WFZ can be predicted, as listed in Table 3. Table 3. Predicting the height of the WFZ with the three models at the three depths.

Methods
Mining Mining Lithology Working-Face Dip Predicted

Prediction the Height of the Water-Flow Fracture Zone
In the Xinli district, the classification of the lithology type is 2, the orebody dip angle is 46 • , the mining heights at each depth are 35 m, 40 m, and 40 m, and the working-face lengths at each depth are 49 m, 56 m, and 56 m. By applying the SVM, RFR, and RoF models generated above the stratum fractures during excavation at −165 m, −200 m, and −240 m depths, the height of the WFZ can be predicted, as listed in Table 3.

Field Observation
To verify the accuracy of the predicted values, a panoramic borehole televiewer ( Figure 11) was employed to measure the height of the WFZ at each depth. The televiewer can present a clear visual inspection of the rocks along the borehole wall with 360 • borehole images and real-time videos. This device can thus record the characteristics of the fractures above the caved zone.

.3. Field Observation
To verify the accuracy of the predicted values, a panoramic borehole televiewer ( Figure 11) was employed to measure the height of the WFZ at each depth. The televiewer can present a clear visual inspection of the rocks along the borehole wall with 360° borehole images and real-time videos. This device can thus record the characteristics of the fractures above the caved zone.  [17,45].
In borehole (1), the developed fractures above 33.64 m had relatively large apertures and tended to be interconnected; water flow traces were also clearly visible, as shown in Figure 12a. However, the fractures in the lower region were isolated, and the apertures were relatively small. In boreholes (2) to (4), similar separations were located at 34.18 m, 35.28 m, and 31.58 m from the bottom of the hole. Based on these data, the height of the WFZ at a −240 m depth could be determined. It is reasonable to infer that the height of the WFZ at a −240 m depth is approximately 35.3 m. The aperture, interconnectivity, fracture development degree, and even traces of the flow of water can be observed by increasing the borehole depth. Furthermore, the top boundaries of the fractured zone above the caved zone can be determined by this device. To obtain the height of the WFZ at the top of the Xinli District, four boreholes were drilled along the roadway to a −200 m depth to measure the height of the WFZ at a −240 m depth. The height of the WFZ at −200 m and −165 m depths was measured by Chen in [17,45].
In borehole (1), the developed fractures above 33.64 m had relatively large apertures and tended to be interconnected; water flow traces were also clearly visible, as shown in Figure 12a. However, the fractures in the lower region were isolated, and the apertures were relatively small. In boreholes (2) to (4), similar separations were located at 34. A comparison between the field observation data and the prediction results obtained via the three models is summarized in Table 4. The field observations at −165 m and −200 m from [17,45] are included in this table. Compared with the field-measured data, the average relative errors are 2.87%, 8.72%, and 16.8%, which indicates that the RoF model has the best application performance.

Conclusions
Owing to its unique hydrogeological environment, undersea mining is a special type of metal mining with high risk. In this study, a novel approach based on the RoF algorithm was proposed to predict the height of the WFZ. To train and test the model, 107 data were collected, and five main controlling factors were employed. Furthermore, the trained and tested model was used to predict the height of the WFZ at a −240 m depth in undersea mining. Compared with other classical integrated learning algorithms, the RoF algorithm can generate base classifiers with large differences and a high precision, giving it a better generalization ability and the smallest R 2 value. To verify the accuracy of the predicted value, a panoramic borehole televiewer was employed to measure the height of the WFZ at each depth. The result indicates that the RoF model has the smallest relative error and best application performance.
Compared with other models, the WFZ development height prediction model based on the RoF algorithm has the following advantages: (1) The model has the features of simple data processing and good compatibility for new data, and it is conducive to the expansion of the database; (2) The model has a high recognition accuracy, good generalization ability, and more reliable mapping; (3) As a base classifier, the decision tree can deal with continuous variables and type variables, while also being A comparison between the field observation data and the prediction results obtained via the three models is summarized in Table 4. The field observations at −165 m and −200 m from [17,45] are included in this table. Compared with the field-measured data, the average relative errors are 2.87%, 8.72%, and 16.8%, which indicates that the RoF model has the best application performance.

Conclusions
Owing to its unique hydrogeological environment, undersea mining is a special type of metal mining with high risk. In this study, a novel approach based on the RoF algorithm was proposed to predict the height of the WFZ. To train and test the model, 107 data were collected, and five main controlling factors were employed. Furthermore, the trained and tested model was used to predict the height of the WFZ at a −240 m depth in undersea mining. Compared with other classical integrated learning algorithms, the RoF algorithm can generate base classifiers with large differences and a high precision, giving it a better generalization ability and the smallest R 2 value. To verify the accuracy of the predicted value, a panoramic borehole televiewer was employed to measure the height of the WFZ at each depth. The result indicates that the RoF model has the smallest relative error and best application performance.
Compared with other models, the WFZ development height prediction model based on the RoF algorithm has the following advantages: (1) The model has the features of simple data processing and good compatibility for new data, and it is conducive to the expansion of the database; (2) The model has a high recognition accuracy, good generalization ability, and more reliable mapping; (3) As a base classifier, the decision tree can deal with continuous variables and type variables, while also being less affected by outliers. At the same time, the model also has inevitable shortcomings: For unbalanced data sets, the recognition results of the RoF algorithm are more inclined toward more categories. In general, it is completely feasible and reliable to use the RoF algorithm to predict the height of WFZ, which is of practical significance for mining design and disaster prevention. More extensive data can lead to a more detailed database and a big data platform, which will make the model more reliable.
However, mining is a complicated industry affected by hydrogeological conditions. The uncertainty in this research lies in the absence of data on the fault, initial fissure, and pore water pressure in the rockmass. Unlike the five factors we studied, these data are hard to obtain, analyze, and quantify. If these indicators experience a large change in the same mine, a more obvious difference in the height of the WFZ could be produced, which could threaten the safety of the mine. To apply this method accurately, the hydrogeological conditions must be precisely detected. In order to prevent the movement and fracture of the rock strata, progressively more mines have adopted the cut and fill mining method. The shortcomings in this research relate to the absence of influence from filling. The effects from filling on fractures in the rockmass are very complex, and the filling materials' strength, filling rate, and compression rate may generate little effect on the height of the WFZ. Future work should consider the effect of filling on the height of the WFZ.
Finally, the results indicate that the machine learning algorithm is a promising technology, and that the resulting WFZ height can be used to manage the last resources and a water inrush reduction. Meanwhile, the empirical prediction equations on the height of the WFZ issued by the government should be updated by applying multiple methods, which can minimize the mineral resource waste and allow mining companies to obtain greater economic benefits under the premise of safety.