Classification of Rock Mass Quality in Underground Rock Engineering with Incomplete Data Using XGBoost Model and Zebra Optimization Algorithm
Abstract
1. Introduction
2. Methodology
2.1. Extreme Gradient Boosting
2.2. Zebra Optimization Algorithm
2.3. Bayesian Optimization Algorithm
2.4. Hybrid Models
- (1)
- Data Preparation: Collect rock mass classification data from underground engineering projects, randomly dividing it into training and test sets. Preprocess the data to eliminate the effects of class imbalance, scale, and magnitude. The same training set is used for both models, while the test set is reserved for evaluation.
- (2)
- Parameter Setting and Initialization: Define the hyperparameter optimization range for XGBoost. Set the population size and iteration number for ZOA and the same iteration number for BO. ZOA randomly generates a group of zebra individuals in the search space, while BO randomly selects several points within the search space.
- (3)
- Fitness Evaluation: Establish an appropriate fitness function for models and calculate the fitness value for each individual or point.
- (4)
- Iterative Update: ZOA updates positions using strategies to evade predators, retaining individuals with higher fitness for the next generation. BO selects the next evaluation point based on the surrogate model through the acquisition function and updates the surrogate model’s parameters with new data points. In each iteration, the model evaluates the objective function.
- (5)
- Output Optimal Solution: Upon reaching the stopping condition, output the optimal hyperparameter combination.
3. Data
3.1. Data Preparation and Analysis
3.2. Data Pre-Processing
4. Modeling
4.1. Model Metrics
4.2. Model Training
5. Results and Discussion
5.1. Model Evaluation
5.2. Model Interpretation
5.3. Engineering Validation
6. Limitations and Future Studies
- The dataset used is relatively small and lacks cases of class I. Expanding the dataset size in the future could further enhance the model’s applicability to a wider range of engineering scenarios.
- While ZOA-XGBoost performs well with missing values, its robustness and stability against other types of uncertainty and noisy data have not been fully verified. Future research could incorporate more noise-handling and uncertainty-evaluation methods to improve model reliability.
- The features used in this study (UCS, RQD, Kv, and W) play important roles in rock mass quality prediction, but other influential factors were not included. Future improvements could involve incorporating additional geological and engineering factors, such as rock mass structural characteristics and stress conditions, to further refine the model.
7. Conclusions
- Performance evaluation results indicate that the ZOA-XGBoost model with a population size of 30 achieved the best predictive performance. The evaluation metrics on the test set were Accuracy of 0.923, Kappa of 0.887, Precision of 0.932, Recall of 0.932, and F1-score of 0.922.
- Multiple feature importance analysis methods demonstrated that RQD and UCS are the key input variables for predicting rock mass quality. ICE analysis revealed that higher RQD and UCS values correspond to better rock mass quality, consistent with actual engineering experience.
- To further validate the performance of the ZOA-XGBoost model, an additional rock mass quality classification dataset containing missing values was constructed. The results showed that ZOA-XGBoost achieved an accuracy of 80.00% on the incomplete dataset, significantly outperforming other machine learning models with imputed missing values. This confirms the reliability and effectiveness of the hybrid model developed in this study for practical engineering applications.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Terzaghi, K. Rock Defects and Loads on Tunnel Supports; Harvard University, Graduate School of Engineering: Cambridge, MA, USA, 1946. [Google Scholar]
- Deere, D.U. Technical Description of Rock Cores for Engineering Purposes; University of Illinois: Urbana, IL, USA, 1962. [Google Scholar]
- Bieniawski, Z.T. Engineering Classification of Jointed Rock Masses. Civ. Eng. Siviele Ingenieurswese 1973, 1973, 335–343. [Google Scholar]
- Barton, N.; Lien, R.; Lunde, J. Analysis of Rock Mass Quality and Support Practice in Tunneling, and a Guide for Estimating Support Requirements: Internal Report; Norges Geotekniske Institute: Trondheim, Norway, 1974. [Google Scholar]
- Ma, J.; Li, T.; Yang, G.; Dai, K.; Ma, C.; Tang, H.; Wang, G.; Wang, J.; Xiao, B.; Meng, L. A Real-Time Intelligent Classification Model Using Machine Learning for Tunnel Surrounding Rock and Its Application. Georisk Assess. Manag. Risk Eng. Syst. Geohazards 2023, 17, 148–168. [Google Scholar] [CrossRef]
- Zhao, J.; Li, D.; Jiang, J.; Luo, P. Uniaxial Compressive Strength Prediction for Rock Material in Deep Mine Using Boosting-Based Machine Learning Methods and Optimization Algorithms. CMES Comput. Model. Eng. Sci. 2024, 140, 275–304. [Google Scholar] [CrossRef]
- Hamdia, K.M.; Ghasemi, H.; Bazi, Y.; AlHichri, H.; Alajlan, N.; Rabczuk, T. A Novel Deep Learning Based Method for the Computational Material Design of Flexoelectric Nanostructures with Topology Optimization. Finite Elem. Anal. Des. 2019, 165, 21–30. [Google Scholar] [CrossRef]
- Liu, K.; Liu, B.; Fang, Y. An Intelligent Model Based on Statistical Learning Theory for Engineering Rock Mass Classification. Bull. Eng. Geol. Environ. 2019, 78, 4533–4548. [Google Scholar] [CrossRef]
- Santos, A.E.M.; Lana, M.S.; Pereira, T.M. Rock Mass Classification by Multivariate Statistical Techniques and Artificial Intelligence. Geotech. Geol. Eng. 2021, 39, 2409–2430. [Google Scholar] [CrossRef]
- Zhou, M.; Chen, J.; Huang, H.; Zhang, D.; Zhao, S.; Shadabfar, M. Multi-Source Data Driven Method for Assessing the Rock Mass Quality of a NATM Tunnel Face via Hybrid Ensemble Learning Models. Int. J. Rock Mech. Min. Sci. 2021, 147, 104914. [Google Scholar] [CrossRef]
- Liu, Q.; Wang, X.; Huang, X.; Yin, X. Prediction Model of Rock Mass Class Using Classification and Regression Tree Integrated AdaBoost Algorithm Based on TBM Driving Data. Tunn. Undergr. Space Technol. 2020, 106, 103595. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: San Francisco, CA, USA, 2016; pp. 785–794. [Google Scholar]
- Zhou, J.; Wang, Z.; Li, C.; Wei, W.; Wang, S.; Armaghani, D.J.; Peng, K. Hybridized Random Forest with Population-Based Optimization for Predicting Shear Properties of Rock Fractures. J. Comput. Sci. 2023, 72, 102097. [Google Scholar] [CrossRef]
- Trojovska, E.; Dehghani, M.; Trojovsky, P. Zebra Optimization Algorithm: A New Bio-Inspired Optimization Algorithm for Solving Optimization Algorithm. IEEE Access 2022, 10, 49445–49473. [Google Scholar] [CrossRef]
- Li, D.; Liu, Z.; Xiao, P.; Zhou, J.; Armaghani, D.J. Intelligent Rockburst Prediction Model with Sample Category Balance Using Feedforward Neural Network and Bayesian Optimization. Undergr. Space 2022, 7, 833–846. [Google Scholar] [CrossRef]
- Hu, J.; Zhou, T.; Ma, S.; Yang, D.; Guo, M.; Huang, P. Rock Mass Classification Prediction Model Using Heuristic Algorithms and Support Vector Machines: A Case Study of Chambishi Copper Mine. Sci. Rep. 2022, 12, 928. [Google Scholar] [CrossRef] [PubMed]
- Li, S.; Shen, Y.; Lin, P.; Xie, J.; Tian, S.; Lv, Y.; Ma, W. Classification Method of Surrounding Rock of Plateau Tunnel Based on BP Neural Network. Front. Earth Sci. 2023, 11, 1283520. [Google Scholar] [CrossRef]
- Wu, S.; Yang, S.; Du, X. A Model for Evaluation of Surrounding Rock Stability Based on D-S Evidence Theory and Error-Eliminating Theory. Bull. Eng. Geol. Environ. 2021, 80, 2237–2248. [Google Scholar] [CrossRef]
- Yin, H.; Zhao, H.; Xu, L.; Zhao, C.; Ma, D.; Cong, S. Classification of Rock Mass in Mine Based on Improved Fuzzy Comprehensive Evaluation Method. Met. Mine 2020, 53–58. (In Chinese) [Google Scholar] [CrossRef]
- Liu, A.; Su, L.; Zhu, X.; Zhao, G. Rock Quality Evaluation Based on Distance Discriminant Analysis and Fuzzy Mathematic Method. J. Min. Saf. Eng. 2011, 28, 462–467. (In Chinese) [Google Scholar]
- Hu, J.; Ai, Z. Extension Evaluation Model of Rock Mass Quality for Underground Mine Based on Optimal Combination Weighting. Gold Sci. Technol. 2017, 25, 39–45. (In Chinese) [Google Scholar]
- Cai, G. Study of the BP Neural Network on the Stability Classification of Surrounding Rocks. Master’s Thesis, Hohai University, Nanjing, China, 2002. (In Chinese). [Google Scholar]
- Huang, R.; Zhao, Z.; Li, P.; Zhang, X. Tunnel’s Quality Evaluation of Surrounding Rock Based on Entropy Weight Method and Extenics. Highw. Eng. 2012, 37, 139–143. (In Chinese) [Google Scholar]
- He, H.; Bai, Y.; Garcia, E.A.; Li, S. ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning. In Proceedings of the 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–8 June 2008; pp. 1322–1328. [Google Scholar]
- Qiu, Y.; Zhou, J. Short-Term Rockburst Damage Assessment in Burst-Prone Mines: An Explainable XGBOOST Hybrid Model with SCSO Algorithm. Rock Mech. Rock Eng. 2023. [Google Scholar] [CrossRef]
- Li, C.; Zhou, J. Prediction and Optimization of Adverse Responses for a Highway Tunnel after Blasting Excavation Using a Novel Hybrid Multi-Objective Intelligent Model. Transp. Geotech. 2024, 45, 101228. [Google Scholar] [CrossRef]
- Goldstein, A.; Kapelner, A.; Bleich, J.; Pitkin, E. Peeking Inside the Black Box: Visualizing Statistical Learning with Plots of Individual Conditional Expectation. J. Comput. Graph. Stat. 2015, 24, 44–65. [Google Scholar] [CrossRef]
- Wu, S.; Chen, J.; Wu, M. Study on Stability Classification of Underground Engineering Surrounding Rock Based on Concept Lattice—TOPSIS. Arab. J. Geosci. 2020, 13, 346. [Google Scholar] [CrossRef]
- Zheng, X. Rock Mass Quality Classification Method Based on the VWT and Cloud Model. Mod. Min. 2018, 34, 88–90+95. (In Chinese) [Google Scholar]
- Liu, M.; Jiang, J.; Jiang, H.; Chen, Z.; Xie, R. Rock Mass Quality Grading of Underground Mining Mine Based on PCA-EWM-TOPSIS Coupled Algorithm. Gold 2022, 43, 27–31. (In Chinese) [Google Scholar]
- Zhang, H.; Yan, W.; Guo, S.; Jiao, M.; Lei, M. Classification of Rock Mass Quality in a Mine Based on ELM Model and Its Application. Gold 2018, 39, 32–34+38. (In Chinese) [Google Scholar]
- Yang, C.; Ke, C.; Yang, J.; Zhou, P.; Sha, Y.; Wu, Y. Evaluation and Prediction of Rock Mass Quality Based on Fuzzy RES-Extenics Theory: A Case Study of Pulang Copper Mine Area. Saf. Environ. Eng. 2022, 29, 122–131. (In Chinese) [Google Scholar] [CrossRef]












| Class | Statistical Indicators | UCS (MPa) | RQD (%) | Kv | W L (min·10 m)−1 | 
|---|---|---|---|---|---|
| class II | Count and percentage | 37 (28.5%) | 37 (28.5%) | 37 (28.5%) | 37 (28.5%) | 
| Mean value | 102.32 | 77.72 | 0.62 | 5.02 | |
| Standard deviation | 25.74 | 6.94 | 0.11 | 5.30 | |
| Min value | 70.00 | 58.13 | 0.30 | 0 | |
| 25th percentile | 90.00 | 75.00 | 0.55 | 0 | |
| 50th percentile | 94.00 | 77.50 | 0.65 | 1.00 | |
| 75th percentile | 95.00 | 82.00 | 0.70 | 10.00 | |
| Max value | 181.73 | 90.10 | 0.75 | 17.00 | |
| class III | Count and percentage | 43 (33.1%) | 43 (33.1%) | 43 (33.1%) | 43 (33.1%) | 
| Mean value | 66.31 | 55.59 | 0.44 | 21.64 | |
| Standard deviation | 28.90 | 15.49 | 0.16 | 36.34 | |
| Min value | 25.00 | 26.00 | 0.12 | 0 | |
| 25th percentile | 40.00 | 50.00 | 0.32 | 9.50 | |
| 50th percentile | 70.00 | 52.00 | 0.40 | 15.00 | |
| 75th percentile | 93.50 | 69.00 | 0.57 | 20.00 | |
| Max value | 127.92 | 88.60 | 0.70 | 223.00 | |
| class IV | Count and percentage | 27 (20.8%) | 27 (20.8%) | 27 (20.8%) | 27 (20.8%) | 
| Mean value | 45.16 | 45.95 | 0.42 | 30.11 | |
| Standard deviation | 24.90 | 15.93 | 0.15 | 36.45 | |
| Min value | 6.00 | 20.00 | 0.20 | 0 | |
| 25th percentile | 28.60 | 36.75 | 0.30 | 8.75 | |
| 50th percentile | 40.00 | 43.00 | 0.38 | 21.00 | |
| 75th percentile | 53.00 | 52.50 | 0.54 | 40.00 | |
| Max value | 118.00 | 89.50 | 0.71 | 168.00 | |
| class V | Count and percentage | 23 (17.7%) | 23 (17.7%) | 23 (17.7%) | 23 (17.7%) | 
| Mean value | 26.97 | 23.70 | 0.43 | 35.28 | |
| Standard deviation | 18.82 | 9.83 | 0.19 | 31.51 | |
| Min value | 4.00 | 10.00 | 0.07 | 0 | |
| 25th percentile | 14.50 | 16.00 | 0.29 | 11.00 | |
| 50th percentile | 27.00 | 24.00 | 0.43 | 20.00 | |
| 75th percentile | 35.90 | 29.00 | 0.58 | 58.50 | |
| Max value | 86.00 | 44.00 | 0.75 | 125.00 | 
| Hyperparameters | Minimum Value | Maximum Value | 
|---|---|---|
| n_estimators | 1 | 500 | 
| max_depth | 1 | 25 | 
| learning_rate | 0.001 | 1 | 
| gamma | 0 | 5 | 
| reg_alpha | 1 | 15 | 
| reg_lambda | 1 | 15 | 
| Model | Hyperparameters | |||||
|---|---|---|---|---|---|---|
| n_Estimators | Max_Depth | Learning_Rate | Gamma | Reg_Alpha | reg_Lambda | |
| ZOA-XGBoost-30 | 161 | 4 | 0.2212 | 0.0001 | 1.0042 | 1.4136 | 
| ZOA-XGBoost-50 | 125 | 4 | 0.3824 | 0.0020 | 1.0002 | 1.4090 | 
| ZOA-XGBoost-70 | 121 | 4 | 0.2494 | 0.0002 | 1.0027 | 1.7869 | 
| ZOA-XGBoost-90 | 98 | 4 | 0.3771 | 0.0011 | 1.0006 | 1.7440 | 
| ZOA-XGBoost-110 | 59 | 4 | 0.3823 | 0 | 1.0000 | 1.1562 | 
| ZOA-XGBoost-130 | 102 | 4 | 0.3461 | 0.0004 | 1.0002 | 1.0101 | 
| Model | ACC | Kappa | Precision | Recall | F1-Score | 
|---|---|---|---|---|---|
| XGBoost | 0.821 | 0.738 | 0.829 | 0.821 | 0.813 | 
| BO-XGBoost | 0.897 | 0.850 | 0.916 | 0.897 | 0.896 | 
| ZOA-XGBoost | 0.923 | 0.887 | 0.932 | 0.923 | 0.922 | 
| No. | Project | UCS (Mpa) | RQD (%) | Kv | W (L (min·10 m)−1) | Class | 
|---|---|---|---|---|---|---|
| 1 | Jinzhou LPG cavern | 144.00 | 68.00 | 0.90 | - | II | 
| 2 | 160.10 | 82.00 | 0.83 | - | II | |
| 3 | 176.40 | 66.00 | 0.80 | - | II | |
| 4 | Jiaojia gold mine | 39.73 | 68.10 | - | 8.00 | III | 
| 5 | 39.73 | 63.70 | - | 80.00 | IV | |
| 6 | 50.20 | 76.30 | - | 15.00 | III | |
| 7 | A metal mine | 120.35 | 69.80 | 0.47 | - | III | 
| 8 | 120.35 | 59.50 | 0.41 | - | III | |
| 9 | 120.35 | 65.30 | 0.45 | - | III | |
| 10 | 120.35 | 70.70 | 0.48 | - | III | |
| 11 | 120.35 | 63.50 | 0.44 | - | III | |
| 12 | 120.35 | 65.70 | 0.45 | - | III | |
| 13 | 120.35 | 57.50 | 0.40 | - | III | |
| 14 | 120.35 | 56.80 | 0.40 | - | III | |
| 15 | 120.35 | 56.80 | 0.40 | - | III | |
| 16 | 120.35 | 60.90 | 0.42 | - | III | |
| 17 | 120.35 | 68.80 | 0.47 | - | III | |
| 18 | 120.35 | 66.40 | 0.46 | - | III | |
| 19 | 120.35 | 65.60 | 0.45 | - | III | |
| 20 | 120.35 | 70.80 | 0.48 | - | III | |
| 21 | 120.35 | 59.90 | 0.41 | - | III | |
| 22 | 156.75 | 80.00 | 0.54 | - | II | |
| 23 | 89.83 | 30.60 | 0.21 | - | IV | |
| 24 | A gold mine | 30.00 | 47.00 | - | 15.00 | III | 
| 25 | 47.60 | 49.00 | - | 9.80 | III | |
| 26 | 38.00 | 23.00 | - | 16.30 | IV | |
| 27 | 88.00 | 57.00 | - | 19.00 | III | |
| 28 | 70.10 | 51.80 | - | 0.05 | II | |
| 29 | Pulang copper mine | - | 64.00 | 0.73 | 15.06 | II | 
| 30 | - | 65.00 | 0.69 | 15.19 | II | |
| 31 | - | 58.00 | 0.61 | 16.17 | II | |
| 32 | - | 50.00 | 0.57 | 17.62 | III | |
| 33 | - | 56.00 | 0.60 | 18.77 | II | |
| 34 | - | 59.00 | 0.64 | 16.33 | II | |
| 35 | - | 53.00 | 0.66 | 21.66 | II | |
| 36 | - | 41.00 | 0.53 | 24.55 | III | |
| 37 | - | 61.00 | 0.65 | 12.16 | III | |
| 38 | - | 70.00 | 0.71 | 7.16 | III | |
| 39 | - | 62.00 | 0.65 | 12.13 | III | |
| 40 | - | 63.00 | 0.63 | 16.62 | III | |
| 41 | - | 58.00 | 0.62 | 14.69 | III | |
| 42 | - | 61.00 | 0.64 | 16.16 | III | |
| 43 | - | 57.00 | 0.59 | 13.15 | III | |
| 44 | - | 64.00 | 0.67 | 8.02 | III | |
| 45 | - | 64.00 | 0.62 | 19.48 | III | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, B.; Liu, Y.; Liu, Z.; Zhu, Q.; Li, D. Classification of Rock Mass Quality in Underground Rock Engineering with Incomplete Data Using XGBoost Model and Zebra Optimization Algorithm. Appl. Sci. 2024, 14, 7074. https://doi.org/10.3390/app14167074
Yang B, Liu Y, Liu Z, Zhu Q, Li D. Classification of Rock Mass Quality in Underground Rock Engineering with Incomplete Data Using XGBoost Model and Zebra Optimization Algorithm. Applied Sciences. 2024; 14(16):7074. https://doi.org/10.3390/app14167074
Chicago/Turabian StyleYang, Bo, Yongping Liu, Zida Liu, Quanqi Zhu, and Diyuan Li. 2024. "Classification of Rock Mass Quality in Underground Rock Engineering with Incomplete Data Using XGBoost Model and Zebra Optimization Algorithm" Applied Sciences 14, no. 16: 7074. https://doi.org/10.3390/app14167074
APA StyleYang, B., Liu, Y., Liu, Z., Zhu, Q., & Li, D. (2024). Classification of Rock Mass Quality in Underground Rock Engineering with Incomplete Data Using XGBoost Model and Zebra Optimization Algorithm. Applied Sciences, 14(16), 7074. https://doi.org/10.3390/app14167074
 
        




 
       