Next Article in Journal
Geological and Social Factors Related to Disasters Caused by Complex Mass Movements: The Quilloturo Landslide in Ecuador (2024)
Previous Article in Journal
Co-Seismic Landslide Detection Combining Multiple Classifiers Based on Weighted Voting: A Case Study of the Jiuzhaigou Earthquake in 2017
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Smart Prediction of Rockburst Risks Using Microseismic Data and K-Nearest Neighbor Classification

1
Institute of Energy Infrastructure, Universiti Tenaga Nasional, Kajang 43000, Malaysia
2
Department of Civil Engineering, Bannu Campus, University of Engineering and Technology Peshawar, Bannu 28100, Pakistan
3
Broadridge Financial Solutions Inc., 51 Mercedes Way, Brentwood, NY 11717, USA
4
Department of Civil Engineering, HITEC University, Taxila 47040, Pakistan
5
Department of Civil and Environmental Engineering, College of Engineering, Majmaah University, Al Majmaah 11952, Saudi Arabia
6
Department of Civil Engineering, Balochistan Campus, National University of Sciences and Technology, Quetta 87300, Pakistan
7
State Key Laboratory of Coastal and Offshore Engineering, Dalian University of Technology, Dalian 116024, China
8
National Centre of Excellence in Geology, University of Peshawar, Peshawar 25000, Pakistan
*
Authors to whom correspondence should be addressed.
GeoHazards 2026, 7(1), 5; https://doi.org/10.3390/geohazards7010005 (registering DOI)
Submission received: 31 October 2025 / Revised: 19 December 2025 / Accepted: 25 December 2025 / Published: 1 January 2026

Abstract

Effective mitigation of geotechnical risk and safety management of underground mine requires accurate estimation of rockburst damage potential. The inherent complexity of the rockburst phenomena due to nonlinear, high dimensional, and interdependent nature of the geological factors involved, however, makes predictive modeling a difficult task. The proposed research is based on the use of the K-Nearest Neighbor (KNN) algorithm to predict the risk of rockbursts with the use of microseismic monitoring data. Several key features like the ratio of total maximum principal stress to uniaxial compressive strength, energy capacity of support system, excavation span, geology factor, Richter magnitude of seismic event, distance between rockburst location and microseismic event, and rock density were applied as input parameters to extract critical rockburst precursor activities. In the test stage, the proposed KNN model recorded an accuracy of 75.50%, a precision of 91.3, a recall value of 50.9, and F1 Score of 57.6. The model is reliable with a significant performance indicating its efficacy in practice. The KNN model showed better classification results as compared to recently available models in literature and provided better generalization and interpretability. The model exhibited high prediction in classified low-risk incidents and had strong indicative capabilities towards high-risk situations, attributed to being a useful tool in rockburst hazard measurement.

1. Introduction

The effective mitigation of geotechnical risks and safe operation of the underground mining systems require accurate assessments and evaluation of the possible causes of rockburst damage. The fact that rockburst events are extremely complex due to their nonlinear effects, multi-dimensional parameters, and interrelationships between geological and stress-related variables makes predictive modeling especially harsh. Rockburst is a violent shock release of the elastic energy accumulated in both rock and coal masses that occurs during underground workings. This leads to the ejection of rock fragments, which might bring injuries, collapse, and deformation accordingly to the supporting structures along with the damage of the facilities [1,2,3]. Similar activity exists in open cuts in the bulk of the joint rock [4,5]. Its economic implications are significant in both areas of civil and mining engineering. The mechanism is not still imagined very well due to the complexity and uncertainty of rockburst. It may be necessary to estimate the realistic potential of rockburst in order to reduce the risks associated with rockburst, including damage to equipment, closure of accesses, delays, and loss of lives and property to facilitate the safety of people and effective construction and service consent of underground projects. Traditional approaches based on mechanics do not allow accurate identification of the risk of rockbursts because the connection between geological, geometric, and mechanical parameters of rock masses in underground conditions is very complex. Also, mechanics-based approaches possess multiple assumptions that, in case ignored, can result in biased predictions of a model. This has compelled many researchers over the past few decades to explore other ways through which they can improve the prediction of hazards and the detection of rockburst event. A number of indicators were proposed in order to judge burst potential by several researchers.
Depending on the variance between the location of the rock damage and place of the seismic occurrence, rockbursts may be categorized as self-induced rockburst and remotely induced rockburst [6]. In the case of the former, the damage and seismic event location are coherent. In the case of the latter, rockburst is induced by distant and comparatively high-level magnitude earthquake. In the case of various types of rockburst, influencing factors are different, and that is why the evaluation of the rockburst damage potential is also dissimilar. This paper will evaluate the damage potential of remotely induced rockburst. Owing to the fact that the place of the damage does not correspond to the place of the microseismic event, it is not easy to assess the potential damage only based on the microseismic event. The information of the microseismic events, the paths of the stress waves, and the conditions of the rock mass on the face of the excavation must be taken into consideration at the same time. The assessment of the potential damage caused by rockbursts remains a challenging problem since there are intricate processes and many factors that can influence the process.
In recent years, data mining methods have been demonstrated to be effective when obtaining complex non-linear dependences between predictor and response variables [7,8,9,10,11,12,13,14,15,16,17,18] and can be applied in the determination of sites at risk of rockburst incidents since the case history data is becoming more prevalent. Several researchers have recommended a number of methods which can be used to predict the rockburst, such as Support Vector Machine (SVM) [19], Artificial Neural Networks (ANNs) [20], Distance Discriminant Analysis (DDA) [21], Bayes Discriminant Analysis (BDA) [22], and Fisher Linear Discriminant Analysis (LDA) [23]; moreover, some systems are based upon hybrid (Zhou et al. [24]; Adoko et al. [25]; Liu et al. [26]) or ensemble (Ge and Feng [27]; Dong et al. [28]) learning methods in long-term prediction of rockburst, and their prediction accuracies are compared. Zhao and Chen [29] in a recent study compared and developed a data-driven model with the help of a convolutional neural network (CNN) to that of a conventional neural network. The proposed CNN model will be of high potential over the conventional neural network in rockburst prediction. These algorithms were fed with several rockburst indicators as input, and the training samples of different sizes were fed. Most or rather all the above techniques have been successful in forecasting rockburst risk, but there are faults in them. In such a way, e.g., in the ANN approach, the optimal structure (number of inputs, hidden layers, and transfer functions) has to be defined beforehand. This is normally done through the trial and error process. Another major limitation of ANN model is the black box nature of the model, not to mention that the relationship between input and output parameters of the system is expressed in terms of the weight matrix and biases, the latter not being accessible to the user [30]. The K-Nearest Neighbor (KNN) algorithm was chosen in this research because it is effective in dealing with complex, non-linear, and locally structured data pertaining to microseismic data. Since it is a non-parametric approach, KNN lacks distribution assumptions and is used to classify new observations as similar to previous outcomes, which is why it is relevant to recognize patterns related to rockburst events. Nonetheless, the KNN provided more flexible and interpretable predictions in instances of small and imbalanced datasets, which have been seen to outperform various benchmark models, illustrating its applicability to the task of assessing rockburst risks.
The main objective of this research is to investigate the importance of microseismic data in the prediction of the risk of rockburst using KNN algorithm. The study offered a data-driven classification model based on microseismic predictors such as the ratio of total maximum principal stress to uniaxial compressive strength, the energy capacity of support system, excavation span, geology factor, Richter magnitude of seismic event, distance between rockburst location and microseismic event, and rock density to determine the potential rockburst risks. The predictive accuracy of the KNN model is measured and compared to similar methods in order to determine its efficiency. Finally, the research aims at improving early warning systems and risk mitigation approaches to underground rock engineering.

2. Data Collection and Analytics

As per the original article by Heal [31], the number of rockburst cases included in the study amounted to 254 and were obtained in 13 underground metal mines in Canada and Australia. The acquisition of these cases was dependent on the rock mass failure conditions resulting due to one microseismic event. The microseismic event and failure location data comprise 83 and 254, respectively, and are in this database. It shows that certain failure spots are associated with the same microseismic event. According to the degree of damage of the rock mass and support, the level of the degree of risk damage of rockburst was classified into five levels, namely, none (L1), low (L2), moderate (L3), high (L4), and strong (L5). These included L1 implying the rock mass exhibited no damage or small loss and the support was undamaged; L2 implying the rock mass was slightly damaged with less than 1 ton of rock displaced, support system was loaded consequently causing the meshes to become loose and plates to be deformed; L3 indicating 1 to 10 tons of rock was displaced and some bolts were broken; L4 indicating 10 to 100 tons of rock was displaced and the support system was significantly damaged; L5 indicating the rock mass displaced above 100 tons of rock. Since the investigation has not specified the location of damage in L1, its original database only had L2, L3, L4, and L5. The sample size at these levels was 116, 48, 63, and 27, respectively. The dataset is compiled from previously published studies and has undergone verification by several research groups. Consequently, no duplicate or conflicting samples were present for further cleaning. The original database contained nine parameters: the ratio of the total maximum principal stress to uniaxial compressive strength (I1), the energy capacity of the support system (I2), excavation span (I3), geology factor (I4), Richter magnitude of the seismic event (I5), distance between the rockburst location and the microseismic event (I6), peak particle velocity (I7), rock density (I8), and the types of supports (I9). Literature can be referred to the specific meaning of each indicator [31]. The significant effect on the evaluation results is associated with different indicator combinations. Heal [31] and Zhou et al. [32] selected I1, I2, I3, I4, and I7; and Li et al. [33] chose I1, I2, I3, I4, I5, I7, and I8. Since I7 was calculated by I5 and I6 with the use of an empirical formula and I9 was hard to be quantified, I1, I2, I3, I4, I5, I6, and I8 in this paper were used to assess the rockburst damage. From this dataset, it can be noted that duplicate sample data existed in the dataset, and some samples had the same indicator value but different rockburst damage levels. The duplicate samples were eliminated to enhance the accuracy of prediction. In the case of the samples that have the same value of an indicator but different levels, the samples with highest level were chosen, to be on the safe side. As a result, the sample size in the new dataset comprised 236 entries, the size of samples at L2, L3, L4, and L5 was 107, 45, 57, and 27, respectively. The ratio of the sample size in different levels was 4.0:1.7:2.1:1.0. It demonstrates that the distribution is rather uneven, which could influence the evaluation outcome.
Figure 1 shows the violin plots of the main parameters of input (I1 to I8) in terms of rockburst hazard assessment. The distributions indicate a balanced dataset with a large variation of geomechanical and seismic cases. Parameters like the ratio of the total maximum principal stress to underlying uniaxial compressive strength (I1), and excavation span (I3) have a wide range of distribution, especially at the high end, and this is useful in noticing high-risk cases. Energy capacity of the support system (I2) and density of rock (I8) display significant variation as well, so far as the dataset is able to portray the distinguishing characteristics of a wide range of support designs and geological materials. The geology factor (I4) and seismic magnitude (I5) are quite uniform, which facilitates the consistent site representation; distance to microseismic events (I6) is concentrated, but still informative. In general, these plots corroborate that the data is dense, varied, and well-organized—features that are desirable in making the data more effective in developing reliable and generalizable predictive rock burst hazard models.
Sample data and a summary of the statistical features of important parameters affecting the rockburst hazard according to 236 recorded rockburst cases are presented in Table 1; the details are available in Appendix A. The ratio between the total-maximum stress principal and uniaxial compressional strength (I1) has an average of 53.90 and standard deviation of 17.80, which goes between 18 and 95. The mean of the energy capacity of the support system (I2) is 7.83 and is highly skewed to the positive direction (skewness = 2.38; kurtosis = 8.28) which is an indication of the existence of some support systems which have been devised to absorb high levels of energy in some situations. The excavation span (I3) is averagely 7.30 m, and some values are as high as 30 m; it has significant positive-skew (2.64) and a kurtosis of 13.64, representing a large number of high-capacity excavation work done in a high-risk environment. The geology factor (I4) and Richter magnitude of seismic events (I5) have relatively symmetric distributions, with a mean of 0.90 and 1.88, respectively, though the latter contains a minimum value of −0.3 that could be either a limit of measurement, or a modeling artifact. The range of the distance of the rockburst position and induced microseismic event (I6) is difficult to determine, as it varies considerably between 5 and 70 m (m) (mean = 13.73). Finally, the rock density (I8) varies between 2700 and 4300 kg/m3 with its average of 2974.53 kg/m3. All parameters exhibit acceptable levels of wide variability, which means that the dataset has sufficient coverage of both normal and extreme cases, and thus, makes it more suitable in the development of robust and generalizable predictive models that can be used in classification of rockburst hazard. The rating of the intensity of a rockburst was divided into four different levels, and the actual distribution of these levels within the data is depicted in Figure 2.

3. K-Nearest Neighbor (KNN)

The KNN algorithm is based on the distance of eigenvectors. Its principle of operation is that it identifies a set of k samples which are closest to the unknown samples in the dataset. Based on the k samples, the label of the unknown sample is executed by calculating the mean of the response variables [34,35]. There are two important parameters in the KNN classification algorithm: “n_ neighbors” and “weights” [36]. The value “n_neighbors” specifies the number of nearest neighbors to utilize in the learning process, that is, the k value; the parameter “weights” is utilized to determine the weight of neighbor samples of each sample. The prediction of possible values is realized with the help of the Weight function: The option uniform denotes the case when all nearest neighbor samples have an equal weight; distance takes place in cases when the weight and the distance are inversely proportional to one another [37]. KNN algorithm is a generalization of nearest neighbor rules. The label of the k-sample most similar to the nearest one is its inductive offset to the class label that will be tested. In contrast to the nearest neighbor, it is different because knowing the smallest neighbor, it enlargens to k during the decision-making process. This addition enables the KNN algorithm to acquire additional information and make use of it. It does not contain the learning processing process compared to other classification algorithms that have a given training process. Figure 3 displays the KNN classification diagram.
The nearest neighbor rule is the long-established form of class-reasoning. Its concept of decision-making is quite rudimentary, namely, the sample used as the test sample is the same as the sample category which is the nearest to it. Given that the training collection and the distance function remain in the same draw-off, the decision outcome of the nearest neighbor rule is always unique regardless of the instance that it will have to test. By taking all sample instances in set E, and letting y and x be such that y is the nearest neighbor instance to x, the category of y is the output that is the nearest neighbor rule. Suppose the specific decision process is that of an unknown category sample, X.
g j X = min g i X     i = 1 , 2 , 3 , ... , C
Then the decision result is X W i .
In this case, the nearest neighbor rule is presented under two perspectives, convergence and generalization error. To test the same point again x, the nearest neighbor x′ obtained when two training sets comprising different samples are used is different. The classification result will depend on the nearest neighbor category label of the label set of the nearest neighbor, P e x , x is thus obtained. It is the conditional error rate, and it depends on both x and x′. In this case, the mean of x′ is can be derived as follows:
P e x = P e x , x p x x d x
The P x x is a conditional probability density function. Suppose P is a non-zero continuous function, then the probability that any point lies on the hyper-sphere centered S as x is
P s = P x d x
The likelihood that none of n samples is within the hypersphere is 1 P s * . In case n , the probability approaches zero. One can conclude that when the nearest neighbor x′ becomes such that x converges towards the point x′ that is to be measured as per the probability, then P e x tends to the Dirac delta function as the probability approaches infinity. In the same way, when the KNN decision rule is taken, then the k neighbors can converge to the point x to be measured. The probability that the point to be measured x is not the same as the category c of the nearest neighbor point is x′ is understood as the probability error rate of the nearest neighbor, and the error rate is
P e r r o r = 1 c Y P c x P c x
In this case, it is assumed that there is independence and equality of distribution of each sample. The Bayesian classifier can always be based upon a sample x lying in the d distance set of x so that
c * = arg max c Y P c x
Now
P e r r o r 1 c Y P 2 c * x
The nearest neighbor rule is found not only simple, but also the general error rate of the rule is not more than twice the Bayesian error rate.

4. Performance Evaluation

When the multiclass classifications are considered, the metrics defined by binary classification are not applicable in the complete extent. A multiclass confusion matrix (see Figure 4) is N by N where N is the number of the different class labels C0, C1, …, CN. Thus, it cannot be used, as the True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN) cases are not applicable. In its place, one can conduct an analysis of a particular class according to the characterization given in the example of Figure 4. To this end, a set of indicators can be developed against each of the classes. Further, based on a sufficient set of these metrics, one may provide the metrics of confusion matrix in aggregate. Table 2 gives a summary of the measures that are used to specify a multiclass confusion matrix and specifically accuracy, recall, precision, and F1 Score.

5. Results and Discussion

To enhance predictive performance, optimization of the KNN model was used, and the main hyperparameters were adjusted. The best configuration was the number of neighbors (n_neighbors) to be set to 4 with a uniform weighting scheme (weights = ‘uniform’), and to use the Mahalanobis distance metric that takes correlations between the features into consideration and scales the data. This distance measure provides a more-informed similarity measure, when features are well-suited to handling datasets with different feature scales or when there are inter-feature relationships. The model applied uniform weights along with considering feature covariance and treated all neighbors equally; thus, it was able to provide high accuracy and a successful trade-off between precision and recall. Mahalanobis distance was chosen because it accounts for inter-feature correlations and scale differences, which are present in the rockburst dataset, whereas Euclidean and Manhattan distances assume independent variables. Superior performance of these optimized hyperparameters had a considerable role in generating robustness and generalization of this model. In the process of training, the model showed exceptional results in the classification of rockburst hazard levels (see Table 3). It has demonstrated a high accuracy of 99.4% in the most frequent L2 class with a correct classification of 172 out of 173 samples, which reveals a successful learning process in the most common hazard level. Although the size of the sample used in L3, L4, and L5 was smaller, the model was still able to classify correctly some of the instances, such as 14.3% of L4 cases. The majority of the misclassification represented neighboring classes, which is natural since the difference between the levels of hazards is not so strong. The training phase performance of the KNN model is remarkable, providing a high classification accuracy of 93.0%, which means that it is highly predictive. The F1 Score (0.964), precision (0.935), and recall (0.994) are additional indications of the adept performance of the model in terms of accuracy and sensitivity, with close to perfect recognition of positive cases. These findings support the strength and validity of the model to learn using the training data.
Analysis of the testing data indicates that the model performs well in identifying the most common and critical level of L2 hazards, with a perfect recall rate of 100% and a high value of precision of 72.7%. This implies that the model has a high probability of identifying conditions at low-risk of rockbursts and this is critical in efficient monitoring and timely response. In addition, there is high precision (100%) in the model when it comes to making predictions of higher-risk levels (L3, L4, L5), which means when it does show higher-risk levels, it is always right. The KNN model was very suitable in the testing stage, with the model achieving a high accuracy (75.5%) and very good precision (91.30) showing a reliable and stable model in terms of predictions. Although the recall (50.9%) shows improvement could be achieved in detecting every positive case, the overall F1 Score (57.6%) indicates a proper and balanced classification strategy, especially when there is need to have high level of confidence about predictions. The obtained results show the model is highly accurate in indicating safe conditions and multi-confident in signaling increase in hazard on the part projected, offering a robust tool that can help in the management of hazards of rockbursts. Table 4 presents the evaluation results of the various approaches. As it was observed, the KNN model turned out to be the most promising, as it yielded the largest overall accuracy rate of 75.5%. It also had balanced F1 Scores across the four classes: [0.842, 0.667, 0.400, 0.444]. On the other hand, the calculated performance of a grouping of Gaussian Process Classifiers (GPCs) used without prior training of the data during preprocessing stage produced an accuracy of mere 48.98% and F1 Score of 0.1818 at level L5 with a paltry, which underscores the deleterious effects of raw, imbalanced data on the performance of the model. With the GPC ensemble, preprocessing and sampling improved the performance (accuracy: 63.27%; F1 Scores: [0.7907, 0.6, 0.5455, 0.3077]), though it was not entirely accurate to capture all the hazard levels. Among the GPC-based configurations, some did not manage to predict the level L5 at all (F1 Score = 0), which indicates a lack of robustness. The KNN model was superior to all the others in accuracy alone, but also was consistent in its generalization between the majority and minority classes. Its capacity to provide relatively accurate predictions without any overfitting and minimal complex architecture of the ensemble illustrates its strengths and applicability to practical rockburst hazard classification training.
The values of precision, recall, and F1 Score at different levels were obtained as calculated in Table 2 and represented in Figure 5. As can be observed, the level L2 evaluation performance was the best and the level L5 the worst. The results show that the model performs best for L2 rockburst levels, with high recall and balanced precision, making it reliable for lower-risk detection. However, for L3–L5, especially the high and extreme risk levels (L4–L5), the model is overly conservative: precision is high, but recall is very low, meaning many dangerous rockburst events are missed. This imbalance leads to reduced F1 scores and poses a serious safety concern, as recall is more critical than precision for high-severity rockburst prediction. The order of the evaluation performance of the various levels after all the values of precision, recall, and F1 Score were put into consideration was L2 > L3 > L4 > L5.

6. Engineering Application

The rockburst damage within Perseverance nickel mine has been evaluated by means of the proposed methodology. Its orebody is situated in ultramafic rocks which are mined under a sub-level caving method of mining. The hanging wall is composed of stiff Felsic volcanics and metasediments and is prone to mining-tsunami. The microseismic monitoring system is deployed at this mine, and it can manage the risk of rockburst based on the microseismic information. Because most of the ramp and infrastructure is part of the hanging wall, it is essential to estimate the potential of rockburst damage. The Perseverance nickel mine was selected as the engineering verification case due to its complex geological conditions and well-documented rockburst history. At present, comparable engineering datasets from other mines are not publicly available; therefore, the inclusion of additional cases is not feasible.
Heal [31] recorded 12 cases of rockburst damages in this mine, with 6 microseisms cases at the depth of between 950–1100 m). The exact data was depicted in Table 5. Microseismic events of Richter magnitude were between 1.5–2.2, and the actual damages in rockbursts were L2 or L4. The proposed methodology was validated on twelve cases of different intensities of rockburst damages. As indicated in the final column of Table 5, it can be ascertained that the trained KNN model was able to predict the levels of rockburst damages in all twelve cases. To further examine how accurate the KNN model is, six events with twelve true cases of engineering projects were selected to be confirmed, nine of that being an L2 test case, one was L3 test cases and one was an L4 test case [31]. Table 5 shows the actual and the predicted classes of these twelve actual engineering cases. As per the findings of the evaluation shown in Table 5, the only cases not consistent with the actual scenario and correctness of the expectation of 75% are the three cases of the levels of rockburst damages caused by the microseismic events 2 and 6. Based on the KNN, the extent of the damage of the rockbursts, caused by a microseismic event 2 and 6, was not identified, and the accuracy is 66.67%. The results acquired in the evaluation of seven cases in the approach of Heal [31] did not match the real situation, and even the accuracy is only 41.67%. Likewise, four cases in the GPC approach [38] did not match the real situation, and the accuracy is 66.67%. Therefore, the methodology offered in the current study assisted in increasing the level of accuracy of rockburst, i.e., 75% damage to a certain degree.

7. Conclusions and Future Prospect

To achieve the aim of this study, a KNN model was developed to classify the rockburst risk using microseismic data. The inputs to model the variables to frame the rockburst risk were set to the ratio of total maximum principal stress to uniaxial compressive strength (I1), energy capacity of support system (I2), excavation span (I3), geology factor (I4), Richter magnitude of seismic event (I5), distance between rockburst location and microseismic event (I6), and rock density (I8). The performance of the KNN model was measured by calculating accuracy, precision, recall, and F1 Score, as performance metrics in both stages, training, and testing. The KNN model performed well in the testing phase, achieving high accuracy (75.5%) and excellent precision (91.3), indicating reliable and consistent predictions. While the recall (50.9) suggests some room for improvement in identifying all positive cases, the overall F1 Score (57.6) confirms a balanced and effective classification approach, especially for practical applications requiring high confidence in predictions. These findings indicate that the model has a high potential of predicting the level of hazards precisely, hence providing a robust and technically effective tool to aid in rockburst hazard evaluation and management. The proposed model may be improved in future research mines with different geological settings and the inclusion of other input parameters, such as support types and peak particle velocity. The combination of these parameters with microseismic indicators can obtain a more detailed explanation of the rockburst mechanisms. In addition, the performance of KNN classification can be enhanced by implementing different hybrid algorithms (KNN-PSO, KNN-RF, or KNN-SVM, etc.) to enhance its performance, in particular, to demonstrate high accuracy on complex and imbalanced data.

Author Contributions

Conceptualization: M.A.; Methodology: Z.U., S.H. (Shay Haq), and F.A.; Data Curation: M.N.K.; Model Development and Analysis: M.A., F.A., and S.H. (Shay Haq); Visualization: R.C.O.; Validation: A.A., R.C.O., S.H. (Shay Haq), Z.U., S.H. (Sabahat Hussan), and M.N.K.; Writing—Original Draft: M.A.; Writing—Review & Editing: M.A., F.A., Z.U., S.H. (Sabahat Hussan), and M.N.K.; Supervision: R.C.O. and A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

Author Zia Ullah was employed by the company Broadridge Financial Solutions Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A

Table A1. Rockburst dataset used in KNN modeling.
Table A1. Rockburst dataset used in KNN modeling.
S. No.I1I2I3 (m)I4I5I6 (m)I8 (kg/m3)Level
16054.20.51.72027004
26084.20.51.72527002
380860.51.81027004
48056.21−0.3527004
5708411.81527002
64053.810.4527002
78085.910.6527002
89086.810527004
98087101027002
10808712527004
118084.1121027004
127089.512.2527004
137583.812.21027003
14758412.21027004
156086.211.6527002
16601010.50.51.6527002
176584.310.3527002
186055.60.51.5527004
1945109.10.51.8527005
204359.30.51.8527004
2143109.311.8527002
2243109.40.5−0.2527002
235483.50.51.3527002
244583.611.31027002
258085.411.31027002
265087.811.31527002
275056.211527003
285055.10.51.2527004
295055.111.2527002
305088.310.7527002
315085.510.7527002
3260108.812527002
336056.212527002
346055.212527002
356058.412527002
36605612527002
3760108.412527002
386055.312527002
396010712527002
4060105.40.52527004
417555.110.61527003
427055.110.61027002
437055.110.6527003
447555.211.31027002
457556.711.31527002
467555.111.32027002
47758711.8527003
48758511.81027002
497555.311.8527003
503525.313.11527005
5135210.613.11527005
52351010.613.12027002
533555.91.53.12527004
5435210.61.53.12527004
5535511.81.53.12027002
5635510.60.53.12527003
573587.60.53.11527004
5841.757.61.53.11527002
5941.72713.11027004
60351010.60.53.11527002
61355713.11527004
624021412.81529005
63392712.81029004
643926.512.85029004
654025.711.62529003
6642.8626.41.51.62029003
6735.126.81.53.55029002
68381090.53.51029005
6932.324.51.53.53529003
7043.787.70.53.52029003
7143.71070.53.52029003
7243.724.213.51529004
7342.8250.53.53029005
7442.810100.53.53029003
7539.5550.53.51029004
7644.4101513.51529004
7747.388.313.55029003
7847.385.513.55029003
7951.4359.31.51.91029003
8039.459.51.52.11529002
8139.424.512.12029003
8239.4101112.12529002
8341.724.50.52.16029003
8440.810100.52.13029003
8540.8580.52.12529003
8644.151812.15029002
873681411.21029002
883686.50.51.21529002
8937.785.210.4529003
9040.884.211.8529002
9130105.30.51.2528003
923085.40.51.2528002
9330105.311.21028002
9430105.211.21028003
957485.90.51.5528005
967485.311.51028003
977485.911.51028002
987486.511.52028002
997486.50.51.51528004
100718810.9528004
1017185.310.91028002
1027110810.91028002
1037185.610.91028002
1047110200.52.1528005
1054085.90.52.1527004
1064085.30.52.1527002
1074085.912.1527002
1084086.512.11027002
1094025.612.11027004
1104025.812.1527004
1114085.512.12027002
112705810.8528004
1137010810.81028002
11470105.510.81028002
115705810.81028004
1167085.110.81028003
1177055.110.8528002
1187055.110.81028002
1195455.710.82027002
12054109.10.50.82527002
1213985.710.83028002
1228457.712.91029005
1234554.812.95029002
12484107.412.95029003
12545106.912.91029002
1264557.412.91029004
1275654.612.92529002
1281855.80.50.41030304
1292458.60.50.41030303
13095106.911.5529002
1319556.610.9529005
13245105.111.6529002
13321511.21.50.9530302
1342156.11.50.9530302
1359510811.6529005
13639105.311.61529003
1372155.511.92030303
1382458.70.51.51030305
1392451111.51530302
140671051−0.2529002
14121590.51.81030304
1422110911.81030302
14395106.811529004
14473256.811529003
14527511.513.13030304
1462757.613.14030304
14735511.513.13030304
14850254.513.14029002
14995257.113.12029002
15073254.713.13029002
15195256.313.13029002
15273254.413.14029002
15373259.613.15029002
15454254.813.16029002
1553454.513.17029003
156251011.60.51.4530304
15725511.60.51.4530304
1582451212530305
159395912529004
1602555.111.3530804
16125510.511.31030803
1622557.811.31030802
16325.9710170.52530805
16425.9786.20.521030803
16525.9785.70.52530804
16625.9755.4121030803
16725.9785.3121030802
16825.9785.60.52530804
1697555.211.61028005
170755511.6528005
171655211.6528002
1725059.111.4528003
1735054.60.51.93028003
17470850.51.6543004
175708812543004
17670860.52543004
17770812121043003
1786710110.52.51543005
1796725512.51543002
180675512.51543002
181761060.52.7543005
18240109.20.51.1543003
1834054.811.11043002
184401011.70.52.5543005
1855010912.72043004
186505612.1543004
187555811.9543005
188555612.3543004
18965101112.31043004
19055560.50.9543004
1916055.512.22043004
1925055.50.51.4543005
19350108.50.51.42043002
19450105.50.51.44043002
195505611.5543003
19650103011.7543005
1977054.41.51.72027002
19870104.60.52528004
19990104.51.521028502
2007055.212.1527005
20156.281011528704
20256.21010111028702
20356.286112028702
20457.886.111528703
20557.886.11.511028703
20657.886.511.5528704
20757.81011.311.51028702
20857.8106.511.51028702
2095786.712.2528704
21057109.512.2528704
211571011.212.21028702
2125786.412.22528702
2135786.512.21028702
214571011.50.51.7528704
21557101111.7528702
21657101111.71028702
217571011.511.71028702
21857107.411.71528702
21957.8106.40.52.5528705
22057.81011.20.52.5528705
22157.8106.412.51028702
22257.81010.612.51028702
22358.61012.40.52.23028702
22458.6105.912.23028702
22558.6106.112.23028702
22659.38812.2528705
22759.385.412.21528703
22859.381012.21028702
22959.38812.21528703
23059.388.412.21528702
23159.38512.22028702
23270.3106.90.52.3529005
23370.3101112.31029002
23470.3105.512.31529003
23570.3105.412.31529002
23672.28411.6529003

References

  1. Ortlepp, W.; Stacey, T. Rockburst mechanisms in tunnels and shafts. Tunn. Undergr. Space Technol. 1994, 9, 59–65. [Google Scholar] [CrossRef]
  2. Dou, L.; Chen, T.; Gong, S.; He, H.; Zhang, S. Rockburst hazard determination by using computed tomography technology in deep workface. Saf. Sci. 2012, 50, 736–740. [Google Scholar] [CrossRef]
  3. Cai, M. Principles of rock support in burst-prone ground. Tunn. Undergr. Space Technol. 2013, 36, 46–56. [Google Scholar] [CrossRef]
  4. Al-Shayea, N. Failure of Rock Anchors along the Road Cut Slopes of Dhila Decent Road, Saudi Arabia. In Proceedings of the International Conference on Problematic Soils (GEOPROB 2005), Famagusta, Cyprus, 25–27 May 2005; Eastern Mediterian University: Famagusta, Cyprus, 2005; Volume 3, pp. 1129–1136. [Google Scholar]
  5. Al-Shayea, N.A. Pullout Failure of Rock Anchor Rods at Slope Cuts, Dhila Decent Road, Saudi Arabia. In Proceedings of the 17th International Road Federation (IRF) World Meeting and Exhibition, Riyadh, Saudi Arabia, 10–14 November 2013. [Google Scholar]
  6. Kaiser, P.; McCreath, D.R.; Tannant, D.D. Canadian Rockburst Research Program 1990–1995; Camiro Mining Division: Sudbury, ON, Canada, 1997. [Google Scholar]
  7. Ahmad, M.; Tang, X.-W.; Qiu, J.-N.; Ahmad, F.; Gu, W.-J. Application of machine learning algorithms for the evaluation of seismic soil liquefaction potential. Front. Struct. Civ. Eng. 2021, 15, 490–505. [Google Scholar] [CrossRef]
  8. Ahmad, M.; Amjad, M.; Al-Mansob, R.A.; Kamiński, P.; Olczak, P.; Khan, B.J.; Alguno, A.C. Prediction of liquefaction-induced lateral displacements using Gaussian process regression. Appl. Sci. 2022, 12, 1977. [Google Scholar] [CrossRef]
  9. Ahmad, F.; Tang, X.-W.; Qiu, J.-N.; Wróblewski, P.; Ahmad, M.; Jamil, I. Prediction of slope stability using Tree Augmented Naive-Bayes classifier: Modeling and performance evaluation. Math. Biosci. Eng. 2022, 19, 4526–4546. [Google Scholar] [CrossRef]
  10. Ahmad, F.; Tang, X.-W.; Ahmad, M.; González-Lezcano, R.A.; Majdi, A.; Arbili, M.M. Stability risk assessment of slopes using logistic model tree based on updated case histories. Math. Biosci. Eng. 2023, 20, 21229–21245. [Google Scholar] [CrossRef] [PubMed]
  11. Ahmad, M.; Al Zubi, M.; Almujibah, H.; Sabri, M.M.S.; Mustafvi, J.B.; Haq, S.; Ouahbi, T.; Alzlfawi, A. Improved prediction of soil shear strength using machine learning algorithms: Interpretability analysis using SHapley Additive exPlanations. Front. Earth Sci. 2025, 13, 1542291. [Google Scholar] [CrossRef]
  12. Ahmad, M.; Al-Mansob, R.A.; Ramli, A.B.B.; Ahmad, F.; Khan, B.J. Unconfined compressive strength prediction of stabilized expansive clay soil using machine learning techniques. Multiscale Multidiscip. Model. Exp. Des. 2024, 7, 217–231. [Google Scholar] [CrossRef]
  13. Al-Zubi, M.A.; Ahmad, M.; Abdullah, S.; Khan, B.J.; Qamar, W.; Abdullah, G.M.S.; González-Lezcano, R.A.; Paul, S.; El-Gawaad, N.S.A.; Ouahbi, T. Long short term memory networks for predicting resilient Modulus of stabilized base material subject to wet-dry cycles. Sci. Rep. 2024, 14, 27928. [Google Scholar] [CrossRef]
  14. Ahmad, F.; Tang, X.; Ahmad, M. Predicting Rainfall-Induced Failure Potential of Highway Slopes: Taking Route T-18 in Taiwan as Case of Illustration. In Proceedings of the 1st International Conference of Climate Change and Emerging Trends in Civil Engineering, Topi, Pakistan, 12–13 February 2024. [Google Scholar]
  15. Ahmad, F.; Tang, X.; Hu, J.; Ahmad, M.; Gordan, B. Improved Prediction of Slope Stability under Static and Dynamic Conditions Using Tree-Based Models. CMES—Comput. Model. Eng. Sci. 2023, 1, 3. [Google Scholar] [CrossRef]
  16. Kamran, M.; Ullah, B.; Ahmad, M.; Sabri, M.M.S. Application of KNN-based isometric mapping and fuzzy c-means algorithm to predict short-term rockburst risk in deep underground projects. Front. Public Health 2022, 10, 1023890. [Google Scholar] [CrossRef] [PubMed]
  17. Ahmad, M.; Alsulami, B.T.; Hakamy, A.H.; Majdi, A.; Alqurashi, M.; Sabri, M. The Performance Comparison of the Decision Tree Models on the Prediction of Seismic Gravelly Soil Liquefaction Potential Based on Dynamic Penetration Test. Front. Earth Sci. 2023, 11, 1105610. [Google Scholar] [CrossRef]
  18. Ahmad, M.; Al-Zubi, M.A.; Kubińska-Jabcoń, E.; Majdi, A.; Al-Mansob, R.A.; Sabri, M.M.S.; Ali, E.; Naji, J.A.; Elnaggar, A.Y.; Zamin, B. Predicting California bearing ratio of HARHA-treated expansive soils using Gaussian process regression. Sci. Rep. 2023, 13, 13593. [Google Scholar] [CrossRef]
  19. Hong-Bo, Z. Classification of rockburst using support vector machine. Rock Soil Mech. 2005, 26, 642–644. [Google Scholar]
  20. Chen, D.; Feng, X.; Yang, C.; Chen, B.; Qiu, S.; Xu, D. Neural network estimation of rockburst damage severity based on engineering cases. In Proceedings of the ISRM SINOROCK 2013 (the 3rd ISRM Symposium on Rock Mechanics), Shanghai, China, 18–20 June 2013; pp. 457–463. [Google Scholar]
  21. Gong, F.; Li, X. A distance discriminant analysis method for prediction of possibility and classification of rockburst and its application. Chin. J. Rock Mech. Eng. 2007, 26, 1012–1018. [Google Scholar]
  22. Gong, F.; Li, X.; Zhang, W. Rockburst prediction of underground engineering based on Bayes discriminant analysis method. Rock Soil Mech. 2010, 31, 370–377. [Google Scholar]
  23. Zhou, J.; Shi, X.-Z.; Dong, L.; Hu, H.-Y.; Wang, H.-Y. Fisher discriminant analysis model and its application for prediction of classification of rockburst in deep-buried long tunnel. J. Coal Sci. Eng. (China) 2010, 16, 144–149. [Google Scholar] [CrossRef]
  24. Zhou, J.; Li, X.; Shi, X. Long-term prediction model of rockburst in underground openings using heuristic algorithms and support vector machines. Saf. Sci. 2012, 50, 629–644. [Google Scholar] [CrossRef]
  25. Adoko, A.C.; Gokceoglu, C.; Wu, L.; Zuo, Q.J. Knowledge-based and data-driven fuzzy modeling for rockburst prediction. Int. J. Rock Mech. Min. Sci. 2013, 61, 86–95. [Google Scholar] [CrossRef]
  26. Liu, Z.; Shao, J.; Xu, W.; Meng, Y. Prediction of rock burst classification using the technique of cloud models with attribution weight. Nat. Hazards 2013, 68, 549–568. [Google Scholar] [CrossRef]
  27. Ge, Q.; Feng, X. Classification and prediction of rockburst using AdaBoost combination learning method. Rock Soil Mech.-Wuhan 2008, 29, 943. [Google Scholar]
  28. Dong, L.-J.; Li, X.-B.; Kang, P. Prediction of rockburst classification using Random Forest. Trans. Nonferrous Met. Soc. China 2013, 23, 472–477. [Google Scholar] [CrossRef]
  29. Zhao, H.; Chen, B. Data-Driven Model for Rockburst Prediction. Math. Probl. Eng. 2020, 2020, 5735496. [Google Scholar] [CrossRef]
  30. Javadi, A.A.; Ahangar-Asr, A.; Johari, A.; Faramarzi, A.; Toll, D. Modelling stress–strain and volume change behaviour of unsaturated soils using an evolutionary based data mining technique, an incremental approach. Eng. Appl. Artif. Intell. 2012, 25, 926–933. [Google Scholar] [CrossRef]
  31. Heal, D. Observations and Analysis of Incidences of Rockburst Damage in Underground Mines. Ph.D. Thesis, The University of Western Australia, Perth, Australia, 2010. [Google Scholar]
  32. Zhou, J.; Shi, X.-Z.; Huang, R.-D.; Qiu, X.-Y.; Chen, C. Feasibility of stochastic gradient boosting approach for predicting rockburst damage in burst-prone mines. Trans. Nonferrous Met. Soc. China 2016, 26, 1938–1945. [Google Scholar] [CrossRef]
  33. Li, N.; Zare Naghadehi, M.Z.; Jimenez, R. Evaluating short-term rock burst damage in underground mines using a systems approach. Int. J. Min. Reclam. Environ. 2020, 34, 531–561. [Google Scholar] [CrossRef]
  34. Akbulut, Y.; Sengur, A.; Guo, Y.; Smarandache, F. NS-k-NN: Neutrosophic set-based k-nearest neighbors classifier. Symmetry 2017, 9, 179. [Google Scholar] [CrossRef]
  35. Wei, C.; Huang, J.; Mansaray, L.R.; Li, Z.; Liu, W.; Han, J. Estimation and mapping of winter oilseed rape LAI from high spatial resolution satellite data based on a hybrid method. Remote Sens. 2017, 9, 488. [Google Scholar] [CrossRef]
  36. France, S.L.; Carroll, J.D.; Xiong, H. Distance metrics for high dimensional nearest neighborhood recovery: Compression and normalization. Inf. Sci. 2012, 184, 92–110. [Google Scholar] [CrossRef]
  37. Yesilbudak, M.; Sagiroglu, S.; Colak, I. A new approach to very short term wind speed prediction using k-nearest neighbor classification. Energy Convers. Manag. 2013, 69, 77–86. [Google Scholar] [CrossRef]
  38. Chen, Y.; Da, Q.; Liang, W.; Xiao, P.; Dai, B.; Zhao, G. Bagged ensemble of gaussian process classifiers for assessing rockburst damage potential with an imbalanced dataset. Mathematics 2022, 10, 3382. [Google Scholar] [CrossRef]
Figure 1. Violin plots showing the distribution of input parameters used in rockburst hazard assessment: (a) I1, (b) I2, (c) I3, (d) I4, (e) I5, (f) I6, and (g) I8.
Figure 1. Violin plots showing the distribution of input parameters used in rockburst hazard assessment: (a) I1, (b) I2, (c) I3, (d) I4, (e) I5, (f) I6, and (g) I8.
Geohazards 07 00005 g001
Figure 2. The distribution of the actual intensity levels in rockbursts.
Figure 2. The distribution of the actual intensity levels in rockbursts.
Geohazards 07 00005 g002
Figure 3. Schematic diagram of KNN classification.
Figure 3. Schematic diagram of KNN classification.
Geohazards 07 00005 g003
Figure 4. Confusion matrix (N × N) for classification problem.
Figure 4. Confusion matrix (N × N) for classification problem.
Geohazards 07 00005 g004
Figure 5. Radar plot depicts the performance of the classification model with regards to precision, recall, and F1 Score in terms of different rockburst hazard levels.
Figure 5. Radar plot depicts the performance of the classification model with regards to precision, recall, and F1 Score in terms of different rockburst hazard levels.
Geohazards 07 00005 g005
Table 1. Representative data and statistical properties of the key factors affecting rockburst risks.
Table 1. Representative data and statistical properties of the key factors affecting rockburst risks.
LevelI1I2I3 (m)I4I5I6 (m)I8 (kg/m3)
46054.20.51.7202700
26084.20.51.7252700
480860.51.8102700
370.3105.512.3152900
270.3105.412.3152900
372.28411.652900
Mean53.907.837.300.901.8813.732974.53
Standard Deviation17.804.203.060.260.8312.08449.61
Sample Variance316.8617.669.340.070.69145.82202,146.59
Kurtosis−0.648.2813.640.08−0.304.504.59
Skewness0.102.382.64−0.190.012.062.45
Minimum18220.5−0.352700
Maximum9525301.53.5704300
Count236236236236236236236
Table 2. Performance metrics for a multiclass confusion matrix.
Table 2. Performance metrics for a multiclass confusion matrix.
MetricFormula
Accuracy (%) A c c = i = 1 N T P C i i = 1 N j = 1 N C i , j
Recall of Class Ci T P R C i = T P C i T P C i + F N C i
Precision of Class Ci P P V C i = T P C i T P C i + F P C i
F1 Score of Class Ci F 1 C i = 2 T P R C i P P V C i T P R C i + P P V C i
Note: Acc: Accuracy; TPR: True positive rate; TP: True positive; PPV: Positive predictive value; FP: False positive; FN: False negative.
Table 3. Confusion matrix results for the KNN model during training and testing phases.
Table 3. Confusion matrix results for the KNN model during training and testing phases.
ActualActual/PredictedPredicted
Training PhaseTesting Phase
L2L3L4L5TotalL2L3L4L5Total
L21720101733200032
L35000511002
L46010760208
L51010250027
Total 1840301874412249
Table 4. Comparison of results of evaluation in other literatures.
Table 4. Comparison of results of evaluation in other literatures.
AccuracyF1 ScoreConfusion MatrixModelReference
48.98%[0.6667, 0.4211, 0.3478, 0.1818] 15         4         2         1 3             4         1         1 5             1         4         2 0             1         4         1 Bagged ensemble of GPCs
without data preprocessing
[38]
61.22%[0.7407, 0.5333, 0.6, 0] 20         1         0         1 5             4         0         0 4             0         6         2 3             1         2         0 Bagged ensemble of GPCs
without under-sampling
57.14%[0.7170, 0.4, 0.5714, 0] 19         2         0         1 5             3         1         0 4             0         6         2 3             1         2         0 GPC without under-sampling
63.27%[0.7907, 0.6, 0.5455, 0.3077] 17         4         0         1 2             6         1         0 2             0         6         4 0             1         3         2 Bagged Ensemble of Gaussian Process Classifiers
61.22%--Stochastic gradient
boosting approach
[32]
75.5%[0.842, 0.667, 0.400, 0.444] 32         0         0         0 1             1         0         0 6             0         2         0 5             0         0         2 KNNProposed method
Table 5. Rockburst damage incidents at Perseverance nickel mine.
Table 5. Rockburst damage incidents at Perseverance nickel mine.
Microseismic EventI1I2I3I4I5I6I8ActualResults Using Heal’s Method [31]Results Using GPC [38]Proposed Method
1585120.51.6142700L2L5L2L2
585120.51.6222700L2L5L2L2
47860.51.6292700L2L2L2L2
2478100.51.8102700L3L4L2L2
4786.60.51.8102700L2L2L4L2
4755.90.51.8162700L3L3L4L4
348104.80.51.5102700L2L2L2L2
48101011.5102700L2L2L2L2
4395511.8132700L2L1L2L2
438511.8132700L2L1L2L2
5588120.51.6102700L2L5L2L2
65881112.252700L4L3L2L3
Note: Bold letter represents inconsistent result with the actual situation.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ahmad, M.; Ullah, Z.; Hussan, S.; Alzlfawi, A.; Omar, R.C.; Haq, S.; Ahmad, F.; Khalil, M.N. Smart Prediction of Rockburst Risks Using Microseismic Data and K-Nearest Neighbor Classification. GeoHazards 2026, 7, 5. https://doi.org/10.3390/geohazards7010005

AMA Style

Ahmad M, Ullah Z, Hussan S, Alzlfawi A, Omar RC, Haq S, Ahmad F, Khalil MN. Smart Prediction of Rockburst Risks Using Microseismic Data and K-Nearest Neighbor Classification. GeoHazards. 2026; 7(1):5. https://doi.org/10.3390/geohazards7010005

Chicago/Turabian Style

Ahmad, Mahmood, Zia Ullah, Sabahat Hussan, Abdullah Alzlfawi, Rohayu Che Omar, Shay Haq, Feezan Ahmad, and Muhammad Naveed Khalil. 2026. "Smart Prediction of Rockburst Risks Using Microseismic Data and K-Nearest Neighbor Classification" GeoHazards 7, no. 1: 5. https://doi.org/10.3390/geohazards7010005

APA Style

Ahmad, M., Ullah, Z., Hussan, S., Alzlfawi, A., Omar, R. C., Haq, S., Ahmad, F., & Khalil, M. N. (2026). Smart Prediction of Rockburst Risks Using Microseismic Data and K-Nearest Neighbor Classification. GeoHazards, 7(1), 5. https://doi.org/10.3390/geohazards7010005

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop