1. Introduction
After drilling, rock masses subjected to uneven loads may experience stress concentration areas near the borehole and its vicinity due to stress redistribution. When the stress in these concentration areas exceeds the strength of the rock, fracturing of the borehole wall occurs, termed as borehole breakout [
1]. Borehole breakout significantly impacts drilling efficiency and borehole quality, thus garnering substantial attention from researchers [
2,
3]. Accurate assessment of borehole breakout size under different conditions is crucial for enhancing underground resource development.
The borehole breakout phenomenon was first documented in a mining report from a South African gold mine [
4] and later observed again in oil well development [
5]. Further research revealed that in vertical boreholes, the direction of breakout consistently aligns parallel to the minimum horizontal in situ stress direction [
6,
7,
8]. Meanwhile, the laboratory study identified the existence of two primary shapes of breakout, V-shape and dog-ear shape.
Figure 1 illustrates the dog-ear-shaped borehole breakout shape with associated symbolic annotations. Significant disparities exist in the fracture mechanisms of rocks during the formation of the two breakout shapes, and their sizes demonstrate a notable correlation with the magnitude of in situ stresses [
9]. Consequently, numerous researchers embarked on establishing the correlation between borehole breakout sizes and in situ stress, aiming to further elucidate the mechanism of borehole breakout. For example, Zoback et al. [
10] combined the Mohr–Coulomb criterion with the Kirsch equation to develop a correlation model between in situ stress and borehole breakout size. Al-Ajmi [
11] analyzed the vertical borehole stress distribution by linear elastic model and evaluated the rock stress state by Mogi–Coulomb criterion to construct the correlation between borehole mud pressure, borehole breakout and in situ stress.
However, because the process of borehole breakout is a continuous development, the stresses in the borehole wall and surrounding area will be further redistributed after the fractured rock is exfoliated, which will result in new rock exfoliation. Therefore, the models proposed by Zoback et al. [
10] and Al-Ajmi [
11] have certain limitations. The advancement of numerical simulation methods has offered an effective approach to addressing this issue, with several researchers successfully simulating the continuous development process of borehole breakout [
12,
13,
14]. In recent years, numerous researchers have also been investigating borehole breakout through numerical simulation methods. For instance, in 2019, Lin et al. [
15] examined the influence of borehole size and temperature on borehole breakout development using Discrete Element Method (DEM). They explored the pattern of microcrack development in the borehole wall and its surroundings under varying borehole sizes and temperatures. Zhang et al. [
16] conducted a study on the impacts of borehole breakouts on breakdown pressure using finite element method (FEM) simulation. They established a correlation between in situ stress, borehole breakout size, and breakdown pressure. Xiang et al. [
17] employed a three-dimensional bonded-particle model to simulate borehole breakout and successfully replicated the V-shape breakout phenomenon.
From the existing experimental, theoretical, and numerical simulation analyses and studies on borehole breakout, it is evident that the size of borehole breakout is influenced by factors such as mineral composition, porosity, grain strength, intergranular strength, angle of internal friction, cohesion, borehole mud pressure, temperature, in situ stress, Poisson’s ratio, rock modulus, and borehole size [
18,
19,
20,
21,
22]. To achieve accurate assessment of borehole breakout, it is apparent that all relevant features should be considered as comprehensively as possible. The intricate nonlinear relationship among numerous features poses a significant challenge for conventional experimental and theoretical analyses, while numerical simulation is also complex and challenging. The emergence of machine learning (ML) methods offers a novel approach to address this issue.
Existing ML methods possess exceptional learning capabilities for capturing complex nonlinear relationships among different factors. Consequently, they have been widely applied by researchers for investigating engineering problems involving multi-feature considerations [
23,
24,
25]. Furthermore, in Sharma et al.’s study, data from different wells were used to train the prediction model for the physical data of the rock at the drill bit during drilling, and the results had fairly good accuracy. This shows that machine learning models are well able to find potential nonlinear relationships between data from mixed datasets [
26]. In the domain of borehole breakout assessment, ML methods have also seen extensive application.
Table 1 collects some existing studies on ML for constructing borehole breakout association models, detailing the features, data sources, and algorithms they employed. The comparison reveals that the primary source of data for existing studies is finite element simulation. Additionally, much of the data obtained through literature review also originates from FEM, as seen in the study by Benemaran [
27]. It was observed that studies directly utilizing borehole breakout size as an output are categorized into single and multiple outputs, and multiple output studies mainly employ artificial neural network (ANN) [
28,
29,
30].
Single output refers to constructing multiple models based on the same input features to predict multiple targets separately when dealing with problems involving multiple prediction targets. On the other hand, multi-output refers to constructing a single model to predict multiple targets simultaneously. Compared to single-output approaches, multi-output models take into account the relationship between input features and targets, while also considering potential interdependencies among targets and objectives [
31,
32]. Hence, when encountering multi-output problems with implicit relationships between outputs, multi-output models typically exhibit superior predictive performance compared to single-output models. Moreover, the training process of multi-output models tends to be more resource-efficient [
33,
34]. Existing studies lack theoretical research on the correlation between breakout angle and breakout depth. However, Soroush’s study demonstrates that incorporating breakout angle into input features significantly enhances the prediction accuracy of breakout depth. This indirectly suggests an implicit correlation between breakout angle and breakout depth [
35]. Consequently, for predicting borehole breakout size, multi-output models hold considerable promise.
In existing multi-output studies on borehole breakout, single data sources are typically utilized, and certain crucial factors are often overlooked, resulting in limited generalization performance of the obtained models. Furthermore, no researcher has conducted a comparison between the established multi-output model and the single-output model, thus precluding the ability to ascertain the effectiveness of applying the multi-output structure. Obviously, further development and testing are still needed to validate the applicability of the multi-output structure in borehole breakout size prediction and to obtain models with high generalization performance and prediction accuracy for borehole breakout size prediction.
In this study, the prediction of borehole breakout size is treated as a multi-output problem considering the influence of multiple features. Through literature research, nine features deemed relevant to borehole breakout were selected as inputs to the model, and a dataset comprising 501 sets of related numerical simulation and physical experiment data was collected. To accurately map the potential connections between the input features and the target, a novel meta-heuristic optimization algorithm (WAOA) combined with three base models (ANN, RF, and XGBoost) was chosen to construct a hybrid multi-output model. The trained WAOA-ANN, WAOA-RF, and WAOA-XGBoost models were utilized to predict borehole breakout size under various conditions. Subsequently, the three hybrid multi-output models obtained are compared and discussed alongside the single-output model within the same setup to validate the effectiveness of employing the multi-output structure and to determine the optimal approach for borehole breakout size assessment. Finally, the relationship between the input features and outputs of the optimal models is analyzed and discussed using the SHAP sensitivity analysis method.
Table 1.
Several borehole breakout prediction related studies.
Table 1.
Several borehole breakout prediction related studies.
Researchers | Data Source | Method | Input | Output | Best | Dataset | Output Form |
---|
Zhang et al. [29] | FEM | ANN |
| | R2: 0.99 | training | Multi-output |
|
| R2: 0.99 | validation |
|
| R2: 0.99 | test |
|
Zhang et al. [28] | FEM | ANN |
| | R2: 0.99 | training | Multi-output |
|
| R2: 0.99 | validation |
|
| R2: 0.99 | test |
|
Zhang and Yin [16] | FEM | ANN |
| | R2: 0.98 | training | Multi-output |
|
| R2: 0.94 | validation |
|
| R2: 0.99 | test |
|
Lin, Singh et al. [36] | Literature Review, Laboratory experiment | ANN |
| | R2: 0.74 | training | Single-output |
R2: 0.95 | validation |
R2: 0.82 | test |
Lin, Kang et al. [15] | Literature Review | Kriging |
| | MRE 2: 8.4% | test | Single-output |
Soroush [35] | High quality acoustic image logs | MLPNN 8 |
| | R2: 0.98 | training | Multi-output |
| R2: 0.65 |
| R2: 0.92 | validation |
| R2: 0.60 |
| R2: 0.94 | test |
| R2: 0.55 |
| | R2: 0.99 | training | Single-output |
R2: 0.98 | validation |
R2: 0.98 | test |
Jolfaei and Lakirouhani [37] | FEM | ANN |
| | R2: 0.90 | Total | Single-output |
| R2: 0.85 |
Lin et al. [38] | Literature Review | ANN |
| | RMSE:3.29 | training | Single-output |
RMSE:12.2 | validation |
CART 9 | RMSE:2.29 | training |
RMSE:10.47 | validation |
Benemaran [27] | Literature Review | XGBoost |
| | R2: 0.98 | training | Single-output |
R2: 0.98 | test |
| R2: 0.99 | training |
R2: 0.98 | test |
H. Zhang et al. [39] | Literature Review | SVR 10 |
| | MRE: 9.59% | test | Single-output |
3. Data Description and Evaluation Metrics
Existing research on borehole breakout can be categorized into numerical simulation and physical experiments in terms of experimental methods. Numerical simulation offers greater convenience in setting up various external conditions to explore the influence of multiple dimensions, making it favored by many scholars. However, the results of physical experiments exhibit better consistency with field reality. In this paper, data obtained from both methods are considered. It is important to note that only data from FEM simulations were collected, as most Discrete Element Method (DEM) simulations were two-dimensional and did not account for the effect of vertical in situ stress [
1,
15,
46].
Through literature research [
19,
20,
22,
27,
35], the following input features were selected from a wide range of features associated with borehole breakout: rock internal friction angle (
), cohesion (
), rock modulus (
), Poisson’s ratio (
), mud pressure in the borehole (
), minimum horizontal principal stress (
), vertical principal stress (
), maximum horizontal principal stress (
), and the radius of the borehole (
).
Meanwhile, a total of 501 sets of relevant data were collected from the existing research literature to construct a borehole breakout size dataset [
16,
28,
29,
30,
37,
47,
48,
49,
50,
51]. Considering that the absolute magnitude of the borehole breakthrough depth is significantly correlated with the borehole radius, and that the borehole radius setting varies in different studies, the borehole breakthrough depth is dimensionless normalized to improve the model generalization ability.
Table 2 summarizes the statistical metrics of the constructed dataset, taking into account the significant variation in borehole sizes involved in different studies, and transforming the borehole breakout depths accordingly. Additionally, it is noted that in the physical experiments, none of them set the mud pressure in the borehole, thus it is considered to be 0 MPa.
Figure 5 illustrates the distribution relationship between the breakout sizes. From the figure, it is evident that all the borehole breakout sizes obtained by different researchers exhibit a pronounced nonlinear relationship. This initial observation confirms the feasibility of employing the multi-output approach for borehole breakout size prediction. Additionally, the correlation analysis between the input features and borehole breakout size is further presented in
Figure 6.
The correlation analysis of the dataset aims to assess whether there is excessive correlation among the input features, potentially leading to multicollinearity issues that could impact the model’s performance. As illustrated in
Figure 6a, the highest correlation observed between features is 0.86. Notably, for identical input features, the borehole breakout angle and depth exhibit completely opposite correlations. Additionally, an input-output distribution analysis is conducted to identify and mitigate the presence of outliers in the dataset, which could disrupt model training and evaluation.
Figure 6b illustrates that there are no apparent outliers in the dataset. Furthermore, it is evident that the distribution of features
is more uniform, whereas the distribution of features
is denser. Consequently, the primary focus of the study appears to be on features
.
Reasonably setting evaluation metrics for model assessment is crucial for selecting the optimal model correctly. Drawing from previous ML studies [
52,
53,
54,
55,
56,
57,
58,
59,
60], commonly used evaluation metrics include the Pearson Correlation Coefficient (R
2), Mean Absolute Error (MAE), Variance Accounted For (VAF), and Root Mean Squared Error (RMSE). The calculation methods for these metrics are outlined as:
where
denotes the predicted value,
denotes the mean of the predicted values,
denotes the actual value,
denotes the mean of the actual values,
denotes the number of data.
4. Result and Discussion
4.1. Model Training
In this study, a novel meta-heuristic optimization algorithm, WAOA, is combined with ANN, RF, and XGBoost to construct more efficient hybrid multi-output models for predicting borehole breakout size. According to the experience and theoretical research of researchers on hyperparameter adjustment of ANN, RF and XGBoost regression prediction model with multi-feature input, the hyperparameter objects and ranges optimized by WAOA in this study are shown in
Table 3.
Figure 7 illustrates the construction, optimization and comparison process of the hybrid multi-output model proposed in this study. After determining the influencing factors of borehole breakout size by analyzing and comparing the existing literature, a substantial amount of relevant data was collected from the existing literature to construct a dataset. To balance the accuracy requirement of the model to explore the intricate nonlinear relationship between input and output factors in the training process, and the rigor requirement of the model comparison process, the dataset was randomly divided into the training set (80%) and the test set (20%).
To comprehensively investigate the intricate nonlinear relationship between input features and borehole breakout size, and to leverage the potential correlation between breakout size, the ANN, RF, and XGBoost was employed as the foundation, with WAOA utilized as the optimization tool to develop a hybrid multi-output prediction model for borehole breakout size. In the optimization process of WAOA, MSE is chosen as the fitness evaluation metric which can be calculated as:
Simultaneously, to address inconsistencies in units among different outputs and ensure a balanced consideration of various output prediction accuracies in the overall fitness evaluation of the model, all input and output features were normalized to [0, 1]. The normalization process can be presented as:
where
denotes the value after normalization,
denotes the value before normalization,
denotes the minimum value of the feature,
denotes the maximum value of the feature. This normalization is an equiproportional transformation of the entire range of features, it does not change the distribution shape of the features, and therefore does not disrupt the potential nonlinear relationships in the dataset.
Since model hyperparameter optimization involves continuously adjusting the hyperparameter settings to fit the training data, overfitting is prone to occur if a single training set is used exclusively for training and evaluating the fitness of adjustments. To mitigate the risk of overfitting in the iterative process of the hybrid multi-output model, 5-fold cross-validation was employed during model training. Multifold cross-validation is an effective method for preventing overfitting during iteration and enhancing model generalization performance. By iteratively dividing the training set data into training and testing folds, multiple validations of the model’s performance for a given hyperparameter setting were conducted to accurately assess the effectiveness of hyperparameter adjustments.
Drawing from previous research experience, the optimization effect of the meta-heuristic optimization algorithm on the target model is significantly influenced by the population size. Therefore, in this study, five different population sizes (10, 20, 40, 80, 160) will be employed to enable the optimization algorithm to accurately search for high-performance hyperparameter settings.
Figure 8 illustrates the curve of model fitness with the number of iterations for different population size settings, and it can be seen that the fitness of the model stabilizes as the number of iterations increases. The ultimate hyperparameter settings of the model through the optimization of WAOA at different population size settings are shown in
Table 4.
Table 5,
Table 6 and
Table 7 summarize the performance and ranking scores of the models trained under different pop sizes on the four predictive evaluation metrics used, and
Figure 9 provides a visual comparison of the model performance scores.
Figure 9 combines the rank of the models on various indicators and assigns scores to them, enabling a comprehensive comparison of multiple indicators with different scales. In the subplots of
Figure 9, the internal wind rose plot represents the model’s separate scores for predicting
and
, and the external one represents the model’s combined scores for predicting
and
on the training and test sets.
4.1.1. Determination of Optimal WAOA-ANN Models
In multi-output ANN models, different outputs share the output layer and the hidden layer, and thus the adjustment of the weights and biases of the hidden layer neurons during the training process is jointly affected by all outputs. The differences between different outputs are mainly caused by the differences in the weights and biases adopted by their respective output layer neurons. The fitness curves of the five WAOA-ANN models under different pop sizes are shown in
Figure 8a,b. From the figure, for the prediction of
, the optimal fitness of the model is stable between
and
, while for the prediction of
, the optimal fitness of the model is stable between
and
. There is a significant decrease in fitness compared to when iterating the initial model.
Even though the model fitness is evaluated by 5-fold cross-validation during the iteration process, overfitting of the model is largely avoided. However, during each iteration, the data in the training set are directly or indirectly involved in the assessment of model fitness, and because it is a multi-output model, the model may not perform equally on two predicted objects, so the ranking of the final model fitness cannot completely reflect the model prediction performance. To completely evaluate the model prediction performance and generalization performance to select the optimal WAOA-ANN model, the resulting models are further compared on the training and test sets.
Table 5 lists the prediction performance and ranking scores of the five WAOA-RF models on the training and test sets. As can be seen from
Table 5, there is no model that can rank first in both
and
prediction on both the training and test sets at the same time. For example, the WAOA-RF model obtained with a pop size setting of 10 ranks first in all four metrics for the prediction of
on the training set and test set, but ranks poorly for the prediction of
on test set. This suggests that multi-output ANN models with different hyperparameter settings have their own focus on different outputs. For the sake of model comprehensiveness, a superior multi-output model should be balanced in the prediction performance of different outputs. Therefore, further the predictive performance of different WAOA-ANN models for
and
on the training set and the test set is shown in
Figure 9a. As can be seen in
Figure 9a, for the five WAOA-ANN models obtained, the models with the pop size setting of 10 and 80 accounted for the highest scores for
and
predictions, respectively, and the model with a pop size of 10 (Num dense layers = 4, Num dense nodes = 100, Learning rate =
) had the highest combined score. Therefore, in the subsequent study, the WAOA-ANN model with a pop size of 10 was used as a representative of the WAOA-ANN model.
4.1.2. Determination of Optimal WAOA-RF Models
For multi-output RF models, node splitting takes into account the variance of the different outputs in the child nodes when constructing the decision tree to achieve a comprehensive consideration of the different outputs.
Figure 8c,d illustrate the variation in fitness during iterations of the WAOA-RF model. As can be seen from the figure, the distribution of the optimal fitness of the five WAOA-RF models for the prediction of
ranges from
and
, and the distribution of the prediction of
ranges from
and
.
Table 6 shows the performance of the five WAOA -RF models in terms of evaluation metrics on the training and test sets, as well as their ranking scores. From
Table 6, it can be noticed that the model with a pop size of 80 takes the absolute lead in the performance of the training set and test set, in the prediction of θ (training set: R
2 = 0.9938, MAE = 0.0083, VAF = 0.9864, RMSE = 0.0149, test set: R
2 = 0. 9704, MAE = 0.0174, VAF = 0. 9395, RMSE = 0.0311). Further analyses of
Figure 9b revealed that the models with pop size 80 and 20 occupy the highest scores for predicting
and
, respectively, and the model with the highest combined score is the model with pop size 80 (n_estimators = 1000, max_depth = 1000).
4.1.3. Determination of Optimal WAOA-XGBoost Models
The multi-output XGBoost model has the same node splitting strategy as the multi-output RF model when constructing decision trees. The difference is that XGBoost takes into account the bias of the previous decision tree when constructing a new decision tree, effectively reducing the variation in prediction performance between different outputs. From
Figure 8e,f, the optimal fitness of the five WAOA-XGBoost models is distributed between
and
for the prediction of
, and between
and
for the prediction of
.
Table 7 summarizes the prediction performance and ranking scores of the five WAOA-XGBoost models on the training and test sets. As can be seen from the table, the prediction performance and ranking scores of most WAOA-XGBoost models in
and
are relatively consistent. This can be observed more intuitively in
Figure 9c, where the models with pop sizes of 10, 80 and 160 each have essentially the same scores on
and
. This confirms the balanced consideration of potential links between different outputs and input features in the multi-output XGBoost model.
It can also be seen from
Figure 9c that the model with a pop size of 10 occupies the highest score for predicting
. The model with a pop size of 80 (max_depth = 4, n_estimators = 374, learning_rate = 0.1626) occupies the highest score for predicting
as well as the highest combined score. In the subsequent study, the model with a pop size of 80 was chosen as a representative of WAOA-XGBoost.
4.2. Model Comparison
Through comparing the optimization effects of the optimization algorithms on the multi-output ANN, RF, and XGBoost models across various population size settings, the optimal results of the three hybrid multi-output models were identified. Regarding the predictive performance, all three models exhibited exceptionally strong capabilities. It is clear that they accurately captured the potential relationship between input features and outputs. However, it remains uncertain whether the potential relationship between the outputs has been appropriately utilized. To further select the final recommended hybrid multi-output model and validate the effectiveness of employing a multi-output setup, a comparative analysis of the obtained models is conducted.
Figure 10 provides an intuitive representation of the effectiveness of three different hybrid multi-output models in predicting borehole breakout size. In the scatter plots, where the actual values are plotted on the
x-axis and the predicted values on the
y-axis, the scatter points in all figures closely align around the y = x line, with most data points exhibiting a relative error within 15%. From the kernel density distributions of the actual and predicted values in the upper and right parts of the scatterplot, the distributions of predicted and actual values are very similar in both the training and test sets. This evidences the excellent performance of the three hybrid multi-output models, with accurate exploration of the potential correlation between input and output.
It is noteworthy that there is no significant difference in the predictive performance of the three models, either on the training or test sets, concerning predicting and . However, regarding the difference in prediction performance between the training set and the test set, the WAOA-ANN model demonstrates a more stable performance, whereas the WAOA-RF and WAOA-XGBoost models exhibit a noticeable decrease in predictive performance in the test set compared to the training set. When comparing the three hybrid multi-output models, it becomes evident that the WAOA-ANN model excels in exploring potential relationships between input and output variables while effectively mitigating overfitting and underfitting issues.
Given that the performance of the three hybrid multi-output models is relatively similar, the comparison of their distributions based on the scatterplot may not distinctly distinguish between the model performances. To provide a clearer assessment, the model performance is further illustrated using a Taylor diagram. This diagram facilitates a visual comparison of the three evaluation metrics (R2, RMSE, and standard deviation) of the models in a single image.
In a Taylor diagram, the closer the data points are to the measurement point, the better the performance of the model they represent. Meanwhile, to evaluate the model’s capacity to leverage potential relationships between outputs, single-output ANN, RF, and XGBoost models were constructed with identical hyperparameter settings for comparative analysis.
Figure 11 presents the Taylor diagram of the prediction performance of different models on the test set. As can be seen from the figure, the WAOA-ANN model is closest to the measure point in both
and
prediction. Additionally, there is a notable enhancement in the prediction performance compared to the single-output ANN model. For the WAOA-RF and WAOA-XGBoost model, on the one hand the predictive performance of the model on the test set is weaker than the WAOA-ANN model. On the other hand, the predictive performance of the multi-output RF and XGBoost models is in close agreement with that of the single-output models.
This demonstrates the strong generalization performance of the WAOA-ANN model, as well as its effective utilization of the potential connection between borehole breakout angle and depth. In comparison to the single-output ANN model, the WAOA-ANN model demonstrates superior prediction performance and requires fewer training resources for predicting the angle and depth of borehole breakout. For the WAOA-RF and WAOA-XGBoost models, there is no significant improvement compared to the single-output RF and XGBoost models. Additionally, their hyperparameters result in large model sizes, making the training process time-consuming and memory intensive, offering no practical advantage over the single-output RF and XGBoost models. Based on the comparative analyses of the models, the WAOA-ANN model is recommended for predicting borehole breakout size. Furthermore, the WAOA-ANN model undergoes further sensitivity analyses to explore the impacts of various input features on borehole breakout size.
4.3. Sensitivity Analysis
SHAP (SHapley Additive exPlanations) is a commonly employed model interpretation technique rooted in the concept of Shapley values from game theory [
61,
62,
63,
64]. It quantifies the contribution of input features to the model output through SHAP values. Analysis conducted with SHAP allows for the assessment of feature influence from both global and local perspectives, offering a more intuitive interpretation of model prediction outcomes. Furthermore, based on SHAP analysis results, adjustments to feature sizes can be made more scientifically and effectively to achieve target output values, serving as a crucial reference for design purposes.
The WAOA-ANN model input feature impact rankings obtained based on SHAP analysis are shown in
Figure 12. For the prediction of
, the ranking of the input feature’s impact on the output is
and for the prediction of
, the ranking is
. The impact of input features on the output (promoting or reducing) changes with the value of the feature, and
Figure 13 demonstrates the impact of different features on the WAOA-ANN output. From the figure, it can be seen that
is positively correlated with
, negatively correlated with
, and has a complex relationship with the
.
is positively correlated with
, negatively correlated with
, and has a complex relationship with the
.
5. Conclusions
Borehole breakout, as a critical factor influencing borehole quality, has consistently attracted significant attention from researchers. This study collected a total of 501 datasets from numerical simulations and physical experiments, encompassing in situ stress, borehole dimensions, rock properties of the borehole wall, and mud pressure in the borehole as input features, with borehole breakout size as the output data. Three hybrid multi-output models (WAOA-ANN, WAOA-RF, WAOA-XGBoost) were developed using ANN, RF, and XGBoost as base algorithms, augmented by the WAOA optimization technique. The predictive performance of these models was compared and analyzed, while a single-output model was constructed to further assess the efficacy of the multi-output framework. The principal findings of this investigation are delineated below:
(1) All three hybrid multi-output models exhibit excellent prediction performance for borehole breakout size. Particularly, the WAOA-ANN model demonstrates consistent performance across both the training and test sets, surpassing WAOA-RF and WAOA-XGBoost on the test set (for training (test): R2 = 0.9906 (0.9819), MAE = 0.0095 (0.0142), VAF = 0.9803 (0.9629), RMSE = 0.018 (0.0244); for training (test): R2 = 0.9839 (0.978), MAE = 0.0167 (0.0208), VAF = 0.9679 (0.9556), RMSE = 0.0284 (0.0334)). Consequently, the WAOA-ANN model is deemed to possess superior prediction capabilities and stronger generalization ability.
(2) The WAOA-ANN model effectively leverages the potential correlation between borehole breakout sizes, resulting in a substantial enhancement in prediction performance compared to the single-output model. Conversely, WAOA-RF and WAOA-XGBoost exhibited no improvement in performance relative to the single-output model with identical hyperparameter settings, while also increasing the computational resource requirements of the model.
(3) The sensitivity analyses conducted on WAOA-ANN indicate that for both and , exhibits the highest sensitivity. However, there exists a considerable disparity in the rankings of input features concerning the impact on and (: , : ). Additionally, variations are observed in the correlation of the same feature with and . For instance, displays a positive correlation with but a negative correlation with . Similarly, and exhibit a positive correlation with but a nonmonotonically correlation with .
In conclusion, all three hybrid multi-output models developed achieved effective prediction of borehole breakout size, but only the WAOA-ANN model successfully demonstrated the advantages of the multi-output structure and performed better in terms of overall prediction performance. Therefore, the obtained WAOA-ANN is considered to be an effective hybrid multi-output model for borehole breakout size prediction. It is important to note that prediction results may be biased if the input feature values used exceed the range of the dataset in this paper. Hence, subsequent studies may consider covering a more comprehensive range of eigenvalues to enhance the robustness of the predictions.