Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm

Zhang, Rui; Zhou, Jian; Tao, Ming; Li, Chuanqi; Li, Pingfeng; Liu, Taoying

doi:10.3390/app14146164

Open AccessArticle

Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm

by

Rui Zhang

¹,

Jian Zhou

^1,*

,

Ming Tao

¹,

Chuanqi Li

²

,

Pingfeng Li

³ and

Taoying Liu

^1,*

¹

School of Resources and Safety Engineering, Central South University, Changsha 410083, China

²

Laboratory 3SR, CNRS UMR 5521, Grenoble Alpes University, 38000 Grenoble, France

³

Hongda Blasting Engineering Group Co., Ltd., Guangzhou 510623, China

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2024, 14(14), 6164; https://doi.org/10.3390/app14146164

Submission received: 21 June 2024 / Revised: 9 July 2024 / Accepted: 12 July 2024 / Published: 15 July 2024

Download

Browse Figures

Versions Notes

Abstract

:

Borehole breakouts significantly influence drilling operations’ efficiency and economics. Accurate evaluation of breakout size (angle and depth) can enhance drilling strategies and hold potential for in situ stress magnitude inversion. In this study, borehole breakout size is approached as a complex nonlinear problem with multiple inputs and outputs. Three hybrid multi-output models, integrating commonly used machine learning algorithms (artificial neural networks ANN, random forests RF, and Boost) with the Walrus optimization algorithm (WAOA) optimization techniques, are developed. Input features are determined through literature research (friction angle, cohesion, rock modulus, Poisson’s ratio, mud pressure, borehole radius, in situ stress), and 501 related datasets are collected to construct the borehole breakout size dataset. Model performance is assessed using the Pearson Correlation Coefficient (R²), Mean Absolute Error (MAE), Variance Accounted For (VAF), and Root Mean Squared Error (RMSE). Results indicate that WAOA-ANN exhibits excellent and stable prediction performance, particularly on the test set, outperforming the single-output ANN model. Additionally, SHAP sensitivity analysis conducted on the WAOA-ANN model reveals that maximum horizontal principal stress (

σ_{H}

) is the most influential parameter in predicting both the angle and depth of borehole breakout. Combining the results of the studies and analyses conducted, WAOA-ANN is considered to be an effective hybrid multi-output model in the prediction of borehole breakout size.

Keywords:

borehole breakout; ANN; RF; XGBoost; Walrus optimization algorithm; multi-output

1. Introduction

After drilling, rock masses subjected to uneven loads may experience stress concentration areas near the borehole and its vicinity due to stress redistribution. When the stress in these concentration areas exceeds the strength of the rock, fracturing of the borehole wall occurs, termed as borehole breakout [1]. Borehole breakout significantly impacts drilling efficiency and borehole quality, thus garnering substantial attention from researchers [2,3]. Accurate assessment of borehole breakout size under different conditions is crucial for enhancing underground resource development.

The borehole breakout phenomenon was first documented in a mining report from a South African gold mine [4] and later observed again in oil well development [5]. Further research revealed that in vertical boreholes, the direction of breakout consistently aligns parallel to the minimum horizontal in situ stress direction [6,7,8]. Meanwhile, the laboratory study identified the existence of two primary shapes of breakout, V-shape and dog-ear shape. Figure 1 illustrates the dog-ear-shaped borehole breakout shape with associated symbolic annotations. Significant disparities exist in the fracture mechanisms of rocks during the formation of the two breakout shapes, and their sizes demonstrate a notable correlation with the magnitude of in situ stresses [9]. Consequently, numerous researchers embarked on establishing the correlation between borehole breakout sizes and in situ stress, aiming to further elucidate the mechanism of borehole breakout. For example, Zoback et al. [10] combined the Mohr–Coulomb criterion with the Kirsch equation to develop a correlation model between in situ stress and borehole breakout size. Al-Ajmi [11] analyzed the vertical borehole stress distribution by linear elastic model and evaluated the rock stress state by Mogi–Coulomb criterion to construct the correlation between borehole mud pressure, borehole breakout and in situ stress.

However, because the process of borehole breakout is a continuous development, the stresses in the borehole wall and surrounding area will be further redistributed after the fractured rock is exfoliated, which will result in new rock exfoliation. Therefore, the models proposed by Zoback et al. [10] and Al-Ajmi [11] have certain limitations. The advancement of numerical simulation methods has offered an effective approach to addressing this issue, with several researchers successfully simulating the continuous development process of borehole breakout [12,13,14]. In recent years, numerous researchers have also been investigating borehole breakout through numerical simulation methods. For instance, in 2019, Lin et al. [15] examined the influence of borehole size and temperature on borehole breakout development using Discrete Element Method (DEM). They explored the pattern of microcrack development in the borehole wall and its surroundings under varying borehole sizes and temperatures. Zhang et al. [16] conducted a study on the impacts of borehole breakouts on breakdown pressure using finite element method (FEM) simulation. They established a correlation between in situ stress, borehole breakout size, and breakdown pressure. Xiang et al. [17] employed a three-dimensional bonded-particle model to simulate borehole breakout and successfully replicated the V-shape breakout phenomenon.

From the existing experimental, theoretical, and numerical simulation analyses and studies on borehole breakout, it is evident that the size of borehole breakout is influenced by factors such as mineral composition, porosity, grain strength, intergranular strength, angle of internal friction, cohesion, borehole mud pressure, temperature, in situ stress, Poisson’s ratio, rock modulus, and borehole size [18,19,20,21,22]. To achieve accurate assessment of borehole breakout, it is apparent that all relevant features should be considered as comprehensively as possible. The intricate nonlinear relationship among numerous features poses a significant challenge for conventional experimental and theoretical analyses, while numerical simulation is also complex and challenging. The emergence of machine learning (ML) methods offers a novel approach to address this issue.

Existing ML methods possess exceptional learning capabilities for capturing complex nonlinear relationships among different factors. Consequently, they have been widely applied by researchers for investigating engineering problems involving multi-feature considerations [23,24,25]. Furthermore, in Sharma et al.’s study, data from different wells were used to train the prediction model for the physical data of the rock at the drill bit during drilling, and the results had fairly good accuracy. This shows that machine learning models are well able to find potential nonlinear relationships between data from mixed datasets [26]. In the domain of borehole breakout assessment, ML methods have also seen extensive application. Table 1 collects some existing studies on ML for constructing borehole breakout association models, detailing the features, data sources, and algorithms they employed. The comparison reveals that the primary source of data for existing studies is finite element simulation. Additionally, much of the data obtained through literature review also originates from FEM, as seen in the study by Benemaran [27]. It was observed that studies directly utilizing borehole breakout size as an output are categorized into single and multiple outputs, and multiple output studies mainly employ artificial neural network (ANN) [28,29,30].

Single output refers to constructing multiple models based on the same input features to predict multiple targets separately when dealing with problems involving multiple prediction targets. On the other hand, multi-output refers to constructing a single model to predict multiple targets simultaneously. Compared to single-output approaches, multi-output models take into account the relationship between input features and targets, while also considering potential interdependencies among targets and objectives [31,32]. Hence, when encountering multi-output problems with implicit relationships between outputs, multi-output models typically exhibit superior predictive performance compared to single-output models. Moreover, the training process of multi-output models tends to be more resource-efficient [33,34]. Existing studies lack theoretical research on the correlation between breakout angle and breakout depth. However, Soroush’s study demonstrates that incorporating breakout angle into input features significantly enhances the prediction accuracy of breakout depth. This indirectly suggests an implicit correlation between breakout angle and breakout depth [35]. Consequently, for predicting borehole breakout size, multi-output models hold considerable promise.

In existing multi-output studies on borehole breakout, single data sources are typically utilized, and certain crucial factors are often overlooked, resulting in limited generalization performance of the obtained models. Furthermore, no researcher has conducted a comparison between the established multi-output model and the single-output model, thus precluding the ability to ascertain the effectiveness of applying the multi-output structure. Obviously, further development and testing are still needed to validate the applicability of the multi-output structure in borehole breakout size prediction and to obtain models with high generalization performance and prediction accuracy for borehole breakout size prediction.

In this study, the prediction of borehole breakout size is treated as a multi-output problem considering the influence of multiple features. Through literature research, nine features deemed relevant to borehole breakout were selected as inputs to the model, and a dataset comprising 501 sets of related numerical simulation and physical experiment data was collected. To accurately map the potential connections between the input features and the target, a novel meta-heuristic optimization algorithm (WAOA) combined with three base models (ANN, RF, and XGBoost) was chosen to construct a hybrid multi-output model. The trained WAOA-ANN, WAOA-RF, and WAOA-XGBoost models were utilized to predict borehole breakout size under various conditions. Subsequently, the three hybrid multi-output models obtained are compared and discussed alongside the single-output model within the same setup to validate the effectiveness of employing the multi-output structure and to determine the optimal approach for borehole breakout size assessment. Finally, the relationship between the input features and outputs of the optimal models is analyzed and discussed using the SHAP sensitivity analysis method.

Table 1. Several borehole breakout prediction related studies.

Researchers	Data Source	Method	Input	Output	Best	Dataset	Output Form
Zhang et al. [29]	FEM	ANN	$σ_{h}$ $σ_{H}$	$r_{b}$	R²: 0.99	training	Multi-output
				$θ$	R²: 0.99	training
				$r_{b}$	R²: 0.99	validation
				$θ$	R²: 0.99	validation
				$r_{b}$	R²: 0.99	test
				$θ$	R²: 0.99	test
Zhang et al. [28]	FEM	ANN	$σ_{h}$ $σ_{H}$	$r_{b}$	R²: 0.99	training	Multi-output
				$θ$	R²: 0.99	training
				$r_{b}$	R²: 0.99	validation
				$θ$	R²: 0.99	validation
				$r_{b}$	R²: 0.99	test
				$θ$	R²: 0.99	test
Zhang and Yin [16]	FEM	ANN	$σ_{h}$ $σ_{H}$	$r_{b}$	R²: 0.98	training	Multi-output
				$θ$	R²: 0.98	training
				$r_{b}$	R²: 0.94	validation
				$θ$	R²: 0.94	validation
				$r_{b}$	R²: 0.99	test
				$θ$	R²: 0.99	test
Lin, Singh et al. [36]	Literature Review, Laboratory experiment	ANN	${B W S}^{1}$ $σ_{v}$ $θ$	$σ_{H}$	R²: 0.74	training	Single-output
					R²: 0.95	validation
					R²: 0.82	test
Lin, Kang et al. [15]	Literature Review	Kriging	$B W S$ $σ_{v}$ $\frac{r_{b}}{r}$	$σ_{H}$	MRE ²: 8.4%	test	Single-output
Soroush [35]	High quality acoustic image logs	MLPNN ⁸	${G R}^{3}$ ${R H O B}^{4}$ ${D T P}^{5}$ ${D T S}^{6}$ ${U C S}^{7}$ $μ$ $ν$	$θ$	R²: 0.98	training	Multi-output
				$r_{b}$	R²: 0.65	training
				$θ$	R²: 0.92	validation
				$r_{b}$	R²: 0.60	validation
				$θ$	R²: 0.94	test
				$r_{b}$	R²: 0.55	test
			$G R$ $R H O B$ $D T P$ $D T S$ $U C S$ $μ$ $ν$ $θ$	$r_{b}$	R²: 0.99	training	Single-output
					R²: 0.98	validation
					R²: 0.98	test
Jolfaei and Lakirouhani [37]	FEM	ANN	$σ_{h}$ $σ_{v}$ $σ_{H}$ $φ$ $c$	$θ$	R²: 0.90	Total	Single-output
Jolfaei and Lakirouhani [37]	FEM	ANN	$σ_{h}$ $σ_{v}$ $σ_{H}$ $φ$ $c$	$\frac{r_{B}}{r}$	R²: 0.85	Total	Single-output
Lin et al. [38]	Literature Review	ANN	$B W S$ $σ_{v}$ $θ$	$σ_{h}$	RMSE:3.29	training	Single-output
		ANN			RMSE:12.2	validation
		CART ⁹			RMSE:2.29	training
		CART ⁹			RMSE:10.47	validation
Benemaran [27]	Literature Review	XGBoost	$σ_{h}$ $σ_{v}$ $σ_{H}$ $φ$ $c$	$θ$	R²: 0.98	training	Single-output
				$θ$	R²: 0.98	test
				$\frac{r_{B}}{r}$	R²: 0.99	training
				$\frac{r_{B}}{r}$	R²: 0.98	test
H. Zhang et al. [39]	Literature Review	SVR ¹⁰	$B W S$ $θ$ $\frac{r_{b}}{r}$	$σ_{h}$	MRE: 9.59%	test	Single-output

¹

B W S

: borehole wall strength; ² MRE: mean relative error; ³

G R

: gamma ray; ⁴

R H O B

: formation bulk density; ⁵

D T P

: compressional sonic; ⁶

D T S

: shear sonic; ⁷

U C S

: uniaxial compressive strength; ⁸ MLPNN: Multilayer Perceptron Neural Network; ⁹ CART: classification and regression tree; ¹⁰ SVR: support vector regressor.

2. Methodologies

2.1. Artificial Neural Network (ANN)

Artificial neural network (ANN) are ML models that mimic the structure and function of biological neural networks, exhibiting strong learning capabilities for complex nonlinear problems [40,41]. Particularly suited for multiple-input multiple-output scenarios, ANN excels at learning and accurately mapping complex relationships between multi-input and multi-output variables. In the ANN model, the primary structure consists of an input layer, an output layer, and several hidden layers. Each layer comprises neurons, with the number of neurons in the input and output layers matching the number of input and output variables, respectively. There is no explicit requirement for the number of neurons per layer in the hidden layer. The input data are processed within neurons using a specific function called the activation function, and the results are then transmitted to neurons in the subsequent layer. The processing of input data by neurons can be represented by as:

y_{j} = f (\sum_{i = 1}^{n} w_{j i} x_{i} + b_{j})

(1)

where

x_{i}

is the

i

-th input data of the

j

-th neuron,

w_{j i}

is the weight of the

i

-th input data of the

j

-th neuron, and

b_{j}

is the bias of the

j

-th neuron. Figure 2 illustrates the fundamental structure of a multi-output ANN with two hidden layers.

2.2. Random Forest (RF)

Random forest (RF) belongs to the ensemble learning methods in ML algorithms, specifically categorized under the Bagging class. Its fundamental building block is the decision tree. The basic process for building a RF model involves: randomly selecting parts of training samples from the training set to create multiple new training sets; training multiple decision trees using the new training sets; aggregating the predictions of these decision trees to make the ultimate prediction. For regression tasks, the final output of RF model is the average of the predictions from the decision trees, while for classification tasks, it is the mode of the decision tree results. Figure 3 illustrates the algorithm flow of the RF model for regression prediction.

RF has been widely used in various fields such as medicine, biology, and engineering since it was proposed by Breiman in 1995 [42,43]. Compared to some existing ML algorithms, RF demonstrates higher accuracy and excellent performance in handling high-dimensional problems. Additionally, RF achieves good prediction accuracy even when dealing with missing values.

2.3. Extreme Gradient Boosting (XGBoost)

XGBoost is an ensemble learning algorithm that extends the gradient boosting decision tree algorithm by introducing regularization terms and second-order derivative information. It belongs to the Boosting class of ensemble algorithms and operates by integrating multiple weak learners into a single strong learner. The fundamental building block of XGBoost is the decision tree. Unlike random forest, in XGBoost, each subsequent decision tree is generated considering the residuals of the previous one, thereby continuously improving the learning of the model. Due to its exceptional learning and generalization capabilities, XGBoost has been successfully applied to numerous complex nonlinear problems. Moreover, XGBoost demonstrates strong applicability and outstanding accuracy in addressing multi-output problems [44].

2.4. The Walrus Optimization Algorithm (WAOA)

Trojovský and Dehghani propose a new meta-heuristic optimization algorithm called WAOA that mimics the behavior of walruses feeding, migrating, and avoiding predators in nature [45]. During each iteration of WAOA, the update of individual locations is divided into three phases: exploration, migration, and exploitation, as shown in Figure 4.

Phase 1 simulates the feeding behavior of walruses, in which walruses in the population move towards the leader which have optimal fitness for better feeding conditions. In this phase, the individual position update method can be expressed as:

{X_{i}^{p}}^{1} (t + 1) = X_{i}^{3} (t) + r_{1} (X_{g} - I_{1} X_{i}^{3} (t))

(2)

X_{i}^{1} (t + 1) = \{\begin{matrix} X_{i}^{p 1} (t + 1), f_{i}^{p 1} < f_{i} \\ X_{i} (t), f_{i}^{p 1} \geq f_{i} \end{matrix}

(3)

where

{X_{i}^{p}}^{1} (t + 1)

denotes the new position of the

i

-th walrus obtained from the phase 1 update at iteration

t + 1

,

X_{i}^{3} (t)

denotes the

i

-th walrus position at iteration

t

,

r_{1}

is a random number form the interval [0, 1],

X_{g}

denotes the global best solution,

I_{1}

is a random number form the interval [1, 2],

X_{i}^{1} (t + 1)

denotes the final position of

i

-th walrus obtained from the phase 1 update at iteration

t + 1

,

{f_{i}^{p}}^{1}

and

f_{i}

denotes the fitness value while walrus’s position at

{X_{i}^{p}}^{1} (t + 1)

and

X_{i} (t)

respectively.

Phase 2 simulates the migration behavior of walruses. During the migration process, the walrus moves toward to a random select walrus, which can represent use:

X_{i}^{p 2} (t + 1) = \{\begin{matrix} X_{i}^{1} (t + 1) + r_{2} (X_{r}^{1} (t + 1) - I_{2} X_{i}^{1} (t + 1)), f_{r} < f_{i}^{1} \\ X_{i}^{1} (t + 1) + r_{2} (X_{i}^{1} (t + 1) - X_{r}^{1} (t + 1)), f_{r} \geq f_{i}^{1} \end{matrix}

(4)

X_{i}^{2} (t + 1) = \{\begin{matrix} X_{i}^{p 2} (t + 1), f_{i}^{p 2} < {f_{i}^{p}}^{1} \\ X_{i}^{1} (t + 1), f_{i}^{p 2} \geq {f_{i}^{p}}^{1} \end{matrix}

(5)

where

{X_{i}^{p}}^{2} (t + 1)

denotes the new position of the

i

-th walrus obtained from the phase 2 update at iteration

t + 1

,

X_{r}^{1} (t + 1)

denotes the position of selected walrus that the

i

-th walrus toward to migrate and

f_{r}

denotes its fitness value,

f_{i}^{1}

denotes the fitness value while walrus’s position at

X_{i}^{1} (t + 1)

,

X_{i}^{2} (t + 1)

denotes the final position of

i

-th walrus obtained from the phase 2 update at iteration

t + 1

,

r_{2}

and

I_{2}

are random number taken from the intervals [0, 1] and [1, 2], respectively,

f_{i}^{p 2}

denotes the fitness value while walrus’s position at

{X_{i}^{p}}^{2} (t + 1)

.

Phase 3 simulates the natural behavior of walruses escaping and fighting with the predators, which can represent use:

{X_{i}^{p}}^{3} (t + 1) = X_{i}^{2} (t + 1) + (L B_{(t + 1)} + (U B_{(t + 1)} - r_{3} L B_{(t + 1)}))

(6)

\{\begin{matrix} L B_{t + 1} = \frac{L B}{t + 1} \\ U B_{t + 1} = \frac{U B}{t + 1} \end{matrix}

(7)

X_{i}^{3} (t + 1) = \{\begin{matrix} X_{i}^{p 3} (t + 1), f_{i}^{p 3} < f_{i}^{2} \\ X_{i}^{2} (t + 1), f_{i}^{p 3} \geq f_{i}^{2} \end{matrix}

(8)

where

{X_{i}^{p}}^{3} (t + 1)

denotes the new position of the

i

-th walrus obtained from the phase 3 update at iteration

t + 1

,

r_{3}

is a random number form the interval [0, 1],

X_{i}^{3} (t + 1)

denotes the final position of

i

-th walrus obtained from the phase 3 update at iteration

t + 1

,

L B

and

U B

denotes the lower and upper bounds of the walrus’s position, respectively,

f_{i}^{2}

denotes the fitness value while walrus’s position at

X_{i}^{2} (t + 1)

,

f_{i}^{p 3}

denotes the fitness value while walrus’s position at

{X_{i}^{p}}^{3} (t + 1)

.

3. Data Description and Evaluation Metrics

Existing research on borehole breakout can be categorized into numerical simulation and physical experiments in terms of experimental methods. Numerical simulation offers greater convenience in setting up various external conditions to explore the influence of multiple dimensions, making it favored by many scholars. However, the results of physical experiments exhibit better consistency with field reality. In this paper, data obtained from both methods are considered. It is important to note that only data from FEM simulations were collected, as most Discrete Element Method (DEM) simulations were two-dimensional and did not account for the effect of vertical in situ stress [1,15,46].

Through literature research [19,20,22,27,35], the following input features were selected from a wide range of features associated with borehole breakout: rock internal friction angle (

φ

), cohesion (

c

), rock modulus (

E

), Poisson’s ratio (

ν

), mud pressure in the borehole (

P_{w}

), minimum horizontal principal stress (

σ_{h}

), vertical principal stress (

σ_{v}

), maximum horizontal principal stress (

σ_{H}

), and the radius of the borehole (

r

).

Meanwhile, a total of 501 sets of relevant data were collected from the existing research literature to construct a borehole breakout size dataset [16,28,29,30,37,47,48,49,50,51]. Considering that the absolute magnitude of the borehole breakthrough depth is significantly correlated with the borehole radius, and that the borehole radius setting varies in different studies, the borehole breakthrough depth is dimensionless normalized to improve the model generalization ability. Table 2 summarizes the statistical metrics of the constructed dataset, taking into account the significant variation in borehole sizes involved in different studies, and transforming the borehole breakout depths accordingly. Additionally, it is noted that in the physical experiments, none of them set the mud pressure in the borehole, thus it is considered to be 0 MPa.

Figure 5 illustrates the distribution relationship between the breakout sizes. From the figure, it is evident that all the borehole breakout sizes obtained by different researchers exhibit a pronounced nonlinear relationship. This initial observation confirms the feasibility of employing the multi-output approach for borehole breakout size prediction. Additionally, the correlation analysis between the input features and borehole breakout size is further presented in Figure 6.

The correlation analysis of the dataset aims to assess whether there is excessive correlation among the input features, potentially leading to multicollinearity issues that could impact the model’s performance. As illustrated in Figure 6a, the highest correlation observed between features is 0.86. Notably, for identical input features, the borehole breakout angle and depth exhibit completely opposite correlations. Additionally, an input-output distribution analysis is conducted to identify and mitigate the presence of outliers in the dataset, which could disrupt model training and evaluation. Figure 6b illustrates that there are no apparent outliers in the dataset. Furthermore, it is evident that the distribution of features

c, φ, σ_{h}, σ_{v}, σ_{H}

is more uniform, whereas the distribution of features

E, ν, P_{w}, r

is denser. Consequently, the primary focus of the study appears to be on features

c, φ, σ_{h}, σ_{v}, σ_{H}

.

Reasonably setting evaluation metrics for model assessment is crucial for selecting the optimal model correctly. Drawing from previous ML studies [52,53,54,55,56,57,58,59,60], commonly used evaluation metrics include the Pearson Correlation Coefficient (R²), Mean Absolute Error (MAE), Variance Accounted For (VAF), and Root Mean Squared Error (RMSE). The calculation methods for these metrics are outlined as:

R^{2} = \frac{{[\sum_{i = 1}^{n} (f_{i} - \bar{f}) (y_{i} - \bar{y})]}^{2}}{\sum_{i = 1}^{n} {(f_{i} - \bar{f})}^{2} \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(9)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |f_{i} - y_{i}|

(10)

V A F = 1 - \frac{var (f_{i} - y_{i})}{var (y_{i})}

(11)

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(f_{i} - y_{i})}^{2}}{n}}

(12)

where

f

denotes the predicted value,

\bar{f}

denotes the mean of the predicted values,

y_{i}

denotes the actual value,

\bar{y}

denotes the mean of the actual values,

n

denotes the number of data.

4. Result and Discussion

4.1. Model Training

In this study, a novel meta-heuristic optimization algorithm, WAOA, is combined with ANN, RF, and XGBoost to construct more efficient hybrid multi-output models for predicting borehole breakout size. According to the experience and theoretical research of researchers on hyperparameter adjustment of ANN, RF and XGBoost regression prediction model with multi-feature input, the hyperparameter objects and ranges optimized by WAOA in this study are shown in Table 3.

Figure 7 illustrates the construction, optimization and comparison process of the hybrid multi-output model proposed in this study. After determining the influencing factors of borehole breakout size by analyzing and comparing the existing literature, a substantial amount of relevant data was collected from the existing literature to construct a dataset. To balance the accuracy requirement of the model to explore the intricate nonlinear relationship between input and output factors in the training process, and the rigor requirement of the model comparison process, the dataset was randomly divided into the training set (80%) and the test set (20%).

To comprehensively investigate the intricate nonlinear relationship between input features and borehole breakout size, and to leverage the potential correlation between breakout size, the ANN, RF, and XGBoost was employed as the foundation, with WAOA utilized as the optimization tool to develop a hybrid multi-output prediction model for borehole breakout size. In the optimization process of WAOA, MSE is chosen as the fitness evaluation metric which can be calculated as:

M S E = \frac{\sum_{i = 1}^{n} {(f_{i} - y_{i})}^{2}}{n}

(13)

Simultaneously, to address inconsistencies in units among different outputs and ensure a balanced consideration of various output prediction accuracies in the overall fitness evaluation of the model, all input and output features were normalized to [0, 1]. The normalization process can be presented as:

x = \frac{x^{'} - x_{\min}}{x_{\max} - x_{\min}}

(14)

where

x

denotes the value after normalization,

x^{'}

denotes the value before normalization,

x_{\min}

denotes the minimum value of the feature,

x_{\max}

denotes the maximum value of the feature. This normalization is an equiproportional transformation of the entire range of features, it does not change the distribution shape of the features, and therefore does not disrupt the potential nonlinear relationships in the dataset.

Since model hyperparameter optimization involves continuously adjusting the hyperparameter settings to fit the training data, overfitting is prone to occur if a single training set is used exclusively for training and evaluating the fitness of adjustments. To mitigate the risk of overfitting in the iterative process of the hybrid multi-output model, 5-fold cross-validation was employed during model training. Multifold cross-validation is an effective method for preventing overfitting during iteration and enhancing model generalization performance. By iteratively dividing the training set data into training and testing folds, multiple validations of the model’s performance for a given hyperparameter setting were conducted to accurately assess the effectiveness of hyperparameter adjustments.

Drawing from previous research experience, the optimization effect of the meta-heuristic optimization algorithm on the target model is significantly influenced by the population size. Therefore, in this study, five different population sizes (10, 20, 40, 80, 160) will be employed to enable the optimization algorithm to accurately search for high-performance hyperparameter settings. Figure 8 illustrates the curve of model fitness with the number of iterations for different population size settings, and it can be seen that the fitness of the model stabilizes as the number of iterations increases. The ultimate hyperparameter settings of the model through the optimization of WAOA at different population size settings are shown in Table 4. Table 5, Table 6 and Table 7 summarize the performance and ranking scores of the models trained under different pop sizes on the four predictive evaluation metrics used, and Figure 9 provides a visual comparison of the model performance scores. Figure 9 combines the rank of the models on various indicators and assigns scores to them, enabling a comprehensive comparison of multiple indicators with different scales. In the subplots of Figure 9, the internal wind rose plot represents the model’s separate scores for predicting

θ

and

r_{b} / r

, and the external one represents the model’s combined scores for predicting

θ

and

r_{b} / r

on the training and test sets.

4.1.1. Determination of Optimal WAOA-ANN Models

In multi-output ANN models, different outputs share the output layer and the hidden layer, and thus the adjustment of the weights and biases of the hidden layer neurons during the training process is jointly affected by all outputs. The differences between different outputs are mainly caused by the differences in the weights and biases adopted by their respective output layer neurons. The fitness curves of the five WAOA-ANN models under different pop sizes are shown in Figure 8a,b. From the figure, for the prediction of

θ

, the optimal fitness of the model is stable between

7.06 \times 10^{- 4}

and

1.14 \times 10^{- 3}

, while for the prediction of

r_{b} / r

, the optimal fitness of the model is stable between

8 \times 10^{- 4}

and

1.2 \times 10^{- 3}

. There is a significant decrease in fitness compared to when iterating the initial model.

Even though the model fitness is evaluated by 5-fold cross-validation during the iteration process, overfitting of the model is largely avoided. However, during each iteration, the data in the training set are directly or indirectly involved in the assessment of model fitness, and because it is a multi-output model, the model may not perform equally on two predicted objects, so the ranking of the final model fitness cannot completely reflect the model prediction performance. To completely evaluate the model prediction performance and generalization performance to select the optimal WAOA-ANN model, the resulting models are further compared on the training and test sets. Table 5 lists the prediction performance and ranking scores of the five WAOA-RF models on the training and test sets. As can be seen from Table 5, there is no model that can rank first in both

θ

and

r_{b} / r

prediction on both the training and test sets at the same time. For example, the WAOA-RF model obtained with a pop size setting of 10 ranks first in all four metrics for the prediction of

θ

on the training set and test set, but ranks poorly for the prediction of

r_{b} / r

on test set. This suggests that multi-output ANN models with different hyperparameter settings have their own focus on different outputs. For the sake of model comprehensiveness, a superior multi-output model should be balanced in the prediction performance of different outputs. Therefore, further the predictive performance of different WAOA-ANN models for

θ

and

r_{b} / r

on the training set and the test set is shown in Figure 9a. As can be seen in Figure 9a, for the five WAOA-ANN models obtained, the models with the pop size setting of 10 and 80 accounted for the highest scores for

θ

and

r_{b} / r

predictions, respectively, and the model with a pop size of 10 (Num dense layers = 4, Num dense nodes = 100, Learning rate =

4.8307 \times 10^{- 4}

) had the highest combined score. Therefore, in the subsequent study, the WAOA-ANN model with a pop size of 10 was used as a representative of the WAOA-ANN model.

4.1.2. Determination of Optimal WAOA-RF Models

For multi-output RF models, node splitting takes into account the variance of the different outputs in the child nodes when constructing the decision tree to achieve a comprehensive consideration of the different outputs. Figure 8c,d illustrate the variation in fitness during iterations of the WAOA-RF model. As can be seen from the figure, the distribution of the optimal fitness of the five WAOA-RF models for the prediction of

θ

ranges from

1.817 \times 10^{- 3}

and

1.863 \times 10^{- 3}

, and the distribution of the prediction of

r_{b} / r

ranges from

2.34 \times 10^{- 3}

and

2.47 \times 10^{- 3}

. Table 6 shows the performance of the five WAOA -RF models in terms of evaluation metrics on the training and test sets, as well as their ranking scores. From Table 6, it can be noticed that the model with a pop size of 80 takes the absolute lead in the performance of the training set and test set, in the prediction of θ (training set: R² = 0.9938, MAE = 0.0083, VAF = 0.9864, RMSE = 0.0149, test set: R² = 0. 9704, MAE = 0.0174, VAF = 0. 9395, RMSE = 0.0311). Further analyses of Figure 9b revealed that the models with pop size 80 and 20 occupy the highest scores for predicting

θ

and

r_{b} / r

, respectively, and the model with the highest combined score is the model with pop size 80 (n_estimators = 1000, max_depth = 1000).

4.1.3. Determination of Optimal WAOA-XGBoost Models

The multi-output XGBoost model has the same node splitting strategy as the multi-output RF model when constructing decision trees. The difference is that XGBoost takes into account the bias of the previous decision tree when constructing a new decision tree, effectively reducing the variation in prediction performance between different outputs. From Figure 8e,f, the optimal fitness of the five WAOA-XGBoost models is distributed between

1.6 \times 10^{- 3}

and

2.0 \times 10^{- 3}

for the prediction of

θ

, and between

1.7 \times 10^{- 3}

and

2.5 \times 10^{- 3}

for the prediction of

r_{b} / r

. Table 7 summarizes the prediction performance and ranking scores of the five WAOA-XGBoost models on the training and test sets. As can be seen from the table, the prediction performance and ranking scores of most WAOA-XGBoost models in

θ

and

r_{b} / r

are relatively consistent. This can be observed more intuitively in Figure 9c, where the models with pop sizes of 10, 80 and 160 each have essentially the same scores on

θ

and

r_{b} / r

. This confirms the balanced consideration of potential links between different outputs and input features in the multi-output XGBoost model.

It can also be seen from Figure 9c that the model with a pop size of 10 occupies the highest score for predicting

θ

. The model with a pop size of 80 (max_depth = 4, n_estimators = 374, learning_rate = 0.1626) occupies the highest score for predicting

r_{b} / r

as well as the highest combined score. In the subsequent study, the model with a pop size of 80 was chosen as a representative of WAOA-XGBoost.

4.2. Model Comparison

Through comparing the optimization effects of the optimization algorithms on the multi-output ANN, RF, and XGBoost models across various population size settings, the optimal results of the three hybrid multi-output models were identified. Regarding the predictive performance, all three models exhibited exceptionally strong capabilities. It is clear that they accurately captured the potential relationship between input features and outputs. However, it remains uncertain whether the potential relationship between the outputs has been appropriately utilized. To further select the final recommended hybrid multi-output model and validate the effectiveness of employing a multi-output setup, a comparative analysis of the obtained models is conducted.

Figure 10 provides an intuitive representation of the effectiveness of three different hybrid multi-output models in predicting borehole breakout size. In the scatter plots, where the actual values are plotted on the x-axis and the predicted values on the y-axis, the scatter points in all figures closely align around the y = x line, with most data points exhibiting a relative error within 15%. From the kernel density distributions of the actual and predicted values in the upper and right parts of the scatterplot, the distributions of predicted and actual values are very similar in both the training and test sets. This evidences the excellent performance of the three hybrid multi-output models, with accurate exploration of the potential correlation between input and output.

It is noteworthy that there is no significant difference in the predictive performance of the three models, either on the training or test sets, concerning predicting

θ

and

r_{b} / r

. However, regarding the difference in prediction performance between the training set and the test set, the WAOA-ANN model demonstrates a more stable performance, whereas the WAOA-RF and WAOA-XGBoost models exhibit a noticeable decrease in predictive performance in the test set compared to the training set. When comparing the three hybrid multi-output models, it becomes evident that the WAOA-ANN model excels in exploring potential relationships between input and output variables while effectively mitigating overfitting and underfitting issues.

Given that the performance of the three hybrid multi-output models is relatively similar, the comparison of their distributions based on the scatterplot may not distinctly distinguish between the model performances. To provide a clearer assessment, the model performance is further illustrated using a Taylor diagram. This diagram facilitates a visual comparison of the three evaluation metrics (R², RMSE, and standard deviation) of the models in a single image.

In a Taylor diagram, the closer the data points are to the measurement point, the better the performance of the model they represent. Meanwhile, to evaluate the model’s capacity to leverage potential relationships between outputs, single-output ANN, RF, and XGBoost models were constructed with identical hyperparameter settings for comparative analysis. Figure 11 presents the Taylor diagram of the prediction performance of different models on the test set. As can be seen from the figure, the WAOA-ANN model is closest to the measure point in both

θ

and

r_{b} / r

prediction. Additionally, there is a notable enhancement in the prediction performance compared to the single-output ANN model. For the WAOA-RF and WAOA-XGBoost model, on the one hand the predictive performance of the model on the test set is weaker than the WAOA-ANN model. On the other hand, the predictive performance of the multi-output RF and XGBoost models is in close agreement with that of the single-output models.

This demonstrates the strong generalization performance of the WAOA-ANN model, as well as its effective utilization of the potential connection between borehole breakout angle and depth. In comparison to the single-output ANN model, the WAOA-ANN model demonstrates superior prediction performance and requires fewer training resources for predicting the angle and depth of borehole breakout. For the WAOA-RF and WAOA-XGBoost models, there is no significant improvement compared to the single-output RF and XGBoost models. Additionally, their hyperparameters result in large model sizes, making the training process time-consuming and memory intensive, offering no practical advantage over the single-output RF and XGBoost models. Based on the comparative analyses of the models, the WAOA-ANN model is recommended for predicting borehole breakout size. Furthermore, the WAOA-ANN model undergoes further sensitivity analyses to explore the impacts of various input features on borehole breakout size.

4.3. Sensitivity Analysis

SHAP (SHapley Additive exPlanations) is a commonly employed model interpretation technique rooted in the concept of Shapley values from game theory [61,62,63,64]. It quantifies the contribution of input features to the model output through SHAP values. Analysis conducted with SHAP allows for the assessment of feature influence from both global and local perspectives, offering a more intuitive interpretation of model prediction outcomes. Furthermore, based on SHAP analysis results, adjustments to feature sizes can be made more scientifically and effectively to achieve target output values, serving as a crucial reference for design purposes.

The WAOA-ANN model input feature impact rankings obtained based on SHAP analysis are shown in Figure 12. For the prediction of

θ

, the ranking of the input feature’s impact on the output is

σ_{H} > c > P_{w} > ν > E > r > φ > σ_{h} > σ_{v}

and for the prediction of

r_{b} / r

, the ranking is

σ_{H} > P_{w} > c > σ_{v} > E > φ > r > ν > σ_{h}

. The impact of input features on the output (promoting or reducing) changes with the value of the feature, and Figure 13 demonstrates the impact of different features on the WAOA-ANN output. From the figure, it can be seen that

θ

is positively correlated with

σ_{H}, ν, E, σ_{h}

, negatively correlated with

c, P_{w}, φ, σ_{v}

, and has a complex relationship with the

r

.

r_{b} / r

is positively correlated with

σ_{H}, σ_{v}

, negatively correlated with

P_{w}, c, E, φ

, and has a complex relationship with the

r, ν, σ_{h}

.

5. Conclusions

Borehole breakout, as a critical factor influencing borehole quality, has consistently attracted significant attention from researchers. This study collected a total of 501 datasets from numerical simulations and physical experiments, encompassing in situ stress, borehole dimensions, rock properties of the borehole wall, and mud pressure in the borehole as input features, with borehole breakout size as the output data. Three hybrid multi-output models (WAOA-ANN, WAOA-RF, WAOA-XGBoost) were developed using ANN, RF, and XGBoost as base algorithms, augmented by the WAOA optimization technique. The predictive performance of these models was compared and analyzed, while a single-output model was constructed to further assess the efficacy of the multi-output framework. The principal findings of this investigation are delineated below:

(1) All three hybrid multi-output models exhibit excellent prediction performance for borehole breakout size. Particularly, the WAOA-ANN model demonstrates consistent performance across both the training and test sets, surpassing WAOA-RF and WAOA-XGBoost on the test set (for

θ

training (test): R² = 0.9906 (0.9819), MAE = 0.0095 (0.0142), VAF = 0.9803 (0.9629), RMSE = 0.018 (0.0244); for

r_{b} / r

training (test): R² = 0.9839 (0.978), MAE = 0.0167 (0.0208), VAF = 0.9679 (0.9556), RMSE = 0.0284 (0.0334)). Consequently, the WAOA-ANN model is deemed to possess superior prediction capabilities and stronger generalization ability.

(2) The WAOA-ANN model effectively leverages the potential correlation between borehole breakout sizes, resulting in a substantial enhancement in prediction performance compared to the single-output model. Conversely, WAOA-RF and WAOA-XGBoost exhibited no improvement in performance relative to the single-output model with identical hyperparameter settings, while also increasing the computational resource requirements of the model.

(3) The sensitivity analyses conducted on WAOA-ANN indicate that for both

θ

and

r_{b} / r

,

σ_{H}

exhibits the highest sensitivity. However, there exists a considerable disparity in the rankings of input features concerning the impact on

θ

and

r_{b} / r

(

θ

:

σ_{H} > c > P_{w} > ν > E > r > φ > σ_{h} > σ_{v}

,

r_{b} / r

:

σ_{H} > P_{w} > c > σ_{v} > E > φ > r > ν > σ_{h}

). Additionally, variations are observed in the correlation of the same feature with

θ

and

r_{b} / r

. For instance,

σ_{v}

displays a positive correlation with

θ

but a negative correlation with

r_{b} / r

. Similarly,

ν

and

σ_{h}

exhibit a positive correlation with

θ

but a nonmonotonically correlation with

r_{b} / r

.

In conclusion, all three hybrid multi-output models developed achieved effective prediction of borehole breakout size, but only the WAOA-ANN model successfully demonstrated the advantages of the multi-output structure and performed better in terms of overall prediction performance. Therefore, the obtained WAOA-ANN is considered to be an effective hybrid multi-output model for borehole breakout size prediction. It is important to note that prediction results may be biased if the input feature values used exceed the range of the dataset in this paper. Hence, subsequent studies may consider covering a more comprehensive range of eigenvalues to enhance the robustness of the predictions.

Author Contributions

R.Z., methodology, validation, resources, visualization, software, and writing—original draft; J.Z., conceptualization, methodology, validation, investigation, visualization, writing—review and editing, supervision, and funding acquisition; M.T., formal analysis and writing—review and editing; C.L., investigation and writing—review and editing; P.L., writing—review and editing; T.L., validation, formal analysis, writing—review and editing, and supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This research is partially supported by the National Natural Science Foundation of China (42177164), the Distinguished Youth Science Foundation of Hunan Province of China (2022JJ10073), the Outstanding Youth Project of Hunan Provincial Department of Education (23B0008) and the Zhumadian Key R&D Special Project (ZMDSZDZX2023006).

Data Availability Statement

All relevant data generated throughout this study are included in this article.

Conflicts of Interest

Author Pingfeng Li was employed by the company Hongda Blasting Engineering Group Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Lee, H.; Moon, T.; Haimson, B. Borehole breakouts induced in arkosic sandstones and a discrete element analysis. Rock Mech. Rock Eng. 2016, 49, 1369–1388. [Google Scholar] [CrossRef]
Addis, M.; Barton, N.; Bandis, S.; Henry, J. Laboratory studies on the stability of vertical and deviated boreholes. In Proceedings of the SPE Annual Technical Conference and Exhibition, New Orleans, LA, USA, 23–26 September 1990. SPE-20406-MS. [Google Scholar]
Meier, T.; Rybacki, E.; Reinicke, A.; Dresen, G. Influence of borehole diameter on the formation of borehole breakouts in black shale. Int. J. Rock Mech. Min. Sci. 2013, 62, 74–85. [Google Scholar] [CrossRef]
Leeman, E. The treatment of stress in rock: I. The rock stress measurement: II. Borehole rock stress measuring instrument: III. The results of some rock stress investigations. JS Afr. Inst. Min. Metall. 1964, 65, 254–284. [Google Scholar]
Cox, J.W. The high resolution dipmeter reveals dip-related borehole and formation characteristics. In Proceedings of the SPWLA Annual Logging Symposium, Los Angeles, CA, USA, 3–6 May 1970. SPWLA-1970-D. [Google Scholar]
Gough, D.; Bell, J. Stress orientations from oil-well fractures in Alberta and Texas. Can. J. Earth Sci. 1981, 18, 638–645. [Google Scholar] [CrossRef]
Haimson, B.; Herrick, C. In situ stress evaluation from borehole breakouts. Experimental studies. In Proceedings of the US Symposium on Rock Mechanics; A. A. Balkema: Rotterdam, The Netherlands, 1985; Volume 26, pp. 1207–1218. [Google Scholar]
Haimson, B.; Song, I. Laboratory study of borehole breakouts in Cordova Cream: A case of shear failure mechanism. Int. J. Rock Mech. Min. Sci. Geomech. Abstr. 1993, 30, 1047–1056. [Google Scholar] [CrossRef]
Haimson, B.C.; Song, I. Borehole breakouts in Berea sandstone: Two porosity-dependent distinct shapes and mechanisms of formation. In Proceedings of the SPE/ISRM Rock Mechanics in Petroleum Engineering, Trondheim, Norway, 8–10 July 1998. SPE-47249-MS. [Google Scholar]
Zoback, M.D.; Moos, D.; Mastin, L.; Anderson, R.N. Well bore breakouts and in situ stress. J. Geophys.Res. Solid Earth 1985, 90, 5523–5530. [Google Scholar] [CrossRef]
Al-Ajmi, A.M.; Zimmerman, R.W. Stability analysis of vertical boreholes using the Mogi–Coulomb failure criterion. Int. J. Rock Mech. Min. Sci. 2006, 43, 1200–1211. [Google Scholar] [CrossRef]
Villarroel, F.; Júnior, E.; Rabello, G.; Bloch, M.; de Azevedo, V., Jr. Breakouts: Physical and numerical modeling. In Proceedings of the SPE Europec featured at EAGE Conference and Exhibition, Barcelona, Spain, 14–17 June 2010. SPE-131656-MS. [Google Scholar]
Wu, B.; Chen, Z.; Zhang, X. Stability of borehole with breakouts—An experimental and numerical modelling study. In Proceedings of the ARMA US Rock Mechanics/Geomechanics Symposium, Houston, TX, USA, 26–29 June 2016. ARMA-2016-2466. [Google Scholar]
Shen, B.; Stephansson, O.; Rinne, M. Simulation of borehole breakouts using FRACOD2D. Oil Gas Sci. Technol. 2002, 57, 579–590. [Google Scholar] [CrossRef]
Lin, H.; Kang, W.-H.; Oh, J.; Canbulat, I.; Hebblewhite, B. Numerical simulation on borehole breakout and borehole size effect using discrete element method. Int. J. Min. Sci. Technol. 2020, 30, 623–633. [Google Scholar] [CrossRef]
Zhang, H.; Yin, S.; Aadnoy, B.S. Numerical investigation of the impacts of borehole breakouts on breakdown pressure. Energies 2019, 12, 888. [Google Scholar] [CrossRef]
Xiang, Z.; Moon, T.; Si, G.; Oh, J.; Canbulat, I. Numerical Analysis of V-Shaped Borehole Breakout Using Three-Dimensional Discrete-Element Method. Rock Mech. Rock Eng. 2023, 56, 3197–3214. [Google Scholar] [CrossRef]
Haimson, B.C.; Chang, C. True triaxial strength of the KTB amphibolite under borehole wall conditions and its use to estimate the maximum horizontal in situ stress. J. Geophys. Res. Solid Earth 2002, 107, ETG 15-11–ETG 15-14. [Google Scholar] [CrossRef]
Ewy, R.; Cook, N. Deformation and fracture around cylindrical openings in rock—II. Initiation, growth and interaction of fractures. Int. J. Rock Mech. Min. Sci. Geomech. Abstr. 1990, 27, 409–427. [Google Scholar] [CrossRef]
Ewy, R.; Cook, N. Deformation and fracture around cylindrical openings in rock—I. Observations and analysis of deformations. Int. J. Rock Mech. Min. Sci. Geomech. Abstr. 1990, 27, 387–407. [Google Scholar] [CrossRef]
Martin, C.; Martino, J.; Dzik, E. Comparison of borehole breakouts from laboratory and field tests. In Proceedings of the SPE/ISRM Rock Mechanics in Petroleum Engineering, Delft, The Netherlands, 29–31 August 1994. SPE-28050-MS. [Google Scholar]
Gomar, M.; Goodarznia, I.; Shadizadeh, S.R. Transient thermo-poroelastic finite element analysis of borehole breakouts. Int. J. Rock Mech. Min. Sci. 2014, 71, 418–428. [Google Scholar] [CrossRef]
Zhou, J.; Zhang, Y.; Qiu, Y. State-of-the-art review of machine learning and optimization algorithms applications in environmental effects of blasting. Artif. Intell. Rev. 2024, 57, 5. [Google Scholar] [CrossRef]
Qiu, Y.; Zhou, J. Short-term rockburst damage assessment in burst-prone mines: An explainable XGBOOST hybrid model with SCSO algorithm. Rock Mech. Rock Eng. 2023, 56, 8745–8770. [Google Scholar] [CrossRef]
Zhou, J.; Shen, X.; Qiu, Y.; Shi, X.; Du, K. Microseismic location in hardrock metal mines by machine learning models based on hyperparameter optimization using bayesian optimizer. Rock Mech. Rock Eng. 2023, 56, 8771–8788. [Google Scholar] [CrossRef]
Sharma, A.; Burak, T.; Nygaard, R.; Hoel, E.; Kristiansen, T.; Hellvik, S.; Welmer, M. Projecting Petrophysical Logs at the Bit through Multi-Well Data Analysis with Machine Learning. In Proceedings of the SPE Offshore Europe Conference and Exhibition, Aberdeen, UK, 5–8 September 2023. D031S012R001. [Google Scholar]
Benemaran, R.S. Application of extreme gradient boosting method for evaluating the properties of episodic failure of borehole breakout. Geoenergy Sci. Eng. 2023, 226, 211837. [Google Scholar] [CrossRef]
Zhang, H.; Yin, S.; Aadnoy, B.S. Poroelastic modeling of borehole breakouts for in-situ stress determination by finite element method. J. Pet. Sci. Eng. 2018, 162, 674–684. [Google Scholar] [CrossRef]
Zhang, H.; Yin, S.; Aadnoy, B.S. Finite-element modeling of borehole breakouts for in situ stress determination. Int. J. Geomech. 2018, 18, 04018174. [Google Scholar] [CrossRef]
Zhang, H.; Yin, S. Inference of in situ stress from thermoporoelastic borehole breakouts based on artificial neural network. Int. J. Numer. Anal. Methods Geomech. 2019, 43, 2493–2511. [Google Scholar] [CrossRef]
Kocev, D.; Džeroski, S.; White, M.D.; Newell, G.R.; Griffioen, P. Using single-and multi-target regression trees and ensembles to model a compound index of vegetation condition. Ecol. Model. 2009, 220, 1159–1168. [Google Scholar] [CrossRef]
Tuia, D.; Verrelst, J.; Alonso, L.; Pérez-Cruz, F.; Camps-Valls, G. Multioutput support vector regression for remote sensing biophysical parameter estimation. IEEE Geosci. Remote Sens. Lett. 2011, 8, 804–808. [Google Scholar] [CrossRef]
Burnham, A.J.; MacGregor, J.F.; Viveros, R. Latent variable multivariate regression modeling. Chemom. Intell. Lab. Syst. 1999, 48, 167–180. [Google Scholar] [CrossRef]
Borchani, H.; Varando, G.; Bielza, C.; Larranaga, P. A survey on multi-output regression. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2015, 5, 216–233. [Google Scholar] [CrossRef]
Soroush, H. A Multilayer Perceptron Neural Network Model to Predict Borehole Breakouts Full Geometry using Rock Properties. In Proceedings of the ARMA US Rock Mechanics/Geomechanics Symposium, Virtual, 28 June–1 July 2020. ARMA-2020-1440. [Google Scholar]
Lin, H.; Singh, S.; Oh, J.; Canbulat, I.; Kang, W.-H.; Hebblewhite, B.; Stacey, T.R. A combined approach for estimating horizontal principal stress magnitudes from borehole breakout data via artificial neural network and rock failure criterion. Int. J. Rock Mech. Min. Sci. 2020, 136, 104539. [Google Scholar] [CrossRef]
Jolfaei, S.; Lakirouhani, A. Sensitivity analysis of effective parameters in borehole failure, using neural network. Adv. Civ. Eng. 2022, 2022, 4958004. [Google Scholar] [CrossRef]
Lin, H.; Singh, S.K.; Xiang, Z.; Kang, W.H.; Raval, S.; Oh, J.; Canbulat, I. An investigation of machine learning techniques to estimate minimum horizontal stress magnitude from borehole breakout. Int. J. Min. Sci. Technol. 2022, 32, 1021–1029. [Google Scholar] [CrossRef]
Zhang, H.; Wu, B.; Nie, Y.; Zhang, X.; Chen, Z. Prediction of in-situ stresses by using machine learning and intelligent optimization algorithms. In Proceedings of the ARMA US Rock Mechanics/Geomechanics Symposium, Atlanta, GA, USA, 25–28 June 2023. ARMA-2023-0453. [Google Scholar]
Agatonovic-Kustrin, S.; Beresford, R. Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research. J. Pharm. Biomed. Anal. 2000, 22, 717–727. [Google Scholar] [CrossRef]
Zupan, J. Introduction to artificial neural network (ANN) methods: What they are and how to use them. Acta Chim. Slov. 1994, 41, 327. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Qi, Y. Random forest for bioinformatics. In Ensemble Machine Learning: Methods and Applications; Springer: New York, NY, USA, 2012; pp. 307–323. [Google Scholar]
Zhang, T.; Zhang, X.; Liu, Y.; Chow, Y.; Iu, H.; Fernando, T. Long-term energy and peak power demand forecasting based on sequential-XGBoost. IEEE Trans. Power Syst. 2023, 39, 3088–3104. [Google Scholar] [CrossRef]
Trojovský, P.; Dehghani, M. A new bio-inspired metaheuristic algorithm for solving optimization problems based on walruses behavior. Sci. Rep. 2023, 13, 8775. [Google Scholar] [CrossRef] [PubMed]
Duan, K.; Kwok, C. Evolution of stress-induced borehole breakout in inherently anisotropic rock: Insights from discrete element modeling. J. Geophys. Res. Solid Earth 2016, 121, 2361–2381. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, H.; Hu, K.; Chen, Z.; Yin, S. Thermoporoelastoplastic Wellbore Breakout Modeling by Finite Element Method. Mining 2022, 2, 52–64. [Google Scholar] [CrossRef]
Lin, H.; Oh, J.; Canbulat, I.; Stacey, T. Experimental and analytical investigations of the effect of hole size on borehole breakout geometries for estimation of in situ stresses. Rock Mech. Rock Eng. 2020, 53, 781–798. [Google Scholar] [CrossRef]
Herrick, C.G.; Haimson, B.C. Modeling of episodic failure leading to borehole breakouts in Alabama limestone. In Proceedings of the ARMA North America Rock Mechanics Symposium, Austin, TX, USA, 1–3 June 1994. ARMA-1994-0217. [Google Scholar]
Haimson, B.; Lee, H. Borehole breakouts and compaction bands in two high-porosity sandstones. Int. J. Rock Mech. Min. Sci. 2004, 41, 287–301. [Google Scholar] [CrossRef]
Lee, H.; Haimson, B. Borehole breakouts and in-situ stress in sandstones. In Proceedings of the In-Situ Rock Stress: International Symposium on In-Situ Rock Stress, Trondheim, Norway, 19–21 June 2006; p. 201. [Google Scholar]
Zhou, J.; Zhang, R.; Qiu, Y.; Khandelwal, M. A true triaxial strength criterion for rocks by gene expression programming. J. Rock Mech. Geotech. Eng. 2023, 15, 2508–2520. [Google Scholar] [CrossRef]
Zhou, J.; Chen, Y.; Li, C.; Qiu, Y.; Huang, S.; Tao, M. Machine learning models to predict the tunnel wall convergence. Transp. Geotech. 2023, 41, 101022. [Google Scholar] [CrossRef]
Wang, Z.; Zhou, J.; Du, K.; Khandelwal, M. Enhanced multi-task learning models for pile drivability prediction: Leveraging metaheuristic algorithms and statistical evaluation. Transp. Geotech. 2024, 47, 101288. [Google Scholar] [CrossRef]
Li, C.; Zhou, J.; Du, K.; Armaghani, D.J.; Huang, S. Prediction of flyrock distance in surface mining using a novel hybrid model of harris hawks optimization with multi-strategies-based support vector regression. Nat. Resour. Res. 2023, 32, 2995–3023. [Google Scholar] [CrossRef]
Li, E.; Zhang, N.; Xi, B.; Yu, Z.; Fissha, Y.; Taiwo, B.O.; Segarra, P.; Feng, H.B.; Zhou, J. Analysis and modelling of gas relative permeability in reservoir by hybrid KELM methods. Earth Sci. Inform. 2024, 1–28. [Google Scholar] [CrossRef]
Qiu, Y.; Zhou, J.; Khandelwal, M.; Yang, H.; Yang, P.; Li, C. Performance evaluation of hybrid WOA-XGBoost, GWO-XGBoost and BO-XGBoost models to predict blast-induced ground vibration. Eng. Comput. 2022, 38, 4145–4162. [Google Scholar] [CrossRef]
Zhou, J.; Dai, Y.; Huang, S.; Armaghani, D.J.; Qiu, Y. Proposing several hybrid SSA—Machine learning techniques for estimating rock cuttability by conical pick with relieved cutting modes. Acta Geotech. 2023, 18, 1431–1446. [Google Scholar] [CrossRef]
Yang, P.; Yong, W.; Li, C.; Peng, K.; Wei, W.; Qiu, Y.; Zhou, J. Hybrid random forest-based models for earth pressure balance tunneling-induced ground settlement prediction. Appl. Sci. 2023, 13, 2574. [Google Scholar] [CrossRef]
Wang, Z.; Zhou, J.; Peng, K. The Potential of Multi-Task Learning in CFDST Design: Load-Bearing Capacity Design with Three MTL Models. Materials 2024, 17, 1994. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.L.; Qiu, Y.G.; Armaghsni, D.J.; Monjezi, M.; Zhou, J. Enhancing rock fragmentation prediction in mining operations: A Hybrid GWO-RF model with SHAP interpretability. J. Cent. South Univ. 2024, 1–14. [Google Scholar] [CrossRef]
Qiu, Y.; Zhou, J.; He, B.; Armaghani, D.J.; Huang, S.; He, X. Evaluation and interpretation of blasting-induced tunnel overbreak: Using heuristic-based ensemble learning and gene expression programming techniques. Rock Mech. Rock Eng. 2024, 1–29. [Google Scholar] [CrossRef]
Guan, J.; Yu, Z.; Liao, Y.; Tang, R.; Duan, M.; Han, G. Predicting Critical Path of Labor Dispute Resolution in Legal Domain by Machine Learning Models Based on SHapley Additive exPlanations and Soft Voting Strategy. Mathematics 2024, 12, 272. [Google Scholar] [CrossRef]
Qiu, Y.; Zhou, J. Novel rockburst prediction criterion with enhanced explainability employing CatBoost and nature-inspired metaheuristic technique. Undergr. Space 2024, 19, 101–118. [Google Scholar] [CrossRef]

Figure 1. Dog-ear shaped borehole breakout.

Figure 2. Multi-output ANN model with two hidden layers.

Figure 3. Algorithm flow of random forest regressor predict.

Figure 4. Flow chart of WAOA.

Figure 5. Distribution of borehole breakout angles and depths [16,28,29,30,37,47,48,49,50,51].

Figure 6. Correlation and distribution of inputs and outputs: (a) correlation; (b) distribution.

Figure 7. Framework for predicting borehole breakout size.

Figure 8. Curve of model fitness with the number of iterations for different population size settings: (a) WAOA-ANN

θ

prediction; (b) WAOA-ANN

r_{b} / r

prediction; (c) WAOA-RF

θ

prediction; (d) WAOA-RF

r_{b} / r

prediction; (e) WAOA-XGBoost

θ

prediction; (f) WAOA-XGBoost

r_{b} / r

.

Figure 8. Curve of model fitness with the number of iterations for different population size settings: (a) WAOA-ANN

θ

prediction; (b) WAOA-ANN

r_{b} / r

prediction; (c) WAOA-RF

θ

prediction; (d) WAOA-RF

r_{b} / r

prediction; (e) WAOA-XGBoost

θ

prediction; (f) WAOA-XGBoost

r_{b} / r

.

Figure 9. Comprehensive comparison of prediction performance of hybrid multi-output models: (a) WAOA-ANN; (b) WAOA-RF; (c) WAOA-XGBoost; (d) legend.

Figure 10. Scatter plots of actual and predicted values of

θ

and

r_{b} / r

for training and testing datasets: (a) WAOA-ANN

θ

prediction; (b) WAOA-ANN

r_{b} / r

prediction; (c) WAOA-RF

θ

prediction; (d) WAOA-RF

r_{b} / r

prediction; (e) WAOA-XGBoost

θ

prediction; (f) WAOA-XGBoost

r_{b} / r

.

Figure 10. Scatter plots of actual and predicted values of

θ

and

r_{b} / r

for training and testing datasets: (a) WAOA-ANN

θ

prediction; (b) WAOA-ANN

r_{b} / r

prediction; (c) WAOA-RF

θ

prediction; (d) WAOA-RF

r_{b} / r

prediction; (e) WAOA-XGBoost

θ

prediction; (f) WAOA-XGBoost

r_{b} / r

.

Figure 11. Taylor diagram of test set performance.

Figure 12. Average impact on model output of

θ

and

r_{b} / r

.

Figure 12. Average impact on model output of

θ

and

r_{b} / r

.

Figure 13. Influence results of each parameter on

θ

and

r_{b} / r

prediction.

Figure 13. Influence results of each parameter on

θ

and

r_{b} / r

prediction.

Table 2. Statistical description of features in the datasets.

Variables	Unit	FEM			Experiment
Variables	Unit	Min	Max	Mean	Min	Max	Mean
$θ$	°	0	81	45.82	25	138	68.04
$\frac{r_{b}}{r}$		0	2.19	0.49	0.02	1.53	0.52
$r$	cm	10	15	13.61	0.8	1.5	1.1
$σ_{H}$	MPa	50	200	103.19	35.4	93.2	61.74
$σ_{v}$	MPa	30	80	47.83	5	50	31.67
$σ_{h}$	MPa	20	85	48.61	10	40	23.54
$P_{w}$	MPa	0	45	19.03	0	0	0
$ν$		0.2	0.25	0.21	0.17	0.27	0.22
$E$	GPa	14.4	59	26.83	6.66	27	15.12
$c$	MPa	6	55	18.66	8.92	15.5	11.83
$φ$	°	32.5	52.5	39.13	18	39.7	31.54

Table 3. Hyperparameter settings and optimization ranges.

Method	Hyperparameters	Range	Parameter Meaning
ANN	Learning rate	[0.00001, 0.1]	Learning rate of the optimizer
	Num dense layers	[1, 10]	Number of hidden layers
	Num dense nodes	[5, 110]	Number of nodes for each layer
	Activation	Relu	Activation function
RF	n_estimators	[1, 1100]	The number of trees
RF	max_depth	[10, 1100]	The maximum tree depth
XGBoost	max_depth	[1, 10]	The maximum tree depth
	learning_rate	(0, 1]	Rate of iteration
	n_estimators	[1, 500]	The number of trees

Table 4. Optimal parameters obtained by WAOA for different Models.

Model	Hyperparameters	Pop Size
Model	Hyperparameters	10	20	40	80	160
WAOA-ANN	Learning rate	$4.8307 \times 10^{- 4}$	$5.4203 \times 10^{- 4}$	$3.2513 \times 10^{- 4}$	$8.0645 \times 10^{- 4}$	$7.0801 \times 10^{- 4}$
	Num dense layers	4	5	6	3	1
	Num dense nodes	100	83	51	89	82
WAOA-RF	n_estimators	274	478	95	1000	93
WAOA-RF	max_depth	510	273	48	1000	949
WAOA-XGBoost	max_depth	3	4	4	4	4
	n_estimators	500	444	500	374	239
	learning_rate	0.3412	0.2143	0.2275	0.1626	0.4481

Table 5. Evaluation metrics performance of different WAOA-ANN models.

Pop Size	Training
	R² (Score)		MAE (Score)		VAF (Score)		RMSE (Score)
	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$
10	0.9906(5)	0.9839(4)	0.0095(5)	0.0167(5)	0.9803(5)	0.9679(4)	0.018(5)	0.0284(4)
20	0.9885(4)	0.9825(2)	0.0107(4)	0.0168(3)	0.9766(4)	0.9645(2)	0.0195(4)	0.0298(2)
40	0.9867(2)	0.9828(3)	0.0127(2)	0.0175(2)	0.9726(2)	0.9651(3)	0.0212(2)	0.0296(3)
80	0.9874(3)	0.9847(5)	0.0113(3)	0.0167(4)	0.9742(3)	0.969(5)	0.0205(3)	0.0279(5)
160	0.9834(1)	0.9809(1)	0.0132(1)	0.0183(1)	0.9662(1)	0.9613(1)	0.0235(1)	0.0312(1)
	Test
	R² (score)		MAE (score)		VAF (score)		RMSE (score)
	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$
10	0.9819(5)	0.978(2)	0.0142(5)	0.0208(3)	0.9629(5)	0.9556(2)	0.0244(5)	0.0334(2)
20	0.9813(4)	0.9758(1)	0.0157(3)	0.0212(2)	0.9628(4)	0.9511(1)	0.0244(4)	0.035(1)
40	0.9791(2)	0.9803(5)	0.0166(2)	0.0187(5)	0.9572(2)	0.9601(5)	0.0262(2)	0.0316(5)
80	0.9804(3)	0.9784(3)	0.0153(4)	0.0203(4)	0.9601(3)	0.9567(3)	0.0253(3)	0.033(3)
160	0.9742(1)	0.9787(4)	0.0168(1)	0.022(1)	0.9489(1)	0.9574(4)	0.0286(1)	0.0327(4)

Table 6. Evaluation metrics performance of different WAOA-RF models.

Pop Size	Training
	R² (Score)		MAE (Score)		VAF (Score)		RMSE (Score)
	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$
10	0.9936(4)	0.9943(1)	0.0084(3)	0.0096(1)	0.9862(4)	0.9877(1)	0.015(4)	0.0176(1)
20	0.9934(3)	0.9948(5)	0.0083(4)	0.0093(5)	0.9857(3)	0.9885(5)	0.0153(3)	0.017(5)
40	0.9932(2)	0.9947(4)	0.0085(2)	0.0095(2)	0.9852(2)	0.9884(4)	0.0155(2)	0.0171(4)
80	0.9938(5)	0.9946(3)	0.0083(5)	0.0093(4)	0.9864(5)	0.9881(3)	0.0149(5)	0.0173(3)
160	0.9929(1)	0.9944(2)	0.0086(1)	0.0094(3)	0.9848(1)	0.9879(2)	0.0158(1)	0.0174(2)
	Test
	R² (score)		MAE (score)		VAF (score)		RMSE (score)
	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$
10	0.9699(4)	0.9642(1)	0.018(3)	0.0226(1)	0.9382(4)	0.9195(1)	0.0314(4)	0.0449(1)
20	0.9689(3)	0.9661(5)	0.0177(4)	0.0218(5)	0.9367(3)	0.9223(4)	0.0318(3)	0.0441(4)
40	0.968(1)	0.9651(2)	0.0184(1)	0.0224(2)	0.9349(1)	0.9201(2)	0.0323(1)	0.0448(2)
80	0.9704(5)	0.9657(3)	0.0174(5)	0.0218(4)	0.9395(5)	0.9219(3)	0.0311(5)	0.0443(3)
160	0.9685(2)	0.966(4)	0.0181(2)	0.0221(3)	0.9358(2)	0.9237(5)	0.0321(2)	0.0438(5)

Table 7. Evaluation metrics performance of different WAOA-XGBoost models.

Pop Size	Training
	R² (Score)		MAE (Score)		VAF (Score)		RMSE (Score)
	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$
10	0.9999(5)	0.9999(5)	0.0009(5)	0.0008(5)	0.9999(5)	0.9999(5)	0.0013(5)	0.0019(5)
20	0.9995(1)	0.9996(2)	0.0027(1)	0.0029(2)	0.999(1)	0.9992(2)	0.004(1)	0.0044(2)
40	0.9997(3)	0.9996(1)	0.0023(4)	0.003(1)	0.9993(3)	0.9992(1)	0.0033(3)	0.0045(1)
80	0.9995(2)	0.9996(3)	0.0026(2)	0.0029(3)	0.9991(2)	0.9992(3)	0.0039(2)	0.0044(3)
160	0.9997(4)	0.9997(4)	0.0023(3)	0.0025(4)	0.9993(4)	0.9995(4)	0.0033(4)	0.0037(4)
	Test
	R² (score)		MAE (score)		VAF (score)		RMSE (score)
	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$	$θ$	$r_{b} / r$
10	0.9392(1)	0.96(1)	0.0261(1)	0.0262(1)	0.8819(1)	0.9194(1)	0.0435(1)	0.045(1)
20	0.976(3)	0.9816(5)	0.0157(3)	0.0164(5)	0.9524(3)	0.9628(5)	0.0276(3)	0.0305(5)
40	0.9796(5)	0.9748(2)	0.0151(4)	0.0187(3)	0.9594(5)	0.949(2)	0.0255(5)	0.0358(2)
80	0.9776(4)	0.9789(4)	0.0149(5)	0.0172(4)	0.9555(4)	0.9573(4)	0.0267(4)	0.0327(4)
160	0.973(2)	0.9756(3)	0.016(2)	0.0195(2)	0.9462(2)	0.9501(3)	0.0293(2)	0.0354(3)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, R.; Zhou, J.; Tao, M.; Li, C.; Li, P.; Liu, T. Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm. Appl. Sci. 2024, 14, 6164. https://doi.org/10.3390/app14146164

AMA Style

Zhang R, Zhou J, Tao M, Li C, Li P, Liu T. Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm. Applied Sciences. 2024; 14(14):6164. https://doi.org/10.3390/app14146164

Chicago/Turabian Style

Zhang, Rui, Jian Zhou, Ming Tao, Chuanqi Li, Pingfeng Li, and Taoying Liu. 2024. "Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm" Applied Sciences 14, no. 14: 6164. https://doi.org/10.3390/app14146164

APA Style

Zhang, R., Zhou, J., Tao, M., Li, C., Li, P., & Liu, T. (2024). Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm. Applied Sciences, 14(14), 6164. https://doi.org/10.3390/app14146164

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Borehole Breakout Prediction Based on Multi-Output Machine Learning Models Using the Walrus Optimization Algorithm

Abstract

1. Introduction

2. Methodologies

2.1. Artificial Neural Network (ANN)

2.2. Random Forest (RF)

2.3. Extreme Gradient Boosting (XGBoost)

2.4. The Walrus Optimization Algorithm (WAOA)

3. Data Description and Evaluation Metrics

4. Result and Discussion

4.1. Model Training

4.1.1. Determination of Optimal WAOA-ANN Models

4.1.2. Determination of Optimal WAOA-RF Models

4.1.3. Determination of Optimal WAOA-XGBoost Models

4.2. Model Comparison

4.3. Sensitivity Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI