Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning

Deng, Tao; Chen, Haoyu; Peng, Shouxing; Xia, Xiangsheng; Wang, Guangjin; Chen, Tao; Zhang, Lingling

doi:10.3390/app16115178

Open AccessArticle

Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning

by

Tao Deng

^1,*,

Haoyu Chen

¹,

Shouxing Peng

²,

Xiangsheng Xia

²,

Guangjin Wang

¹,

Tao Chen

² and

Lingling Zhang

¹

Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming 650500, China

²

Pangang Group Mining Company Limited, Panzhihua 617063, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(11), 5178; https://doi.org/10.3390/app16115178

Submission received: 4 April 2026 / Revised: 19 May 2026 / Accepted: 20 May 2026 / Published: 22 May 2026

(This article belongs to the Special Issue Recent Advances in Geotechnical Structures and Soil–Structure Interaction)

Download

Browse Figures

Versions Notes

Featured Application

This work applies a machine learning–-based framework to optimize mining site and road safety, dynamically adjusting pillar and sidewall parameters to balance economic benefits and infrastructure protection. It offers practical applications in real-time mine planning and risk management.

Abstract

Managing surface subsidence caused by underground mining beneath critical infrastructure requires highly efficient and robust optimization models. This study presents a data-driven, multi-objective framework for the collaborative safety optimization of mine stopes and overlying roads, applied to a quartzite mine beneath a secondary highway in Yunnan Province. Based on 88 FLAC3D simulation samples, an XGBoost surrogate model was developed to predict geomechanical responses (Z-displacement and extraction volume), while a Bayesian-optimized Long Short-Term Memory (BO-LSTM) network was employed to forecast ore price signals. To ensure model generalizability and mitigate overfitting in the limited stope dataset, a 5-fold cross-validation (5-fold CV protocol was systematically incorporated into the model training process. Critically, the CV-supported predictive models underwent stress testing under varying training data sizes (40–90%), Gaussian noise intensities (1–13%), and outlier distributions (5–20% proportion; 10–50%amplitude) to define the boundaries of algorithmic reliability. The NSGA-II algorithm was used to map the Pareto-optimal frontier, which was deployed via a Tkinter graphical user interface for dynamic stope geometry adjustments driven by price fluctuations. Secondary FLAC3D validation of the recommended parameters (21.44 m pillar, 1.60 m sidewall) yielded minimal relative errors of 5.07% for displacement and 2.38% for extraction volume. This validated framework demonstrates numerical robustness while balancing geotechnical safety and dynamic market rewards.

Keywords:

underground mining; machine learning; road safety; multi-objective optimization; critical infrastructure protection

1. Introduction

As underground mining progresses to greater depths, surface subsidence and cracking increasingly threaten the safety and functionality of overlying roads. Surface highways, as critical infrastructure for mineral transport and public traffic, are particularly vulnerable. Mining-induced subsidence can cause uneven pavement settlement, cracking, and localized collapse, increasing accident risks, impairing road function, and raising maintenance costs [1,2]. Ensuring road stability while optimizing mining structures is therefore essential for mining safety and transportation continuity [3].

Traditional multi-objective parameter optimization relies mainly on empirical formulas or repeated numerical simulations. Discrete combinations of barrier pillars and sidewalls are tested to evaluate surface subsidence and economic returns, and the optimal scheme is selected by comparison [4]. However, as the number of variables increases, the combinatorial space grows exponentially, requiring extensive repeated simulations. These methods also depend on subjective experience, often yielding only local optima and failing to capture the nonlinear interactions between parameters and objectives [5].

Recent advances in artificial intelligence have transformed multi-objective optimization. Machine learning (ML) can construct high-fidelity datasets from multiple sources, including geological, mining, and geomechanical data, revealing complex relationships among pillar and sidewall dimensions, surface subsidence, and economic performance [6]. ML enables rapid global identification of Pareto-optimal solutions, reducing computational costs and overcoming limitations of empirical design.

Reasonable design of barrier pillars is crucial for both safety and economic efficiency. Numerous studies have investigated optimal dimensions and configurations: Hu et al. [7] studied safety pillar optimization in combined open-pit and underground mining; Li et al. [8] evaluated pillar stability using limit equilibrium and simulation methods; Ma et al. [9] and Xu et al. [10] focused on roof pillar stability and thickness; Long et al. [11] analyzed stress and displacement under different thicknesses; Lu et al. [12] found that a 20 m pillar minimizes roof deformation. Zhang et al. [13] reviewed empirical, numerical, statistical, and AI-based methods, highlighting the benefits of integrating data-driven and theoretical approaches. Xu et al. [14] investigated artificial pillar substitution during the open-pit to underground transition, and Babanouri et al. [15] compared recovery methods via 3D simulations, confirming the safety and controllability of the “split and fender” strategy.

Analytical and mechanical models have also been widely applied. Guo et al. [16] established a plane strain elasticity model for mine walls; Wang et al. [17] used cusp catastrophe theory to study wall instability and proposed thickness reduction strategies; Li et al. [18] derived stress expressions for horizontal pillars based on thick-plate theory and verified a safe thickness of 20 m; Xie et al. [19] developed stability analysis models under longitudinal and transverse loads using small-deflection elastic plate bending theory.

Most existing studies focus on isolated structural stability, often neglecting the impact of parameter adjustments on ore yield and economic returns, which may result in conservative or suboptimal designs. To address these limitations, this study proposes an ML-based multi-objective optimization framework using simulation-generated datasets and dual objectives: controlling surface subsidence and maximizing economic benefits. Intelligent algorithms perform global optimization, providing a foundation for safe and efficient design of underground mining structures beneath highways [20,21,22,23,24,25].

Research on surface highway safety above mining areas has also progressed. Strategic planning frameworks have been proposed to manage overlapping mining and highway zones [26]; predictive modeling using MITSOUKO software and stochastic methods has been employed to forecast road settlement [27,28]; finite element simulations have been used to study subsidence under complex geological conditions [29]; neural network-based adaptive prediction models have further improved forecast accuracy, reducing errors in real-time monitoring [30,31,32]. These studies provide important references for model validation and optimization.

In summary, as underground mining reaches deeper strata, surface subsidence and cracking increasingly threaten road networks. Optimizing barrier pillars and sidewall thickness is essential for both safety and economic efficiency. By integrating machine learning, numerical simulation, and economic analysis, this study proposes a multi-objective optimization framework to coordinate underground mining with surface highway safety, offering new theoretical and technical references for the industry.

2. Geological Survey and Rock Mechanics Parameter Assessment in Mining Regions

2.1. Overview of Geological Engineering Conditions

As shown in Figure 1, the silicon stone mine in Yunnan is situated in medium-grained biotite granite of the Late Caledonian to Mid-Hercynian period, characterized by hard rock with well-developed joints and surface weathering. The V1 ore body lies shallowly beneath a secondary highway, requiring strict control of surface subsidence. Complex hydrogeological conditions, including porous aquifers and interconnected joint-fracture systems, weaken surrounding rock stability. To ensure efficient extraction and surface protection, the mining layout follows the ore strike, with key parameters: block length 50 m, mining room length 44 m, block height 60 m, pillar width 6 m, bottom pillar height 6 m, and top pillar height 4 m. Security pillars and protective walls are placed at critical locations to enhance mine stability.

2.2. Parameters for Rock Mechanics Evaluation

After collecting samples from the underground mine, as depicted in Figure 2, uniaxial compression and direct shear tests were performed to evaluate the rock’s mechanical properties. These tests allowed for the determination of critical parameters, such as compressive strength and shear strength. Following this, the rock mechanics parameters were further calculated using the Geological Strength Index (GSI) and the Hoek–Brown strength criterion, taking into account the geological structure of the rock mass. The detailed values are presented in Table 1. These results serve as a solid numerical basis for the subsequent stability analysis of the rock mass.

3. Design of Collaborative Optimization Scheme for Mining Site and Road Based on Intelligent Optimization and Sample Construction

In underground mining, the design of the mining site structure directly affects resource recovery and the safety of surface facilities, especially when the mining site is located directly beneath a secondary highway. The stability of the mining site is critical to highway safety. A reasonable design of isolation pillars and sidewall thickness not only affects the stability of the mining site but also determines the level of protection for the highway. Traditional design methods often focus on a single objective, neglecting the balance between economic benefits and safety. By combining machine learning and multi-objective optimization methods, the impact of mining site design parameters on both safety and economic benefits can be accurately predicted. This study uses intelligent optimization of isolation pillar and sidewall thickness to ensure safe mining and highway protection, providing technical support for underground mining operations.

3.1. Scheme Design

In the data preparation stage, this study adheres to the principles of systematization and reproducibility, establishing a database through the combined optimization of numerical simulations and machine learning. The database is constructed in accordance with the geological conditions and design requirements of the underground mining project. As shown in Table 2, the isolation pillar thickness (ranging from 18 to 25 m) and the sidewall thickness (ranging from 0.5 to 3 m) are selected as the main control parameters. A total of 88 parameter combinations are generated to evaluate the safety (using surface subsidence as the safety indicator) and economic benefits (using mining volume as the core variable) under different parameters. Combining nearly 20 years of silicon stone price data, a “Mining Volume-Price Prediction” economic benefit calculation system is established, and the mining strategy is adjusted based on real-time prices. When prices rise, the mining volume is increased, and the pillar and sidewall thicknesses are adjusted accordingly. When prices fall, the mining volume is reduced, and the mining scale is appropriately shrunk, while ensuring highway safety. Through the FLAC3D numerical simulation model, ore extraction volume and surface subsidence are used as supervisory indicators, while mining engineering parameters serve as feature variables, ensuring the model accurately simulates the actual mining operations. The simulation results are used to train the machine learning model, optimize the mining site structure design, and improve prediction accuracy. All input variables are standardized and normalized during the data processing phase to ensure data consistency and the effectiveness of model training.

3.2. Technical Approach

3.2.1. Data Preparation—Constructing the Sample Training Set

Surface subsidence—mining site parameter combination sample training set construction: Based on the scheme design table shown in Table 2, a corresponding numerical model is established using FLAC3D software (Flac3D6.00.72) to obtain the surface subsidence values for each model under different parameter combinations.
Economic benefit—mining site parameter combination sample training set construction: Based on the scheme design table shown in Table 2, the recoverable ore quantity is calculated under different parameter combinations, which is used as the economic benefit value.
Ore price—time sample training set construction: Monthly framework prices from 2020 to 2025 are collected to obtain the ore prices for different time periods.

3.2.2. Establishment of Prediction Models

Surface Subsidence Prediction Model: This model takes the underground mining engineering structure parameters as inputs and predicts the resulting surface subsidence.

Mining Volume Prediction Model: This model uses the underground mining engineering structure parameters as inputs to predict the corresponding mining volume.

Economic Benefit Prediction Model: This model takes underground mining engineering parameters (such as isolation pillar thickness and sidewall thickness) along with the target mining date as input variables. By integrating the ore price-time forecast curve, it predicts the potential economic benefits.

3.2.3. Intelligent Optimization of Underground Mining Engineering Structural Parameters Based on Machine Learning

Environment Setup

In machine learning algorithms, the environment defines how the agent interacts with its surroundings. In the case of multi-objective optimization, setting up the environment is crucial. It is important to clearly define the state, action, and reward functions:

State: Represents the current combination of mining structure parameters (e.g., isolation pillar thickness, sidewall thickness).

Action: Refers to how the agent adjusts the mining structure parameters, such as increasing or decreasing them.

Reward Function: The reward should account for both economic benefits and safety levels.

2.: Multi-Objective Optimization Execution

Using the trained models for predicting economic benefits and safety levels, a combination of multi-objective optimization and machine learning algorithms is employed to identify the optimal mining structure parameters. This approach aims to maximize economic benefits while adhering to safety constraints. Additionally, the price prediction model is integrated to dynamically adjust mining parameters based on real-time fluctuations in ore prices.

3.: Verification and Optimization

To ensure the accuracy of the dual-objective optimization results (economic benefits and safety), the following evaluation procedure is adopted: “Numerical Simulation Verification → Economic Verification → Correlation Analysis.” Ongoing integration of real-world application feedback helps refine the algorithms and parameters, enhancing the reliability and precision of the optimization outcomes.

(1): Numerical Simulation Verification

In FLAC3D, a three-dimensional slope stability model is developed using the optimized mining parameters (e.g., isolation pillar thickness and sidewall thickness). This model is used to compute surface subsidence and assess the safety levels.

(2): Economic Verification

Economic benefits are recalculated using the optimized mining parameters (isolation pillar thickness and sidewall thickness) to verify the accuracy of the predicted economic performance.

3.3. Construction of Surface Subsidence and Economic Benefit Sample Training Set

Construction of Surface Subsidence Sample Training Set

Based on the geological profiles and the design parameters provided in Table 2, a numerical model of the underground mine and the surface was constructed, as shown in Figure 3. The numerical model has dimensions of 625 m in the X direction, 425 m in the Y direction, and 169.5 m in the Z direction, with the ore body having a thickness of 7.4 m and a length of 120 m. The mesh size in the ore body region was set to 2 m × 2 m, while the surrounding area was also meshed with 2 m × 2 m elements. Subsequently, a series of numerical simulations was carried out under various parameter combinations to quantify ore recovery. The 3D geometric meshes generated in the Rhino environment were successfully imported into FLAC3D for stability analysis. The model employed the Mohr–Coulomb constitutive criterion, with the lateral and bottom boundaries fixed during the calculations. Finally, the displacement data obtained from the different scenarios were systematically compiled into a comprehensive training database for the machine learning phase.

2.: Numerical Calculation Analysis

Through the analysis of Figure 4, the surface displacement under different isolation pillar sidewall thicknesses was obtained. Figure 4a,c and e show the surface displacement when the isolation pillar thickness is 18 m, 19 m, and 20 m, respectively, with a sidewall thickness of 1 m. The displacements are 19.29 cm in Figure 4a, 18.25 cm in Figure 4c, and 16.64 cm in Figure 4e. Figure 4b,d and f show the displacement when the isolation pillar is 19 m, and the sidewall thicknesses are 0.5 m, 1 m, and 1.5 m, respectively. The displacements are 46.89 cm in Figure 4b, 18.25 cm in Figure 4d, and 11.08 cm in Figure 4f. Overall, as the sidewall thickness of the isolation pillar increases, the surface displacement shows a decreasing trend. Notably, when the sidewall thickness is 1.5 m, the surface displacement decreases significantly, indicating that increasing the sidewall thickness helps reduce surface displacement.

3.: Construction of the Economic Benefit Sample Training Set

In underground mining engineering, simply pursuing the maximization of mining volume does not equate to optimal economic benefits. Actual economic benefits are determined by a dynamic coupled system involving both mining volume (engineering output) and ore price (market value). During the calculation process, ore price data from 2000 to 2025 were collected. Given that the silicon stone price has shown significant temporal fluctuations over the past 25 years, this study constructs a three-dimensional data structure based on “structural parameters—mining volume—price time series” with the aim of generating a high-quality training dataset that includes “economic benefits” as the core objective variable.

The economic benefit calculation logic defined in this study is as follows:

E = V (T, S) \times P (t)

(1)

In the equation, E represents the economic benefit, V is the mining volume determined through numerical simulation (affected by the isolation pillar thickness and sidewall thickness), and

P (t)

is the forecasted market price of the ore for the target mining period.

A portion of the sample training sets for surface subsidence, mining volume, and ore price is shown in Table 3 and Table 4. This dataset not only reflects the impact of structural parameters on production but also quantifies the nonlinear effects of market fluctuations on economic benefits by incorporating price forecasts, providing data support for subsequent optimization calculations.

4. Intelligent Optimization of Underground Mining Site Structural Parameters Based on Multi-Objective Optimization and Machine Learning Methods

4.1. Principles of the LSTM Algorithm

Long Short-Term Memory (LSTM) networks are an advanced evolution of standard Recurrent Neural Networks (RNNs), specifically designed to overcome the inherent challenges of modeling long-term temporal dependencies. Traditional RNNs often encounter vanishing or exploding gradient problems when processing long sequences, whereas LSTMs mitigate these issues through an innovative architecture that includes memory cells and regulatory gates. This structure enables the network to retain critical information over extended time intervals.

As shown in Figure 5, the operational core of an LSTM is the “cell state,” which acts as a persistent repository controlled by three main functional units: the forget gate, the input gate, and the output gate. These gates collaboratively regulate the flow and updating of information, a process mathematically formulated through a series of interconnected state equations.

Forget Gate: The forget gate controls how much of the information will be “forgotten.” Its output is determined by the current input

x_{t}

and the previous hidden state

h_{t - 1}

:

f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f})

(2)

Here, σ represents the sigmoid activation function,

W_{f}

is the weight matrix for the forget gate, and

b_{f}

is the bias term.

Input Gate: The input gate controls how much of the current input information will be stored in the memory cell. The calculation formula is as follows:

i_{t} = σ (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i})

(3)

Here, i represents the output of the input gate,

W_{i}

is the weight matrix for the input gate, and

b_{i}

is the bias term.

Candidate Memory Cell: It generates candidate memory information, which, together with the input gate, determines the new information to be stored at the current time step:

{\tilde{C}}_{t} = t a n h (W_{C} \cdot [h_{t - 1}, x_{t}] + b_{C})

(4)

Here,

{\tilde{C}}_{t}

represents the candidate memory cell, and

t a n h

is the hyperbolic tangent activation function.

Memory Cell Update: The memory cell state is updated by combining the forget gate and the input gate:

C_{t} = f_{t} \cdot C_{t - 1} + i_{t} \cdot {\tilde{C}}_{t}

(5)

Here,

C_{t}

represents the memory cell state at the current time step, and

C_{t - 1}

represents the memory cell state at the previous time step.

Output Gate: The output gate controls how much of the current memory cell’s information will affect the current output. The calculation formula is as follows:

o_{t} = σ (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o})

(6)

Here,

o_{t}

represents the output of the output gate,

W_{o}

is the weight matrix for the output gate, and

b_{o}

is the bias term.

Final Output: The current hidden state is generated through the output gate and the memory cell state at the current time step.

h_{t} = o_{t} \cdot t a n h (C_{t})

(7)

Here,

h_{t}

represents the hidden state at the current time step, which is the output of the LSTM.

In this study, a two-layer LSTM (64 and 32 neurons, 1 output neuron with linear activation) was constructed to predict silica prices. The input sequence length was 18, prices were log-transformed and normalized, an Adam optimizer with MSE loss was used, and EarlyStopping prevented overfitting. Hyperparameters were optimized via Bayesian Optimization, and k-fold validation ensured reproducibility.

4.2. XGBoost Algorithm

XGBoost is an efficient implementation of the Gradient Boosting algorithm, widely used in machine learning for classification and regression problems. Its core idea is based on the Gradient Boosting Decision Tree (GBDT) in ensemble learning, which works by sequentially training a series of weak learners (usually decision trees) to progressively reduce the model’s error.

Review of the Gradient Boosting Decision Tree (GBDT) Framework

Gradient Boosting is an ensemble learning method that combines multiple weak classifiers (e.g., shallow decision trees) into a strong classifier through an additive model. In GBDT, for Mean Squared Error (MSE) loss, each tree fits the residuals of the previous model. For a general loss function, each tree fits the negative gradient of the loss function with respect to the previous prediction (i.e., pseudo-residuals) using gradient descent.

Let the objective function be

L (θ)

, where

θ

represents the model parameters, and the training dataset is

{(x_{i}, y_{i})}_{i = 1}^{n}

, where

x_{i}

denotes the input features and

y_{i}

denotes the corresponding labels. For each iteration, the gradient boosting method updates the model’s prediction by fitting the negative gradient of the loss function.

2.: Objective Function of XGBoost

The core of XGBoost is to optimize the model by minimizing the objective function. Unlike traditional GBDT, XGBoost introduces a regularization term to prevent overfitting. The objective function can be expressed as follows:

L (θ) = \sum_{i = 1}^{n} L (y_{i}, {\hat{y}}_{i}) + Ω (f)

(8)

L (y_{i}, {\hat{y}}_{i})

is the training error term, typically the loss function, used to measure the difference between the model’s predicted values

{\hat{y}}_{i}

and the true values

y_{i}

;

Ω (f)

is the regularization term, which controls the model complexity and prevents overfitting. This term takes into account the complexity of the tree structure.

(1): Training Error Term

The training error term

L (y_{i}, {\hat{y}}_{i})

is typically determined by selecting an appropriate loss function based on the specific task:

For regression problems, the commonly used loss function is Mean Squared Error (MSE):

L (y_{i}, {\hat{y}}_{i}) = {(y_{i} - {\hat{y}}_{i})}^{2}

(9)

For binary classification problems, the commonly used loss function is the log loss:

L (y_{i}, {\hat{y}}_{i}) = - [y_{i} \log ({\hat{y}}_{i}) + (1 - y_{i}) \log (1 - {\hat{y}}_{i})]

(10)

(2): Regularization Term

The regularization term

Ω (f)

is used to control the complexity of the tree and prevent overfitting. Its form is as follows:

Ω (f) = ϒ T + \frac{1}{2} λ \sum_{j = 1}^{T} {w_{j}}^{2}

(11)

T

is the number of leaf nodes in the tree;

w_{j}

is the weight of the

j

leaf node;

ϒ

and

λ

are hyperparameters that control the model complexity.

3.: Tree Structure and Learning Algorithm

In XGBoost, the model is built through a stepwise additive process. In each iteration, the model optimizes the residuals by training a new decision tree and adding it to the existing model. Let

{\hat{y}}_{i}^{(t - 1)}

be the predicted value after the

t - 1

iteration, and the prediction value for the new round,

{\hat{y}}_{i}

, can be expressed as follows:

{\hat{y}}_{i} = {\hat{y}}_{i}^{(t - 1)} + η f_{t} (x_{i})

(12)

where,

f_{t} (x_{i})

represents the predicted value of the tree after the

t

-th round of training;

η \in (0, 1]

is the learning rate, used to reduce the influence of each individual tree and mitigate the risk of overfitting.

During the training of each tree, XGBoost learns the structure and weights of each tree by fitting the negative gradient of the objective function. The learning process of each tree is optimized by minimizing the following objective function:

L^{(t)} = \sum_{i = 1}^{n} L (y_{i}, {\hat{y}}_{i}^{(t)}) + Ω (f_{t})

(13)

To select the best split for each node, XGBoost calculates the gain for each possible split. The gain value reflects the reduction in error after splitting the current node. The formula for calculating the gain is as follows:

G a i n = \frac{1}{2} (\frac{{(\sum_{i \in L} g_{i})}^{2}}{\sum_{i \in L} h_{i} + λ} + \frac{{(\sum_{i \in R} g_{i})}^{2}}{\sum_{i \in R} h_{i} + λ} - \frac{{(\sum_{i \in S} g_{i})}^{2}}{\sum_{i \in S} h_{i} + λ})

(14)

g_{i}

represents the gradient of sample

i

;

h_{i}

represents the second-order gradient of sample

i

;

L

and

R

represent the left and right subtrees of the current node, respectively;

S

represents all samples in the current node.

The gain value is used to measure the effectiveness of the current split, and the split with the maximum gain is selected.

As shown in Figure 6 XGBoost, through multiple optimizations of the traditional gradient boosting method, has become a machine learning algorithm that excels at handling large-scale data. It combines an efficient training mechanism, regularization techniques, and flexible model tuning capabilities, making it a powerful tool in many practical applications.

4.3. NSGA-II Algorithm

As shown in the Figure 7 The Non-dominated Sorting Genetic Algorithm II (NSGA-II) is a multi-objective evolutionary algorithm that improves traditional genetic algorithms using fast non-dominated sorting, crowding distance, and elitism. By combining parent and offspring populations, it preserves high-quality solutions and balances convergence speed with Pareto front diversity.

Population Initialization

The population initialization is the starting point of the NSGA-II algorithm’s iteration. The core task is to construct a valid initial parent population P₀ of size N. The specific steps are as follows:

First, define the decision variable set

X = (x_{1}, x 2, \dots, x_{n})

and the constraint intervals

X i \in \{X i, \min, X i, \max\}

for the multi-objective optimization problem, and determine an appropriate population size N based on the problem’s complexity. Next, use real-valued encoding and generate candidate individuals within the constraint intervals by random sampling or Latin Hypercube Sampling (LHS). LHS ensures that the initial individuals are evenly distributed across the solution space, enhancing population diversity. Then, perform a legality check on the generated candidates, and for individuals that exceed the constraint boundaries, use truncation or reflection methods for correction. Finally, compile N valid individuals to form the initial parent population P₀, which lays the foundation for the subsequent genetic operations and iterative optimization.

2.: Genetic Operations

The parent population

p_{t}

undergoes selection, crossover, and mutation operations to generate an offspring population

Q_{t}

of the same size. In the selection process, a tournament selection strategy is used (typically k = 2). Individuals with better non-dominance levels are prioritized, and when levels are the same, individuals with larger crowding distances are selected. This process is repeated to construct a pairing pool. For crossover, the simulated binary crossover (SBX) strategy is applied to real-valued encoded individuals, adjusting the genetic difference between offspring and parents using a crossover factor. In the mutation process, a polynomial mutation strategy is used to introduce small disturbances to the individual’s genes, breaking the local optimum constraints and maintaining population diversity. This results in the final offspring population

Q_{t}

.

3.: The parent population and the offspring population are merged to form the new generation mixed population.
4.: Non-dominated Sorting

Non-dominated sorting is performed on the mixed population

R t

, which results from the merging of the parent and offspring populations. First, the dominance relationship is defined: An individual

X a

dominates

X b

if

X a

is no worse than

X b

in all optimization objectives and at least one objective is better. Then, the population is traversed to calculate the dominance count and dominance sets for each individual. Individuals with an empty dominance set are placed in the optimal non-dominated front

F 1

. Subsequently, the individuals in the dominance set

F 1

are iteratively reduced in dominance count, and individuals with a dominance count of 0 are placed in the next layer. This process continues until the entire population is layered, providing a basis for priority selection in subsequent steps.

5.: Crowding Distance Calculation

Next, the crowding distance

D i

is calculated for each individual

i

in the current population, as shown below:

In the equation,

y_{i - 1, j}

and

y_{i + 1, j}

represent the function values of two adjacent individuals i after sorting, on the j-th objective.

y_{j}^{\max}

and

y_{j}^{\min}

represent the maximum and minimum values of the j-th objective in the current population. The crowding distance reflects the distribution density around an individual; the larger the crowding distance, the more sparse the individual is, indicating better diversity in the solution set.

6.: Individual Selection

The purpose of individual selection is to filter out a population

P_{t + 1}

of size N from the non-dominated layers after sorting. The selection follows the priority of non-dominated layers from high to low, sequentially adding all individuals from each layer to the candidate set. If the population size has not reached N after adding the current layer, the next layer is added. If the size exceeds N, individuals in the current layer are sorted in descending order of crowding distance, and the top k individuals (where k is the number of individuals needed to fill the gap) are selected. This process ultimately forms the next-generation parent population that meets the size requirement, providing high-quality samples for subsequent iterative optimization.

7.: Repeat steps (2)~(6) until the maximum evolution iteration count is reached, resulting in the Pareto approximate optimal solution set.

4.4. Bayesian Optimization Algorithm

Bayesian Optimization (BO) is an efficient global optimization framework designed for black-box objective functions where evaluation is expensive or derivatives are unavailable. The basic idea is to use probabilistic surrogate models, such as Gaussian Process (GP), to perform Bayesian modeling of the objective function

f : χ \to R

. The posterior distribution of the function values is obtained based on prior knowledge and existing observations. An acquisition function is then constructed based on this posterior, which balances exploration and exploitation, iteratively selecting the next evaluation point to approach the global optimum. If the function values at the input point set

X = [x_{1}, \dots, x_{n}]

are denoted as the vector

f = {[f (x 1), \dots, f (x_{n})]}^{T}

, the joint distribution under the GP assumption is as follows:

f \sim η (m, K)

(15)

where

m = {[m (x 1), \dots, m (x_{n})]}^{T}

is the mean function vector, and the elements of the covariance matrix are given by the kernel function

k (x, x^{'})

, i.e.,

x

. Given the current observational data, the GP can provide the posterior mean

μ (x)

and posterior standard deviation

σ (x)

for any candidate point

x

. A commonly used acquisition function and its closed-form expression (for the maximization case) are as follows:

E I (x) = E [\max (0, f (x) - f_{b e s t} - ξ)] = (μ (x) - f_{b e s t} - ξ) Φ (Z) + σ (x) ϕ (Z)

(16)

P I (x) = Φ (\frac{μ (x) - f_{b e s t} - ξ}{σ (x)}) = Φ (Z)

(17)

U C B (x) = μ (x) + k σ (x)

(18)

where

f_{b e s t}

is the current observed optimal value,

ξ \geq 0

and

k > 0

are the adjustment parameters that balance exploration and exploitation, and

Φ

and

ϕ

are denoted the cumulative distribution function (CDF) and the probability density function (PDF) associated with a standard normal distribution, respectively, where

Z = \frac{μ (x) - f_{b e s t} - ξ}{σ (x)}

(19)

In practical applications, the choice of the kernel function and its hyperparameters can be estimated by maximizing the marginal likelihood or using Bayesian methods. In summary, BO constructs a probabilistic surrogate with uncertainty quantification for the objective function and uses the acquisition function to drive sample selection. This allows for efficient optimization within a limited evaluation budget. It is widely used in engineering design, hyperparameter tuning, and optimization problems with limited experimental resources.

4.5. K-Fold Cross-Validation

Let the dataset be

D = {x i, y i}_{i}^{N} = 1

, which is divided into KKK roughly equal subsets

D 1, D 2, \dots, D K

. In the k-th iteration,

D_{k}

is used as the validation set, and the remaining

K - 1

subsets form the training set

D_{t r a i n}^{(k)}

to train the model

f^{(k)}

. The average performance is computed as follows:

P e r f = \frac{1}{K} \sum_{k = 1}^{K} L (f^{(k)}, D_{k})

where L denotes the evaluation metric (e.g., accuracy, Mean Squared Error). K-fold cross-validation effectively utilizes all data and reduces bias due to random splitting. Common choices are

K = 5

or

K = 10

.

As shown in Figure 8, the study first generates 88 sample datasets using FLAC3D numerical simulations, considering variations in security pillar height and sidewall thickness on surface displacement and mining volume. An XGBoost surrogate model with K-fold cross-validation (k = 5) is constructed for rapid prediction of physical responses. Historical ore prices are similarly predicted using K-fold cross-validated LSTM and BO-LSTM models, with the Bayesian-optimized BO-LSTM providing the most accurate market price signals. For mining parameter optimization, single-objective XGBoost-BO is compared with multi-objective XGBoost-NSGA-II, and NSGA-II is employed to obtain Pareto-optimal solutions balancing safety and economic benefits. All selected models are integrated into a GUI-based decision support system that dynamically adjusts parameter schemes based on predicted market prices and outputs recommended stope designs and highway safety plans. The framework’s reliability is confirmed through comparison with secondary FLAC3D simulation results.

4.6. Model Training and Testing

4.6.1. Convergence Analysis

Convergence Analysis of the Ore Price Prediction Model

Figure 9 presents the convergence analysis of the ore price prediction models. In Figure 9a, the BO-LSTM model shows a rapid decrease in both training and validation loss during the initial epochs, stabilizing at a low loss value after around 20 epochs, indicating fast and stable convergence. In contrast, Figure 9b shows the standard LSTM model, where the training loss decreases more slowly, and the validation loss exhibits larger fluctuations throughout the 100 epochs, suggesting slower convergence and less stable generalization. Overall, these results demonstrate that the BO-LSTM model converges faster and achieves more reliable predictions compared with the conventional LSTM model. Therefore, Bo-LSTM is better suited for this ore price prediction task, not only improving convergence speed but also enhancing global optimization capability and model stability.

2.: Iterative Analysis of Mining Field Structural Parameter Optimization

As shown in Figure 10, the convergence characteristics of different optimization algorithms are presented. In Figure 10a, the XGBoost-NSGA-II surface displacement iteration curve shows that the displacement values decrease rapidly during the initial iterations and stabilize in subsequent iterations, indicating that the algorithm converges quickly and steadily for surface displacement prediction. Figure 10b shows the XGBoost-NSGA-II mining volume iteration curve, which exhibits a similar trend: the mining volume rises rapidly in the early iterations and remains stable thereafter, suggesting that the multi-objective optimization can efficiently approximate Pareto-optimal solutions that balance safety and economic benefits. Figure 10c presents the iteration curve of the single-objective XGBoost-BO, where the loss decreases significantly in the early iterations and changes minimally in subsequent iterations, reflecting the convergence characteristics of single-objective optimization in parameter adjustment. Overall, NSGA-II demonstrates higher convergence speed and stability in multi-objective optimization problems, while XGBoost-BO also converges effectively in single-objective optimization, albeit with relatively limited optimization flexibility.

4.6.2. Accuracy Analysis

To conduct a rigorous assessment of the predictive accuracy across all models, a suite of fundamental statistical indicators was employed: Mean Squared Error (MSE), Coefficient of Determination (R²), Mean Absolute Percentage Error (MAPE), and Mean Absolute Error (MAE). These metrics provide a multi-faceted perspective on model robustness. The comparative outcomes for the four investigated algorithms—specifically regarding their performance in forecasting displacement, excavation volume, and mineral market prices—are synthesized in Table 5 and Table 6.

M S E = \frac{1}{n} \sum_{i = 1}^{n} (y i - \hat{y} i)

(20)

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - \hat{y} i)}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y} i)}^{2}}

(21)

M A P E = \frac{1}{n} \sum_{i = 1}^{n} [\frac{|\hat{y} i - y i|}{y i}] \times 100

(22)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y i - \hat{y} i|

(23)

In the formula,

y i

represents the true value of the i-th sample;

\hat{y} i

represents the predicted value of the i-th sample; and N is the number of samples.

Goodness-of-Fit Analysis

Figure 11 illustrates the predictive performance of different models and their R goodness-of-fit. Figure 11a and b show the predicted ore prices using BO-LSTM and LSTM, respectively, where BO-LSTM achieves a notably higher R² = 0.97 compared to LSTM’s R² = 0.76, indicating that the Bayesian-optimized LSTM model exhibits superior accuracy in price prediction. Figure 11c,d present the predictions of surface displacement and mining volume by the XGBoost-BO model, with R² values of 0.93 and 0.86, demonstrating that the single-objective optimization model provides good predictive performance for physical responses. Figure 11e,f show the corresponding predictions of the XGBoost-NSGA-II model, with R² values of 0.96 and 0.99 for surface is placement and mining volume, indicating that multi-objective optimization achieves higher fitting accuracy than single-objective optimization and more precisely captures the relationship between actual mining parameters and physical responses. Overall, both BO-LSTM and XGBoost-NSGA-II demonstrate high predictive accuracy and reliability within their respective tasks.

2.: Comparison Analysis of True Values and Predicted Values

Figure 12 presents the comparison between predicted and actual values for different models. The results indicate that BO-LSTM and XGBoost-NSGA-II closely reproduce the actual data, accurately capturing ore prices, surface displacement, and mining output, whereas the single-objective XGBoost-BO model reflects the overall trends but with slightly lower accuracy.

4.6.3. Sensitivity Analysis

Price Sensitivity Analysis

The price prediction model is a typical endogenous variable-driven model. Its core logic is based on the autocorrelation of historical silicon ore prices for evolutionary reasoning, where the feature space is composed only of “historical prices” and the “time dimension.” Since the model does not incorporate external variables such as electricity costs, downstream demand, supply-demand gaps, or macroeconomic policies, it is not possible to observe the causal feedback of output fluctuations by disturbing the independent variables. Under the algorithmic framework, all external factors have been internalized into the statistical characteristics of the price curve through the “comprehensive environmental effects.” If a sensitivity analysis were performed, the results would only reflect the robustness of hyperparameters or data noise, rather than sensitivity to market factors. Therefore, under the logic framework of this project, such an analysis holds no business reference value.

2.: Underground Mining Site Parameter Sensitivity Analysis

As shown in Figure 13, the sensitivity analysis of the model outputs with respect to key parameters is presented. Figure 13a depicts the sensitivity analysis for surface displacement, showing that displacement decreases noticeably with changes in the parameters, but the overall variation is moderate, indicating that the model exhibits a certain degree of robustness in predicting displacement. Figure 13b illustrates the sensitivity analysis for mining volume, which similarly shows a gradual decrease with parameter changes and relatively uniform variation, suggesting limited sensitivity of the model in predicting mining output. Overall, these two figures indicate that, within the current model framework, the physical responses remain relatively stable with respect to changes in input parameters, confirming the robustness of the model.

4.6.4. Analysis of Core Predictive Models Under Varying Sample Sizes and Input Noise

The analysis shows that LSTM-BO achieves higher accuracy in price prediction, while XGBoost-NSGA-II excels in mining site parameter optimization. This section focuses on these two models, examining how sample size, noise, and outliers affect prediction accuracy.

Analysis of Training Data Size for a Single Sample

This experiment analyzes the sample size sensitivity and input perturbation of the core models, focusing on LSTM-BO and XGBoost-NSGA-II. As shown in Table 7, from 88 full-factor samples, 40–90% were used as training sets and the rest as validation sets. The models were trained, and R², MAE, and MSE were computed to assess prediction accuracy across different sample sizes.

Figure 14 illustrates that the proportion of training samples significantly affects model predictive performance. As the training sample ratio increases from 40% to 90%, all models show a marked improvement in goodness-of-fit (R²) and a substantial reduction in Mean Squared Error (MSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE). The most notable performance gains occur when the ratio rises from 40% to 60%, with R² increasing sharply and error metrics decreasing significantly, indicating high model sensitivity to training data. Beyond a 70% sample ratio, performance metrics stabilize, suggesting that further increasing the sample size yields limited additional improvement.

2.: Noise Analysis

As shown in Table 8, this experiment introduced Gaussian noise with standard deviations of 1%, 3%, 5%, 7%, 9%, 11%, and 13% by adjusting the noise proportion in the training data. The BO-LSTM and XGBoost-NSGA-II models were then trained under these conditions, and performance metrics such as R², MAE, and MSE were calculated to quantify the models’ predictive accuracy under varying noise levels.

As shown in Figure 15, the models exhibit varying performance under different noise levels. As the noise level increases from 1% to 13%, the R² values for displacement, mining volume, and price show a decreasing trend, while MSE, MAE, and MAPE generally increase. This indicates that the predictive accuracy of the models declines with increasing data noise, although the magnitude of this decline differs across metrics.

Specifically, the displacement metric is highly sensitive to noise, with R² decreasing rapidly and error metrics rising significantly. The mining volume metric is relatively robust, showing only modest increases in errors. The price metric demonstrates strong noise tolerance, maintaining relatively high R² values and exhibiting slow growth in error measures. These results suggest that model robustness varies across different target variables, and the impact of data noise should be carefully considered in practical applications.

3.: Outlier Sensitivity Analysis

Outlier sensitivity analysis is conducted to assess the extent to which data analysis or model results are affected by outliers. Specifically, in Table 9, outlier proportions are set at 5%, 10%, 15%, and 20%, with deviation magnitudes of 10%, 20%, 30%, 40%, and 50%. Under these conditions, the BO-LSTM and XGBoost-NSGA-II models are trained. Model performance is then quantified by calculating metrics such as R², MAE, and MSE, allowing evaluation of predictive accuracy and robustness under varying outlier conditions.

From Figure 16, experimental results show that as the amplitude and proportion of outliers increase, both models exhibit a stepwise performance decline. Within a “safe zone” of outlier amplitudes below 30%, NSGA-II and Bayesian-optimized models maintain high R² scores. Beyond this threshold, error metrics (MAE, MSE, MAPE) rise sharply, with MSE being most sensitive, indicating that large outliers severely distort the loss function. XGBoost demonstrates stronger stability for discrete parameters, while LSTM is more sensitive to temporal fluctuations, with price predictions showing a regular stratified decline in R² as noise grows. These findings highlight the importance of data cleaning and defining model reliability boundaries for safe and robust deep mining predictions.

The experimental results confirm that the proposed algorithm is reasonable and robust. Accuracy improves with more training data, while moderate noise and outliers have a limited impact. Performance drops only beyond critical thresholds, demonstrating the algorithm’s reliability limits. Overall, the findings show it is suitable for predictive modeling in complex, noisy datasets.

4.7. Intelligent Mining Site Parameter Optimization System for Underground Mining Engineering Structures

4.7.1. System Introduction

From Figure 17, the technical workflow of the system consists of three main modules. First, in the Market Prediction Layer, historical ore prices are used to train a BO-LSTM model, which is validated in the early stage using K-fold cross-validation to ensure prediction accuracy and generalization ability, producing future cycle price signals. Next, in the Engineering Layer, FLAC3D simulation data are used to build an XGBoost surrogate model (Virtual Stope), and NSGA-II multi-objective optimization is applied to simultaneously consider recovery ratio and support stability, yielding Pareto-optimal engineering parameters. Finally, in the Decision Engine Layer, mining parameters are dynamically adjusted based on the market price signals and optimization results: high prices increase output weight to compress pillars, while low prices increase support weight to enhance stability. The system outputs practical mining schemes, including stope geometry, pillar spacing, and support strength, thereby achieving a deep integration of market prediction, engineering optimization, and intelligent decision-making.

4.7.2. Establishment of an Intelligent Decision-Making Platform with Deep Integration of Market Forecasting and Engineering Optimization

The system is an intelligent decision-making platform for mining safety and market optimization, built with Python Tkinter and integrating machine learning, deep learning, and multi-objective optimization.

Market Prediction: BO-LSTM forecasts ore prices, validated with K-fold cross-validation (MAPE). Users can obtain predicted prices and trends for a target month.

Mining Site Optimization: XGBoost surrogate models and NSGA-II optimize pillar height and sidewall thickness, balancing safety and output. Results are visualized alongside historical samples.

Field Validation: Users input measured displacement and mining volume; the system computes deviations from AI predictions and displays measured points on the chart for comparison.

The left panel handles inputs and outputs, while the right panel shows dynamic visualizations, supporting informed decision-making for mining operations.

As shown in Figure 18, it is the GUI interface diagram of the prediction system.

4.7.3. Accuracy Verification of the GUI Prediction Interface System

Ore Price Prediction Accuracy Validation

As shown in Table 10, by comparing historical prices with the predictions from the GUI system, it can be observed that the overall predictions are relatively close to the actual values, but there are still some errors. Specifically, most predicted values have a relative error between 1% and 10% compared to the historical prices. For example, the prediction of 317.63 for a historical price of 320 results in an error of only 0.74%, while the prediction of 313.33 for a historical price of 330 has an error of approximately 5.05%. However, some data points show larger errors, such as the historical price of 450 predicted as 375, with an error of 16.67%, and the historical price of 340 predicted as 285, with an error of 16.18%, indicating that the system’s predictions deviate more in higher values or certain fluctuating ranges. Overall, the prediction system is able to capture the trend accurately in most cases, but further optimization is needed to reduce extreme errors and ensure the stability and reliability of the model.

2.: Validation of Mining Site Structure Parameter Accuracy

Based on the results above, it can be inferred that the GUI interface, integrated with the algorithm, demonstrates reliable accuracy in ore price forecasting, especially in more stable market conditions. This suggests that when using this model to predict future prices, the effectiveness and accuracy of the predictions can be reasonably ensured. Therefore, when forecasting ore prices for the next three years based on this model, it can provide strong support to decision-makers, offering scientific guidance for selecting the optimal mining site structure parameters. This prediction not only provides direction for optimizing mining strategies but also offers data support for resource allocation and production scheduling, further enhancing the precision and scientific nature of decision-making. As shown in Table 11, the mining site structure parameters correspond to the predicted future prices.

3.: Error Validation Calculation Based on FLAC3D

A numerical calculation model was established using FLAC3D. As shown in Figure 19 and Table 12, the surface settlement value is 9.89 cm. The calculated mining volume is 69,834.5 t. The relative error of displacement is 5.07%, and the relative error of mining volume is 2.38%, demonstrating a certain level of accuracy.

P = \frac{|x - x_{t r u e}|}{|x_{t r u e}|} \times 100 %

(24)

A represents the measured or calculated value; B represents the true value.

5. Conclusions

This study tackles the challenge of quartzite mining beneath a secondary highway in Yunnan Province by proposing a hybrid optimization framework that integrates geomechanical numerical simulation with machine learning. Three-dimensional FLAC3D simulations quantified surface displacement (cm) and ore recovery under varying pillar heights (18–25 m) and sidewall thicknesses (0.5–3 m), enabling systematic assessment of tunnel stability, road safety, and economic efficiency.

To accelerate analysis, an XGBoost-based “virtual tunnel” surrogate model was developed for rapid prediction of geomechanical responses. Using 5-fold cross-validation, the model achieved high-accuracy R² = 0.9709 for vertical surface displacement and R² = 0.9963 for ore recovery. Robustness tests against varying training data sizes, noise levels, and outliers identified a “safe operating zone,” ensuring reliable predictions under data perturbations.

For dynamic integration with market signals, the XGBoost outputs were combined with a Bayesian-optimized LSTM (BO-LSTM) via NSGA-II multi-objective optimization. The BO-LSTM tracked ore price fluctuations with high accuracy (MAPE = 1.26%), allowing the framework to adapt tunnel design-narrower pillars during high prices to maximize yield, wider during low prices to prioritize stability and settlement control. NSGA-II delineated a Pareto frontier, balancing surface settlement and ore recovery.

Despite promising results, limitations remain. The XGBoost surrogate relied on only 88 full-factorial samples, and economic modeling simplified nonlinear support costs and external factors without considering legal settlement thresholds. Future work will expand stochastic sampling, refine cross-validation, and integrate NPV-based economics. Incorporating digital twin technology could enable real-time linkage between underground operations and surface monitoring, supporting more adaptive and predictive mining optimization.

Author Contributions

Data curation, formal analysis, investigation, methodology, software, validation, visualization, writing—original draft, T.D.; conceptualization, data curation, investigation, methodology, writing—review and editing, H.C.; data curation, investigation, resources, supervision, writing—review and editing, S.P.; data curation, investigation, writing—review and editing, X.X.; data curation, investigation, writing—review and editing, funding, G.W.; data curation, investigation, writing—review and editing, T.C.; data curation, investigation, writing—review and editing, L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

Central Government Funds for Guiding Local Science and Technology Development (202407AC110019).

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

Shouxing Peng, Xiangsheng Xia and Tao Chen were employed by Pangang Group Mining Company Limited. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Altun, A.O.; Yilmaz, I.; Yildirim, M. A short review on the surficial impacts of underground mining. Sci. Res. Essays 2010, 5, 3206–3212. [Google Scholar]
Bazaluk, O.; Kuchyn, O.; Saik, P.; Soltabayeva, S.; Brui, H.; Lozynskyi, V.; Cherniaiev, O. Impact of ground surface subsidence caused by underground coal mining on natural gas pipeline. Sci. Rep. 2023, 13, 19327. [Google Scholar] [CrossRef]
Man, T.; Wang, Z.; Huppert, H.; Ren, S.; Galindo-Torres, S.A.; Zhang, X.; Zhou, A. Propagation scaling and micromechanics of buoyant granular column collapses. J. Geophys. Res. Earth Surf. 2026, 131, e2025JF008812. [Google Scholar] [CrossRef]
Guo, R. Investigation on the optimization of safe thickness of horizontal isolation pillars in the east mining region of Hongniu Copper Mine. In Proceedings of the International Conference on Optics, Electronics, and Communication Engineering (OECE 2025); SPIE: Bellingham, WA, USA, 2025; Volume 13965, pp. 1061–1066. [Google Scholar]
Chen, T.; Mitri, H.S. Strategies for surface crown pillar design using numerical modelling–A case study. Int. J. Rock Mech. Min. Sci. 2021, 138, 104599. [Google Scholar] [CrossRef]
Mehra, A.; Budi, G. 3D Modelling approach to identify parametric configurations for pillar stability in underground metal mine: A case study. Geomat. Nat. Hazards Risk 2024, 15, 2367630. [Google Scholar] [CrossRef]
Hu, W.; Zhou, A. Self-organizing Optimization of Parameters for Combined Mining and Retained Layers in Open-pit and Underground Joint Mining. Min. Res. Dev. 2002, 02, 7–9. [Google Scholar]
Li, Y.; Nan, S.; Zhao, X.; Yang, T.; Tang, C.; Zhang, Y.; Tan, Z. Stability study of underground boundary pillars in open-pit mines. Chin. J. Rock Mech. Eng. 2005, 24, 278–283. [Google Scholar]
Ma, T.; Tang, C.; Yang, Y.; Lin, P. Stability Analysis of Roof Pillars in the Transition from Open-pit to Underground Mining. J. Northeast. Univ. 2006, 27, 450–453. [Google Scholar]
Xu, H.; Yang, T.; Zhu, L. Study on the Reasonable Thickness of Roof Pillars at the Transition from Open-pit to Underground Mining in the Sijiaying Iron Mine III. China Min. 2007, 04, 74–76+80. [Google Scholar]
Long, L.; Chen, X.; Liu, C.; Liu, X.; Mo, C. Safety Thickness Optimization of Isolation Pillars Based on the Critic Weighting Method. Chem. Eng. Miner. Process. 2021, 50, 1–5. [Google Scholar]
Lu, P.; Zhang, J.; Yang, Z.; Cheng, Y.; Dong, J. Study on Determining the Isolation Pillars in the Transition from Open-pit to Underground Mining in a Copper Mine. Non Ferr. Met. Min. Sect. 2020, 72, 11–14+19. [Google Scholar]
Zhang, Y.; Qi, H.; Li, C.; Zhou, J. Enhancing safety, sustainability, and economics in mining through innovative pillar design: A state-of-the-art review. J. Saf. Sustain. 2024, 1, 53–73. [Google Scholar] [CrossRef]
Xu, S.; Suorineni, F.T.; An, L.; Li, Y.H.; Jin, C.Y. Use of an artificial crown pillar in transition from open pit to underground mining. Int. J. Rock Mech. Min. Sci. 2019, 117, 118–131. [Google Scholar] [CrossRef]
Babanouri, N.; Beyromvand, H.; Dehghani, H. Evaluation of different methods of pillar recovery in coal mining by numerical simulation: A case study. Environ. Earth Sci. 2023, 82, 110. [Google Scholar] [CrossRef]
Guo, J.; Zhao, Y.; Zhang, W.; Dai, X.; Xie, X. Stress Analysis of Mine Walls in Isolation Pillar Mining under Multidirectional Loading. J. Cent. South Univ. Nat. Sci. Ed. 2018, 49, 3020–3028. [Google Scholar]
Wang, Y.; Xu, H.; Wu, A.; Ai, C.; Wu, P. Instability Mechanism of Temporary Mine Wall Systems Based on the Pointed Discontinuity Model and Optimization of Mine Wall Thickness. J. Min. Saf. Eng. 2016, 33, 662–667+675. [Google Scholar]
Li, X.; Peng, D.; Feng, F.; Li, X. Stability Analysis of Isolation Pillars in Deep Falling to Backfill Mining Based on the Thin Plate Theory. J. China Univ. Min. Technol. 2019, 48, 484–494. [Google Scholar]
Xie, X.; Li, D.; Kong, L. Stability Analysis Model of Mine Walls Based on Elastic Plate Theory and Its Application. J. Min. Saf. Eng. 2020, 37, 698–706. [Google Scholar]
Parmar, H.; Bafghi, A.Y.; Najafi, M. Impact of ground surface subsidence due to underground mining on surface infrastructure: The case of the Anomaly No. 12 Sechahun, Iran. Environ. Earth Sci. 2019, 78, 409. [Google Scholar] [CrossRef]
Ivadilinova, D.T.; Issabek, T.K.; Takhanov, D.K.; Yeskenova, G.B. Predicting Underground Mining Impact on the Earth’s Surface. Sci. Bull. Natl. Min. Univ. 2023, 1, 32–37. [Google Scholar] [CrossRef]
Bugajska, N.J.; Milczarek, W.J. Remote sensing monitoring of influence of underground mining in the area of the S3 Express Road. IOP Conf. Ser. Earth Environ. Sci. 2021, 684, 012028. [Google Scholar] [CrossRef]
Vinay, L.S.; Bhattacharjee, R.M.; Ghosh, N.; Kumar, S. Machine learning approach for the prediction of mining-induced stress in underground mines to mitigate ground control disasters and accidents. Geomech. Geo-Phys. Geo-Energy Geo-Resour. 2023, 9, 159. [Google Scholar]
Deng, T.; He, J.; Sun, J.; Peng, S.; Pang, X.; Chen, T.; Zhang, X. Intelligent optimization of slope step parameters in open pit mines containing weak interbedded layers based on machine learning and multi-objective optimization methods. Front. Earth Sci. 2025, 13, 1666375. [Google Scholar] [CrossRef]
Fattahi, H.; Ghaedi, H.; Armaghani, D.J. Optimizing underground coal mine safety: Leveraging advanced computational algorithms for roof fall rate prediction and risk mitigation. Min. Metall. Explor. 2024, 41, 2849–2867. [Google Scholar] [CrossRef]
Yıldız, T.D. Overlapping of mine sites and highway route in Turkey: Evaluation in terms of mining land use criteria and land-use planning. Land Use Policy 2021, 106, 105444. [Google Scholar] [CrossRef]
Vušović, N.M.; Hejmanowski, R.; Vlahović, M.M. Impact of Mining Disturbance on Highway Sustainability: A Case Study of Aleksinac Mine Area, Serbia. Appl. Sci. 2025, 15, 2291. [Google Scholar] [CrossRef]
Cheng, G.; Liu, H.; Li, F.; Nie, T.; Wang, Q.; Peng, C. Study on subsidence evolution induced by coal mining under highway based on finite element simulation. Energy Explor. Exploit. 2025, 43, 1180–1205. [Google Scholar] [CrossRef]
Deng, W.N.; Zhang, H.X. Present Situation of Research on Coal Mining Subsidence under Highway in China. Adv. Mater. Res. 2013, 664, 954–959. [Google Scholar] [CrossRef]
Nguyen, H.V.; Le, D.Q.; Nguyen, L.Q.; Lipecki, T. Prediction of road subsidence caused by underground mining activities by artificial neural networks. Inżynieria Miner. 2023, 29, 627–637. [Google Scholar] [CrossRef]
Vušović, N.M.; Vlahović, M.M. Simulating the mine subsidence and deformations of highway using a stochastic model. Res. Sq. 2024. preprint. [Google Scholar]
Chen, Y.; Zha, J.; Wang, L. Adaptive dynamic prediction model of mining subsidence aided by measured data. Sci. Rep. 2025, 15, 14754. [Google Scholar] [CrossRef]

Figure 1. Site diagram.

Figure 2. Rock mechanics experiments.

Figure 3. Numerical calculation model of the mining site.

Figure 4. Schematic diagram of numerical calculation: (a) isolation pillar of 18 m with a sidewall thickness of 1 m; (b) isolation pPillar of 19 m with a sidewall thickness of 0.5 m; (c) isolation pillar of 19 m with a sidewall thickness of 1 m; (d) isolation pillar of 19 m with a sidewall thickness of 1 m; (e) isolation pillar of 20 m with a sidewall thickness of 1 m; (f) isolation pillar of 19 m with a sidewall thickness of 1.5 m.

Figure 5. LSTM diagram.

Figure 6. XGBoost algorithm diagram.

Figure 7. NSGA-II algorithm diagram.

Figure 8. Technical roadmap.

Figure 9. Convergence analysis of ore price prediction model: (a) BO-LSTM iteration curve diagram; (b) LSTM iteration curve diagram.

Figure 10. Iteration curve schematic diagram: (a) XGBoost-NSGA-II displacement iteration curve; (b) XGBoost-NSGA-II mining volume iteration curve; (c) XGBoost-BO iteration curve.

Figure 11. R² schematic diagram: (a) BO-LSTM predicted value R² schematic diagram; (b) LSTM-predicted value R² schematic diagram; (c) XGBoost-BO displacement R² schematic diagram; (d) XGBoost-BO mining output R² schematic diagram; (e) XGBoost-NSGA-II displacement R² schematic diagram; (f) XGBoost-NSGA-II mining output R² schematic diagram.

Figure 12. Schematic diagram of comparison between true values and predicted values: (a) BO-LSTM price real vs. predicted ore price comparison chart; (b) LSTM price real vs. predicted ore price comparison chart; (c) XGBoost-NSGA-II displacement real vs. predicted comparison chart; (d) XGBoost-NSGA-II mining output real vs. predicted comparison chart; (e) XGBoost-BO displacement real vs. predicted comparison chart; (f) XGBoost-BO mining output real vs. predicted comparison chart.

Figure 13. Sensitivity analysis chart: (a) displacement sensitivity analysis; (b) mining volume sensitivity analysis.

Figure 14. Evaluation metrics for different training sample sizes.

Figure 15. Evaluation metrics under different noise levels.

Figure 16. Evaluation metrics under varying outlier proportions and deviation magnitudes.

Figure 17. Flowchart of the mining site parameter prediction system.

Figure 18. GUI interface diagram of the prediction system.

Figure 19. Displacement from numerical simulation: (a) displacement contour map of the mining site; (b) displacement at monitoring points.

Table 1. Rock mechanics parameters.

Rock Mass	Weight γ (g/cm³)	Cohesion c (MPa)	Friction Φ (°)	Elastic Modulus (GPa)	Poisson
Highly Weathered Biotite Granite	26.2	0.083	28.07	0.67	0.29
Silicon Stone	26.8	0.172	37.2	2.2	0.24

Table 2. Scheme design table.

Serial Number	Isolation Pillar Thickness /m	Sidewall Thickness /m
1	18	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
2	19	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
3	20	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
4	21	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
5	22	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
6	23	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
7	24	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3
8	25	0.5; 0.75; 1; 1.25; 1.5; 1.75; 2; 2.25; 2.5; 2.75; 3

Table 3. Partial sample training set for surface subsidence displacement and mining output.

Serial Number	Isolation Pillar Thickness /m	Mining Site Sidewall Thickness /m	Displacement /cm	Mining Volume /t
1	25	0.5	13.8981	70,589.86
2	25	0.75	11.0857	68,732.23
3	25	1	8.27336	66,874.61
4	24	1	10.886	68,785.31
5	24	1.25	9.1795	66,874.61
6	24	1.5	7.473	64,963.9
7	23	0.5	23.043	74,623.57
8	23	0.75	17.7465	72,659.79
9	23	1	12.45	70,696.01
…	…	…	…	…

Table 4. Partial sample training set for ore price.

Year	Month	Ore Price
2024	1	330
	2	330
	3	325
	4	320
	5	315
	6	310
	7	305
	8	300
	9	300
	10	310
	11	315
	12	310

Table 5. Evaluation of prediction accuracy for ore prices.

Algorithm	Method	Price
LSTM-BO	MAE	4.2254
	R²	0.97
	MSE	25.4912
	MAPE	1.26%
LSTM	MAE	14.52
	R²	0.76
	MSE	336.09
	MAPE	4.17

Table 6. Evaluation of mining site parameters.

Algorithm	Method	Displacement	Mining Output
XGBoost-NSGA-II	MAE	0.7353	430.6059
	R²	0.9709	0.9963
	MSE	2.1235	251,585.7983
	MAPE	4.5%	0.65%
XGBoost-BO	MAE	1.7923	2499.4637
	R²	0.9106	0.8379
	MSE	8.2553	10,120,834.04
	MAPE	13.3%	3.7%

Table 7. Analysis and calculation of sample training data size.

Metrics	Proportion of Samples
Percentage of samples	40%	50%	60%	70%	80%	90%

Table 8. Calculation of Gaussian Noise Standard Deviations.

Metrics	Artificial Gaussian Noise
Noise level	1%	3%	5%	7%	9%	11%	13%

Table 9. Outlier proportions and deviation magnitudes.

Metrics	Outlier Proportion	Deviation Magnitude
Percentage	5%, 10%, 15%, 20%	10%, 20%, 30%, 40%, 50%

Table 10. Calculation of relative error in GUI price prediction.

Date	Historical Prices/¥	Predicted Prices by the GUI System/¥	Relative Error
2021.2	330	300	9.09%
2023.7	330	313.33	5.05%
2023.9	340	314.05	7.63%
2024.5	320	317.63	0.74%
2020.8	450	375	16.67%
2024.12	310	320.63	3.43%
2021.5	340	285	16.18%
2023.5	340	312.35	8.13%
2024.9	300	319.36	6.45%

Table 11. Mining site parameter prediction results based on monthly price factors.

Date	Predicted Price	Mining Site Structure Parameters		Objective
Date	Predicted Price	Safety Pillar/m	Sidewall Thickness/m	Displacement /cm	Mining output /t
2027.2	330.85	21.44	1.60	9.66	73,579

Table 12. Calculation of relative error.

	Predicted Value		Calculated Value		Relative Error
Objective	Displacement /cm	Mining output /t	Displacement /cm	Mining output /t	Displacement	Mining output
	9.66	73,562	9.89	69,834.5	2.38%	5.07%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Deng, T.; Chen, H.; Peng, S.; Xia, X.; Wang, G.; Chen, T.; Zhang, L. Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning. Appl. Sci. 2026, 16, 5178. https://doi.org/10.3390/app16115178

AMA Style

Deng T, Chen H, Peng S, Xia X, Wang G, Chen T, Zhang L. Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning. Applied Sciences. 2026; 16(11):5178. https://doi.org/10.3390/app16115178

Chicago/Turabian Style

Deng, Tao, Haoyu Chen, Shouxing Peng, Xiangsheng Xia, Guangjin Wang, Tao Chen, and Lingling Zhang. 2026. "Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning" Applied Sciences 16, no. 11: 5178. https://doi.org/10.3390/app16115178

APA Style

Deng, T., Chen, H., Peng, S., Xia, X., Wang, G., Chen, T., & Zhang, L. (2026). Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning. Applied Sciences, 16(11), 5178. https://doi.org/10.3390/app16115178

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on the Collaborative Safety Optimization of Underground Mine Workings and Surface Roads Based on Machine Learning

Featured Application

Abstract

1. Introduction

2. Geological Survey and Rock Mechanics Parameter Assessment in Mining Regions

2.1. Overview of Geological Engineering Conditions

2.2. Parameters for Rock Mechanics Evaluation

3. Design of Collaborative Optimization Scheme for Mining Site and Road Based on Intelligent Optimization and Sample Construction

3.1. Scheme Design

3.2. Technical Approach

3.2.1. Data Preparation—Constructing the Sample Training Set

3.2.2. Establishment of Prediction Models

3.2.3. Intelligent Optimization of Underground Mining Engineering Structural Parameters Based on Machine Learning

3.3. Construction of Surface Subsidence and Economic Benefit Sample Training Set

4. Intelligent Optimization of Underground Mining Site Structural Parameters Based on Multi-Objective Optimization and Machine Learning Methods

4.1. Principles of the LSTM Algorithm

4.2. XGBoost Algorithm

4.3. NSGA-II Algorithm

4.4. Bayesian Optimization Algorithm

4.5. K-Fold Cross-Validation

4.6. Model Training and Testing

4.6.1. Convergence Analysis

4.6.2. Accuracy Analysis

4.6.3. Sensitivity Analysis

4.6.4. Analysis of Core Predictive Models Under Varying Sample Sizes and Input Noise

4.7. Intelligent Mining Site Parameter Optimization System for Underground Mining Engineering Structures

4.7.1. System Introduction

4.7.2. Establishment of an Intelligent Decision-Making Platform with Deep Integration of Market Forecasting and Engineering Optimization

4.7.3. Accuracy Verification of the GUI Prediction Interface System

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI