Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine

Chen, Rongjian; Li, Diyuan; Sun, Jiahao; Cao, Jianfu; Zhou, Tong; Zhang, Chen

doi:10.3390/app16115275

Open AccessArticle

Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine

by

Rongjian Chen

^1,2,3,

Diyuan Li

¹

,

Jiahao Sun

^1,*

,

Jianfu Cao

³,

Tong Zhou

³ and

Chen Zhang

¹

School of Resources and Safety Engineering, Central South University, Changsha 410083, China

²

China Railway Resources Group Co., Ltd., Beijing 100039, China

³

Yichun Luming Mining Co., Ltd., Yichun 153000, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(11), 5275; https://doi.org/10.3390/app16115275

Submission received: 17 April 2026 / Revised: 21 May 2026 / Accepted: 22 May 2026 / Published: 25 May 2026

(This article belongs to the Topic Failure Characteristics of Deep Rocks, 3rd Edition)

Download

Browse Figures

Versions Notes

Abstract

Accurate and efficient rock mass quality classification is a prerequisite for assessing slope stability, designing support schemes, and ensuring mining safety in open-pit mines. However, traditional empirical classification methods rely heavily on expert judgment and often struggle to capture the complex, nonlinear relationships among factors influencing slope stability. Existing intelligent classification models also suffer from limitations, including sensitivity to incomplete data, insufficient feature interaction learning, and unstable performance on small-scale datasets. To address these issues, this study develops a deep forest (DeepForest) model optimized by three metaheuristic algorithms—brown bear optimizer (BBO), tuna swarm optimizer (TSO), and sparrow search algorithm (SSA)—to intelligently classify slope rock mass quality. A rock mass quality dataset containing 204 groups of slope and non-slope cases was established to train and evaluate the classification performance of the DeepForest models. Six influencing factors were set as input parameters: uniaxial compressive strength (UCS) of rock, rock quality designation (RQD), spacing of discontinuities (Sd), rock mass integrity coefficient (Kv), groundwater conditions (W), and site type (St). Multivariate imputation by chained equations (MICE), isolation forest (IsoForest), and synthetic minority over-sampling technique (SMOTE) were used to handle missing values, outliers, and imbalance in the dataset, respectively. The performance of the proposed models was evaluated using five metrics: accuracy, precision, recall, F1-score, and area under the receiver operating characteristic curve (AUC). The experimental results indicate that the BBO-DeepForest model performed best on the independent test set, with accuracy, precision, recall, F1-score, and average AUC values of 0.878, 0.682, 0.678, 0.678, and 0.961, respectively. A comparison with seven well-known imputation algorithms revealed the superiority of the selected imputation algorithm in recovering incomplete rock mass quality datasets. Model interpretation results showed that RQD and UCS are critical feature parameters for classifying slope rock mass quality. At last, the proposed BBO-DeepForest model was employed to verify the rock mass quality of three slopes at the Luming molybdenum mine, resulting in classifications consistent with on-site observations. It demonstrates that combining DeepForest with metaheuristic optimization algorithms is a feasible and accurate approach for intelligently classifying the rock mass quality of slopes.

Keywords:

slope engineering; rock mass classification; deep forest; metaheuristic algorithms

1. Introduction

The stability of rock slopes is a critical issue in open-pit mining, hydropower engineering, and transportation infrastructure construction. As an intrinsic factor controlling the deformation and failure behavior of slopes, rock mass quality plays a vital role in slope stability analysis and engineering design [1]. Accurate and reliable rock mass quality classification not only provides an important basis for slope stability assessment and safety protection, but also helps prevent geological disasters such as collapses and landslides. Therefore, improving the accuracy of rock mass quality classification is crucial to ensuring the safety of infrastructure construction in slope engineering.

Traditional rock mass quality classification methods can be divided into single-factor methods and multi-factor methods based on the number of influencing factors considered. Single-factor methods classify rock mass quality using a single evaluation index, primarily rock strength [2] and rock quality designation [3]. These methods are simple and quick, but they lack accuracy. This is because a single evaluation index cannot truly reflect rock mass quality. In contrast, multi-factor methods use multiple evaluation indices to classify rock mass quality. For instance, the rock mass rating (RMR) system [4] considers five influencing factors: uniaxial compressive strength (UCS), rock quality designation (RQD), spacing of discontinuities (Sd), groundwater, and conditions of discontinuities. The Q-system [5] considers six influencing factors, including RQD, number of joint sets, joint roughness number, joint alteration number, joint water reduction factor, and stress reduction factor. The Geological Strength Index (GSI) classification method [6] introduces the GSI to characterize rock mass quality. These methods provide more accurate and comprehensive results for rock mass classification. However, these empirical classification methods rely heavily on engineering experience and predefined scoring criteria. Their discrete evaluation structures and simplified linear weighting strategies make it difficult to accurately characterize the complex nonlinear interactions among geological parameters under heterogeneous slope conditions. In addition, different experts may produce inconsistent classification results for the same rock mass, reducing the robustness and reproducibility of the evaluation process.

To address the drawbacks of empirical approaches and improve the accuracy and objectivity of classification results, fuzzy comprehensive evaluation methods have been widely employed in rock mass classification research. For instance, Wu et al. [7] established a probability-based rock mass quality classification model by combining Monte Carlo simulation with the ideal-point method. Dai et al. [8] developed a comprehensive evaluation model for classifying the rock mass quality of the roadways at the Sanshandao gold mine by combining the entropy weighting method, ideal point method, and gray relational analysis. Wang and Guo [9] proposed a method for classifying rock mass quality based on an improved cloud model, effectively quantifying qualitative evaluation indicators. Wu et al. [10] established a slope rock mass quality classification model based on interval continuous mathematics. Furthermore, Fan et al. [11] developed a rock mass classification model that combines subjective and objective weighting with topological theory, and obtained classification results at the Kunyang phosphate mine superior to both RMR and Q-system. Although fuzzy comprehensive evaluation methods have enriched rock mass quality assessment research, their performance still depends strongly on prior knowledge, manually designed membership functions, and subjective weighting schemes. Moreover, these methods generally exhibit limited self-learning capability and poor adaptability when dealing with high-dimensional nonlinear geological data.

In recent years, owing to the accelerated development of computer technology, artificial intelligence (AI) has become a powerful tool for addressing nonlinear, multi-parameter, and uncertainty-related problems in mining and geotechnical engineering. Compared with empirical approaches and fuzzy comprehensive evaluation methods, the most distinctive advantage of AI is its ability to learn and build intelligent classification models from large amounts of rock mass quality data. For instance, Liu et al. [12] and Santos et al. [13] applied support vector machines and artificial neural networks, respectively, to construct rock mass quality classification models from datasets containing 25 and 30 cases. Santos et al. [14] further compared the performance of four different intelligent algorithms in rock mass classification. They demonstrated that the ensemble-learning-based random forest algorithm achieves the best classification performance. Considering the advantages of ensemble learning, Sheng et al. [15] combined stacking strategies with deep neural networks to develop an ensemble model for classifying the slope rock mass quality from a dataset containing 310 cases, and achieved accurate classification results at a disused quarry. To further improve classification efficiency and accuracy, Wang et al. [16] compiled a dataset of 266 cases from the Deziwa open-pit mine. Based on this dataset, they established an ensemble classification model integrating comprehensive weight analysis, the equilibrium optimizer, and adaptive gradient boosting. Although these intelligent models have shown promising performance in rock mass quality classification, their applicability remains challenging. For instance, support vector machines are sensitive to parameter settings, neural networks generally require large-scale datasets to avoid overfitting, and boosting-based ensemble methods may exhibit reduced robustness when handling noisy or incomplete data.

In addition, the performance of intelligent models is highly sensitive to hyperparameter settings. Inappropriate configurations not only increase computational cost but also degrade classification accuracy [17]. Therefore, to achieve optimal performance, it is essential to employ suitable optimization algorithms for hyperparameter tuning. In this regard, Li and Wang [18] combined particle swarm optimization with support vector machines to achieve accurate classification of the rock mass quality of highway slopes. Hu et al. [19] applied particle swarm optimization, genetic algorithms, and gray wolf optimization to optimize support vector machine models for classifying rock mass quality at the Chambishi copper mine, demonstrating that gray wolf optimization significantly enhances classification performance across optimal parameter combinations. Yang et al. [20] further improved the classification accuracy of extreme gradient boosting for underground rock mass quality assessment by adopting an advanced zebra optimization algorithm. Nevertheless, several challenges remain unresolved. First, many intelligent models are highly sensitive to incomplete, noisy, or imbalanced geological datasets, which are common in practical engineering investigations. Second, conventional machine-learning algorithms often struggle to effectively capture complex feature interactions and uncertainty characteristics among geological parameters. Third, deep neural network-based methods generally require large-scale labeled datasets and extensive hyperparameter tuning, which may limit their applicability in small-sample geotechnical engineering scenarios.

To address the above shortcomings, this study proposes a hybrid intelligent classification framework integrating the Deep Forest (DeepForest) model with three metaheuristic optimization algorithms—Brown Bear Optimizer (BBO), Tuna Swarm Optimizer (TSO), and Sparrow Search Algorithm (SSA)—for slope rock mass quality classification. The proposed framework aims to improve classification robustness under incomplete and imbalanced datasets, enhance nonlinear feature interaction learning, and optimize model hyperparameters for small-sample geotechnical engineering conditions. DeepForest is an advanced hybrid model that combines the predictive advantages of both neural networks and random forests. Compared with other neural networks and ensemble-learning algorithms, the unique cascaded forest architecture effectively reduces the dependence of DeepForest on hyperparameter tuning and large-scale datasets. This makes DeepForest particularly suitable for small-sample, high-dimensional, and uncertainty-related geotechnical engineering problems. The remaining sections of this paper are outlined below. Section 2 describes the DeepForest model, BBO, TSO, and SSA optimization algorithms in detail. Section 3 introduces the dataset for classifying slope rock mass quality and presents the corresponding data analysis. Section 4 demonstrates the metrics for evaluating performance and the development process of the model. Section 5 evaluates the performance of the proposed model and verifies its applicability through practical engineering applications. Section 6 summarizes the principal conclusions and discusses the limitations of this study along with potential directions for future research.

2. Methodology

2.1. Deep Forest Algorithm

The deep forest (DeepForest) is a deep-learning algorithm based on random forests. It utilizes a multi-layer cascade structure (each layer comprising multiple random forests) to progressively extract features, ultimately outputting classification or regression results [21]. The core architecture of DeepForest primarily comprises two components: multi-grained scanning and a cascade forest. As a novel ensemble-learning algorithm combining random forests and deep neural networks, DeepForest first transforms the original input features through multi-grained scanning to strengthen their representational capacity. Subsequently, it employs a cascade structure for layer-by-layer representation learning, thereby effectively improving the classification or regression performance of the model. A schematic diagram of the DeepForest algorithm is shown in Figure 1.

For the front-end multi-grained scanning structure, the DeepForest algorithm employs a sliding window to divide the feature vectors of the raw data into multiple subsequences or sub-regions, where each subsequence represents data fragments at different scales. These subsequences or sub-regions obtained through the sliding window generate feature vectors of different dimensions, which depend on the window size and the feature extraction method. Obviously, smaller windows generate lower-dimensional feature vectors, while larger windows generate higher-dimensional feature vectors. In this way, data statistical characteristics over a longer time range can be reflected. Through this multi-grained scanning structure, the DeepForest algorithm can comprehensively utilize information from different scales and time spans, thereby gaining a more comprehensive understanding of the complexity and dynamics of the modeling data.

For the back-end cascade forest structure, DeepForest adopts a multi-level decision-making framework to further improve the performance and robustness of the model. In a cascade forest, each layer of the random forest consists of multiple decision trees, each of which is constructed on a random feature space and data subset. At the same time, the output from the previous layer is used as the input for the next layer. This allows deeper trees to learn based on higher-level feature combinations, thereby capturing more complex data patterns and relationships. Through this multi-level ensemble learning, which is structurally similar to deep neural networks, the cascade forest can progressively extract and integrate abstract feature representations of the data, endowing the overall model with stronger generalization ability and prediction accuracy. It is worth noting that this hierarchical design significantly improves the model’s ability to handle complex data and to cope with noise and the curse of dimensionality.

2.2. Brown Bear Optimizer

The brown bear optimizer (BBO) is a metaheuristic optimization algorithm based on natural behavior proposed by Prakash et al. [22]. It is inspired by observations of communication patterns among brown bears, particularly their pedal scent marking and sniffing behaviors. The main characteristics of this behavior include maintaining gait, careful stepping, and twisting footsteps.

In the BBO algorithm, different groups of brown bears inhabiting the same territory are regarded as individual solution sets of the population, and the pedal scent marking generated by each group is treated as a decision variable within each solution set. The territory of the brown bears is regarded as the search space for the problem. During initialization, different groups are randomly generated within the territory of brown bears and assigned a specific amount of pedal scent marking. The markings of different groups have unique characteristics and are retained in their respective territories. The scope of territory is defined by the decision variable boundaries of the corresponding problem. The mathematical expression for the random initialization of brown bear groups is as follows:

P_{i, j} = P_{i, j}^{m i n} + λ \cdot (P_{i, j}^{m a x} - P_{i, j}^{m i n})

(1)

where

P_{i, j}

is the j-th pedal scent marking of the i-th brown bear group, and

λ

is a uniformly distributed random number in the range

[0, 1]

. This initialization strategy ensures that candidate solutions are uniformly distributed within the search space, thereby improving population diversity and reducing the risk of premature convergence in early iterations.

In most cases, only male individuals exhibit the behavior of pedal scent marking. To simplify the problem, the number of male individuals in each group is set to 1. This gives the male member of each group a unique gait when walking. Therefore, the pedal scent markings produced by the male brown bears in each group exhibit distinct characteristics. Assuming that the behavior of pedal scent marking based on the unique gait will continue until one-third of the total number of iterations

N_{t}

. A mathematical model of this process can be described as follows:

P_{i, j, k}^{n e w} = P_{i, j, k}^{o l d} (θ_{k} \cdot α_{i j, k} P_{i, j, k}^{o l d})

(2)

θ_{k} = \frac{C_{t}}{N_{t}}

(3)

where

α_{i j, k}

is a uniformly distributed random number in the range

[0, 1]

;

θ_{k}

is the occurrence factor for the k-th iteration, which increases linearly with the number of iterations; and

C_{t}

is the current iteration number. In this stage, the update mechanism emphasizes exploration by amplifying individual differences among solutions, which helps the algorithm explore a wider search space and avoid early stagnation in local optima.

Between one-third and two-thirds of the total number of iterations, the pedal scent markings of the brown bear are updated according to the characteristics of careful stepping. This is primarily to enhance the behavior of pedal scent markings. A mathematical model of this process can be described as follows:

P_{i, j, k}^{n e w} = P_{i, j, k}^{o l d} + F_{k} \cdot (P_{j, k}^{b e s t} - L_{k} \cdot P_{j, k}^{w o r s t})

(4)

F_{k} = β_{1, k} \cdot θ_{k}

(5)

L_{k} = r o u n d (1 + β_{2, k})

(6)

where

β_{1, k}

and

β_{2, k}

are uniformly distributed random numbers in the range

[0, 1]

;

F_{k}

is the step factor for the k-th iteration; and

L_{k}

is the step length for the k-th iteration. This stage introduces a balance between the best and worst solutions, enabling the algorithm to exploit promising regions while maintaining population diversity. As a result, the search process gradually shifts from global exploration to local refinement.

From two-thirds of the iterations to the final stage, the pedal scent markings of the brown bear are updated according to the characteristics of twisting footsteps. This is to further establish more durable pedal scent markings. At the same time, these markings will also be utilized to create scent maps by other members of the group. A mathematical model of this process can be described as follows:

P_{i, j, k}^{n e w} = P_{i, j, k}^{o l d} + ω_{i, k} \cdot (P_{j, k}^{b e s t} - |P_{i, j, k}^{o l d}|) - ω_{i, k} \cdot (P_{j, k}^{w o r s t} - |P_{i, j, k}^{o l d}|)

(7)

ω_{i, k} = 2 π \cdot θ_{k} \cdot γ_{i, k}

(8)

where

ω_{i, k}

represents the k-th twist angular velocity for the k-th iteration, and

γ_{i, k}

is a uniformly distributed random number in the range

[0, 1]

. This mechanism further enhances local exploitation capability by intensifying search around high-quality solutions, while simultaneously using information from inferior solutions to escape suboptimal regions.

Sniffing behavior is common among members of every brown bear group. By sniffing pedal scent markings, they can communicate with one another and move within their territory. To move, brown bears begin sniffing randomly selected pedal marks within the territory. Then they will move towards the pedal scent markings of their own group, ignoring those of the others. The sniffing behavior selects two random candidate solutions and updates the movement process of brown bears using the following mathematical model:

P_{m, j, k}^{w, m} = \{\begin{matrix} P_{m, j, k}^{w, k} + λ_{j, k} \cdot (P_{m, j, k}^{w, k} - P_{n, j, k}^{w, k}), f (P_{n, k}^{w, k}) < f (P_{n, k}^{w, k}) \\ P_{m, j, k}^{w, k} + λ_{j, k} \cdot (P_{n, j, k}^{w, k} - P_{m, j, k}^{w, k}), f (P_{n, k}^{w, k}) < f (P_{n, k}^{w, k}) \end{matrix}

(9)

where

λ_{j, k}

is a uniformly distributed random number in the range

[0, 1]

. This operation enables information exchange between candidate solutions, allowing individuals to move toward better-performing solutions while preserving stochastic exploration behavior. This helps improve convergence stability and prevents premature convergence. Finally, the BBO optimizer achieves an effective balance between exploration and exploitation through staged behavioral transitions.

2.3. Tuna Swarm Optimizer

The tuna swarm optimizer (TSO) is a metaheuristic optimization algorithm based on population behavior proposed by Xie et al. [23]. It is inspired by observations of two types of foraging behavior in tuna: spiral foraging and parabolic foraging. The basic idea is to treat each individual in the population as a tuna. Each tuna searches for the optimal solution through its own foraging strategy, while also being influenced by other tuna’s foraging. In each iteration of the algorithm, each tuna adjusts its position based on its own fitness and the fitness of surrounding tuna, thereby better adapting to the environment and finding the optimal solution.

Similar to most metaheuristic algorithms based on population behavior, TSO initializes the population by randomly generating individuals within the search space, which can be mathematically expressed as follows:

p_{i}^{n} = r a n d \cdot (u b - l b) + l b, i = 1, 2, 3, \dots, N

(10)

where rand is a random number in the range

[0, 1]

;

u b

and

l b

are the upper and lower limits of the search space, respectively;

p_{i}^{n}

is the initialization value of the tuna swarm; and N is the population size.

Spiral foraging is the first foraging strategy of the tuna swarm. When tuna swarms feed, they first swim in a spiral shape, then drive prey into shallow waters. This is because prey in shallow waters are easier to catch. The specific mathematical model is as follows:

X_{i}^{t + 1} = \{\begin{matrix} \{\begin{matrix} α_{1} \cdot (X_{b e s t}^{t} + β \cdot |X_{b e s t}^{t} - X_{i}^{t}|) + α_{2} \cdot X_{i}^{t}, i = 1; \\ α_{1} \cdot (X_{b e s t}^{t} + β \cdot |X_{b e s t}^{t} - X_{i}^{t}|) + α_{2} \cdot X_{i - 1}^{t}, i = 2,3, \dots, N \end{matrix}, r a n d < \frac{t}{t_{m a x}} \\ \{\begin{matrix} α_{1} \cdot (X_{r a n d}^{t} + β \cdot |X_{r a n d}^{t} - X_{i}^{t}|) + α_{2} \cdot X_{i}^{t}, i = 1 \\ α_{1} \cdot (X_{r a n d}^{t} + β \cdot |X_{r a n d}^{t} - X_{i}^{t}|) + α_{2} \cdot X_{i - 1}^{t}, i = 2,3, \dots, N \end{matrix}, r a n d \geq \frac{t}{t_{m a x}} \end{matrix}

(11)

where

X_{i}^{t + 1}

denotes the position of the individual at the (t + 1)-th iteration;

X_{b e s t}^{t}

denotes the optimal position of the current individual;

X_{r a n d}^{t}

denotes the position of the random individual; and

α_{1}

and

α_{2}

represent the weight coefficients controlling the movement of the individual towards the optimal individual and the previous individual, respectively. Their formulations are given in Equation (12):

\{\begin{matrix} α_{1} = a + (1 - a) \times t / t_{m a x} \\ α_{2} = (1 - a) - (1 - a) \times t / t_{m a x} \end{matrix}

(12)

where

a

is a constant representing the coefficient of the degree to which the tuna follows the optimal individual and the previous individual during the initial stage;

t

represents the current number of iterations;

t_{m a x}

represents the maximum number of iterations; and

β

is the spiral factor, representing the extent to which an individual moves towards a random individual or an optimal individual. The specific formulation of

β

is given as follows:

\{\begin{matrix} β = e^{b l} \cdot \cos (2 π b) \\ l = e^{3 \cos (((t_{m a x} + 1 / t) - 1) π)} \end{matrix}

(13)

where

b

is a random number in the range

[0, 1]

. The tuna swarm improves its search capability for the space surrounding the prey through spiral foraging.

According to the first case in Equation (11), when all tunas are spiraling around their prey, they will seek the optimal position in the hunting space. However, when the optimal position fails to capture prey, blind following will reduce the feeding efficiency of the tuna swarm. Therefore, to enhance the global search capability of the tuna swarm, a random position is introduced as a search point for spiral foraging, as described in the second case of Equation (11). With increasing numbers of iterations, the random position will progressively transform into the optimal position, and the search capability and accuracy of the TSO algorithm will improve significantly.

Parabolic foraging is the second foraging strategy of the tuna swarm. When tuna swarms feed, they swim in a parabolic shape to capture their prey. Additionally, tuna also conduct local searches within their activity area to discover potential food sources. The specific mathematical model is as follows:

X_{i}^{t + 1} = \{\begin{matrix} X_{b e s t}^{t} + r a n d \cdot (X_{r a n d}^{t} - X_{i}^{t}) + T F \cdot p^{2} \cdot (X_{r a n d}^{t} - X_{i}^{t}), & r a n d < 0.5 \\ T F \cdot p^{2} \cdot X_{i}^{t}, & r a n d \geq 0.5 \end{matrix}

(14)

where

T F

is a random number with a value of −1 or 1, which controls the direction of individual position updates; and

p

is an adjustment coefficient, which controls the magnitude of individual position updates.

Through the cooperation of the two aforementioned foraging strategies, the tuna swarm constantly updates individual positions until the stopping condition is satisfied. During each iteration, tuna updates the optimal positions of individuals according to their current location and fitness values, including historical optimums and global historical optimums. At last, the position and fitness value of the optimal individual are returned.

2.4. Sparrow Search Algorithm

The sparrow search algorithm (SSA) is a metaheuristic optimization algorithm based on sparrow predatory behavior proposed by Xue et al. [24]. It categorizes sparrow populations into discoverers, followers, and scouts. Each role exhibits different biological behaviors based on its own state and the external environment. Due to its advantages of simplicity, ease of implementation, and few control parameters, SSA has been widely applied in various optimization problems.

Assume there are

N

sparrows foraging in the search space, with the upper and lower bounds denoted by ub and lb, respectively, and the dimensionality of the space is defined as

D

. The position of the sparrow can be represented as

x_{i} = (x_{i 1}, x_{i 2}, . . ., x_{i D}) (i = 1, 2, . . ., N)

, and the foraging ability (fitness) of the i-th sparrow is expressed as

f (x_{i})

. Based on the fitness of each sparrow, the population can be divided into discoverers and followers, with quantities represented as

P D

and

N - P D

, respectively. Discoverers lead the foraging direction of the population, while followers follow the discoverers to forage. Their position update rules are given in Equations (15) and (16), respectively. Sparrows switch between these two behaviors based on their own fitness.

x_{i, j}^{t + 1} = \{\begin{matrix} x_{i, j}^{t} \cdot \exp (\frac{- i}{α \cdot T}), R_{2} < S T \\ x_{i, j}^{t} \cdot \exp (\frac{- i}{α \cdot T}), R_{2} \geq S T \end{matrix}

(15)

where

x_{i, j}

represents the i-th sparrow in the j-th dimension,

j = 1, 2, \dots, D

;

t

and

T

represent the number of iterations and the maximum number of iterations, respectively; α is a random number in the range

[0, 1]

;

R_{2}

and

S T

are the warning value and safety threshold in the ranges

[0, 1]

and

[0.5, 1]

, respectively. When

R_{2} < S T

, the current sparrow population has not perceived danger, and discoverers continue to search based on the current position. When

R_{2} \geq S T

, the current sparrow flock perceives danger, and discoverers lead the population to move randomly to avoid danger.

x_{i, j}^{t + 1} = \{\begin{matrix} Q \cdot \exp (\frac{x_{w o r s t}^{t} - x_{i, j}^{t}}{i^{2}}), i > \frac{N}{2} \\ x_{P}^{i + 1} + |x_{i j}^{t} - x_{P}^{i + 1}| \cdot A^{+} \cdot L, o t h e r w i s e \end{matrix}

(16)

where

Q

is a random number following a normal distribution;

x_{w o r s t}^{t}

denotes the worst position of the sparrow population at the t-th iteration;

x_{P}^{i + 1}

denotes the optimal position of the discoverer at (t + 1)-th iteration;

A

and

L

are both

1 \times D

dimensional matrices. All elements in matrix

L

are 1, and elements in matrix

A

are randomly assigned 1 or −1:

A^{+} = A^{T} \cdot (A \cdot A^{T})^{- 1}

. When

i > N / 2 (i = P D + 1, \dots, N)

, the current follower is considered to have low fitness and should fly elsewhere to forage; otherwise, the current follower follows the sparrow positioned at the optimal location to forage.

Randomly selected scouts are responsible for detecting the surrounding environment and adjusting their positions to avoid danger. The number of scouts can be represented as

S D

, and their position update is governed by the following formula:

x_{i, j}^{t + 1} = \{\begin{matrix} x_{i, j}^{t} + K \cdot (\frac{|x_{i, j}^{t} - x_{w o r s t}^{t}|}{(f_{i} - f_{w}) + ε}), f_{i} = f_{g} \\ x_{b e s t}^{t} + β \cdot |x_{i, j}^{t} - x_{b e s t}^{t}|, f_{i} > f_{g} \end{matrix}

(17)

where

K

is a random number in the range

[- 1, 1]

;

f_{i}

represents the fitness of the sparrow

x_{i}

;

f_{w}

and

f_{g}

represent the fitness of the sparrow population at the optimal and worst positions, respectively;

ε

is the smallest constant that ensures the denominator is not zero;

x_{b e s t}^{t}

denotes the optimal position of the sparrow population at the t-th iteration; and

β

is a normally distributed random number in the range

[0, 1]

. When

f_{i} = f_{g}

, the sparrows located at the center move toward the peripheral sparrows. When

f_{i} > f_{g}

, the sparrows at the periphery move toward the center to reduce the likelihood of encountering danger.

2.5. Hybrid Optimization Model Based on DeepForest

The objective of this study is to optimize the DeepForest model for classifying slope rock mass quality. For this purpose, three metaheuristic optimization algorithms named BBO, TSO, and SSA were employed to find the optimal hyperparameter combination of the DeepForest model, resulting in the BBO-DeepForest, TSO-DeepForest, and SSA-DeepForest models, respectively. Although these algorithms differ significantly in their metaheuristic principles, their cores are all related to swarm intelligence behavior. Consequently, their optimization procedures for the DeepForest model follow a consistent framework. The specific optimization flow is illustrated in Figure 2.

(1): Data preprocessing: Rock mass data are collected from real geotechnical engineering projects. Randomly split training and test sets, ensuring that all three models are trained and evaluated on identical data partitions. Process the training set, including imputing missing values, removing outliers, and balancing the categories.
(2): Hyperparameter selection and initialization: The hyperparameters of the DeepForest model and their corresponding search ranges are defined. In addition, the population size and the maximum number of iterations for the BBO, TSO, and SSA algorithms are specified.
(3): Iterative optimization: An appropriate loss function is established to assess the classification performance of the model during the optimization process. During each iteration, the optimization algorithms generate candidate hyperparameter combinations, which are subsequently used to construct and train the corresponding DeepForest model. The classification performance of the DeepForest model on the validation set is then returned to the optimization algorithms as the fitness value for updating the population. The optimal hyperparameters are determined by minimizing the loss value over successive iterations.
(4): Optimal model output: Once the termination condition of the iterative process is satisfied, the optimal model with the best combination of hyperparameters is obtained. An independent test set is subsequently employed to assess the performance of the model in slope rock mass classification.

3. Data

3.1. Data Collection and Description

Over the past few decades, numerous rock mass quality classification cases with different rock structures and stability conditions have been recorded. To develop a DeepForest model for intelligently classifying slope rock mass quality, a large number of cases were collected from rock slopes in open-pit mines, transportation highways, and hydropower projects [25,26]. In addition, to expand the rock mass quality classification database, some non-slope rock mass quality samples from underground mines and deep-buried tunnels were also incorporated [19,27,28,29,30]. Due to the fact that the collected engineering cases originated from different regions, projects, and investigation conditions, the original measurement records could not be retrospectively accessed. Therefore, it was difficult to re-obtain raw rock specimens, unify laboratory testing standards, or reconstruct field investigation procedures for each case. To minimize data heterogeneity and potential inconsistencies, several basic quality-control principles were strictly followed during dataset construction, including unit normalization, variable-name standardization, unified category encoding, and range verification of engineering parameters. In addition, samples with physically unreasonable or obviously contradictory parameter values reported in the original literature were removed. Only cases explicitly associated with rock mass quality classification parameters and complete classification labels were retained in the final dataset. However, some collected cases still contained missing values. This missingness was primarily caused by differences in reporting schemes and research focuses among studies, rather than by physical measurement limitations. Therefore, the missing mechanism was considered to be more consistent with the Missing at Random (MAR) assumption in statistical learning. This issue will be addressed during subsequent data preprocessing.

The constructed database consists of 204 data groups, each containing six feature parameters: UCS, RQD, Sd, rock mass integrity coefficient (Kv), groundwater conditions (W), and site type (St). These feature parameters well reflect characteristics such as rock strength, degree of rock fragmentation, development degree of rock mass structural planes, and the weakening effect of groundwater on the rock mass. They not only comprehensively characterize the true quality of the rock mass, but are also easy to obtain in practical engineering.

Rock mass quality is categorized into five grades according to the RMR system: very good (Grade I), good (Grade II), fair (Grade III), poor (Grade IV), and very poor (Grade V). The detailed classification criteria are provided in Table 1. It is worth noting that, except for groundwater conditions, all other feature parameters are quantitative indicators. This is because groundwater conditions are represented quantitatively in tunnels but qualitatively in slopes. To ensure database consistency and rationality, groundwater conditions were mapped using a scoring system.

Figure 3 shows the proportion of total rock mass quality samples and their grades in this study. It is evident that the dataset is imbalanced. The proportion of Grade III rock mass quality samples is relatively high, while the proportion of Grade I rock mass quality is relatively low. Descriptive statistics of the rock mass quality dataset are summarized in Table 2. The maximum and minimum values measure the variation range of feature parameters; the mean and median describe the concentration of data; the standard deviation describes the dispersion of the data; skewness and kurtosis reflect the distribution of the dataset; and the count reflects the completeness of the feature parameters. As shown in Table 2, there are 143 and 23 missing values for the parameters Sd and Kv, respectively. In machine learning, missing data means incomplete information, which can hinder training and degrade predictive performance. Therefore, the problems of imbalance and missing values in the rock mass quality dataset will be further addressed in the subsequent work.

3.2. Data Analysis

The scatter matrix of the six feature parameters in the rock mass quality dataset is shown in Figure 4. The diagonal elements represent the distribution of each feature parameter corresponding to the five rock mass quality grades. A larger distribution area indicates a stronger relationship between the parameter and rock mass quality. Several feature parameters exhibit clear linear relationships, suggesting that careful consideration is required in feature selection.

Figure 5 presents the correlation between feature parameters and the target variable. It can be seen that UCS and Kv show a strong positive correlation, with the maximum correlation coefficient of 0.68; RQD and Sd show a strong negative correlation, with the minimum correlation coefficient of −0.38. In addition, all six feature parameters show significant negative correlations with rock mass quality grade. The higher the parameter value, the poorer the rock mass quality. Overall, the correlation coefficients among the six feature parameters are relatively moderate, and they can be used for subsequent rock mass classification.

3.3. Data Preprocessing

Considering the missing values and imbalance in the rock mass classification dataset, stratified random sampling was utilized to split the dataset into a training set (80%) and a test set (20%). This approach preserves the original class distribution within each subset, ensuring better representativeness. The training set is utilized for developing models and optimizing hyperparameters, while the testing set is employed for assessing model performance. The training set and test set are independent of each other and have no intersection.

To preserve existing data as much as possible and avoid destroying the distribution structure of the dataset, multivariate imputation by chained equations (MICE) [32] was adopted to estimate missing values in the training and test sets. MICE iteratively estimates missing values by treating each feature containing missing values as a target of other features. It can fully utilize the available information in the dataset to minimize estimation bias, thereby avoiding distortion of the original data distribution. The data distributions before and after imputation using MICE on the training set are shown in Figure 6. It can be observed that the complete dataset after imputation has a probability distribution that is nearly consistent with that of the missing dataset before imputation.

For machine learning, not only missing values but also outliers can significantly impair model performance. Identifying and handling potential outliers is a critical step in data preprocessing. Therefore, after imputing missing values, the isolation forest (IsoForest) algorithm [33] was used to detect and handle outliers. IsoForest isolates observations by randomly selecting features and splitting feature values, without considering any distance- or density-based discrimination thresholds. The results of outlier detection using the default parameters of the IsoForest on the training set are shown in Figure 7.

According to Figure 7, 43 samples were identified as outliers, including all four Class I samples. This result was considered unreasonable because excessive removal of samples may destroy the original distribution structure of the dataset, particularly for minority classes. On the contrary, moderate outlier samples may reflect real engineering variability and can contribute to improving the robustness of machine-learning models. Therefore, a further sensitivity analysis was conducted on the contamination parameter of IsoForest, which controls the expected proportion of outliers in the dataset. It should be emphasized that Class I samples are excluded from outlier detection. This is primarily to avoid excessive removal of minority-class samples and preserve the original class structure of the training dataset. The sensitivity analysis results are shown in Figure 8. It can be observed that removing more outlier samples did not lead to a consistent improvement in model performance. However, their accuracy on the test set remained higher than that of the baseline model. Therefore, considering the balance between model performance and preserving the original data distribution, a relatively conservative contamination level of 0.03 was selected. Under this setting, only five samples were removed from the training dataset, and the remaining 158 samples were retained for subsequent model construction.

It is well recognized that the accuracy and reliability of machine-learning models are sensitive to the size and quality of the training dataset. When trained on imbalanced datasets, models tend to be biased toward the majority class, leading to poor classification performance on minority classes. Therefore, the synthetic minority over-sampling technique (SMOTE) [34] was employed to balance the training set. Unlike simple duplication, SMOTE generates new synthetic samples for the minority class based on the k-nearest neighbors within the local feature space. After applying SMOTE, the number of training samples increased from 158 to 340. Furthermore, prior to model development, all features were normalized to minimize the influence of differences in feature scales across parameters.

4. Model Development

4.1. Evaluation Metrics

Performance evaluation is a crucial step in the process of model development. Appropriate evaluation methods facilitate a comprehensive insight into model performance and ensure robustness and generalization capability in rock mass quality classification. In this study, five class labels were defined as the output categories for predicting rock mass quality. Therefore, five suitable metrics were selected to evaluate the classification performance of the developed models, including accuracy, precision, recall, F1-score, and receiver operating characteristic (ROC) curve. Among these five metrics, accuracy is utilized to assess the overall classification performance of the model. Precision is utilized to assess the local classification performance of the model for each class. Recall represents the proportion of correctly classified samples within a given class and reflects the capability of the model to identify positive samples. The F1-score is the harmonic average of precision and recall, and can provide a balanced assessment of overall classification performance. The ROC curve is utilized to assess the overall classification performance of the model at different thresholds. Meanwhile, the area under the ROC curve (AUC) directly reflects the average classification capability over all possible thresholds. All five metrics are calculated from the confusion matrix illustrated in Figure 9. The higher the metric value, the better the classification performance of the model. If their values reach 1, the model achieves the highest classification accuracy.

4.2. Model Construction

Before developing the DeepForest model, it is necessary to optimize its hyperparameters. Hyperparameter optimization can significantly enhance the prediction performance and generalization ability of the model. As an ensemble-learning algorithm that combines neural networks and random forests, DeepForest demonstrates excellent performance in handling complex data. Therefore, this study employs three meta-heuristic optimization algorithms to develop DeepForest models, including BBO, TSO, and SSA.

Since DeepForest incorporates both neural network and random forest architectures, its hyperparameter configuration should be considered comprehensively. In this regard, several critical hyperparameters of the DeepForest model were selected as optimization targets, including max_layers, n_estimators, n_trees, max_depth, min_samples_split, min_samples_leaf, criterion, and n_bins. Their optimization ranges and detailed explanation are provided in Table 3.

During the process of optimizing model hyperparameters employing BBO, TSO, and SSA algorithms, population size and number of iterations are crucial parameters. They jointly determine the runtime of the optimization. Larger population sizes and more iterations can improve exploration ability, but they also significantly raise the computational costs. In contrast, smaller values may cause premature convergence to local optima. Therefore, appropriate settings should be selected based on problem complexity and available computational resources. Considering these factors, six population sizes (20, 40, 60, 80, 100, and 120) and 100 iterations were ultimately adopted to optimize the DeepForest model. Due to the limited number of Class I samples in the training set, the average logarithmic loss obtained from a three-fold cross-validation was used as the objective function. A lower average loss indicates higher classification accuracy. In addition, to avoid potential data leakage during hyperparameter optimization, SMOTE was moved to each cross-validation fold. The iterative convergence processes of the three optimization algorithms applied to the DeepForest model are illustrated in Figure 10. It can be observed that larger population sizes lead to faster convergence. After approximately the 60th iteration, the average loss values of most models stabilized and reached their optimal states.

As shown in Figure 10, the BBO, TSO, and SSA algorithms achieve the lowest average loss values during iteration at population sizes of 60, 80, and 80, respectively. This indicates that, under these configurations, the model can achieve the highest classification accuracy while maintaining computational efficiency. Therefore, the selected population sizes for BBO, TSO, and SSA are 60, 80, and 80, respectively, with the maximum number of iterations set to 60 for all three algorithms. The results of hyperparameter optimization for DeepForest using BBO, TSO, and SSA are presented in Table 4. At this stage, we have completed the development of the BBO-DeepForest, TSO-DeepForest, and SSA-DeepForest models.

5. Result and Discussion

5.1. Model Evaluation

After obtaining optimal hyperparameter combinations for the DeepForest model, an independent test set was used to assess the classification performance of the following models: BBO-DeepForest, TSO-DeepForest, SSA-DeepForest, and DeepForest without hyperparameter optimization. The confusion matrices of four models on the test set are presented in Figure 11. The confusion matrix is a reliable and intuitive visualization tool for classification results, where the values on the diagonal represent correctly classified samples, and those on the off-diagonal represent incorrectly classified samples. The unoptimized DeepForest model performs significantly worse at classifying Grade II and Grade III rock mass quality than the optimized models. In contrast, the BBO-DeepForest model achieves high classification accuracy for these two grades.

The accuracy, precision, recall, and F1-score, calculated from confusion matrices, are summarized in Table 5. The unoptimized DeepForest model exhibits the poorest performance, with accuracy, precision, recall, and F1-score values of 0.780, 0.629, 0.622, and 0.624, respectively. In contrast, the optimized models achieve improved performance across all evaluation metrics. Overall, BBO-DeepForest demonstrates the best classification performance among the three optimized models, obtaining the highest values for all four metrics (accuracy: 0.878, precision: 0.682, recall: 0.678, and F1-score: 0.678). Nevertheless, all four models showed limited classification capability for Grade I samples, which may be attributed to the extremely limited number of Grade I cases available in the dataset (only five samples). Consequently, the ranking scores of all models are further presented in Figure 12. The results indicate that SSA-DeepForest outperforms TSO-DeepForest due to its higher score. Ultimately, BBO-DeepForest exhibited the best classification performance with the highest ranking score of 16.

To evaluate the classification performance of the models more comprehensively, Figure 13 presents the ROC curves of the four models along with their corresponding AUC values. For ROC curves, the larger the distance between the curve and the diagonal line, the higher the AUC value, and the better the model performance. According to Figure 13, all models achieve average AUC values exceeding 0.9. Particularly, the BBO-DeepForest model, which achieved the highest AUC values for rock mass classification (with AUC values of 1.0, 0.974, 0.941, 0.900, and 0.989 for Grades I, II, III, IV, and V, respectively). The TSO-DeepForest and SSA-DeepForest models follow closely, with average AUC values of 0.959 and 0.960, respectively. The DeepForest model exhibits the lowest AUC (0.930). It further demonstrates that the classification performance of the unoptimized DeepForest is inferior to that of the optimized models, particularly BBO-DeepForest.

5.2. Model Comparison

In complex geological environments, collecting complete rock mass data is generally regarded as a severe challenge. Therefore, the proper handling of missing data becomes crucial for developing accurate and reliable classification models. Considering that feature Sd exhibited the highest missing value ratio (approximately 70%) in the constructed dataset, a sensitivity analysis was conducted to evaluate whether this heavily imputed variable could systematically influence the classification results. Specifically, the feature Sd was removed from the dataset, and the entire modeling procedure was repeated under the same experimental settings using the unoptimized DeepForest model, including data imputation, model training, and performance evaluation. The average results of 10 independent experiments are presented in Table 6.

The comparison results indicate that the overall prediction performance remains relatively stable after removing the high-missing-ratio variable. Compared with the model including Sd, the classification accuracy of the model without Sd increased only slightly from 0.798 to 0.817, while precision, recall, and F1-score also exhibited only marginal changes. These findings suggest that the proposed framework does not excessively rely on heavily imputed features and that the MICE-based imputation process did not introduce severe systematic bias into the final classification results.

To further demonstrate the rationality and reliability of the MICE method adopted in this study, it was compared with seven widely used imputation methods, including mean imputation, median imputation, k-nearest neighbors (KNN), MissForest, expectation maximization (EM), generative adversarial imputation networks (GAIN), and multiple imputation using denoising autoencoders (MIDA). DeepForest and BBO-DeepForest were selected as classifiers. To ensure a fair comparison, identical training and testing sets were used across all methods. IsoForest and SMOTE were also not performed on the training set. The comparison results are presented in Figure 14.

According to Figure 14, the imputed dataset using the MICE method helped the DeepForest and BBO-DeepForest models achieve the highest accuracy in rock mass classification. Moreover, by comparing the accuracy of BBO-DeepForest and DeepForest across different imputation methods, it can be observed that the BBO algorithm consistently enhances the classification performance of DeepForest. Notably, for the imputed dataset using the KNN method, the classification accuracy is improved by 9.76%. This further demonstrates the superiority of the BBO algorithm.

To further explain the above comparison results, an additional evaluation of imputation quality was conducted. Specifically, the root mean square error (RMSE) and Jensen–Shannon divergence (JSD) were adopted to quantitatively evaluate the imputation performance of different algorithms. RMSE was used to measure the deviation between imputed values and true values, while JSD was employed to evaluate the similarity between the probability distributions of the imputed and original datasets. Lower RMSE and JSD values indicate better imputation performance and distribution consistency, respectively. The comparison experiment was conducted using samples with complete observations. Specifically, portions of the observed values were randomly masked at missing ratios of 10%, 20%, and 30%, thereby generating pseudo-missing datasets with known ground truth. Then, different imputation algorithms were employed to estimate the missing values, and the imputed datasets were evaluated using RMSE and JSD. To ensure a fair and unbiased comparison, the average results of 10 independent runs were reported. Table 7 summarizes the imputation performance of different algorithms.

The results indicate that MICE achieved relatively lower imputation errors and better distribution consistency in most cases compared with the other imputation algorithms. In particular, under moderate and high missing ratios (20% and 30%), MICE consistently maintained competitive RMSE and JSD values, demonstrating its capability to provide relatively reliable estimations for missing geological parameters. These findings demonstrate that the superiority of MICE in the proposed framework is not only reflected in classification accuracy but also in the quality and distribution consistency of the imputed data.

Furthermore, to better illustrate the competitiveness of the proposed model relative to other machine-learning models, several advanced classification models were introduced for comparison, including SVM, artificial neural network (ANN), gradient boosting decision tree (GBDT), and extreme gradient boosting (XGBoost). To ensure a fair and robust comparison, repeated random sub-sampling validation was adopted. Specifically, the dataset was randomly divided into training (80%) and testing (20%) subsets ten times using different random seeds, while maintaining the same preprocessing strategy, including missing value imputation, outlier removal, data balancing, and normalization. The average and standard deviation values of accuracy, precision, recall, and F1-score were calculated to evaluate both classification performance and model stability. The hyperparameters of all models followed the default settings in the scikit-learn library. The comparative results are presented in Table 8.

As shown in Table 8, DeepForest achieved the best classification performance among all models, with an average accuracy of 80.5% and an F1-score of 76.1%. In contrast, SVM and ANN exhibited relatively lower predictive capability, indicating limited adaptability to the nonlinear and heterogeneous characteristics of the rock mass dataset. As representative boosting-based ensemble-learning models, XGBoost and GBDT have demonstrated competitive performance compared with traditional machine-learning models. In particular, XGBoost achieved an average accuracy of 77.3%, which was close to that of DeepForest. However, DeepForest still outperformed XGBoost in all evaluation metrics, especially in terms of macro-average F1-score and Recall, suggesting a stronger capability for handling imbalanced multi-class rock mass quality classification tasks. Furthermore, DeepForest exhibited relatively stable performance across repeated random experiments, with moderate standard deviation values in all metrics. This indicates that the cascade forest structure possesses good robustness and generalization potential under small-sample and heterogeneous engineering geological conditions. Overall, the proposed DeepForest model demonstrated stronger competitiveness compared with other classification models.

5.3. Ablation Study on External Feature St

The DeepForest model for classifying slope rock mass quality was developed using a collected dataset containing six feature parameters. In addition to five inherent rock mass parameters (UCS, RQD, Sd, Kv, and W), an external feature, St, was employed to distinguish between engineering scenarios (slopes and non-slopes). Generally, rock mass classification should be independent of the specific engineering scenario. The same rock mass should be classified into the same quality grade regardless of whether it occurs in tunnels, slopes, or other engineering environments. However, the dataset used in this study was compiled from multiple literature sources involving different engineering scenarios. In practical engineering, slope rock masses are commonly influenced by weathering, unloading effects, stress redistribution, and joint opening processes, which may lead to systematic differences in the statistical characteristics of slope and non-slope rock mass datasets. Consequently, St was incorporated as an external feature to help the model better capture the distribution heterogeneity among multi-source datasets.

To quantitatively evaluate the contribution of St, an ablation study was conducted. Specifically, the original training set (80%) obtained through stratified random sampling was divided into two versions: one including St and the other without St. The unoptimized DeepForest model was trained separately on these two datasets, while the independent testing set (20%) was used for performance evaluation. Accuracy, precision, recall, and F1-score were adopted as evaluation metrics. To ensure a fair and unbiased comparison, the average results of 10 independent runs were reported. The comparison results are presented in Table 9.

Table 9 shows that incorporating St can improve the classification performance of the model. Compared with the model without St, the model including St achieved improvements of 1.2%, 2.9%, 1.4%, and 2.4% in accuracy, precision, recall, and F1-score, respectively. These results indicate that although St is not an inherent parameter of the RMR system, it can serve as an effective auxiliary contextual feature for enhancing the generalization capability of intelligent classification models trained on multi-source datasets.

5.4. Model Explanation

Model interpretability is essential for developing rock mass quality classification models. In particular, feature importance analysis can help explain the contribution of input variables to the classification outcomes. To further investigate the significance of different features and their impact mechanisms in rock mass quality classification, the Shapley additive explanations (SHAP) method [35] was employed to capture the nonlinear relationships between features and the target variable. SHAP is a game theory-based interpretability method that effectively quantifies the marginal contribution of each input to the model output. From a statistical perspective, it reveals the mean marginal effect of appending one feature to other feature subsets. Mathematically, the SHAP value can be defined as follows:

ϕ_{i} = \sum_{S \subseteq N ∖ \{i\}} \frac{|S|! (p - |S| - 1)!}{p!} [v (S \cup \{i\}) - v (S)]

(18)

where

ϕ_{i}

is the SHAP value of feature i;

N

is the complete feature set;

S

is the feature subset excluding feature i; p is the total number of features; and

v (S)

is the model output value for the feature subset

S

.

Considering the superior performance of the BBO-DeepForest model, SHAP values were employed during the classification process to quantify the contributions of the six feature parameters: UCS, RQD, Sd, Kv, W, and St. The overall contribution of each input feature was evaluated by computing the mean absolute SHAP values, and SHAP analysis was further used to illustrate how these factors influence the classification output. The detailed results are presented in Figure 15 and Figure 16, respectively. The sign of the SHAP value signifies whether a feature has a positive or negative impact on the model prediction. The larger the absolute value of SHAP, the greater the impact of input features on the classification output.

According to Figure 15, RQD and UCS exhibit relatively high mean SHAP values, indicating that these parameters contribute substantially to the prediction behavior of the BBO-DeepForest model. Sd, W, and Kv also show noticeable contributions to the classification process. Although St presents comparatively lower average SHAP values, it still provides useful contextual information for distinguishing between slope and non-slope rock masses. These results suggest that the model prediction is strongly associated with multiple geological parameters, particularly RQD and UCS. However, due to the existence of correlations among several rock mass parameters, the SHAP values should be interpreted as model-specific feature attribution results rather than strict indicators of independent physical importance.

The analysis results in Figure 16 further illustrate the sensitivity of the BBO-DeepForest model to variations in geological parameters. Among all features, RQD and UCS exhibit comparatively higher SHAP contributions, indicating that the model prediction is more responsive to changes in these variables. Additionally, Sd demonstrates a noticeable influence on the classification results. In contrast, the SHAP distributions of W, Kv, and St are relatively more dispersed, suggesting comparatively weaker global attribution effects within the trained model. Overall, the SHAP analysis provides an interpretable description of the prediction behavior of the proposed model from a data-driven perspective. Nevertheless, because several geological parameters exhibit moderate to strong multicollinearity, the SHAP-based feature contributions should not be directly interpreted as independent geomechanical dominance or causal relationships.

To further evaluate the correlations among the geological parameters, variance inflation factor (VIF) analysis was conducted after missing value imputation, as shown in Table 10. The results indicated that several variables exhibited moderate to strong multicollinearity, which is expected because many rock mass parameters are intrinsically coupled in geological environments. Since the proposed BBO-DeepForest model is based on ensemble tree learning, the presence of correlated variables mainly affects the interpretation of feature attribution rather than the predictive capability of the classifier itself.

5.5. Engineering Verification

To verify the feasibility of the proposed model in slope rock masses, three independent engineering cases from the Luming Molybdenum Mine in Yichun, China, were employed for external validation. The validation dataset was completely excluded from model training, hyperparameter optimization, and cross-validation, and was used solely for independent testing of the trained models.

The Luming molybdenum mine is a large-scale mining project undertaken by China Railway Group in the Lesser Khingan Range. The mine is situated within the Luming Forest Farm of the Tieli Forestry Bureau in Heilongjiang Province, approximately 2 km northeast of the forest farm. The mining area covers about 4.6 km², with an exploitation depth ranging from an elevation of 640 m to 0 m. Based on the engineering geological conditions and the mechanical properties of the rock mass, the slope angle was determined to be 42°. Except for the loose rock group, the engineering geological rock units in the deposit consist predominantly of hard, massive monzonitic granite. The region has an average annual precipitation of 638 mm, with a maximum daily rainfall of 60.2 mm. The slope rock mass is influenced by weathering, tectonic activity, and unloading effects, leading to the well-developed presence of joints and fractures. Consequently, it is vital to accurately classify the slope rock mass quality to ensure slope stability and mining safety at the Luming molybdenum mine.

According to the investigation of slope stability conditions at the Luming molybdenum mine, rock mass quality at three bench levels in the eastern pit was selected for evaluation, as shown in Figure 17. To obtain accurate feature parameters, both field investigations and laboratory tests were conducted to characterize the slope rock mass in the eastern sector of the pit. Specifically, RQD, Sd, Kv, and W were obtained through core drilling, joint surveys, wave velocity testing, and seepage analysis, respectively. The UCS of the rock was determined via uniaxial compression tests on samples collected from the study area, and St was labeled as slope rock mass. The detailed feature parameters for each slope are listed in Table 11.

The classification results of the slope rock mass quality in the eastern open pit are shown in Table 12. It can be observed that the BBO-DeepForest model correctly classified the rock mass quality of all three bench levels as Grade IV, which is consistent with field observations. The DeepForest and SSA-DeepForest models exhibit similar performance, both misclassifying one Grade IV sample as Grade V. The TSO-DeepForest model also produces a misclassification, assigning one Grade IV sample to Grade III. In summary, all three optimized models proposed in this study demonstrate good feasibility and reliability in practical applications, particularly the BBO-DeepForest model.

5.6. Limitations

Despite the satisfactory performance of the proposed framework, there are still some limitations requiring further improvement:

First, the database was compiled from heterogeneous literature sources collected under different geological conditions, engineering scenarios, and investigation standards. Although basic quality-control procedures and parameter standardization were implemented during dataset construction, it remains difficult to completely eliminate the influence of source heterogeneity.

Second, several variables in the dataset exhibited relatively high missing ratios, particularly the parameter Sd. Although sensitivity analysis demonstrated that the proposed framework does not excessively rely on heavily imputed variables, uncertainty associated with missing-data mechanisms may still affect model robustness.

Third, although independent engineering cases from the Luming Molybdenum Mine were employed for external validation, the current validation dataset remains limited in scale and mainly consists of Grade IV samples. Therefore, the generalization capability of the proposed framework still requires further validation through broader geological environments or engineering scenarios.

Therefore, the proposed framework should be regarded as a feasibility study for intelligent rock mass quality classification under heterogeneous multi-source conditions. Future study will focus on establishing larger-scale databases with unified testing standards and conducting further engineering validation.

6. Conclusions

Rock mass quality classification is fundamental to slope stability analysis. AI technology has proven to be a reliable approach for accurately classifying slope rock mass quality. However, single classifiers often exhibit suboptimal performance when dealing with complex rock mass datasets, particularly in the presence of missing and imbalanced data. Therefore, developing more effective models is of great significance for slope rock mass quality classification. In this study, three hybrid models were developed by coupling metaheuristic optimization algorithms (BBO, TSO, and SSA) with the DeepForest model for classifying slope rock mass quality. The principal conclusions are presented as follows:

(1): According to performance evaluation, BBO-DeepForest was identified as the most satisfactory model on the independent test set, yielding accuracy, precision, recall, F1-score, and average AUC values of 0.878, 0.682, 0.678, 0.678, and 0.961, respectively.
(2): Compared with seven well-known imputation algorithms, including Mean, Median, KNN, MissForest, EM, GAIN, and MIDA, MICE can provide better imputation quality and help the DeepForest model achieve improved classification performance on rock mass quality datasets with missing values. In addition, SHAP-based model interpretation reveals that RQD and UCS are the most influential feature parameters for rock mass quality classification.
(3): Three independent engineering cases from the Luming molybdenum mine were employed for external validation of the proposed models. The results demonstrated that the BBO-DeepForest model is a feasible and better classification model, outperforming the baseline model and other optimized models in terms of accuracy.

Although the classification performance and external validation results are encouraging, the generalization capability of the proposed model still requires broader validation due to the limited external cases. In the future, we will collect more slope cases from diverse geological environments and engineering scenarios to further validate the generalization capability of the proposed model.

Author Contributions

Conceptualization, R.C. and D.L.; methodology, R.C. and J.S.; software, J.S.; validation, R.C., J.S., and J.C.; formal analysis, J.S.; investigation, D.L.; resources, D.L., J.C., and T.Z.; data curation, J.S.; writing—original draft preparation, R.C. and C.Z.; writing—review and editing, J.S. and D.L.; visualization, J.S.; supervision, D.L.; project administration, D.L. and J.C.; funding acquisition, D.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Luming Molybdenum Mine–University Industry Collaborative Project (grant number LM(2024)-F-108).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

Authors Rongjian, Jianfu Cao, and Tong Zhou were employed by the company China Railway Resources Group Co., Ltd. and Yichun Luming Mining Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Pantelidis, L. An alternative rock mass classification system for rock slopes. Bull. Eng. Geol. Environ. 2010, 69, 29–39. [Google Scholar] [CrossRef]
Terzaghi, K. Rock defects and loads on tunnel supports. In Rock Tunneling with Steel Supports; Commercial Shearing and Stamping Company: Youngstown, OH, USA, 1946. [Google Scholar]
Deere, D.U. Technical description of rock cores for engineering purpose. Rock Mech. Eng. Geol. 1964, 1, 17–22. [Google Scholar]
Bieniawski, Z.T. Engineering classification of jointed rock masses. Civ. Eng. Siviele Ingenieurswese 1973, 1973, 335–343. Available online: https://journals.co.za/doi/pdf/10.10520/AJA10212019_17397 (accessed on 7 January 2026).
Barton, N.; Lien, R.; Lunde, J. Engineering classification of rock masses for the design of tunnel support. Rock Mech. 1974, 6, 189–236. [Google Scholar] [CrossRef]
Hoek, E.; Brown, E.T. Practical estimates of rock mass strength. Int. J. Rock Mech. Min. Sci. 1997, 34, 1165–1186. [Google Scholar] [CrossRef]
Wu, L.; Li, S.; Zhang, M.; Zhang, L. A new method for classifying rock mass quality based on MCS and TOPSIS. Environ. Earth Sci. 2019, 78, 199. [Google Scholar] [CrossRef]
Dai, B.; Li, D.; Zhang, L.; Liu, Y.; Zhang, Z.; Chen, S. Rock Mass Classification Method Based on Entropy Weight–TOPSIS–Grey Correlation Analysis. Sustainability 2022, 14, 10500. [Google Scholar] [CrossRef]
Wang, J.; Guo, J. Research on rock mass quality classification based on an improved rough set–cloud model. IEEE Access 2019, 7, 123710–123724. [Google Scholar] [CrossRef]
Wu, L.; Liu, D.; Cao, P. A new method for evaluating rock mass quality of slopes based on interval continuous mathematical models. Bull. Eng. Geol. Environ. 2020, 79, 1357–1364. [Google Scholar] [CrossRef]
Fan, A.; Wang, W.; Li, S.; Zhang, B.; Liu, Y.; Wu, H. Evaluation of Rock Mass Quality of Phosphorite Mines by Topology Based on Optimal Combination Weight. Gold Sci. Technol. 2024, 32, 132–143. [Google Scholar] [CrossRef]
Liu, K.; Liu, B.; Fang, Y. An intelligent model based on statistical learning theory for engineering rock mass classification. Bull. Eng. Geol. Environ. 2019, 78, 4533–4548. [Google Scholar] [CrossRef]
Santos, A.E.M.; Lana, M.S.; Pereira, T.M. Rock Mass Classification by Multivariate Statistical Techniques and Artificial Intelligence. Geotech. Geol. Eng. 2021, 39, 2409–2430. [Google Scholar] [CrossRef]
Santos, A.E.M.; Lana, M.S.; Pereira, T.M. Evaluation of machine learning methods for rock mass classification. Neural Comput. Appl. 2022, 34, 4633–4642. [Google Scholar] [CrossRef]
Sheng, D.; Yu, J.; Tan, F.; Tong, D.; Yan, T.; Lv, J. Rock mass quality classification based on deep learning: A feasibility study for stacked autoencoders. J. Rock Mech. Geotech. Eng. 2023, 15, 1749–1758. [Google Scholar] [CrossRef]
Wang, H.; Gao, Y.; Xie, Y.; Wu, S.; Sun, J.; Zhou, Y.; Xiong, P. Hybrid Prediction Model of Engineering Classification of Slope Rock Mass Based on DCWA-EO-AdaBoost Model and BQ Method. KSCE J. Civ. Eng. 2024, 28, 3722–3740. [Google Scholar] [CrossRef]
Zhou, J.; Dai, Y.; Huang, S.; Armaghani, D.J.; Qiu, Y. Proposing several hybrid SSA—Machine learning techniques for estimating rock cuttability by conical pick with relieved cutting modes. Acta Geotech. 2023, 18, 1431–1446. [Google Scholar] [CrossRef]
Li, X.; Wang, Q. Prediction of surrounding rock classification of highway tunnel based on PSO-SVM. In Proceedings of the 2019 International Conference on Robots & Intelligent System (ICRIS), Haikou, China, 15–16 June 2019; pp. 443–446. [Google Scholar]
Hu, J.; Zhou, T.; Ma, S.; Yang, D.; Guo, M.; Huang, P. Rock mass classification prediction model using heuristic algorithms and support vector machines: A case study of Chambishi copper mine. Sci. Rep. 2022, 12, 928. [Google Scholar] [CrossRef]
Yang, B.; Liu, Y.; Liu, Z.; Zhu, Q.; Li, D. Classification of rock mass quality in underground rock engineering with incomplete data using XGBoost model and zebra optimization algorithm. Appl. Sci. 2024, 14, 7074. [Google Scholar] [CrossRef]
Zhou, Z.-H.; Feng, J. Deep forest. Natl. Sci. Rev. 2018, 6, 74–86. [Google Scholar] [CrossRef] [PubMed]
Prakash, T.; Singh, P.P.; Singh, V.P.; Singh, S.N. A novel brown-bear optimization algorithm for solving economic dispatch problem. In Advanced Control & Optimization Paradigms for Energy System Operation and Management; River Publishers: Gistrup, Denmark, 2023; pp. 137–164. [Google Scholar]
Xie, L.; Han, T.; Zhou, H.; Zhang, Z.-R.; Han, B.; Tang, A. Tuna Swarm Optimization: A Novel Swarm-Based Metaheuristic Algorithm for Global Optimization. Comput. Intell. Neurosci. 2021, 2021, 9210050. [Google Scholar] [CrossRef]
Xue, J.; Shen, B. A novel swarm intelligence optimization approach: Sparrow search algorithm. Syst. Sci. Control Eng. 2020, 8, 22–34. [Google Scholar] [CrossRef]
Zhang, Y. Research on Comprehensive Evaluation Method of Rock Massquality on Slope of Open-Pit Mine. Master’s Thesis, Ningbo University, Ningbo, China, 2023. [Google Scholar]
Kong, D. Study on Stability Evaluation of Rock Slope Based on BP Neural Network. Ph.D. Thesis, Nanchang Hangkong University, Nanchang, China, 2017. [Google Scholar]
Li, S.; Shen, Y.; Lin, P.; Xie, J.; Tian, S.; Lv, Y.; Ma, W. Classification method of surrounding rock of plateau tunnel based on BP neural network. Front. Earth Sci. 2023, 11, 1283520. [Google Scholar] [CrossRef]
Wu, S.; Yang, S.; Du, X. A model for evaluation of surrounding rock stability based on D-S evidence theory and error-eliminating theory. Bull. Eng. Geol. Environ. 2021, 80, 2237–2248. [Google Scholar] [CrossRef]
Liu, A.; Su, L.; Zhu, X.; Zhao, G. Rock quality evaluation based on distance discriminant analysis and fuzzy mathematic method. J. Min. Saf. Eng. 2011, 28, 462–467. [Google Scholar]
Li, F.; Liu, Y.; Qin, S.; Yu, J.; Zhang, D.; Ye, T. Research on rock mass quality optimization classification and support based on weighted membership matrix. J. Min. Strata Control Eng. 2024, 6, 52–61. [Google Scholar] [CrossRef]
GB/T 50218-2014; Standard for Engineering Classification of Rock Mass. Ministry of Housing and Urban-Rural Development of the People’s Republic of China (MOHURD: Beijing, China, 2014; p. 95.
Van Buuren, S.; Groothuis-Oudshoorn, K. Mice: Multivariate Imputation by Chained Equations in R. J. Stat. Softw. 2011, 45, 1–67. [Google Scholar] [CrossRef]
Xu, H.; Pang, G.; Wang, Y.; Wang, Y. Deep isolation forest for anomaly detection. IEEE Trans. Knowl. Data Eng. 2023, 35, 12591–12604. [Google Scholar] [CrossRef]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. Proc. Adv. Neural Inf. Process. Syst. 2017, 30, 4765–4774. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the DeepForest algorithm.

Figure 2. Overall workflow for constructing hybrid optimization models.

Figure 3. Proportion of rock mass quality samples in each grade.

Figure 4. Scatter matrix of the six parameters in the rock mass quality dataset.

Figure 5. Correlation analysis heatmap of the rock mass quality dataset.

Figure 6. Distribution consistency validation on the training set: (a) Comparison on feature Sd; (b) Comparison on feature Kv.

Figure 7. Outlier detection results on the training set: (a) Original distribution; (b) Outlier distribution.

Figure 8. Sensitivity analysis of the contamination parameter.

Figure 9. Calculation of evaluation metrics.

Figure 10. Iterative convergence process of hyperparameter optimization for the DeepForest model using three optimization algorithms: (a) BBO-DeepForest; (b) TSO-DeepForest; (c) SSA-DeepForest.

Figure 11. Confusion matrices of four models on the test set: (a) DeepForest; (b) BBO-DeepForest; (c) TSO-DeepForest; (d) SSA-DeepForest.

Figure 12. Ranking scores of the four models on the test set.

Figure 13. ROC curves of four models on the test set: (a) DeepForest; (b) BBO-DeepForest; (c) TSO-DeepForest; (d) SSA-DeepForest.

Figure 14. Comparison of classification accuracy using different imputation methods: (a) DeepForest; (b) BBO-DeepForest.

Figure 15. Feature importance based on mean absolute SHAP values.

Figure 16. Summary scatter plot of SHAP values for feature variation.

Figure 17. Slope benches of the eastern pit of the Luming molybdenum mine: (a) Case 1 +420 slope; (b) Case 2 +435 slope; (c) Case 3 +450 slope.

Table 1. Rock mass quality classification criteria [4,31].

Rock Mass Quality Grades	Quantitative Criteria						Qualitative Description
	UCS (MPa)	RQD (%)	Sd (cm)	Kv (-)	W (-)
	UCS (MPa)	RQD (%)	Sd (cm)	Kv (-)	Tunnel (L × (min × 10 m)⁻¹)	Slope
I	>120	>90	>200	>0.75	0 → 15	Dry → 15	Extremely hard rock with very high integrity; intact rock masses with minimal discontinuities.
II	60~120	75~90	60~200	0.45~0.75	10 → 10	Damp → 10	Hard to extremely hard rock; rock mass generally intact with minor discontinuities; overall good integrity.
III	30~60	50~75	20~60	0.3~0.45	10~25 → 7	Wet → 7	Moderately hard rock with medium integrity; noticeable joints or fractures; rock mass moderately broken but still maintains overall stability.
IV	15~30	25~50	6~20	0.2~0.3	25~125 → 4	Dripping → 4	Soft to moderately soft rock or significantly fractured hard rock; rock mass shows low integrity with well-developed discontinuities.
V	<15	<25	<6	<0.2	>125 → 0	Flowing → 0	Soft to extremely soft or highly fractured rock; rock mass extremely broken with very poor integrity and lowest stability.

Table 2. Descriptive statistics of the rock mass quality dataset.

Statistic	UCS (MPa)	RQD (%)	Sd (cm)	Kv (-)	W (-)	St (-)
Mean	83.63	51.70	70.94	0.48	8.62	0.25
Standard deviation	61.66	25.29	136.13	0.19	3.73	0.43
Median	75.20	52.0	36.0	0.45	7.0	0.0
Minimum	0.80	0.0	2.60	0.05	0.0	0.0
Maximun	250.0	100.0	985.0	1.0	15.0	1.0
Skew	1.42	−0.34	5.46	0.34	0.06	1.16
Kurtosis	1.65	−0.69	34.87	−0.14	−0.06	−0.65
Count	204	204	61	181	204	204

Table 3. Optimization ranges of DeepForest hyperparameters.

Hyper-Parameter	Range	Explanation
max_layers	(1, 8)	The maximum number of cascade layers in the deep forest
n_estimators	(1, 6)	The number of estimators in each cascade layer
n_trees	(50, 300)	The number of trees in each estimator
max_depth	(2, 20)	The maximum depth of each tree
min_samples_split	(2, 30)	The minimum number of samples required to split an internal node
min_samples_leaf	(1, 10)	The minimum number of samples required to be at a leaf node
criterion	{gini, entropy}	The function to measure the quality of a split
n_bins	(10, 255)	The number of bins used for non-missing values

Table 4. Results of hyperparameter optimization for DeepForest.

Hyper-Parameter	Optimal Value
Hyper-Parameter	BBO-DeepForest	TSO-DeepForest	SSA-DeepForest
max_layers	3	5	7
n_estimators	4	3	6
n_trees	105	83	286
max_depth	19	10	16
min_samples_split	8	3	5
min_samples_leaf	1	2	2
criterion	entropy	gini	entropy
n_bins	18	107	148

Table 5. Evaluation results of four models on the test set.

Model	Accuracy	Precision	Recall	F1-Score
DeepForest	0.780	0.629	0.622	0.624
BBO-DeepForest	0.878	0.682	0.678	0.678
TSO-DeepForest	0.805	0.642	0.638	0.638
SSA-DeepForest	0.829	0.656	0.650	0.651

Table 6. Performance comparison of the model with and without the Sd feature.

Model	Accuracy	Precision	Recall	F1-Score
With Sd	0.798 ± 0.052	0.778 ± 0.108	0.742 ± 0.106	0.753 ± 0.104
Without Sd	0.817 ± 0.053	0.796 ± 0.104	0.757 ± 0.108	0.768 ± 0.106

Table 7. Performance comparison of different imputation algorithms.

Missing Ratio	10%		20%		30%
Metrics	RMSE	JSD	RMSE	JSD	RMSE	JSD
Mean	0.1175 ± 0.0000	0.0913 ± 0.0000	0.1308 ± 0.0000	0.0947 ± 0.0000	0.1452 ± 0.0000	0.1194 ± 0.0000
Median	0.1280 ± 0.0000	0.0933 ± 0.0000	0.1351 ± 0.0000	0.0928 ± 0.0000	0.1504 ± 0.0000	0.1219 ± 0.0000
KNN	0.0847 ± 0.0000	0.0779 ± 0.0000	0.0961 ± 0.0000	0.0728 ± 0.0000	0.1072 ± 0.0000	0.1044 ± 0.0000
MissForest	0.0832 ± 0.0005	0.0653 ± 0.0008	0.0968 ± 0.0027	0.0600 ± 0.0035	0.1059 ± 0.0092	0.1016 ± 0.0033
EM	0.5058 ± 0.0000	0.3266 ± 0.0000	0.4943 ± 0.0000	0.3640 ± 0.0000	0.4984 ± 0.0000	0.3747 ± 0.0000
GAIN	0.4106 ± 0.0194	0.3821 ± 0.0116	0.4875 ± 0.0287	0.4803 ± 0.0239	0.5229 ± 0.0149	0.4963 ± 0.0162
MIDA	0.2113 ± 0.0095	0.0131 ± 0.1779	0.2707 ± 0.0041	0.2354 ± 0.0019	0.2440 ± 0.0260	0.2524 ± 0.0956
MICE	0.1143 ± 0.0148	0.0778 ± 0.0119	0.0955 ± 0.0038	0.0627 ± 0.0071	0.0925 ± 0.0105	0.0956 ± 0.0078

Note: Bold values indicate the minimum value in each column.

Table 8. Performance comparison of different classification models.

Model	Accuracy	Precision	Recall	F1-Score
DeepForest	0.805 ± 0.052	0.783 ± 0.117	0.760 ± 0.109	0.761 ± 0.109
SVM	0.700 ± 0.064	0.670 ± 0.082	0.715 ± 0.103	0.665 ± 0.094
ANN	0.681 ± 0.062	0.656 ± 0.059	0.745 ± 0.081	0.660 ± 0.070
GBDT	0.756 ± 0.079	0.733 ± 0.124	0.715 ± 0.137	0.706 ± 0.137
XGBoost	0.773 ± 0.066	0.738 ± 0.118	0.739 ± 0.118	0.722 ± 0.113

Note: Bold values indicate the minimum value in each column.

Table 9. Performance comparison of different configurations involving the external feature St.

Model	Accuracy	Precision	Recall	F1-Score
With St	0.795 ± 0.064	0.762 ± 0.114	0.738 ± 0.123	0.739 ± 0.120
Without St	0.783 ± 0.072	0.733 ± 0.134	0.724 ± 0.130	0.715 ± 0.129

Table 10. VIF analysis of six feature parameters.

Feature	UCS	RQD	Sd	Kv	W	W
VIF	7.65	5.57	1.91	14.99	11.76	2.09

Table 11. Rock mass feature parameters of the eastern pit slopes at Luming molybdenum mine.

Slope	UCS (MPa)	RQD (%)	Sd (cm)	Kv (-)	W (-)	St (-)
+420 platform	19.37	40.1	12.2	0.43	15	1
+435 platform	45.68	29.6	7.5	0.35	15	1
+450 platform	51.23	25.9	5.2	0.26	15	1

Table 12. Classification results of rock mass quality for the eastern pit slopes of Luming molybdenum mine.

Slope	Actual Grade	DeepForest	BBO-DeepForest	TSO-DeepForest	SSA-DeepForest
+420 platform	IV	V	IV	III	V
+435 platform	IV	IV	IV	IV	IV
+450 platform	IV	IV	IV	IV	IV

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, R.; Li, D.; Sun, J.; Cao, J.; Zhou, T.; Zhang, C. Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine. Appl. Sci. 2026, 16, 5275. https://doi.org/10.3390/app16115275

AMA Style

Chen R, Li D, Sun J, Cao J, Zhou T, Zhang C. Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine. Applied Sciences. 2026; 16(11):5275. https://doi.org/10.3390/app16115275

Chicago/Turabian Style

Chen, Rongjian, Diyuan Li, Jiahao Sun, Jianfu Cao, Tong Zhou, and Chen Zhang. 2026. "Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine" Applied Sciences 16, no. 11: 5275. https://doi.org/10.3390/app16115275

APA Style

Chen, R., Li, D., Sun, J., Cao, J., Zhou, T., & Zhang, C. (2026). Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine. Applied Sciences, 16(11), 5275. https://doi.org/10.3390/app16115275

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Slope Rock Mass Classification Using Deep Forest Optimized by Three Metaheuristic Algorithms: A Case Study of Luming Molybdenum Mine

Abstract

1. Introduction

2. Methodology

2.1. Deep Forest Algorithm

2.2. Brown Bear Optimizer

2.3. Tuna Swarm Optimizer

2.4. Sparrow Search Algorithm

2.5. Hybrid Optimization Model Based on DeepForest

3. Data

3.1. Data Collection and Description

3.2. Data Analysis

3.3. Data Preprocessing

4. Model Development

4.1. Evaluation Metrics

4.2. Model Construction

5. Result and Discussion

5.1. Model Evaluation

5.2. Model Comparison

5.3. Ablation Study on External Feature St

5.4. Model Explanation

5.5. Engineering Verification

5.6. Limitations

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI