SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size

Hu, Shengming; Mao, Zhibin; Deng, Lijun; Wang, Qinghua; Liu, Xuanchi; Wang, Zhou

doi:10.3390/math14122074

Open AccessArticle

SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size

by

Shengming Hu

^1,2,*

,

Zhibin Mao

¹,

Lijun Deng

²

,

Qinghua Wang

¹,

Xuanchi Liu

³ and

Zhou Wang

¹

National and Provincial Joint Engineering Laboratory for the Hydraulic Engineering Safety and Efficient Utilization of Water Resources of Poyang Lake Basin, Jiangxi University of Water Resources and Electric Power, Nanchang 330099, China

²

Department of Civil and Environmental Engineering, University of Alberta, Edmonton, AB T6G 1H9, Canada

³

Wenzhou Zhongyuan Engineering Project Management Co., Ltd., Wenzhou 325006, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(12), 2074; https://doi.org/10.3390/math14122074

Submission received: 13 May 2026 / Revised: 27 May 2026 / Accepted: 8 June 2026 / Published: 10 June 2026

(This article belongs to the Special Issue Advances in Numerical Computation and Mathematical Modelling for Mechanics and Dynamics in Geotechnical Engineering)

Download

Browse Figures

Versions Notes

Abstract

Reliable prediction of the factor of safety (Fs) of circular-failure soil slopes is critical to geotechnical practice. Data-driven models developed on small slope-stability datasets are, however, prone to overfitting, data leakage, and optimistic bias, which can lead to overestimated predictive performance. This study presents a small-sample-oriented, leakage-aware support vector regression (SVR) framework with a radial basis function (RBF) kernel for Fs prediction. A database of 80 published circular-failure slope cases was compiled, and six predictors were adopted: soil unit weight, slope height, pore pressure ratio, cohesion, internal friction angle, and slope angle. To improve reliability under limited-data conditions, preprocessing, hyperparameter tuning, and performance evaluation were all embedded within a repeated nested cross-validation framework. The proposed SVR model was benchmarked against the back-propagation neural network (BPNN) and radial basis function neural network (RBFNN) models under identical validation partitions and evaluation settings. The results indicated that SVR achieved the best predictive performance among the three candidate models. For case-level illustration, a single representative hold-out split was reported in addition to the repeated nested cross-validation results, on which the SVR model attained an R² of 86.56%, an RMSE of 0.07497, an MAE of 0.0666, and an MRE of 5.29%. In this test subset, all SVR predictions exhibited relative errors below 10%, indicating more stable predictive behaviour than the benchmark models. The main contribution of this study is thus a validated SVR framework for small-sample conditions.

Keywords:

support vector regression; nested cross-validation; machine learning; slope stability; generalization performance

MSC:

62J02; 68T05; 62P30; 86A60

1. Introduction

Slope stability assessment is a fundamental task in geotechnical engineering as it underpins landslide hazard evaluation, slope design, and engineering risk management [1]. For soil slopes and embankments, circular sliding is one of the most common and representative failure modes. Slope stability is commonly quantified by the factor of safety (Fs), which depends on the combined effects of slope geometry, soil strength, and hydraulic conditions. In practice, variables such as unit weight, slope height, pore pressure ratio, cohesion, internal friction angle, and slope angle interact in a highly nonlinear manner, which makes reliable Fs prediction challenging.

To address this problem, machine-learning methods have been increasingly applied to slope stability prediction. Early studies showed that neural networks and other data-driven models can estimate slope safety from geotechnical input variables [2,3,4], and subsequent work extended this comparison to a broader range of machine-learning paradigms for slope stability prediction [5,6,7,8,9]. Collectively, these studies indicate that data-driven methods can effectively capture the nonlinear relationship between slope parameters and Fs. Support-vector-based methods have attracted particular attention because they are well-suited to nonlinear regression problems with limited samples, which is a common characteristic of slope-stability case databases.

Several studies have already explored SVR-based approaches to slope stability prediction. Xue [10] developed a PSO-LSSVM model for slope stability prediction, showing that support-vector-based models can achieve competitive predictive performance. Sari et al. [11] applied SVR directly to factor-of-safety prediction and demonstrated its feasibility for this task. Wei et al. [12] further examined hybrid SVR-based models and also reported favourable results. Together, these studies established the applicability of SVR to slope stability prediction. However, they focused primarily on predictive accuracy—that is, on whether SVR or its hybrid variants could outperform competing models—rather than on the reliability of the reported SVR performance under small-sample conditions.

This distinction matters because slope-stability datasets derived from published case histories are usually limited in size. Under such conditions, model development based on a single train-test split or ordinary k-fold cross-validation can readily produce optimistic estimates if hyperparameter tuning and final performance evaluation are not strictly separated [13]. Consequently, apparently high predictive accuracy does not necessarily reflect reliable out-of-sample performance. From this perspective, the key gap in existing SVR-based slope-stability studies is the lack of a rigorous framework for evaluating the reliability of SVR in data-scarce settings. In other words, the central issue is not whether SVR can fit the available data well, but whether it can deliver stable, reproducible, and trustworthy generalization performance.

To address this gap, this study develops a small-sample-oriented, leakage-aware SVR framework for predicting the factor of safety of circular-failure soil slopes. Based on a database of 80 published cases [14,15], the proposed model uses six input variables: soil unit weight, slope height, pore pressure ratio, cohesion, internal friction angle, and slope angle. To reduce optimistic bias, preprocessing, hyperparameter tuning, and model evaluation are all embedded within a repeated nested cross-validation procedure, in which the inner loop is used for model selection and the outer loop is reserved for out-of-sample assessment. The proposed SVR framework is further benchmarked against back-propagation neural networks (BPNN) and radial basis function neural networks (RBFNN) under identical validation partitions and evaluation settings. The main contribution of this work is to shift the emphasis from accuracy-oriented SVR application to reliability-oriented SVR evaluation, thereby providing a more robust and engineering-relevant framework for preliminary slope stability assessment under limited-data conditions.

2. Basic Principle of Support Vector Regression

2.1. Regression Function in Feature Space

SVR is a regression extension of support vector machine (SVM) theory. Unlike classification-oriented SVM, which seeks an optimal separating hyperplane between classes, SVR constructs a regression function that predicts continuous target values accurately while preserving good generalization capability. In slope stability prediction, the target variable —the factor of safety—is a continuous quantity. Therefore, support vector regression provides the appropriate theoretical basis for this study rather than support vector classification [11,16].

Given a training dataset

\{(x_{i}, y_{i})}_{i = 1}^{n}, x_{i} \in R^{m}, y_{i} \in R,

(1)

where

x_{i}

denotes the input vector composed of slope characteristic variables and

y_{i}

denotes the corresponding target output, the objective of SVR is to determine a regression function of the form

f (x) = w^{T} ϕ (x) + b

(2)

where

ϕ (x)

is a nonlinear mapping from the input space to a high-dimensional feature space,

w

is the weight vector in the feature space, and

b

is the bias term.

The core idea of SVR is to find a regression function that combines high predictive accuracy with good generalization. Compared with ordinary empirical fitting methods, SVR emphasizes structural risk minimization, which enables the model to achieve a favourable balance between fitting performance and model complexity. This property makes SVR well-suited for small-sample nonlinear regression problems in geotechnical engineering [10].

2.2. ε-Insensitive Loss and Optimization Formulation

To achieve both prediction accuracy and model simplicity, SVR introduces the ε-insensitive loss function. The key idea is that errors falling within a predefined tolerance band are not penalized, whereas only deviations greater than ε contribute to the loss. This strategy improves the robustness and generalization of the regression model, especially for small-sample nonlinear prediction problems.

The

ε

-insensitive loss function is defined as

L_{ε} (y, f (x)) = \{\begin{matrix} 0, & ∣ y - f (x) ∣ \leq ε, \\ ∣ y - f (x) ∣ - ε, & ∣ y - f (x) ∣ > ε . \end{matrix}

(3)

To allow for samples that fall outside the

ε

-insensitive tube, two nonnegative slack variables,

ξ_{i}

and

ξ_{i}^{*}

, are introduced. The primal optimization problem of SVR can then be written as

\underset{w, b, ξ_{i}, ξ_{i}^{*}}{m i n} \frac{1}{2} ∥ w ∥^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*})

(4)

subject to

y_{i} - w^{T} ϕ (x_{i}) - b \leq ε + ξ_{i}

(5)

w^{T} ϕ (x_{i}) + b - y_{i} \leq ε + ξ_{i}^{*}

(6)

ξ_{i} \geq 0, ξ_{i}^{*} \geq 0, i = 1,2, \dots, n

(7)

In Equation (4), the term

\frac{1}{2} ∥ w ∥^{2}

controls the flatness or complexity of the regression function, whereas the second term penalizes prediction errors outside the

ε

-insensitive interval. The parameter

C

is the penalty coefficient, which determines the trade-off between model smoothness and empirical fitting error. A larger

C

places more emphasis on fitting the training samples, while a smaller

C

favours a smoother regression function with potentially better generalization under limited-data conditions.

Figure 1 illustrates the basic idea of support vector regression. The central solid line represents the regression function

f (x)

, while the two dashed lines denote the upper and lower bounds of the

ε

-insensitive tube, i.e.,

f (x) + ε

and

f (x) - ε

, respectively. Samples located inside the tube do not contribute to the loss, whereas samples outside the tube are penalized through the slack variables

ξ_{i}

and

ξ_{i}^{*}

.

2.3. Kernel Function and RBF-SVR Model

By introducing Lagrange multipliers and solving the corresponding dual problem, the regression function can be expressed as

f (x) = \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) K (x_{i}, x) + b

(8)

where

α_{i}

and

α_{i}^{*}

are Lagrange multipliers, and

K (x_{i}, x)

is the kernel function. The kernel function replaces the inner product in the high-dimensional feature space and enables nonlinear regression without explicitly computing the nonlinear mapping

ϕ (x)

. Only the samples associated with nonzero

(α_{i}− α_{i}^{*})

contribute to the final regression function; these samples are called support vectors [16].

Among the various kernel functions, the RBF kernel is one of the most widely used in nonlinear regression problems owing to its strong nonlinear fitting capability and relatively simple form [11]. The RBF kernel is defined as

K (x_{i}, x_{j}) = e x p (- γ {∥ x_{i} - x}_{j} ∥^{2})

(9)

where

γ

is the kernel parameter controlling the influence range of individual samples. A larger

γ

implies a narrower influence range and a more flexible regression surface, whereas a smaller

γ

yields a smoother regression function.

In practical applications, the performance of an SVR model is mainly controlled by three key parameters, namely the penalty parameter

C

, the kernel parameter

γ

, and the insensitive loss width

ε

. Parameter

C

determines the trade-off between fitting accuracy and generalization ability,

γ

governs the nonlinear mapping characteristics of the RBF kernel, and

ε

specifies the width of the no-penalty tube around the regression function. Careful selection of these parameters is essential for obtaining a reliable regression model [9,12].

In this study, the

ε

-SVR model with the RBF kernel is adopted to predict the factor of safety of circular-failure soil slopes. The SVR framework is well-suited for this problem because it can effectively handle nonlinear relationships among slope variables and typically exhibits good generalization performance under limited sample conditions. This provides a sound theoretical basis for constructing the subsequent slope stability prediction model.

3. Construction of Slope Stability Prediction Model Based on SVR

3.1. Problem Definition and Dataset

The objective of this study is to construct a regression framework for predicting Fs of circular-failure soil slopes. The modelling dataset consisted of 80 circular-failure slope cases compiled from published sources. The principal references used for data extraction and cross-checking were Feng [14] and Ma et al. [15]. The cases were reorganized into a unified dataset containing soil unit weight (

γ_{s}

), slope height (

H

), pore pressure ratio (

r_{u}

), cohesion (

c^{'}

), internal friction angle (φ), slope angle (β), and Fs.

As illustrated in Figure 2, soil unit weight, cohesion, and internal friction angle characterize the fundamental strength properties of the slope material, whereas slope height and slope angle describe the geometric conditions of the slope. The pore pressure ratio reflects the hydraulic effect on effective stress and therefore directly influences slope stability.

From a geotechnical perspective, each input variable governs a distinct aspect of circular-failure slope stability. Soil unit weight (

γ_{s}

) acts as the principal driver of gravitational stress and therefore increases the destabilizing shear force, although its net influence on Fs is moderated by the simultaneous increase in normal stress mobilized on the slip surface. Slope height (H) directly amplifies the gravitational driving force and, for a fixed geometry, tends to reduce Fs roughly in proportion to H. The pore-pressure ratio (

r_{u}

) reduces the effective normal stress on the failure surface and consequently diminishes the mobilized frictional resistance; even modest increases in

r_{u}

can produce substantial reductions in Fs under otherwise identical conditions. Effective cohesion (

c^{'}

) provides the stress-independent component of shear strength and exerts a clear positive influence on Fs, particularly in shallow slopes and short slip surfaces where the cohesive contribution dominates. The internal friction angle (φ) controls the stress-dependent component of shear strength; larger φ improves Fs primarily on deeper or longer slip surfaces where the normal stress is large. Slope angle (β) defines the geometry of the slip mass and exerts the most direct geometric control on Fs: steeper slopes simultaneously reduce the available resisting moment and increase the driving moment, producing a strongly nonlinear reduction in stability. The six predictors therefore jointly describe strength (

γ_{s}

,

c^{'}

, φ), geometry (H, β), and pore pressure (

r_{u}

)—the three classical components of limit-equilibrium slope-stability analysis—and their combined nonlinear interaction motivates the use of a kernel-based regression framework.

The compiled dataset was used for model development, methodological comparison, and performance evaluation under small-sample conditions. Appendix A provides the full compiled dataset used in the present study. Distributions of the six input variables are shown in Figure 3.

As shown in Figure 3, the six predictors span a wide range characteristic of published circular-failure case histories. Soil unit weight (

γ_{s}

) is concentrated between 18 and 31 kN/m³, reflecting the typical range of cohesive and granular soils encountered in slope-stability studies. Slope height (H) is strongly right-skewed, ranging from 3.66 m for small embankments to 511 m for large hydropower-related slopes; this asymmetry reflects the over-representation of medium-height cases in the published literature. Effective cohesion (

c^{'}

) shows the widest relative variability (0–150 kPa, std 29.5 kPa), capturing both nearly cohesionless granular materials and stiff cohesive soils. The internal friction angle (φ) is concentrated around 30–40°, consistent with typical geotechnical materials. Slope angle (β) ranges from 8° to 53° with a bimodal pattern reflecting both gentle natural slopes and steep engineered cuts. The compiled database (N = 80) was assembled from two recognized geotechnical compilations [14,15] that together cover a representative cross-section of soil, rock-fill, and weathered-rock slopes; although it does not exhaustively sample all geological settings, the input ranges in Figure 3 substantially overlap those encountered in standard preliminary slope-stability assessment, and the framework can be re-trained or extended whenever a domain-specific extension is required. Two limitations of this database should nevertheless be acknowledged. First, the cases are drawn from only two published sources and may therefore inherit selection biases inherent to those sources (e.g., a preference for well-documented failures over routine stable slopes). Second, the pore-pressure ratio (

r_{u}

) is markedly concentrated between 0.25 and 0.35 (mean 0.31, std 0.07), with only a small fraction of cases outside this band; consequently, predictions for slopes under markedly different hydraulic conditions (e.g., fully saturated or strongly drained slopes) should be interpreted with caution, as discussed further in Section 4.5.

3.2. Data Inspection and Preprocessing

Before model development, the dataset was subjected to a data-quality inspection, including checks for missing values, abnormal records, variable ranges, and basic descriptive statistics. Missing values, if any, were handled only within the training data in each validation cycle so as to avoid information leakage from the test data. Outlier inspection was first performed by examining the physical plausibility of each record and cross-checking the original data source. Statistical outlier screening was used only as an auxiliary procedure, and no sample was removed solely on the basis of model residuals.

All continuous variables were standardized using statistics derived exclusively from the training folds. For a given variable x, the standardized value z was obtained as

z = (x - u_{t r a i n}) / σ_{t r a i n}

(10)

where

u_{t r a i n}

and

σ_{t r a i n}

denote the mean and standard deviation calculated from the corresponding training fold. The same transformation was then applied to the associated validation or test fold. This fold-wise preprocessing strategy ensured that the reported prediction performance reflected genuine generalization rather than inadvertent reuse of information from held-out data.

3.3. Correlation Analysis and Variable Relevance

To gain an initial understanding of the dataset, pairwise Pearson correlation coefficients and scatter plots were used to examine the linear associations among the input variables. The Pearson correlation coefficient r is defined as

r = \sum [(x_{i} - \bar{x}) (y_{i} - \bar{y})] / \sqrt{Σ {(x_{i} - \bar{x})}^{2} \cdot Σ {(y_{i} - \bar{y})}^{2}}

(11)

where

x_{i}

and

y_{i}

are the observed values of two variables, and

\bar{x}

and

\bar{y}

are their corresponding sample means. Figure 4 presents the pairwise scatter distributions together with the Pearson correlation matrix of the six predictors.

It should be noted that a low pairwise correlation does not imply statistical independence. Therefore, the correlation analysis in this study was used only for descriptive exploration and preliminary screening of variable relationships, rather than as a formal test of independence.

Quantitative inspection of Figure 4 reveals several patterns useful for interpreting the modelling results that follow. Among the six predictors, the strongest positive Pearson correlations are observed between

γ_{s}

and H (r = +0.71),

γ_{s}

and

c^{'}

(r = +0.54),

γ_{s}

and φ (r = +0.53), and φ and β (r = +0.55). These reflect the over-representation in the database of dense, deep, and high-strength rock-fill slopes that combine high unit weight, large height, and elevated strength parameters. Moderate correlations between

c^{'}

and H (r = +0.45) and between

c^{'}

and β (r = +0.33) likewise stem from sampling characteristics rather than from any inherent mechanical coupling. The pore-pressure ratio

r_{u}

shows weak negative correlations with

c^{'}

(r = −0.32), H (r = −0.31), and

γ_{s}

(r = −0.29), consistent with the tendency of lower-saturation cases to be reported for stiffer, deeper slopes. The Pearson coefficient, however, measures only linear association and is therefore not fully aligned with the strongly nonlinear nature of the slope-stability problem. To complement the linear analysis, the Spearman rank correlation and a histogram-based normalized mutual-information (MI) matrix were additionally computed on the same 80-case data. The Spearman coefficients confirm the patterns observed under Pearson (

γ_{s}

–H: ρ = +0.78; φ–β: ρ = +0.55;

r_{u}

–

c^{'}

: ρ = −0.42), indicating that the dependencies are monotonic and not driven by extreme observations. The MI matrix further identifies

γ_{s}

as carrying the highest mutual information with Fs (I = 0.40), followed by H (I = 0.30), β (I = 0.26),

r_{u}

(I = 0.26),

c^{'}

(I = 0.25), and φ (I = 0.20). This ordering reveals that

γ_{s}

and H are the most informative predictors under nonlinear dependence—a finding that is partially masked when only the weaker Pearson correlations between

γ_{s}

/H and Fs are considered. These observations motivate the kernel-based regression framework adopted in Section 3.4, which can capture such nonlinear input-output dependencies without prescribing a parametric form.

3.4. Model Development

SVR was adopted as the core regression model in this study. Unlike classification-oriented support vector machines, SVR seeks to determine a regression function that deviates from the observed target values by no more than a predefined ε-insensitive margin while maintaining model flatness. By introducing slack variables and a penalty parameter

C

, SVR balances model complexity against empirical fitting error. To capture nonlinear relationships between the geotechnical variables and Fs, the RBF kernel was adopted.

Three hyperparameters govern the behaviour of the RBF-SVR model: the penalty parameter

C

, the kernel parameter γ, and the ε-insensitive loss width ε. Parameter

C

controls the trade-off between model smoothness and fitting error, γ determines the influence range of individual samples in the transformed feature space, and ε defines the tolerance margin for regression errors. Owing to its strong nonlinear mapping capability and good small-sample generalization, the RBF-SVR model is well-suited to the present slope stability prediction problem.

3.5. Hyperparameter Optimization and Reproducible Validation

To improve robustness and reproducibility, hyperparameter tuning and model evaluation were conducted within a nested cross-validation framework. The outer loop was used for unbiased performance estimation, whereas the inner loop was used for hyperparameter selection. Specifically, a repeated five-fold outer cross-validation design was adopted, and the random partitioning process was repeated ten times to reduce the influence of any single split on the final evaluation. All models and the validation framework were implemented in MATLAB R2018b+ LIBSVM 2.6.

Within each outer-loop training subset, an inner five-fold grid search was performed to determine the optimal hyperparameters of the RBF-SVR model. All preprocessing procedures, including standardization and optional data cleaning operations, were performed using training data only within each fold. After the optimal hyperparameters had been identified in the inner loop, the SVR model was retrained on the full outer-loop training subset and then evaluated on the corresponding outer-loop test subset. This procedure yielded repeated out-of-sample estimates of predictive performance and thereby reduced the optimism associated with a single random train–test split. Because repeated outer-loop results generate a large number of fold-wise predictions, a single representative hold-out split was additionally selected for graphical presentation and sample-wise error comparison in Section 4, whereas the repeated nested cross-validation framework remained the primary procedure for robust model development and validation. The overall workflow of model construction, hyperparameter tuning, and validation is summarized in Figure 5, and the main model settings and search ranges are listed in Table 1.

To illustrate the advantage of the nested CV mechanism over conventional validation protocols commonly used in small-sample geotechnical machine learning, a methodological comparison is provided in Table 2. Nested cross-validation separates hyperparameter tuning (inner loop) from performance evaluation (outer loop) to yield unbiased error estimates, which is essential for rigorous evaluation in small-sample scenarios.

3.6. Benchmark Models and Evaluation Metrics

Two benchmark models were considered: the back-propagation neural network (BPNN) and the radial basis function neural network (RBFNN). Both were implemented under the same validation protocol as the SVR model. All three models used the same input variables and outer-loop data partitions, so that the comparison reflected differences in modelling strategy rather than differences in data usage. Crucially, the two benchmark models also underwent identical inner-loop grid-search hyperparameter tuning within the nested cross-validation framework, ensuring that the reported performance ranking is not biased by differential tuning effort. The RBFNN grid covered n_centres ∈ {10, 15, 20, 25} (initialized by k-means clustering), Gaussian-RBF inverse-width γ ∈ {2⁻⁷, 2⁻⁵, 2⁻³, 2⁻¹}, and ridge regularization α ∈ {10⁻³, 10⁻², 10⁻¹} (48 candidates per fold); the BPNN grid covered n_hidden ∈ {8, 16, 24}, weight-decay α ∈ {10⁻⁴, 10⁻³, 10⁻²}, and Adam learning rate lr ∈ {0.01, 0.02} with 400 training epochs (18 candidates per fold). These configurations are reported in the revised Table 1 for reproducibility, addressing the requirement that all three models be tuned to comparable rigour. The Wilcoxon signed-rank test of Section 3.8 should also be interpreted in this context: with only 20 paired observations from a single representative hold-out split, the test has limited statistical power and the reported p-values must be regarded as descriptive evidence rather than confirmatory inference. The 50-fold nested-CV results in Supplementary Material S1 provide the principal evidence on which the reliability claim is based.

Model performance was assessed using four metrics: the coefficient of determination (R²), root mean square error (RMSE), mean absolute error (MAE), and mean relative error (MRE). In general, a larger R² and smaller RMSE, MAE, and MRE indicate better model performance. The repeated outer-loop evaluation served as the principal robustness check during model development, whereas a representative hold-out split was additionally used for graphical illustration and sample-wise comparison in Section 4.

3.7. Qualitative Engineering-Consistency Assessment

To enhance engineering credibility while remaining fully consistent with the available evidence, the fitted model was examined from the perspective of physically meaningful response tendencies rather than through formal post hoc interpretability tools [17]. The assessment therefore focused on whether the dominant response directions implied by the prediction results were broadly compatible with established geotechnical principles for circular-failure slopes, particularly the expected roles of cohesion, internal friction angle, pore pressure, slope angle, and slope height.

This engineering-consistency assessment was used as a qualitative credibility check rather than as a complete feature-attribution analysis. In other words, a model was regarded as more convincing when its predictive behaviour remained compatible with the mechanics of shear resistance and driving-force balance, rather than merely producing favourable numerical scores on a particular data split.

3.8. Statistical Comparison Based on Paired Sample-Wise Errors

To determine whether the performance differences among competing models were statistically meaningful rather than incidental to the selected test samples, a non-parametric Wilcoxon signed-rank test was performed on paired prediction errors derived from the representative hold-out test subset reported in Section 4. Absolute error (AE) and relative error (RE) vectors for the 20 hold-out cases were used for pairwise comparison among SVR, RBFNN, and BPNN. This analysis was intended as a descriptive statistical comparison on a common illustrative split, rather than as the sole basis for model validation.

Accordingly, the methodological framework established in this section comprises problem definition and dataset specification, leakage-aware preprocessing, rigorous model development, benchmark comparison, qualitative engineering interpretability assessment, and non-parametric statistical testing of paired prediction errors.

4. Results and Discussion

This section presents the prediction results obtained from a representative hold-out split and discusses the comparative behaviour of SVR, RBFNN, and BPNN for circular-failure slope stability prediction. The purpose of this representative split is to provide a case-level illustration of model performance, including fitted trends, sample-wise errors, and paired error differences. These results should be interpreted as complementary evidence to the repeated nested cross-validation framework described in Section 3, rather than as a substitute for the overall robustness evaluation.

4.1. Representative Prediction Performance of SVR

Figure 6 shows the prediction results of the SVR model on the representative training-test split. The training subset is used to illustrate the fitting behaviour of the model, whereas the hold-out test subset provides a direct visual assessment of its predictive response for unseen cases within the same partition.

For the training subset, the SVR model achieved an R² of 97.58% and an RMSE of 0.0518, indicating that the predicted safety factors closely matched the observed values. This result indicates that the nonlinear mapping between the six input variables and the factor of safety was effectively captured by the SVR model. More importantly, the model also maintained favourable predictive performance on the hold-out test subset, with an R² of 86.56%, RMSE of 0.07497, MAE of 0.0666, and MRE of 5.29%. The predicted trend remained consistent with the measured values, and no test case produced a relative error greater than 10%; the maximum relative error was 9.29%.

These results indicate that the SVR model achieved a good balance between fitting capability and out-of-sample prediction accuracy on the representative split. However, because the dataset is small, the hold-out result should not be interpreted in isolation. Its primary role is to provide a transparent and visual demonstration of model behaviour, whereas the reliability of the modelling strategy is primarily supported by the repeated nested validation design. The approximately 11-percentage-point gap between the training R² (97.58%) and the test R² (86.56%) on the representative split warrants an explicit comment. Such a gap is consistent with mild overfitting: the RBF-SVR with

C

= 8 and γ = 0.5 (the typical optimal setting selected by the inner-loop search) is sufficiently flexible to fit small local patterns of the 60 training cases more tightly than its true ability to generalize to unseen slopes. The smaller train–test gap observed under repeated nested CV, where the training R² averages 0.92 against an outer-test R² of 0.41 across 50 folds, further indicates that the larger 11-pp gap on this single split is partly stochastic and that any individual hold-out estimate should be interpreted alongside the full distribution rather than in isolation.

4.2. Comparative Performance of SVR, RBFNN, and BPNN

The RBFNN and BPNN models were evaluated using the same representative hold-out split as the SVR model. As shown in Figure 7 and Figure 8, both neural-network-based models reproduced the general trend of the data to some extent, but their predictive performance on the test subset was less stable than that of SVR.

For the RBFNN model, the training-set performance was acceptable, with an R² of 92.84% and an RMSE of 0.0891. However, its test-set performance declined markedly, with an R² of 70.15%, RMSE of 0.11838, MAE of 0.1085, and MRE of 8.88%. Eight test cases produced relative errors greater than 10%, and the largest relative error reached 15.70%. This indicates that the RBFNN model approximated the overall data trend but was more sensitive to local deviations in the hold-out subset.

The BPNN model exhibited a similar pattern. Its training-set performance was close to that of RBFNN, with an R² of 92.63% and an RMSE of 0.0906. On the test subset, the BPNN model achieved an R² of 71.05%, RMSE of 0.10955, MAE of 0.0978, and MRE of 7.91%. Six test cases yielded relative errors greater than 10%, and the maximum relative error reached 16.93%. Compared with SVR, BPNN exhibited larger local prediction errors and weaker consistency between predicted and observed safety factors.

The sample-wise error statistics in Table 3 further support this comparison. SVR generally produced smaller absolute and relative errors across the 20 hold-out test cases, whereas RBFNN and BPNN exhibited several cases with noticeably larger deviations. The summary metrics in Table 4 further confirm that SVR achieved the highest R² and the lowest RMSE, MAE, and MRE among the three models. Therefore, within this representative partition, SVR provided the most stable and accurate prediction of the factor of safety.

4.3. Pairwise Error Comparison and Statistical Evidence

To further examine whether the observed differences among the three models were reflected at the case level, Wilcoxon signed-rank tests were performed on the paired absolute and relative errors of the 20 hold-out test cases. A non-parametric test was chosen because the comparison was based on paired errors from the same test cases and does not require the assumption of normally distributed error differences.

As summarized in Table 5, SVR produced significantly lower absolute and relative errors than RBFNN and BPNN on the representative hold-out subset. The differences between SVR and RBFNN were significant for both AE and RE, and the same pattern was observed for the comparison between SVR and BPNN. In contrast, the difference between RBFNN and BPNN was not statistically significant at the 0.05 level.

These results provide additional evidence that the advantage of SVR was not limited to the overall performance metrics but was also reflected in paired case-wise errors. Nevertheless, the statistical test was conducted on a single representative hold-out split and should therefore be interpreted cautiously. Its purpose is to support the case-level comparison among models, whereas the broader assessment of generalization performance should rely on the repeated nested cross-validation framework.

4.4. Performance Rationale Under Small-Sample Nonlinear Slope-Stability Prediction

The favourable performance of SVR can be explained by the compatibility between its learning mechanism and the characteristics of the present slope-stability problem. The factor of safety of circular-failure soil slopes is controlled by nonlinear interactions among soil strength parameters, slope geometry, and pore-pressure-related stress reduction. These interactions are difficult to represent using a simple linear regression form, especially when the available data are limited.

The RBF-kernel-based SVR model is well suited to this type of problem because it can project the original input variables into a high-dimensional feature space and approximate nonlinear relationships without explicitly prescribing the functional form of the slope-stability equation. At the same time, the structural-risk-minimization principle and the regularization term in SVR help control model complexity. This is particularly important in small-sample geotechnical datasets, where excessive flexibility can lead to apparently good training performance but poor generalization to unseen cases.

The comparison with RBFNN and BPNN supports this interpretation. Although the two neural-network-based models fitted the training data reasonably well, their test-set performance deteriorated more substantially. This suggests that they were more vulnerable to local sample characteristics and partition-dependent fluctuations. In contrast, SVR maintained lower test errors and fewer large local deviations, indicating a more favourable balance between nonlinear fitting capacity and model restraint. Therefore, the advantage of SVR in this study should not be understood as higher numerical accuracy, but as stronger generalization behaviour under limited-data conditions.

This point is central to the contribution of the present work. For small-sample slope-stability prediction, a model with a slightly lower apparent training fit but stronger out-of-sample stability is more valuable than a highly flexible model that performs well only on selected partitions. The proposed SVR framework therefore shifts the emphasis from accuracy-oriented model application to reliability-oriented model evaluation.

4.5. Geotechnical Consistency and Applicability of the Proposed Framework

The prediction results are also consistent with the basic mechanical understanding of circular-failure slope stability. In general, increases in cohesion and internal friction angle are expected to improve shear resistance and increase the factor of safety. In contrast, increases in slope height, slope angle, and pore pressure ratio tend to reduce slope stability by increasing driving effects or reducing effective stress. The fact that the SVR model produced stable predictions within this mechanically meaningful input space strengthens confidence that the learned relationship was not merely a statistical artifact.

This geotechnical consistency is important because machine-learning-based slope-stability models should not be judged only by error metrics. In engineering applications, a useful predictive model should also behave in a way that is compatible with established slope-stability mechanisms. The present results suggest that the proposed SVR framework can capture the combined influence of strength, geometry, and pore-pressure variables in a manner that is broadly consistent with the expected response of circular-failure soil slopes.

However, the applicability of the proposed model should be clearly bounded. The framework is most suitable for rapid preliminary assessment, early-stage screening, and comparative evaluation of circular-failure soil slopes whose input variables fall within or close to the range of the training database. It should not be used as a direct replacement for detailed site investigation, limit-equilibrium analysis, numerical modelling, or design-code-based assessment. Its reliability may decrease when applied to slopes with markedly different geological conditions, non-circular or progressive failure mechanisms, strong spatial heterogeneity, complex three-dimensional geometry, or coupled seepage-deformation processes that are not represented in the present database.

Therefore, the proposed SVR framework should be regarded as a data-driven decision-support tool rather than an independent design method. Its main value lies in providing a robust and transparent preliminary prediction of the factor of safety under limited-data conditions. When used together with engineering judgement, site-specific investigation, and conventional stability analysis, it can help improve the efficiency and reliability of early-stage slope stability assessment. A complementary analysis comprising (i) the full distribution of test-set metrics across the 50 outer folds, (ii) a direct comparison with standard 5-fold cross-validation, (iii) partial-dependence plots of the SVR model providing quantitative evidence of geotechnical consistency, and (iv) a brief discussion of the computational cost of the proposed framework is provided in Supplementary Material S1.

5. Conclusions

This study evaluated an SVR-based framework for predicting the factor of safety of circular-failure soil slopes under small-sample conditions. The following conclusions can be drawn.

(1): Among the three evaluated models (SVR, RBFNN, and BPNN), the SVR model achieved the best overall predictive performance. On the representative hold-out split, it produced the highest $R^{2}$ and the lowest RMSE, MAE, and MRE among the three models.
(2): The SVR model exhibited more stable sample-wise prediction behaviour than the two neural-network benchmark models. In the representative test subset, all SVR predictions had relative errors below 10%, whereas both RBFNN and BPNN produced multiple cases with larger deviations. The Wilcoxon signed-rank test further confirmed that the AE and RE values of SVR were significantly lower than those of both benchmark models.
(3): The prediction behaviour of the SVR model remained broadly consistent with established geotechnical expectations for circular-failure slopes. This result, together with its stronger comparative performance, indicates that SVR provided the most credible predictions among the three evaluated models for the present dataset.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math14122074/s1, Supplementary Material S1: Repeated nested cross-validation, comparison with standard k-fold CV, and partial-dependence analysis.

Author Contributions

Conceptualization, Z.M.; methodology, S.H. and Z.M.; software, S.H.; validation, Q.W.; formal analysis, Z.W.; investigation, S.H.; data curation, X.L.; writing—original draft preparation, S.H.; writing—review and editing, L.D.; visualization, S.H.; supervision, L.D.; project administration, S.H.; funding acquisition, S.H. All authors have read and agreed to the published version of the manuscript.

Funding

The research was supported by the National Natural Science Foundation of China (No. 42162023) and Research Project on Teaching Reform in Higher Education Institutions in Jiangxi Province (No. JXJG-23-18-23). The first author was supported by the “Yuanhang Project” of the Jiangxi Association for Science and Technology for visiting the University of Alberta, Canada.

Data Availability Statement

The data presented in this study are included in the article and Appendix A. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Xuanchi Liu was employed by Wenzhou Zhongyuan Engineering Project Management Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Source Note

Eighty circular-failure slope cases were compiled from Feng (2000) [14] and Ma et al. (2022) [15] and selected in the present study. Detailed information about these cases is summarized in Table A1.

Table A1. Detailed parameters of selected slope cases for the present analyses.

Num.	$γ_{s}$ (kN/m³)	$c^{'}$ (kPa)	$φ$ (°)	$β$ (°)	$H$ (m)	$r_{u}$	$F s$
1	12	0	30	35	8	0.32	0.86
2	23.47	0	32	37	214	0.32	1.08
3	16	70	20	40	115	0.32	1.11
4	20.41	24.91	13	22	10.67	0.35	1.40
5	19.63	11.97	20	22	12.19	0.41	1.35
6	21.82	8.62	32	28	12.8	0.49	1.03
7	20.41	33.52	11	16	45.72	0.20	1.28
8	18.84	15.32	30	25	10.67	0.38	1.63
9	18.84	0	20	20	7.62	0.45	1.05
10	25	120	45	53	120	0.32	1.30
11	25	55	36	45	239	0.25	1.71
12	25	63	32	44.5	239	0.25	1.49
13	25	63	32	46	300	0.25	1.45
14	25	48	40	45	330	0.25	1.62
15	31.3	68.60	37	47.5	262.5	0.25	1.20
16	31.3	68.60	37	47	270	0.25	1.20
17	31.3	58.80	35.5	47.5	438.5	0.25	1.20
18	31.30	58.80	35.5	47.5	502.7	0.25	1.20
19	31.30	68	37	47	360.5	0.25	1.20
20	31.30	68	37	8	305.5	0.25	1.20
21	18.68	26.34	15	35	8.23	0.32	1.11
22	16.50	11.49	0	30	3.66	0.32	1.0
23	18.84	14.36	25	20	30.5	0.32	1.88
24	18.84	57.46	20	20	30.5	0.32	2.05
25	28.44	29.42	35	35	100	0.32	1.78
26	28.44	39.23	38	35	100	0.32	1.99
27	20.60	16.28	26.5	30	40	0.32	1.25
28	14.80	0	17	20	50	0.32	1.13
29	14	11.97	26	30	88	0.32	1.02
30	21.43	0	20	20	61	0.50	1.03
31	19.06	11.71	28	35	21	0.11	1.09
32	18.84	14.36	25	20	30.5	0.45	1.11
33	21.51	6.94	30	31	76.81	0.38	1.01
34	14	11.97	26	30	88	0.45	0.63
35	18	24	30.15	45	20	0.12	1.12
36	23	0	20	20	100	0.30	1.20
37	22.40	100	45	45	15	0.25	1.80
38	22.40	10	35	45	10	0.40	0.90
39	20	20	36	45	50	0.50	0.83
40	20	0	36	45	50	0.25	0.79
41	20	0	36	45	50	0.50	0.67
42	22	0	40	33	8	0.35	1.45
43	24	0	40	33	8	0.30	1.58
44	20	0	24.5	20	8	0.35	1.37
45	18	5	30	20	8	0.30	2.05
46	27	40	35	43	420	0.25	1.15
47	27	50	40	42	407	0.25	1.44
48	27	35	35	42	359	0.25	1.27
49	27	37.50	35	37.8	320	0.25	1.24
50	27	32	33	42.6	301	0.25	1.16
51	27	32	33	42.4	289	0.25	1.30
52	27.30	14	31	41	110	0.25	1.25
53	27.30	31.50	29.7	41	135	0.32	1.25
54	27.30	16.80	28	50	90.50	0.32	1.25
55	27.30	26	31	50	92	0.32	1.25
56	27.30	10	39	41	511	0.32	1.43
57	27.30	10	39	40	470	0.32	1.42
58	25	46	35	47	443	0.32	1.28
59	25	46	35	44	435	0.32	1.37
60	25	46	35	46	432	0.32	1.23
61	26	150	45	30	200	0.32	1.20
62	18.50	25	0	30	6	0.32	1.09
63	18.50	12	0	30	6	0.32	0.78
64	22.40	10	35	30	10	0.32	2.0
65	21.40	10	30.34	30	20	0.32	1.70
66	22	20	36	45	50	0.32	1.02
67	22	0	36	45	50	0.32	0.89
68	12	0	30	45	4	0.32	1.46
69	12	0	30	45	8	0.32	0.80
70	12	0	30	45	4	0.32	1.44
71	31.30	68	37	49	200.5	0.32	1.20
72	20	20	36	45	50	0.32	0.96
73	27	40	35	47.1	292	0.32	1.15
74	25	46	35	50	284	0.32	1.34
75	31.30	68	37	46	366	0.32	1.20
76	25	46	36	44.5	299	0.32	1.55
77	27.30	10	39	40	480	0.32	1.45
78	25	46	35	46	393	0.32	1.31
79	25	48	40	49	330	0.32	1.49
80	31.30	68.60	37	47	305	0.32	1.20

References

Hu, S.; Lu, Y.; Liu, X.; Huang, C.; Wang, Z.; Huang, L.; Zhang, W.; Li, X. Stability prediction of circular sliding failure soil slopes based on a genetic algorithm optimization of random forest algorithm. Electron. Res. Arch. 2024, 32, 6120–6139. [Google Scholar] [CrossRef]
Lu, P.; Rosenbaum, M.S. Artificial neural networks and grey systems for the prediction of slope stability. Nat. Hazards 2003, 30, 383–398. [Google Scholar] [CrossRef]
Das, S.K.; Biswal, R.K.; Sivakugan, N.; Das, B. Classification of slopes and prediction of factor of safety using differential evolution neural networks. Environ. Earth Sci. 2011, 64, 201–210. [Google Scholar] [CrossRef]
Liu, Z.; Shao, J.; Xu, W.; Chen, H.; Zhang, Y. An extreme learning machine approach for slope stability evaluation and prediction. Nat. Hazards 2014, 73, 787–804. [Google Scholar] [CrossRef]
Tien Bui, D.; Moayedi, H.; Gör, M.; Jaafari, A.; Foong, L.K. Predicting slope stability failure through machine learning paradigms. ISPRS Int. J. Geo-Inf. 2019, 8, 395. [Google Scholar] [CrossRef]
Mahmoodzadeh, A.; Mohammadi, M.; Hama Ali, H.F.; Ibrahim, H.H.; Abdulhamid, S.N.; Nejati, H.R. Prediction of safety factors for slope stability: Comparison of machine learning techniques. Nat. Hazards 2022, 111, 1771–1799. [Google Scholar] [CrossRef]
Khajehzadeh, M.; Keawsawasvong, S. Predicting slope safety using an optimized machine learning model. Heliyon 2023, 9, e23012. [Google Scholar] [CrossRef] [PubMed]
Kurnaz, T.F.; Erden, C.; Dağdeviren, U.; Demir, A.S.; Kökçam, A.H. Comparison of machine learning algorithms for slope stability prediction using an automated machine learning approach. Nat. Hazards 2024, 120, 6991–7014. [Google Scholar] [CrossRef]
Tun, S.H.; Zeng, C.; Jamil, F. Prediction of slope stability based on five machine learning techniques approaches: A comparative study. Multiscale Multidiscip. Model. Exp. Des. 2025, 8, 224. [Google Scholar] [CrossRef]
Xue, X. Prediction of slope stability based on hybrid PSO and LSSVM. J. Comput. Civ. Eng. 2017, 31, 04016041. [Google Scholar] [CrossRef]
Sari, P.A.; Suhatril, M.; Osman, N.; Mu’azu, M.A.; Dehghani, H.; Sedghi, Y.; Safa, M.; Hasanipanah, M.; Wakil, K.; Khorami, M.; et al. An intelligent based-model role to simulate the factor of safe slope by support vector regression. Eng. Comput. 2019, 35, 1521–1531. [Google Scholar] [CrossRef]
Wei, W.; Li, X.; Liu, J.; Zhou, Y.; Li, L.; Zhou, J. Performance evaluation of hybrid WOA-SVR and HHO-SVR models with various kernels to predict factor of safety for circular failure slope. Appl. Sci. 2021, 11, 1922. [Google Scholar] [CrossRef]
Kumar, C.; Walton, G.; Santi, P.; Luza, C. Random cross-validation produces biased assessment of machine learning performance in regional landslide susceptibility prediction. Remote Sens. 2025, 17, 213. [Google Scholar] [CrossRef]
Feng, X.T. Introduction to Intelligent Rock Mechanics; Science Press: Beijing, China, 2000. (In Chinese) [Google Scholar]
Ma, J.; Jiang, S.; Liu, Z.; Ren, Z.; Lei, D.; Tan, C.; Guo, H. Machine learning models for slope stability classification of circular mode failure: An updated database and automated machine learning (AutoML) approach. Sensors 2022, 22, 9166. [Google Scholar] [CrossRef] [PubMed]
Samui, P. Support vector classifier analysis of slope. Geomat. Nat. Hazards Risk 2013, 4, 1–12. [Google Scholar] [CrossRef]
Pei, T.; Qiu, T. Machine learning with monotonic constraint for geotechnical engineering applications: An example of slope stability prediction. Acta Geotech. 2024, 19, 3863–3882. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of support vector regression with the ε-insensitive tube.

Figure 2. Definition of input variables for circular-failure slope stability modelling.

Figure 3. Statistical distributions of six input variables in the compiled dataset: (a–f) correspond to soil unit weight (

γ_{s}

), slope height (

H

), pore pressure ratio (

r_{u}

), cohesion (

c^{'}

), internal friction angle (φ), and slope angle (

β

).

Figure 3. Statistical distributions of six input variables in the compiled dataset: (a–f) correspond to soil unit weight (

γ_{s}

), slope height (

H

), pore pressure ratio (

r_{u}

), cohesion (

c^{'}

), internal friction angle (φ), and slope angle (

β

).

Figure 4. Pairwise scatter plots and Pearson correlation structure of input variables.

Figure 5. Flowchart of the SVR modelling and hyperparameter optimization procedure.

Figure 6. Illustrative prediction results of the SVR model on a representative training–test split. (a) Fit on the training subset (training R² = 97.58%, RMSE = 0.0518); (b) prediction on the hold-out test subset (test R² = 86.56%, RMSE = 0.07497, MAE = 0.0666, MRE = 5.29%). Hollow circles denote the observed factor of safety, and filled red circles denote the SVR prediction.

Figure 7. Illustrative prediction results of the RBFNN model on a representative training–test split. (a) Fit on the training subset (training R² = 92.84%, RMSE = 0.0891); (b) prediction on the hold-out test subset (test R² = 70.15%, RMSE = 0.11838, MAE = 0.1085, MRE = 8.88%). Hollow circles denote the observed factor of safety, and filled red circles denote the RBFNN prediction.

Figure 8. Illustrative prediction results of the BPNN model on a representative training–test split. (a) Fit on the training subset (training R² = 92.63%, RMSE = 0.0906); (b) prediction on the hold-out test subset (test R² = 71.05%, RMSE = 0.10955, MAE = 0.0978, MRE = 7.91%). Hollow circles denote the observed factor of safety, and filled red circles denote the BPNN prediction. (Note: metrics in this caption refer to the originally reported single hold-out split; updated values from the repeated nested cross-validation procedure are provided in the revised Section 4).

Table 1. Configuration of the SVR model and hyperparameter search space.

Option	Description	Setting/Search Range
SVR type	Regression formulation	ε-SVR
Kernel function	Nonlinear kernel	RBF
Penalty parameter $C$	Trade-off between flatness and fitting error	2⁻⁵ to 2¹⁵
Kernel parameter γ	Influence range in the RBF kernel	2⁻¹⁵ to 2³
Insensitive width ε	Tolerance band for regression error	10⁻³ to 10⁻¹
Validation protocol	Outer-loop evaluation and inner-loop tuning	Repeated 5-fold outer CV × 10 repeats; inner 5-fold grid search

Table 2. Comparison of validation strategies and their applicability to small-sample slope data.

Validation Strategy	Mechanism Description	Applicability	Leakage Risk	Effectiveness for Small-Sample Geotech
Train-Test Split	Randomly split data once. Hyperparameters tuned on the test set.	Big data or baselines.	Extreme	Very poor; highly susceptible to optimistic bias.
Standard k-fold CV	Data divided into k folds and used iteratively for validation. Best average hyperparameters selected.	Static model evaluation without hyperparameter tuning.	High	Poor; hyperparameters absorb all data information, leading to pseudo-generalization.
Nested CV	Dual-loop structure: inner loop for grid search, outer loop for isolated generalization test.	Hyperparameter-sensitive and data-scarce scenarios.	Near Zero	Excellent; the gold standard for unbiased model selection and evaluation with N = 80.

Table 3. Detailed prediction results and error statistics for the representative hold-out test subset.

Num.	Actual Value	SVR Model			RBFNN Model			BPNN Model
Num.	Actual Value	Predicted Value	AE	RE (%)	Predicted Value	AE	RE (%)	Predicted Value	AE	RE (%)
1	1.70	1.542	0.1580	9.29	1.5347	0.1653	9.72	1.5041	0.1959	11.52
2	1.45	1.5519	0.1019	7.03	1.3683	0.0817	5.63	1.4055	0.0445	3.07
3	1.00	0.9287	0.0713	7.13	0.9066	0.0934	9.34	0.9245	0.0755	7.55
4	1.34	1.4102	0.0702	5.24	1.5108	0.1708	12.75	1.2508	0.0892	6.66
5	1.35	1.2467	0.1033	7.65	1.427	0.0770	5.70	1.4102	0.0602	4.46
6	1.20	1.1851	0.0149	1.24	1.08	0.1200	10.00	1.2907	0.0907	7.56
7	1.45	1.4206	0.0294	2.03	1.3453	0.1047	7.22	1.5266	0.0766	5.28
8	1.43	1.4883	0.0583	4.08	1.4094	0.0206	1.44	1.3373	0.0927	6.48
9	1.27	1.1857	0.0843	6.64	1.1953	0.0747	5.88	1.3197	0.0497	3.91
10	1.25	1.2771	0.0271	2.17	1.1185	0.1315	10.52	1.1989	0.0511	4.09
11	1.40	1.4936	0.0936	6.69	1.195	0.205	14.64	1.1977	0.2023	14.45
12	1.20	1.2963	0.0963	8.03	1.3181	0.1181	9.84	1.3213	0.1213	10.11
13	0.96	1.0227	0.0627	6.53	1.1107	0.1507	15.70	1.0442	0.0842	8.77
14	1.12	1.075	0.0450	4.02	0.9843	0.1357	12.12	1.1774	0.0574	5.12
15	1.08	1.1076	0.0276	2.56	1.1299	0.0499	4.62	1.0110	0.0690	6.39
16	1.37	1.439	0.0690	5.04	1.4461	0.0761	5.55	1.5202	0.1502	10.96
17	1.20	1.2245	0.0245	2.04	1.2367	0.0367	3.06	1.2739	0.0739	6.16
18	1.12	1.185	0.0650	5.80	0.9643	0.1557	13.90	1.1974	0.0774	6.91
19	1.20	1.1096	0.0904	7.53	1.0651	0.1349	11.24	1.4031	0.2031	16.93
20	0.79	0.8295	0.0395	5.00	0.8582	0.0682	8.63	0.8827	0.0927	11.73

Table 4. Overall performance of the three prediction models on the representative hold-out test subset.

Prediction Model	R²	RMSE	MAE	MRE
SVR	86.56%	0.07497	0.0666	5.29%
RBFNN	70.15%	0.11838	0.1085	8.88%
BPNN	71.05%	0.10955	0.0978	7.91%

Table 5. Summary of Wilcoxon signed-rank test results based on paired case-wise errors of the representative hold-out test subset.

Pairwise Comparison	AE p-Value	RE p-Value	Conclusion
SVR vs. RBFNN	0.0014	0.0008	SVR significantly lower errors
SVR vs. BPNN	0.0047	0.0036	SVR significantly lower errors
RBFNN vs. BPNN	0.3488	0.3118	No significant difference

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hu, S.; Mao, Z.; Deng, L.; Wang, Q.; Liu, X.; Wang, Z. SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size. Mathematics 2026, 14, 2074. https://doi.org/10.3390/math14122074

AMA Style

Hu S, Mao Z, Deng L, Wang Q, Liu X, Wang Z. SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size. Mathematics. 2026; 14(12):2074. https://doi.org/10.3390/math14122074

Chicago/Turabian Style

Hu, Shengming, Zhibin Mao, Lijun Deng, Qinghua Wang, Xuanchi Liu, and Zhou Wang. 2026. "SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size" Mathematics 14, no. 12: 2074. https://doi.org/10.3390/math14122074

APA Style

Hu, S., Mao, Z., Deng, L., Wang, Q., Liu, X., & Wang, Z. (2026). SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size. Mathematics, 14(12), 2074. https://doi.org/10.3390/math14122074

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

SVR-Based Framework for Predicting Stability of Circular-Failure Slopes with Small Sample Size

Abstract

1. Introduction

2. Basic Principle of Support Vector Regression

2.1. Regression Function in Feature Space

2.2. ε-Insensitive Loss and Optimization Formulation

2.3. Kernel Function and RBF-SVR Model

3. Construction of Slope Stability Prediction Model Based on SVR

3.1. Problem Definition and Dataset

3.2. Data Inspection and Preprocessing

3.3. Correlation Analysis and Variable Relevance

3.4. Model Development

3.5. Hyperparameter Optimization and Reproducible Validation

3.6. Benchmark Models and Evaluation Metrics

3.7. Qualitative Engineering-Consistency Assessment

3.8. Statistical Comparison Based on Paired Sample-Wise Errors

4. Results and Discussion

4.1. Representative Prediction Performance of SVR

4.2. Comparative Performance of SVR, RBFNN, and BPNN

4.3. Pairwise Error Comparison and Statistical Evidence

4.4. Performance Rationale Under Small-Sample Nonlinear Slope-Stability Prediction

4.5. Geotechnical Consistency and Applicability of the Proposed Framework

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Source Note

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI