A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data

Zhang, Rui; Zhang, Wuyu

doi:10.3390/app152212174

Open AccessArticle

A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data

by

Rui Zhang

^1,2 and

Wuyu Zhang

^1,2,*

¹

School of Civil Engineering and Water Resources, Qinghai University, Xining 810016, China

²

Key Laboratory of Energy-Saving Building Materials and Engineering Safety, Xining 810016, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(22), 12174; https://doi.org/10.3390/app152212174

Submission received: 13 October 2025 / Revised: 14 November 2025 / Accepted: 15 November 2025 / Published: 17 November 2025

Download

Browse Figures

Versions Notes

Abstract

In the field of geoengineering, predicting foundation settlement is a critical topic. Traditional settlement prediction methods struggle to accurately reflect settlement under complex geological conditions. This study combines cone penetration test (CPT) data and collects data from 46 different geoengineering sites from the literature. Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), Deep Neural Network (DNN), Support Vector Machine (SVM), and Random Forest (RF) models are individually established, and an ensemble model is proposed to predict shallow foundation settlement St. The results show that the proposed ensemble model exhibits the best predictive performance, providing a reference for practical engineering projects. The predictions of the optimal model are compared with those of single models and traditional methods, and the uncertainty of model predictions is quantified using Monte Carlo Simulation (MCS). Sensitivity analyses are conducted using feature importance analysis and SHAP methods to assess the influence of input parameters on the prediction results. Finally, Generative Adversarial Networks (GANs) are introduced to generate new data to validate the generalization capability of the model.

Keywords:

shallow foundation settlement; Gradient Boosting; machine learning; uncertainty analysis; generative adversarial networks

1. Introduction

The prediction of settlement for shallow foundations is directly related to the safety and service life of buildings and is crucial in the field of civil engineering. The vast majority of structures are supported by shallow foundations, and settlement is the primary controlling factor considered in the design of shallow foundations [1]. Settlement of shallow foundations can lead to cracks, tilting, deformation, and even structural failure of buildings, severely affecting their functionality and safety. With the acceleration of urbanization, the number and scale of buildings continue to increase, raising higher demands on foundation engineering. Especially in soft soil regions, settlement problems are more pronounced due to the high compressibility and low bearing capacity of the foundation soil layers. Therefore, accurate settlement analysis and estimation are important topics in the design of shallow foundations.

Traditional settlement prediction methods are heavily data-dependent, requiring a large amount of field test data, which is time-consuming, labor-intensive, and costly to obtain. Due to the complexity and variability of geological conditions, there remain many uncertainties in geotechnical engineering [2], posing significant challenges to the practical application of traditional settlement prediction methods. These methods struggle to accurately reflect settlement behavior, thus impacting the reliability of engineering design and construction. Therefore, accurately predicting the settlement of shallow foundations remains a challenging task.

In shallow foundation design, the fundamental equation for calculating foundation settlement is based on soil stiffness (E), foundation width (B), contact stress (q), and the stress distribution with depth. These parameters are influenced by soil type, time, and loading conditions. Using a single soil property value as the average stiffness of all soil layers can lead to bias in settlement estimation [3]. By improving the quality and quantity of field investigation tests, the uncertainty in foundation settlement analysis can be reduced. With the rapid development of in situ testing technologies, they have played an important role in geotechnical engineering design [4,5]. Among these, CPT (Cone Penetration Testing) has become a crucial tool for characterizing foundation soil layers due to its high resolution and accuracy [6]. CPT data can provide information about soil stratigraphy, density, and strength, which is vital for accurately predicting foundation settlement [7,8]. However, accurately and effectively integrating CPT data with machine learning models and quantifying the uncertainty of predictions remains a topic worthy of further research [9].

In recent years, with the development of machine learning techniques, an increasing number of researchers have attempted to apply them to settlement prediction to improve accuracy and reliability. Methods such as Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Artificial Neural Network (ANN), and Decision Tree (DT) have been widely applied in the field of civil engineering [10,11], for example, in intelligent seismic risk assessment [12], intelligent prediction of tunnel deterioration [13], and geological layer classification [14].

Among them, GBDT is an ensemble learning method that builds multiple decision trees sequentially, with each tree learning the residuals of the preceding trees [15]. XGBoost is an efficient implementation of GBDT that optimizes speed and performance using techniques such as second-order Taylor expansion, regularization terms, and parallel processing [16]. Deep neural networks (DNNs) are a type of artificial neural network architecture that extract and combine features of input data layer by layer through multiple layers of neurons, enabling automatic modeling and prediction of complex nonlinear relationships [17]. Support vector machines (SVMs) are a commonly used supervised learning method suitable for handling high-dimensional data and complex boundary problems [18], using nonlinear kernel functions to capture the complex relationships between input parameters and settlement. Random forest (RF) is a decision tree–based ensemble learning method with strong feature selection capability and resistance to overfitting [19]. By aggregating multiple decision trees, random forest can effectively improve model stability and predictive accuracy [20]. However, single machine learning models may have certain limitations when handling complex problems. For example, SVMs can have high computational complexity when processing large-scale data [21], while random forest—despite its stability—may perform inadequately on time-series data. Therefore, combining the strengths of multiple models to develop hybrid algorithms may be an effective approach to addressing settlement prediction problems [18,22].

This paper proposes a shallow foundation settlement prediction method based on machine learning and CPT data. Forty-six datasets from different geological conditions were collected from the literature. GBDT, XGBoost, DNN, SVM, and RF models were developed, and an ensemble model was proposed. Multiple model architectures were systematically compared using 5-fold nested cross-validation to determine the “best” ensemble; exhaustive grid searches were used for hyperparameter tuning, and the nested cross-validation employed outer folds for performance assessment and inner folds for parameter tuning to eliminate bias [23]. A small independent test set was reserved, reflecting the limited validation-data scenarios encountered in engineering practice. The test set was completely independent of the training process and provided a true external validation of performance. Comparing the predictive accuracy of each model showed that the SVM-ensemble RF model outperformed the other models. To validate the effectiveness of the proposed ensemble, the SVM-ensemble RF model’s predictions were compared with those from traditional settlement prediction methods, and Monte Carlo simulation (MCS) was used to quantify uncertainty in the ensemble model’s predictions [24,25]. Results indicate that the SVM-ensemble RF model surpasses the other models and traditional methods in both predictive accuracy and stability. In addition, to further analyze the influence of input parameters on predictions, feature importance and SHAP (Shapley Additive Explanations) values were used to conduct a sensitivity analysis of the model [26,27]. This revealed the contribution of each input parameter to settlement prediction and provided guidance for parameter selection and optimization in engineering practice. Finally, to verify the model’s generalization capability, 50 new datasets were generated using generative adversarial networks (GANs) based on the 46 datasets collected from the literature and used with the proposed ensemble model.

With the continuous advancement of data acquisition technologies and the ongoing development of machine learning algorithms, future research can further explore how to integrate more types of data (such as remote sensing data, seismic data, etc.) with machine learning models to improve the accuracy and applicability of settlement prediction [28,29,30]. Additionally, incorporating uncertainty analysis and risk assessment into the models is also a worthwhile direction for in-depth study. Through the research presented in this paper, we hope to provide a machine learning-based settlement prediction method for shallow foundations that is more accurate and widely applicable, thereby promoting technological progress and application development in this field.

2. Traditional Method

A significant amount of research has been conducted on predicting shallow foundation settlement using empirical correlations based on CPT records. These relationships primarily consider the soil’s elastic modulus (

E_{s}

), foundation width (B), contact pressure (q), and cone tip resistance (

q_{c}

). A summary of the most common analytical methods reported in the literature is presented in Table 1. From Table 1, it can be seen that Janbu (1967) [31], Schmerthmann (1978) [32], and Berardi and Lancellotta (1991) [33] provided a fundamental formula for calculating settlement values based on static cone penetration test results. Recently, Malekdoost and Eslami (2011) [34] and Valikhah and Eslami (2019) [3] modified Janbu’s equation by adding several other terms to the modulus (m). It is worth noting that the accuracy of the formulas provided in Table 1 is relatively high when approaching the ultimate bearing capacity or within the range of

S_{t}

/B = 0.1 [35]. All relationships in Table 1 were obtained from various case studies through plate load tests (PLT). Based on PLT results, weightings that lead to plate failure can be determined. Conversely, allowable bearing capacity of piles can be calculated by permitting a certain amount of settlement.

Although the recommended formulas in Table 1 vary due to factors such as soil type, time, and loading conditions, there is still a lack of a universal method capable of accurately predicting foundation settlement in geotechnical engineering practice. Predicting foundation settlement involves complex nonlinear characteristics and the coupled effects of multiple variables, making the development of a model that can capture the complexity of foundation soil layers and improve prediction accuracy highly significant.

To address the aforementioned issues, this study employs various machine learning models and ensemble models to predict shallow foundation settlement and compares the prediction results of each model. The proposed ensemble model, alongside single models and traditional methods, has been validated through Monte Carlo simulation uncertainty analysis, demonstrating its capability to accurately predict shallow foundation settlement with different geotechnical parameters under continuous CPT records.

3. Case Study Records

Table 2 provides an overview of a database containing 46 cases used for model development and analysis in this study. As shown in Table 2, the cases include 22 square, 17 circular, and 7 rectangular foundations. B represents the foundation width,

D_{f}

the foundation depth, q the foundation load,

q_{t}

the corrected

q_{c}

,

R_{f}

the friction ratio, and

S_{t}

the settlement amount. The foundation embedment depth ranges from 0 to 2.35 m, with measured settlements in these cases ranging from 25 to 260 mm, where

S_{t}

/B is approximately 2% to 10% of the foundation width. According to the soil and loading conditions mentioned in Schmertmann (1978) [32], the effective depth under the foundation is approximated as 2B. Table 2 considers the arithmetic averages of CPT-related parameters down to a depth of 2B beneath the foundation (i.e.,

R_{f}

and

q_{t}

). It is noteworthy that in all cases of the present study, loads are assumed to be located near St/B = 0.1, where St/B = 0.1 is the quality standard for sand PLT. Previous studies, including Malekdoost and Eslami (2011) [34] and Valikhah and Eslami (2019) [3], have detailed the validity of the current data referenced in this paper.

It is worth noting that the database can be expanded so that, at each stage of the loading test, the corresponding settlement for each load on the foundation can be regarded as part of the database, and in fact, more datasets can be envisioned. However, when the soil fails, the soil conditions beneath the footing will be similar to those recorded in the CPT data (such as

q_{t}

). Therefore, the load on the foundation corresponding to the CPT record is the ultimate load (q) [39]. This provides a theoretical basis for the input parameters of the model.

In this study, the soil St is considered to be influenced by the foundation width B, the corrected cone tip resistance

q_{t}

, and the foundation load q. Although other parameters such as

R_{f}

,

I_{c}

, OCR, and

D_{r}

may provide incremental improvements in specific geological settings, the current trio captures the fundamental mechanics of foundation settlement in cohesionless soils. The demonstrated performance indicates that this simplified approach offers a powerful practical tool for preliminary design and rapid assessment; therefore, B,

q_{t}

, and q are used as input parameters for the SVM-integrated RF model. The relationship between these three input parameters and the output parameter is shown in Figure 1.

4. Modeling Process

4.1. Gradient Boosting Decision Tree (GBDT)

Gradient Boosting Decision Tree (GBDT) is an ensemble learning algorithm [15]. Its core idea is to iteratively train a new weak learner (usually a decision tree) each round to correct the residuals of the previous model, using gradient descent to minimize the loss function. Each round learns the negative gradient direction of the loss function, and the final prediction is obtained by weighted summation of all weak learners’ outputs. GBDT has the following characteristics: it can handle nonlinear relationships, is relatively robust to outliers, does not require feature standardization, and offers relatively good interpretability.

4.2. Extreme Gradient Boosting (XGBoost)

Extreme Gradient Boosting (XGBoost) is an efficient implementation of GBDT [16] and includes the following improvements: L1 and L2 regularization terms added to the objective function to effectively prevent overfitting; although boosting is a serial process, XGBoost can parallelize feature selection and split-finding; missing value handling: it automatically learns how to handle missing values; pruning strategy: uses post-pruning for greater precision; cache optimization: data structures and algorithms are optimized for faster training. XGBoost offers the following advantages: excellent performance, high computational efficiency, built-in cross-validation, and support for custom objective functions.

4.3. Deep Neural Network (DNN)

A Deep Neural Network (DNN) is a neural network model composed of multiple neurons and hidden layers, featuring a multilayer structure. It is a supervised learning model proficient in handling high-dimensional data and complex nonlinear problems. The core idea of a DNN is to automatically extract features and construct mapping relationships through data-driven methods. It is widely applied in tasks such as classification, regression, image recognition, and sequence prediction. The model structure is shown in Figure 2 [17].

Suppose there is a neural network with l-layers, where the computation of each layer’s neurons can be represented as:

z^{[l]} = W^{[l]} \cdot a^{[l - 1]} + b^{[l]}

(1)

a^{[l]} = f (z^{[l]})

(2)

\hat{y} = a^{[l]}

(3)

where

z^{[l]}

is the linear output of the l-th layer,

W^{[l]}

is the weight matrix of the l-th layer,

b^{[l]}

is the bias vector of the l-th layer,

a^{[l - 1]}

is the nonlinear activation output of the (l-1)-th layer and

\hat{y}

is the predicted output of the neural network.

Use a loss function to measure the error between predicted values and true values.

L = \frac{1}{m} \sum_{i = 1}^{m} {({\hat{y}}_{i} - y_{i})}^{2}

(4)

where

{\hat{y}}_{i}

is the predicted value of the i-th training sample,

y_{i}

is the true value or label of the i-th training sample.

Use the Adam optimizer to update parameters based on gradients, minimizing the loss function through the optimization algorithm:

W^{[l]} : = W^{[l]} - α \frac{\partial L}{\partial W^{[l]}}, b^{[l]} : = b^{[l]} - α \frac{\partial L}{\partial b^{[l]}}

(5)

m_{t} = β_{1} m_{t - 1} + (1 - β_{1}) g_{t}, v_{t} = β_{2} v_{t - 1} + (1 - β_{2}) {g_{t}}^{2}

(6)

W : = W - α \frac{m_{t}}{\sqrt{v_{t}} + ϵ}

(7)

where

α

is the learning rate,

m_{t}

is the first-order momentum, and

v_{t}

is the second-order momentum.

4.4. Support Vector Machines (SVM)

Support Vector Machine (SVM) is a supervised learning model used for classification and regression tasks. Its core idea is to find an optimal hyperplane in the feature space to effectively separate and predict data points. In classification tasks, SVM improves the model’s generalization ability by maximizing the margin of the decision boundary, while in regression tasks, SVM aims to find a function f(x) whose deviation from the actual target value y at each training point does not exceed

ϵ

, while keeping the function as flat as possible.

The goal of Support Vector Regression (SVR) is to achieve predictions by optimizing the following function [18]:

f (x) = 〈ω, x〉 + b

(8)

where

〈ω, x〉

represents the inner product of

ω

and

x

,

x

is the input feature vector,

ω

is the weight vector, which defines the normal vector to the hyperplane learned by the SVR model, and

b

is the bias term. To achieve this goal, SVR needs to solve the following optimization problem:

m i n \frac{1}{2} {‖ω‖}^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*})

(9)

where ‘min’ is the minimum value of the first item.

The constraints are:

y_{i} - 〈ω, x_{i}〉 - b \leq ϵ + ξ_{i}

(10)

〈ω, x_{i}〉 + b - y_{i} \leq ϵ + ξ_{i}^{*}

(11)

ξ_{i}, ξ_{i}^{*} \geq 0

(12)

In this context,

ξ_{i}

and

ξ_{i}^{*}

are slack variables used to handle deviations beyond

ϵ

, while

C

is the regularization parameter that controls the trade-off between the model’s tolerance for errors and its complexity.

To handle nonlinear data, the SVM model introduces a kernel function, which maps the original feature space to a higher-dimensional feature space, thereby finding a linear hyperplane in the high-dimensional space to solve the nonlinear problem.

4.5. Random Forest (RF)

Random Forest (RF) is an ensemble learning method widely used for classification and regression tasks. Its core idea is to improve the accuracy and stability of the model by constructing multiple decision trees and combining their prediction results. In regression tasks, Random Forest generates the final prediction value by averaging the predictions of all decision trees, effectively reducing the overfitting problem that may occur with individual decision trees. The prediction formula for the random forest is as follows [19]:

\hat{y} = \frac{1}{T} \sum_{t = 1}^{T} h_{t} (x)

(13)

In this context,

T

represents the number of decision trees, and

h_{t} (x)

is the predicted value of the input

x

from the t-th tree. By averaging the predictions from multiple trees, random forests can significantly enhance the model’s generalization ability, particularly excelling in handling complex datasets. In this study, random forests are used not only to improve prediction accuracy but also to provide important support for the model’s interpretability through feature importance analysis.

4.6. Integration of SVM and RF

Figure 3 illustrates the complete process of the improved SVM-integrated RF model for predicting foundational settlement. First, the raw data undergo preprocessing, including standardization to eliminate the impact of differing feature scales, ensuring that all features contribute on the same scale. Next, principal component analysis (PCA) is applied to the standardized data for dimensionality reduction. The number of principal components is determined based on a cumulative explained variance threshold of ≥95%, effectively extracting the main features while reducing redundant information.

During the model training phase, a rigorous nested cross-validation framework is employed: the outer 5-fold cross-validation is used for performance evaluation, and the inner 3-fold cross-validation is for hyperparameter optimization. Grid search is conducted to systematically tune the SVM’s regularization parameter C, the insensitive parameter epsilon, and the number of trees in the random forest. In the ensemble architecture design, the SVR first learns the nonlinear patterns in the data, and its output predictions are used as input features for the RF model, forming a cascade ensemble structure. This design not only enhances the model’s ability to capture complex nonlinear relationships but also further corrects residuals through the RF’s ensemble mechanism, significantly improving prediction accuracy.

4.7. Code Implementation Framework and Development Environment

The custom programming implementation in this study was developed using Python 3.9 and employed key scientific computing libraries, including scikit-learn (version 1.2.2), NumPy (1.24.3), pandas (1.5.3), and SHAP (0.41.0). The integrated SVM-RF model was implemented as a custom estimator class that inherits from scikit-learn’s BaseEstimator and RegressorMixin base classes, ensuring compatibility with the scikit-learn ecosystem and adherence to its API standards.

5. Results

5.1. Evaluation Indicators

In the modeling and validation phase, evaluating and comparing the efficiency of models is a crucial step in ensuring predictive performance. This study assesses the predictive performance of the SVM-ensemble RF model using various statistical metrics, including the coefficient of determination (

R^{2}

), root mean square error (RMSE), mean absolute percentage error (MAPE), and mean absolute deviation (MAD), as defined in Formulas (14)–(17). The coefficient of determination (

R^{2}

) quantifies the correlation or degree of collinearity between predicted and actual values, RMSE reflects the extent of deviation between predicted values and actual observations, MAPE measures the error between predicted and actual values, while MAD assesses the average absolute difference between predicted values and target values. These metrics evaluate the model’s predictive accuracy and error levels from different perspectives, providing a comprehensive basis for performance validation.

R^{2} = 1 - [\frac{\sum_{1}^{M} {(S_{a t} - S_{p t})}^{2}}{\sum_{1}^{M} {(S_{a t})}^{2}}]

(14)

R M S E = \sqrt{\frac{1}{M} \sum_{1}^{M} {(S_{a t} - S_{p t})}^{2}}

(15)

M A P E = \sum_{1}^{M} \frac{|S_{a t} - S_{p t}|}{S_{a t}} \times 100

(16)

M A D = \frac{\sum_{1}^{M} |S_{a t} - S_{p t}|}{M}

(17)

In Equations (14)–(17),

S_{a t}

is the actual

S_{t}

,

S_{p t}

is the predicted

S_{t}

, and

M

is the total number of data points.

5.2. St Prediction

5.2.1. GBDT, XGBoost and DNN Model

Figure 4, Figure 5 and Figure 6 show the results of predicting settlement St using the GBDT, XGBoost, and DNN models, respectively. In the figures, the blue curve represents the actual measured values, while the red curve represents the model predictions. In terms of performance metrics, the GBDT model achieved

R^{2}

= 0.876, RMSE = 22.982, MAD = 15.846, and MAPE = 20.12%; the XGBoost model achieved

R^{2}

= 0.928, RMSE = 17.533, MAD = 11.941, and MAPE = 14.86%; and the DNN model achieved

R^{2}

= 0.826, RMSE = 27.284, MAD = 22.432, and MAPE = 27.03%. Across all evaluation metrics, the XGBoost model demonstrated superior performance. These metrics indicate that the XGBoost model can accurately predict settlement and possesses strong generalization ability. This may be because tree-based methods (GBDT and XGBoost) typically perform well when handling structured data and nonlinear relationships, whereas DNNs might be better suited for complex data such as images and text. Since predicting St in this task likely involves structured data, tree-based models have an advantage.

5.2.2. SVM Model and RF Model

Figure 7 and Figure 8 show the results of predicting settlement St using the SVM model and the RF model, respectively. In terms of performance metrics, the SVM model’s prediction results are better, with

R^{2}

= 0.916, RMSE = 18.968, MAD = 12.517, and MAPE = 12.79%. In comparison, the RF model’s prediction results are slightly inferior, with

R^{2}

= 0.901, RMSE = 20.570, MAD = 13.458, and MAPE = 15.87%. SVM often performs well on small to medium-sized, low-dimensional datasets, especially when there is a clear boundary or a complex nonlinear relationship between features and the target variable (captured through kernel functions). The current results align with these characteristics.

5.2.3. Integrated Model

The prediction results of the SVM-integrated RF model are shown in Figure 9. The actual and predicted values almost coincide, indicating that the model’s predicted values are very close to the actual values, demonstrating a high degree of fit and prediction accuracy. Statistical indicators further confirm this, with

R^{2}

= 0.978, RMSE = 3.764, MAD = 3.171, and MAPE = 5.02%, and performance based on the mean ± standard deviation of the outer folds of nested cross-validation:

R^{2}

= 0.978 ± 0.05, RMSE = 3.764 ± 0.45, MAD = 3.171 ± 0.38, MAPE = 5.02% ± 0.32%. These results indicate that the multi-models established by the SVM-ensemble RF approach proposed in this study achieve very high data fit, perform well in handling complex ground settlement problems, effectively capture nonlinear relationships in the data, and provide high-precision predictions.

To provide a comprehensive model comparison, the nested cross-validation statistical results of this study are shown in the table below (all results are reported as mean ± standard deviation), as presented in Table 3. Specifically, the SVM-ensemble RF model’s metrics are significantly superior to those of the other five models, demonstrating very strong generalization ability and accuracy. The models ranked after the SVM-ensemble RF in terms of performance are XGBoost, SVM, RF, GBDT, and DNN, respectively. By combining the strengths of SVM and RF, the ensemble model can more effectively capture nonlinear relationships in the data and substantially reduce prediction error. The SVM performs excellently on high-dimensional data and complex nonlinear problems, while the RF has advantages in feature selection and resistance to overfitting. Integrating the two further improves the model’s predictive accuracy and stability.

5.3. Comparison Between the Integrated Model and the Existing Equations

In this section, the SVM-ensemble RF model with the best predictive performance is selected and compared with traditional empirical formula prediction methods. Table 1 summarizes the comparison of traditional methods relying on empirical formulas, including the research results of Janbu (1967) [31], Schmertmann (1978) [32], Berardi and Lancelotta (1991) [33], Malekdoost and Eslami (2011) [34], and Valikhah and Eslami (2019) [3]. Thirteen out of the 46 datasets used in this study were randomly selected for validation. Figure 10 compares the prediction results of the previous method and SVM-integrated RF model, and provides

R^{2}

, RMSE, MAD, and MAPE values for each method for quantitative comparison between the previous method and the current model.

As shown by the results in Figure 10, the SVM-integrated RF model significantly outperforms previous methods in prediction accuracy. For example, the integrated model proposed in this study has a mean absolute deviation (MAD) value of 3.171, which is much lower than the MAD values of earlier methods, indicating a clear advantage in prediction error. Additionally, the RMSE and MAPE values of the SVM-integrated RF model are also significantly lower than those of other methods, further confirming its high accuracy in settlement prediction. In contrast, traditional methods, which rely on empirical formulas or linear assumptions, struggle to fully capture complex nonlinear relationships, resulting in certain limitations in prediction performance. Notably, compared to previous approaches, the SVM-integrated RF model proposed in this study not only demonstrates superior prediction accuracy but also offers greater versatility and practicality. Traditional methods often entail complex calculations or depend on specific soil conditions, whereas the integrated model developed here leverages machine learning techniques to more efficiently handle multidimensional input parameters, simplifying the computation process. Furthermore, the SVM-integrated RF model is more flexible in its input parameter requirements, making it suitable for a wider range of geological conditions and engineering scenarios.

5.4. Uncertainty Analysis of Integrated Models

Since an SVM-integrated RF model introduces a human kernel function and the impact of the integrated model architecture on the prediction boundary remains unclear, along with the randomness in training and testing splits affecting the model’s stability, quantifying the uncertainty of the prediction model is crucial for improving the model’s credibility. This paper employs Monte Carlo Simulation (MCS) to quantify the uncertainty in the predictions of the proposed integrated model. Monte Carlo Simulation is a numerical method based on random sampling that generates predictive distributions by sampling the uncertainties in model parameters and structure, and further calculates the predictive mean, standard deviation, and confidence intervals, thereby providing a more comprehensive reference for model predictions.

Assuming the number of samples in the test set is N and the number of simulations is S, the dimension of the prediction matrix Y is N × S. The formulas for calculating the prediction mean, standard deviation, and confidence interval are given by Equations (18)–(20).

{\hat{y}}_{i} = \frac{1}{S} \sum_{j = 1}^{S} y_{i j}, i = 1,2, \dots, N

(18)

σ_{i} = \sqrt{\frac{1}{S} \sum_{1}^{S} {(y_{i j} - {\hat{y}}_{i})}^{2}}, i = 1,2, \dots, N

(19)

{C I}_{95 %} = [{\hat{y}}_{i} - 1.96 \cdot σ_{i}, {\hat{y}}_{i} + 1.96 \cdot σ_{i}]

(20)

In this context,

{\hat{y}}_{i}

represents the predicted mean, and

y_{i j}

is the predicted value of the i-th sample in the j-th simulation.

σ_{i}

is the standard deviation of the predictions, and

{C I}_{95 %}

refers to the 95% confidence interval, which represents the upper and lower bounds of the predicted value at a 95% confidence level. It provides a range of credibility for the predicted values.

Through Monte Carlo simulation, this article quantified the uncertainty of the predicted results of the test set samples. Figure 11 shows the actual values, predicted mean, and 95% confidence interval of the test set samples. It can be seen from the figure that the predicted values of each sample are represented by a vertical interval, with the upper and lower bounds corresponding to the upper and lower limits of the confidence interval, respectively. The prediction intervals for sample 1 are [102.0, 102.4], sample 2 is [60.0, 60.8], and sample 3 is [64.0, 65.0]. The narrow confidence interval range of all samples indicates that the model has a high degree of certainty in predicting these samples. Further validated the accuracy of the integrated machine learning prediction model proposed in this study.

5.5. Sensitivity Analysis of Integrated Models

In order to comprehensively evaluate the impact of input variables on the prediction results of SVM-integrated RF model, this paper adopts two methods for sensitivity analysis of SVM-integrated RF model, namely feature importance analysis and SHAP (Shapley Additive exPlans) value. For feature importance analysis, we used model-based feature importance (through a random forest model) and custom perturbation analysis methods. For SHAP analysis, we used the SHAP library. Figure 12 presents the feature importance values of three input parameters in the model (foundation width B, foundation load q, and corrected cone tip resistance

q_{t}

). The results indicate that the feature importance value of B is 14.036, making it the variable with the greatest influence on the prediction results; followed by q, with a feature importance value of 5.0694; and

q_{t}

, which has a feature importance value of 4.5948, indicating a relatively smaller impact. This suggests that foundation width B is the most critical input parameter in the model, contributing significantly more to the settlement prediction results than the other variables. In contrast, although q and

q_{t}

also have some influence on the prediction results, their importance is significantly lower than that of B. This finding aligns with the physical mechanisms of foundation settlement, as foundation width directly affects the stress distribution and settlement of the foundation soil.

The second sensitivity analysis method used in this study is SHAP (Shapley Additive exPlans) value. Figure 13 shows the impact of quantifying three input parameters B, q, and

q_{t}

using the SHAP method on the model results. From Figure 13, it can be seen that a higher B value has a positive impact on the model output, with a positive SHAP value, indicating that an increase in B will lead to an increase in the predicted value; A lower B value has a negative impact on the model output, and a negative SHAP value indicates that a decrease in B will lead to a decrease in the predicted value. This distribution characteristic is consistent with the physical mechanism of foundation settlement, as the width of the foundation directly affects the stress distribution and settlement of the foundation. A larger foundation width usually leads to greater settlement. In contrast, the SHAP values of ultimate loads q and

q_{t}

have a narrower distribution, concentrated around 0, indicating that q and qt have a smaller impact on the model output. Although the variation in q has a certain impact on the prediction results, its influence range is limited, and its positive or negative contributions to the model output are relatively balanced. This is consistent with the previous analysis of feature importance, further verifying the secondary role of q and

q_{t}

in the model.

To gain a deeper understanding of the nonlinear feature effects predicted by the model, we conducted partial dependence (PDP) and individual conditional expectation (ICE) analyses. Figure 14 shows the nonlinear influence mode of foundation width B and load q on settlement prediction. The PDP curve shows that the foundation width B exhibits a significant nonlinear effect on settlement prediction within the range of 0.5–2.0 m. When B increases from 0.5 m to about 1.2 m, the predicted settlement shows a rapid downward trend, with an average decrease of about 40%. After B > 1.2 m, the settlement prediction tends to stabilize, indicating a significant threshold effect of the foundation width effect. The ICE curve distribution shows significant heterogeneity in the response of different samples to changes in baseline width, with some samples exhibiting stronger width sensitivity. The influence of load q on settlement prediction exhibits stronger nonlinear characteristics. In the low load range (200–600 kPa), settlement growth is relatively gentle. After the load exceeds 600 kPa, settlement prediction shows an accelerated growth trend, reflecting typical nonlinear soil response. The dispersion degree of ICE curve in the load dimension is relatively small, indicating a high consistency of load effects among different samples. These findings provide important theoretical basis for optimizing basic dimensions and load control in engineering practice, demonstrating the superior performance of ensemble learning models in capturing nonlinear characteristics of complex geotechnical problems.

6. Model Validation

In order to verify the generalization capability of the integrated model proposed in this study, this section employs a Generative Adversarial Network (GAN) to generate 50 new datasets based on 46 groups of data collected from the literature. This study used the open-source library CTGAN (version 0.8.0) to generate synthetic data. The model training is set to 2000 cycles, with a batch size of 100, and automatically identifies discrete variables in the dataset to ensure that the generated table data conforms to the original data structure. The GAN comprises two main components: a generator and a discriminator. The generator (G) is responsible for generating “fake” data in an attempt to make it indistinguishable from real data, while the discriminator (D) is tasked with determining whether the input data is real (from the actual dataset) or generated by the generator (fake). The objective of the GAN is to minimize the loss of the generator while maximizing the loss of the discriminator, which can be formalized as follows:

{m i n}_{G} {m a x}_{D} V (D, G) = {E ~}_{P d a t a (x)} [\log D (x)] + {E ~}_{z ~ P z (z)} [\log (1 - D (G (z)))]

(21)

where

x

represents real data and

z

represents random noise.

Figure 15 shows the prediction results of the integrated model on 50 newly generated datasets by GAN, with

R^{2}

= 0.806, RMSE = 5.390, MAD = 4.932, and MAPE = 8.83%. The errors are relatively small and the prediction results are satisfactory, indicating that the integrated model proposed in this study has good generalization ability.

To improve the authenticity of the generated data, we have taken the following steps: (1) In GAN training, we used adversarial loss functions and regularization techniques to encourage alignment between the generated data and the real data distribution; (2) Compare the distribution similarity between generated data and real data through statistical tests (such as K-S test); (3) Based on domain knowledge, visual inspection and validation of key parameter ranges were conducted on the generated data to ensure that it conforms to physical meaning. We emphasize that GAN generated data is only used for auxiliary validation and does not replace real data testing. In future work, we will expand the real dataset to improve model reliability.

7. Conclusions

This study utilized 46 sets of data collected from the literature to establish DNN, SVM, and RF models and an ensemble model for predicting shallow foundation settlement, and compared the predictive performance of each model. The following are the main conclusions and discussions of this study:

(1): The SVM-ensemble RF model proposed in this study predicts values that closely match the actual values and outperforms the other models, followed by XGBoost, SVM, RF, GBDT, and DNN. Therefore, the proposed ensemble model has very high predictive capability and can effectively capture the complex nonlinear relationships between soil layer characteristics and settlement.
(2): This study compared the SVM-ensemble RF model with traditional methods and single machine learning models. The results showed that the SVM-ensemble RF model performed best across all evaluation metrics. This further validates that the integrated model proposed in this study can achieve more realistic and accurate settlement predictions, providing a reference for geotechnical engineering practice.
(3): This study employs Monte Carlo simulation to quantify the uncertainty of the ensemble model’s prediction results and conducts a sensitivity analysis. The results of uncertainty quantification all fall within the 95% confidence interval. The sensitivity analysis shows that, when using an SVM-integrated RF model to predict settlement values (St), the foundation width (B) has the greatest influence, followed by foundation load (q), and finally corrected cone tip resistance ( $q_{t}$ ).
(4): Finally, based on the 46 sets of data collected from the literature, 50 new datasets were generated using a Generative Adversarial Network (GAN) and applied to the ensemble model proposed in this study. The results indicate that the ensemble model demonstrates good generalization ability.

Based on the aforementioned research content, future studies could explore how to integrate more types of data (such as remote sensing data and seismic data) with existing models. Through data fusion, a more comprehensive description of the characteristics of the foundation soil layers can be achieved, thereby improving the accuracy and applicability of predictions. Additionally, by utilizing big data technologies and cloud computing platforms, large-scale datasets can be processed and analyzed more effectively, enhancing the model’s generalization capability. Furthermore, while ensuring prediction accuracy, research should focus on simplifying the model structure to improve its computational efficiency and interpretability. Uncertainty analysis and prior assessments can also be incorporated into the model to enhance the reliability and practicality of the prediction results.

Author Contributions

Conceptualization, R.Z. and W.Z.; Methodology, R.Z. and W.Z.; Software, R.Z. and W.Z.; Validation, R.Z. and W.Z.; Formal analysis, R.Z. and W.Z.; Investigation, R.Z. and W.Z.; Resources, W.Z.; Data curation, R.Z. and W.Z.; Writing—original draft, R.Z. and W.Z.; Writing—review & editing, R.Z. and W.Z.; Visualization, R.Z. and W.Z.; Supervision, W.Z.; Project administration, W.Z.; Funding acquisition, W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (No. 52468054), the Qinghai Provincial Science and Technology Cooperation Project (No. 2023-HZ-806) and the Fund of China Scholarship Council (No. 202308630007).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Holtz, R.D.; Kovacs, W.D.; Sheahan, T.C. An Introduction to Geotechnical Engineering; Prentice-hall: Englewood Cliffs, NJ, USA, 1981. [Google Scholar]
Paikowsky, S.G.; Fu, Y.; Amatya, S.; Canniff, M.C. Uncertainty in shallow foundations settlement analysis and its utilization in SLS design specifications. In Proceedings of the 17th International Conference on Soil Mechanics and Geotechnical Engineering (Volumes 1, 2, 3 and 4), Alexandria, Egypt, 5–9 October 2009; pp. 1317–1320. [Google Scholar]
Valikhah, F.; Eslami, A. CPT-Based Nonlinear Stress–Strain Approach for Evaluating Foundation Settlement: Analytical and Numerical Analysis. Arab. J. Sci. Eng. 2019, 44, 8819–8834. [Google Scholar] [CrossRef]
Omer, J.R.; Delpak, R.; Robinson, R.B. A new computer program for pile capacity prediction using CPT data. Geotech. Geol. Eng. 2006, 24, 399–426. [Google Scholar] [CrossRef]
Moshfeghi, S.; Eslami, A. Reliability-based assessment of drilled displacement piles bearing capacity using CPT records. Mar. Georesources Geotechnol. 2019, 37, 67–80. [Google Scholar] [CrossRef]
Robertson, P.K. Soil behaviour type from the CPT: An update. In Proceedings of the 2nd International Symposium on Cone Penetration Testing, Huntington Beach, CA, USA, 9–11 May 2010; Volume 2, pp. 575–583. [Google Scholar]
Mayne, P.W. Cone Penetration Testing; Transportation Research Board: Washington, DC, USA, 2007. [Google Scholar]
Lunne, T.; Powell, J.J.M.; Robertson, P.K. Cone Penetration Testing in Geotechnical Practice; CRC Press: Boca Raton, FL, USA, 2002. [Google Scholar]
Jefferies, M.G.; Davies, M.P. Use of CPTU to estimate equivalent SPT N 60. Geotech. Test. J. 1993, 16, 458–468. [Google Scholar] [CrossRef]
Samui, P. Support vector machine applied to settlement of shallow foundations on cohesionless soils. Comput. Geotech. 2008, 35, 419–427. [Google Scholar] [CrossRef]
Kaveh, A. Applications of Artificial Neural Networks and Machine Learning in Civil Engineering; Studies in Computational Intelligence; Springer: Cham, Switzerland, 2024; Volume 1168. [Google Scholar]
Kourehpaz, P.; Molina Hutt, C. Machine learning for enhanced regional seismic risk assessments. J. Struct. Eng. 2022, 148, 04022126. [Google Scholar] [CrossRef]
Ahmed, M.O.; Khalef, R.; Ali, G.G.; El-Adaway, I.H. Evaluating deterioration of tunnels using computational machine learning algorithms. J. Constr. Eng. Manag. 2021, 147, 04021125. [Google Scholar] [CrossRef]
Faraz Athar, M.; Khoshnevisan, S.; Sadik, L. CPT-Based Soil Classification through Machine Learning Techniques. In Proceedings of the Geo-Congress 2023, Los Angeles, CA, USA, 26–29 March 2023; pp. 277–292. [Google Scholar]
Bentéjac, C.; Csörgő, A.; Martínez-Muñoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2021, 54, 1937–1967. [Google Scholar] [CrossRef]
Ester, M.; Kriegel, H.P.; Xu, X. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer: New York, NY, USA, 2009. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Ho, T.K. Random decision forests. In Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, Canada, 14–16 August 1995; Volume 1, pp. 278–282. [Google Scholar]
Cervantes, J.; Garcia-Lamont, F.; Rodríguez-Mazahua, L.; Lopez, A. A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing 2020, 408, 189–215. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Asgarkhani, N.; Kazemi, F.; Jankowski, R.; Formisano, A. Dynamic ensemble-learning model for seismic risk assessment of masonry infilled steel structures incorporating soil-foundation-structure interaction. Reliab. Eng. Syst. Saf. 2025, 267, 111839. [Google Scholar] [CrossRef]
Benker, M.; Furtner, L.; Semm, T.; Zaeh, M.F. Utilizing uncertainty information in remaining useful life estimation via Bayesian neural networks and Hamiltonian Monte Carlo. J. Manuf. Syst. 2021, 61, 799–807. [Google Scholar] [CrossRef]
Padarian, J.; Minasny, B.; McBratney, A.B. Assessing the uncertainty of deep learning soil spectral models using Monte Carlo dropout. Geoderma 2022, 425, 116063. [Google Scholar] [CrossRef]
Nguyen, H.T.T.; Cao, H.Q.; Nguyen, K.V.T.; Pham, N.D.K. Evaluation of explainable artificial intelligence: Shap, lime, and cam. In Proceedings of the FPT AI Conference, Ha Noi, Vietnam, 6–7 May 2021; pp. 1–6. [Google Scholar]
Lundberg, S.M.; Erion, G.G.; Lee, S.I. Consistent individualized feature attribution for tree ensembles. arXiv 2018, arXiv:1802.03888. [Google Scholar]
Cracknell, M.J.; Reading, A.M. Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information. Comput. Geosci. 2014, 63, 22–33. [Google Scholar] [CrossRef]
Wang, W.; Xue, C.; Zhao, J.; Yuan, C.; Tang, J. Machine learning-based field geological mapping: A new exploration of geological survey data acquisition strategy. Ore Geol. Rev. 2024, 166, 105959. [Google Scholar] [CrossRef]
Ishihara, K.; Yoshimine, M. Evaluation of settlements in sand deposits following liquefaction during earthquakes. Soils Found. 1992, 32, 173–188. [Google Scholar] [CrossRef]
Janbu, N. Settlement Calculations Based on the Tangent Modulus Concept; Technical University of Norway: Trondheim, Norway, 1967. [Google Scholar]
Schmertmann, J.H. Guidelines for Cone Test, Performance, and Design; Report FHWA-TS-78209; Federal Highway Administration: Washington, DC, USA, 1978; p. 145. [Google Scholar]
Berardi, R.; Lancellotta, R. Stiffness of granular soils from field performance. Geotechnique 1991, 41, 149–157. [Google Scholar] [CrossRef]
Malekdoost, M.; Eslami, A. Application of CPT data for estimating foundations settlement-case histories. Sharif J. Civ. Eng. 2011, 1, 75–85. [Google Scholar]
MolaAbasi, H.; Saberian, M.; Khajeh, A.; Li, J.; Chenari, R.J. Settlement predictions of shallow foundations for non-cohesive soils based on CPT records-polynomial model. Comput. Geotech. 2020, 128, 103811. [Google Scholar] [CrossRef]
Eslami, A.; Gholami, M. Bearing capacity analysis of shallow foundations from CPT data. In Proceedings of the 16th International Conference on Soil Mechanics and Geotechnical Engineering, Osaka, Japan, 12–16 September 2005; pp. 1463–1466. [Google Scholar]
Mayne, P.W.; Illingworth, F. Direct CPT method for footing response in sands using a database approach. In Proceedings of the 2nd International Symposium on Cone Penetration Testing, Huntington Beach, CA, USA, 9–11 May 2010. [Google Scholar]
Briaud, J.L.; Gibbens, R. Behavior of five large spread footings in sand. J. Geotech. Geoenviron. Eng. 1999, 125, 787–796. [Google Scholar] [CrossRef]
Eslami, A.A.; Gholami, A.M. Analytical model for the ultimate bearing capacity of foundations from cone resistance. Sci. Iran. 2006, 13, 223–233. [Google Scholar]

Figure 1. The relationship between input and output parameters. (a) B-

S_{t}

; (b) q-

S_{t}

; (c)

q_{t}

-

S_{t}

.

Figure 1. The relationship between input and output parameters. (a) B-

S_{t}

; (b) q-

S_{t}

; (c)

q_{t}

-

S_{t}

.

Figure 2. Structure of the DNN Model.

Figure 3. Algorithm of SVM-integrated RF model to estimate the settlement of the foundation.

Figure 4. Comparison between the actual values and predicted values of St using the GBDT.

Figure 5. Comparison between the actual values and predicted values of St using the XGBoost.

Figure 6. Comparison between the actual values and predicted values of St using the DNN Model.

Figure 7. Comparison between the actual values and predicted values of St using the SVM model.

Figure 8. Comparison between the actual values and predicted values of St using the RF model.

Figure 9. Comparison between the actual values and predicted values of St using the SVM-integrated RF method.

Figure 10. Comparison between the actual and predicted St by previous methods and the current model. (a) Janbu (1967) [31]; (b) Schmertmann (1978) [32]; (c) Berardi and Lancellotta (1991) [33]; (d) Malekdoost and Eslami (2011) [34]; (e) Valikhah and Eslami (2019) [3]; (f) The SVM-integrated RF model.

Figure 11. Quantifying uncertainty in prediction results of the test set using Monte Carlo methods.

Figure 12. Visualization of feature importance for the three input parameters B, q and

q_{t}

.

Figure 12. Visualization of feature importance for the three input parameters B, q and

q_{t}

.

Figure 13. Quantify the impact of three input parameters on the model results using SHAP method.

Figure 14. The Mechanism of Influence of Foundation Width and Load on Settlement Prediction Based on PDP and ICE.

Figure 15. Comparison between the actual values and predicted values of St for GAN-generated data using the SVM-integrated RF method.

Table 1. The calculation method for foundation settlement proposed by predecessors.

No	Methods	Equation	Remarks
1	Janbu (1967) [31]	$S_{t} = (\frac{1}{m j} [{(\frac{σ_{0}^{'} + {∆ σ}^{'}}{σ_{r}^{'}})}^{j} - {(\frac{σ_{0}^{'}}{σ_{r}^{'}})}^{j}]) \times H$	$S_{t}$ : foundation settlement, $ε$ : strain induced by effective stress increase, $σ_{0}^{'}$ : initial effective stress, ${∆ σ}^{'}$ : increase in effective stress under applied stress, m: modulus number, H: the thickness of the target layer, j: stress exponent = 0.5, $σ_{r}^{'}$ : constant stress equal to 100 kPa
2	Schmertmann (1978) [32]	$S_{t} = C_{1} C_{2} q \sum_{0}^{2 B} \frac{I_{z}}{E} H$	$C_{1}$ : a correction factor for the depth of foundation embedment = $1 - 0.5 \frac{σ_{0}^{'}}{q}$ , $C_{2}$ : a correction factor to account for creep in soil (t is time in year) = $1 - 0.2 \log \frac{t}{0.1}$ , $I_{z}$ : strain influence factor, E = 2 $q_{c}$
3	Berardi and Lancellotta (1991) [33]	$S_{t} = \frac{q B}{E} I_{s}$	$E = K_{E} σ_{r}^{'} {(\frac{σ_{0}^{'} + \frac{{∆ σ}^{'}}{2}}{σ_{r}^{'}})}^{0.5}$ , $K_{E}$ : modulus number, $I_{s}$ = 0.63, 0.69, and 0.88 for circle, square, and rectangular foundations, respectively
4	Malekdoost and Eslami (2011) [34]	$S_{t} = (\frac{1}{m j} [{(\frac{σ_{0}^{'} + {∆ σ}^{'}}{σ_{r}^{'}})}^{j} - {(\frac{σ_{0}^{'}}{σ_{r}^{'}})}^{j}]) \times H$	m = 2 $q_{c}$ , $j = \frac{q_{c} [1 + (0.05 \log q_{c}) \times R_{f}^{2}]}{5^{\log q_{c}} (11 \sqrt{R_{f} + R_{f}^{2}})}$ , $q_{c}$ : cone tip resistance, $R_{f}$ : friction ratio = $\frac{f_{s}}{q_{c}}$
5	Valikhah and Eslami (2019) [3]	$S_{t} = (\frac{1}{m j} [{(\frac{σ_{0}^{'} + {∆ σ}^{'}}{σ_{r}^{'}})}^{j} - {(\frac{σ_{0}^{'}}{σ_{r}^{'}})}^{j}]) \times H$	$m = 0.25 b \times {(\frac{2 B + 1}{3 B})}^{3} \times q_{c}$ , B: foundation width, b: penetration cone diameter, $j = \frac{q_{c}}{x + y q_{c}}$ , $x = 0.02 R_{f} + 0.5$ , $y = 7.53 {(σ_{0}^{'})}^{- 0.25}$

Table 2. A summary of the data employed in this research.

Soil Type	Case No.	Footing Shape	$q_{t}$ (kPa)	$R_{f}$	$D_{f}$ (m)	B (m)	$q$ (kPa)	$S_{t}$ (mm)	Reference
Silt	1	Square	2500	0.5	0	1	300	98	Eslami and Gholami (2005) [36]
	2		2800	0.5	0	1	325	97
Silt Sand	3	Square	7000	0.5	0	0.6	1260	55
	4		10,000	0.5	0	0.6	1280	55
Silt Clay	5	Circular	1400	0.6	0	0.45	170	40
	6		1700	0.6	0	0.6	170	55
	7		2000	0.6	0	0.6	170	55
Silt Clay	8	Circular	3100	0.6	1.5	0.6	520	60
	9		4600	0.6	1.5	0.6	310	55
	10		5400	0.6	1.5	0.6	310	60
	11		6000	0.6	1.5	0.6	690	60
Glaciofluvial Sand	12	Rectangular	10,720	0.51	0.4	0.6	1740	59	Mayne and Illingworth (2010) [37]
	13		10,720	0.51	0.6	1.2	1740	119
	14		10,720	0.51	0.8	1.7	1740	170
	15		10,720	0.51	1.1	2.4	1740	245.8
Siliceous Sand	16	Square	3440	0.44	0.5	0.5	480	51
Sand, Silty Sand	17	Square	7520	0.65	0.76	1	1540	100	Briaud and Gibbens (1999) [38]
	18		7520	0.65	0.76	1.5	1540	154
Silt	19	Square	1700	0.5	0	1	375	115	Eslami and Gholami (2006) [39]
	20		2000	0.5	0	1	370	100
Silt Sand	21	Square	3000	0.5	0	0.6	1260	60
Silt Clay	22	Circular	500	0.6	0	0.3	170	33
	23		900	0.6	0	0.3	170	25
Silt Clay	24	Circular	1000	0.6	1.5	0.6	600	72
	25		1700	0.6	1.5	0.6	600	72
	26		2500	0.6	1.5	0.6	600	60
White Fine Sand	27	Square	3660	0.54	0	0.69	620	65	Mayne and Illingworth (2010) [37]
Glaciofluvial Sand	28	Rectangular	4010	0.63	0	1	840	102.4
	29		4010	0.63	0	1	840	102.4
	30		3200	0.63	1.1	2.4	640	260
Compacted Fill	31	Square	880	0.53	0	0.46	150	47
	32		3860	0.48	0	0.63	580	64
	33		2870	0.58	0	0.8	520	82
Alluvial Sand	34	Circular	6720	0.6	2.2	2.2	1280	250
	35		6720	0.6	2.2	2.2	1280	250
	36		10,460	0.52	2.35	2.35	1730	245
	37		10,460	0.52	2.35	2.35	1730	245
Dune Sand	38	Square	4010	0.66	0	0.7	840	71.7
	39		4010	0.66	0	0.7	840	71.7
	40		4010	0.66	0	1	840	102.4
	41		4010	0.66	0	1	840	102.4
	42		4010	0.66	0	1	840	102.4
Silty Sand	43	Circular	1710	0.55	0.6	1.82	1710	186
Siliceous Dune Sand	44	Square	480	0.44	0.5	0.5	480	51
	45		480	0.44	1	1	480	102
	46		480	0.44	1	1	480	102

Table 3. Comparison of nested cross validation performance of GBDT, XGBoost, DNN, SVM, RF, and SVM-integrated RF models (mean ± standard deviation).

Model	$R^{2}$	RMSE	MAD	MAPE
GBDT	0.876 ± 0.009	22.982 ± 0.43	15.846 ± 0.43	20.12% ± 0.40%
XGBoost	0.926 ± 0.007	17.533 ± 0.48	11.914 ± 0.39	14.86% ± 0.35%
DNN	0.826 ± 0.015	27.284 ± 0.68	22.432 ± 0.52	27.03% ± 0.48%
SVM	0.916 ± 0.008	18.968 ± 0.52	12.517 ± 0.41	12.79% ± 0.38%
RF	0.901 ± 0.012	20.570 ± 0.61	13.458 ± 0.49	15.87% ± 0.45%
SVM-integrated RF	0.978 ± 0.005	3.764 ± 0.45	3.171 ± 0.38	5.02% ± 0.32%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, R.; Zhang, W. A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data. Appl. Sci. 2025, 15, 12174. https://doi.org/10.3390/app152212174

AMA Style

Zhang R, Zhang W. A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data. Applied Sciences. 2025; 15(22):12174. https://doi.org/10.3390/app152212174

Chicago/Turabian Style

Zhang, Rui, and Wuyu Zhang. 2025. "A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data" Applied Sciences 15, no. 22: 12174. https://doi.org/10.3390/app152212174

APA Style

Zhang, R., & Zhang, W. (2025). A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data. Applied Sciences, 15(22), 12174. https://doi.org/10.3390/app152212174

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Shallow Foundation Settlement Prediction Method Considering Uncertainty Based on Machine Learning and CPT Data

Abstract

1. Introduction

2. Traditional Method

3. Case Study Records

4. Modeling Process

4.1. Gradient Boosting Decision Tree (GBDT)

4.2. Extreme Gradient Boosting (XGBoost)

4.3. Deep Neural Network (DNN)

4.4. Support Vector Machines (SVM)

4.5. Random Forest (RF)

4.6. Integration of SVM and RF

4.7. Code Implementation Framework and Development Environment

5. Results

5.1. Evaluation Indicators

5.2. St Prediction

5.2.1. GBDT, XGBoost and DNN Model

5.2.2. SVM Model and RF Model

5.2.3. Integrated Model

5.3. Comparison Between the Integrated Model and the Existing Equations

5.4. Uncertainty Analysis of Integrated Models

5.5. Sensitivity Analysis of Integrated Models

6. Model Validation

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI