Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach

Jia, Yuhan; Li, Zhaochao

doi:10.3390/buildings16102042

Open AccessArticle

Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach

by

Yuhan Jia

¹ and

Zhaochao Li

^2,*

¹

School of Civil and Environmental Engineering, Hunan University of Technology, Zhuzhou 412007, China

²

Intelligent Control of Safety and Risk for Existing Engineering Structures, Key Laboratory of Hunan Province, Hunan University of Technology, Zhuzhou 412007, China

^*

Author to whom correspondence should be addressed.

Buildings 2026, 16(10), 2042; https://doi.org/10.3390/buildings16102042

Submission received: 8 April 2026 / Revised: 8 May 2026 / Accepted: 20 May 2026 / Published: 21 May 2026

(This article belongs to the Topic Digital Twins and Artificial Intelligence for Advancing Smart Green Building and City Resilience)

Download

Browse Figures

Versions Notes

Abstract

This study proposes an innovative methodology that integrates finite element simulation, machine learning, and interpretable model analysis to predict the vertical bearing capacity of piles in cold saline environments. Initially, Python scripts are developed to drive the ABAQUS platform, and LHS (Latin Hypercube Sampling) is employed to generate random parameter combinations to construct a multi-dimensional ML (machine learning) database. Six ML models, including XGBoost and LightGBM, are developed with hyperparameters optimized by cross-validation and grid search. Model performance is evaluated by five metrics (R², MSE, RMSE, MAE, and MAPE). Finally, parametric sensitivity is analyzed by the SHAP (SHapley Additive exPlanations) method. The study demonstrates that: (1) the XGBoost and LightGBM models achieve optimal performance on the test set, and the generalization ability significantly exceeds other models; (2) pile diameter is the primary factor influencing vertical bearing capacity, and corrosion depth exhibits higher sensitivity than corrosion thickness; and (3) the bearing capacity of the pile is predicted by using the automated parametric modeling method based on Python (3.8)-ABAQUS (2022). The automated modeling and prediction framework may serve as a reference for pile design in similarly complex environments.

Keywords:

vertical bearing capacity; cold saline environments; finite element-machine learning integration; XGBoost; LightGBM; SHapley Additive exPlanations (SHAP)

1. Introduction

Salt-laden soil is a distinct geological formation in cold marshland areas. Pile foundations may be corroded by salt-laden soil under seasonal freeze–thaw cycles, and this corrosion reduces the bearing capacity of the pile foundations. Thus, it is necessary to consider the effect of erosion on the pile foundation of highways and bridges in salt marsh environments. However, it remains a major challenge to accurately predict the load-bearing capacity of piles in cold marsh saline soils [1,2,3]. Shao et al. developed a numerical method to simulate the damage and degradation of concrete piles in sulfate saline soil [4]. Yunusaliyev et al. studied the effects of environmental aggressiveness and changes in the properties of saline soils on horizontally loaded piles [5]. However, a standardized theoretical framework has not been established. Traditional methods, such as static load tests and empirical formulae, often fail to capture the nonlinear dependency between these multiple factors and the ultimate bearing capacity. Finite Element Analysis (FEA) may simulate soil–pile interactions, but it is limited by calculation efficiency and adaptation to different site conditions. A robust data-driven approach has the potential to improve the accuracy of prediction since the rapid advancement of machine learning (ML) has shown robust applications in pile foundation engineering. Momeni et al. employed a genetic algorithm (GA) for optimizing a Artificial Neural Network (ANN) model to predict the bearing capacity of piles [6]. Fattahi et al. used two optimization algorithms, Fruit Fly Optimization (FFO) and Invasive Weed Optimization (IWO), to predict the load-bearing capacity of piles [7]. Sun et al. highlighted a Support Vector Regression (SVR) model optimized by the Grey Wolf Optimizer (GWO) algorithm to predict the ultimate bearing capacity of embedded rock piles [8]. Chen et al. developed a hybrid model based on four ML algorithms and Ensemble Learning (EL) to predict the ultimate bearing capacity of rock-socketed shafts [9]. However, there are still limitations in current predictions of the ultimate bearing capacity of piles: (1) Few existing models have been specifically developed for cold and swampy regions. (2) It is challenging to build a comprehensive database for prediction models due to the limited field data [10]. The data samples from field tests often exhibit high correlation, leading to overfitting and misleading results [11].

Tree boosting is a highly effective and widely used machine learning method [12]. The Extreme Gradient Boosting (XGBoost) and the Light Gradient Boosting Machine (LightGBM) algorithms are efficient gradient boosting frameworks for managing high-dimensional nonlinear data and capturing complex feature interactions. In recent years, XGBoost and LightGBM have successfully addressed several challenges in geotechnical engineering. Zhang et al. employed an efficient reliability analysis framework utilizing an XGBoost surrogate model to evaluate the failure probability of unsaturated slopes experiencing rapid decline. The model considered the depth-dependent variability of spatially heterogeneous soil [13]. Zhang et al. calculated the reliability of the time-dependent characteristics of the Bazimen Landslide in the Three Gorges Reservoir Area, utilizing XGBoost and LightGBM algorithms [14]. Wang et al. proposed a tree-based model, LightGBM, to predict the intensity of rock bursts [15].

This study proposes a hybrid approach integrating numerical simulation with an ML model. Python scripts are utilized to systematically modify parameters such as pile geometry, material properties, and soil characteristics in the ABAQUS package. A diverse dataset is created for ML by automating model generation. Subsequently, the XGBoost and LightGBM models are developed and employed to predict the vertical bearing capacity of corroded piles in cold marsh environments. The ABAQUS model is verified successfully by the experimental results. The XGBoost and LightGBM models achieve better fitting performance compared to other models, and the two models exhibit the highest accuracy and superior generalization ability. The approach offers a significant reduction in research costs and time compared to traditional field test-based methods by automating parameter variations and computational processes.

2. Numerical Simulations

The pile foundation deteriorates because it is corroded in salt marsh environments. The corrosion is simulated by reducing the elastic modulus of the pile body [16]. The spalling thickness and corrosion depth represent lateral and vertical corrosion of the pile, respectively. It was found that corrosion primarily occurs within 8 m below the surface, exhibiting a corrosion depth of 0–8 m and a spalling thickness of 0–16 cm [17].

The corrosion effects in cold saline environments are captured through two key parameters: corrosion depth (h, ranging from 0.1 to 8.0 m) and spalling thickness (δ, ranging from 0.01 to 0.20 m). These parameters represent the cumulative structural consequences of temperature fluctuations, salinity concentration, and freeze–thaw cycles on the pile body. In the finite element model, the corroded zone is assigned a separate material with a negligibly small elastic modulus (E = 300 Pa) rather than reducing the elastic modulus of the original pile material by a percentage. The current model addresses the structural effects of corrosion at a given degradation state rather than modeling the electrochemical corrosion process itself. Each simulation represents a steady-state condition at a specific corrosion level; by varying h and δ across wide ranges, the dataset captures the pile response at different stages of corrosion progression.

The developed 2D model is compared with experimental data to validate the proposed numerical framework [18]. In this model, the stress–strain behavior of the soil is described by the Mohr–Coulomb constitutive model. The corroded region is modeled with a reduced elastic modulus, scaled by a small factor (1 × 10⁻⁶ or less) to prevent matrix singularity. As the normal pile–soil interface is defined by hard contact, appropriate local mesh refinement is applied around the contact area to ensure computational accuracy and minimize boundary effects. Figure 1 shows the mesh of the model, and Figure 2 illustrates the load–displacement curve of the simulation. It is found the bearing capacity is 614 kN in the numerical model. The experimental bearing is 587.51 kN [18]. The difference is about 4%, indicating that the proposed 2D numerical model is accurate in predicting the bearing capacity of the pile foundation in salt marsh environments.

The model employs a three-step analysis procedure: (1) a geostatic step to establish the initial stress field under gravity loading, during which the pile elements are deactivated using the Model Change technique; (2) a static step to activate the pile elements and establish the pile–soil contact (surface-to-surface contact with penalty friction coefficient μ = 0.25); and (3) a static step with displacement-controlled loading (vertical displacement u₂ = −0.5 m applied at the pile head, with nlgeom = ON) to simulate the pile loading process. The bearing capacity is defined as the reaction force (RF2) at the pile head node when the vertical displacement reaches 0.05 m. The soil domain is modeled as a 25 m × 70 m rectangle with CAX4 elements (4-node and 3-node axisymmetric elements). Boundary conditions include fixed bottom (u₁ = u₂ = 0), roller sides (u₁ = 0), and axisymmetric axis constraints.

It should be noted that the 2D axisymmetric model inherently assumes that the pile geometry, soil stratigraphy, loading, and corrosion distribution are all symmetric about the pile’s central axis. In reality, corrosion may occur non-uniformly around the pile circumference. The present model therefore represents a conservative or idealized scenario, and its extension to fully 3D non-symmetric cases is a potential direction for future work.

In the ABAQUS model, ‘corrosion depth’ is implemented by assigning the reduced elastic modulus to the pile elements located within a specified depth range from the ground surface (0–8 m, as noted in the manuscript). ‘Spalling thickness’ is simulated by deactivating or assigning a negligibly small stiffness to a corresponding layer of elements at the perimeter of the pile within the corroded depth. The Python 3.8 script automatically modifies the material assignments and element sets based on the sampled parameter values.

3. Methodology

The methodology consists of three components. Figure 3 presents the detailed workflow. Initially, the random function of Python is utilized to generate the dataset. Following data collection and preprocessing, a dataset with a multi-dimensional feature space is established and split into a training set (80%) and a testing set (20%). Subsequently, six ML models are developed, trained, and evaluated. Specifically, the training phase incorporates a systematic hyperparameter optimization process, utilizing grid search combined with 5-fold cross-validation to identify the optimal hyperparameters. The performance of each model is then assessed using the R-squared, RMSE, MAE, and MAPE metrics. Finally, Mean Decrease in Impurity (MDI) and SHapley Additive exPlanations (SHAP) analyses are applied to conduct feature importance ranking and global interpretations of the developed models. In this context, ‘the term multi-dimensional’ refers to the eight-dimensional input feature space defining the pile–soil system in cold saline environments. This space consists of eight specific parameters, pile diameter (D), pile length (L), spalling thickness (δ), corrosion depth (h), elastic modulus of pile (Ep), elastic modulus of soil (Es), internal friction angle (φ), and cohesion (c), which are collectively used to predict a single target output: the vertical bearing capacity (Q).

3.1. Data Preparation

The dataset is generated from numerical simulations using the ABAQUS finite element platform. This study selects eight key parameters as shown in Table 1. The eight input parameters are selected based on their established influence on pile bearing capacity: pile diameter (D, 1.0–2.0 m) and pile length (L, 30–60 m) directly determine shaft resistance and end-bearing capacity; spalling thickness (δ, 0.01–0.20 m) and corrosion depth (h, 0.1–8.0 m) characterize corrosion damage extent; elastic modulus of pile (Ep, 15–40 GPa) represents the structural stiffness of the intact pile body; and the Mohr–Coulomb soil parameters including elastic modulus (Es, 15–40 MPa), internal friction angle (φ, 10–30°), and cohesion (c, 10–30 kPa) govern the soil’s shear strength and deformation behavior. The current model assumes a homogeneous soil layer, which is a simplification adopted for the parametric study. Pore water conditions are not explicitly modeled as separate features because the Mohr–Coulomb model uses effective stress parameters.

A Python-based ABAQUS script is employed to develop a numerical model and establish a database, which includes eight parameters and the corresponding bearing capacities. Specifically, a settlement of 50 mm corresponds to the vertical ultimate bearing capacity of the pile. An improved Latin Hypercube Sampling (LHS) method [19] is employed to combine parameters. First, samples are obtained by identifying equal-probability intervals within the variable domain for each variable [20]. The stratified unit order is determined via random permutation in each dimension, and a uniformly distributed random offset is superimposed to mitigate the effect of grid alignment. Furthermore, the standardized samples are mapped onto the actual project to ensure that the generated parameter combinations align with the engineering constraints of corrosion width. The method improves spatial coverage and reduces computational redundancy by the introduction of physically constrained randomness. The improvement is achieved by dimension-independent permutations coupled with uniform perturbation mechanisms. A total of 600 simulations were performed, representing a balance between adequate coverage of the eight-dimensional parameter space via Latin Hypercube Sampling and the available computational resources for batch processing in ABAQUS. Six hundred models are batch-generated and computed by executing a Python-based finite element script. Subsequently, all corresponding parameters are saved as CSV files. The vertical bearing capacity corresponding to a 50 mm pile-top settlement is extracted for all ODB files and saved as a CSV file by using a separate data extraction script written in Python. Figure 4 presents the frequency distribution and correlation of the generated data.

3.2. Overview of Machine Learning

3.2.1. XGBoost

The XGBoost model is a powerful and scalable tree-boosting ML framework with a sparse-aware algorithm [21]. Figure 5 illustrates the schematic of XGboost. The algorithm incorporates advanced techniques such as tree pruning, column sampling, and parallelization to accelerate training without compromising accuracy. The core strength lies in the ability to enhance model generalization by regularization strategies and an efficient split-point determination mechanism. A pre-sorting algorithm is employed to perform a global scan of feature values, resulting in an optimal split-point selection. Additionally, an automatic missing-value handling mechanism is employed to manage missing data directly without preprocessing [22]. XGBoost supports multi-threaded parallel computation and user-defined loss functions, demonstrating high accuracy and stability on small- to medium-sized datasets. Consequently, XGBoost has been widely adopted in various domains [23].

The model is written as a sum of multiple trees:

{\hat{y}}_{i} = \sum_{k = 1}^{K} f_{k} (x_{i})

(1)

where

f_{k} (x_{i})

is the

k

th tree. A loss function may be defined to quantify the discrepancy between the predicted (true) values and the objective function:

L (\emptyset) = \sum_{i = 1}^{N} l (y_{i}, {\hat{y}}_{i}) + \sum_{k = 1}^{K} Ω (f_{k})

(2)

where

l (y_{i}, {\hat{y}}_{i})

is the loss function;

Ω (f_{k})

is the regularization term to prevent overfitting, defined as:

Ω (f_{k}) = Υ T + \frac{1}{2} λ {‖W‖}^{2}

(3)

where

T

is the number of leaves in the tree;

W

is the weight vector of the leaves;

Υ

and

λ

are the regularization parameters, respectively. The second-order Taylor expansion is employed to approximate the objective function of XGBoost:

L^{(t)} \approx \sum_{i = 1}^{N} [g_{i} f_{t} (x_{i}) + \frac{1}{2} h_{i} f_{t}^{2} (x_{i})] + Ω (f_{t})

(4)

where

g_{i} = \frac{\partial l (y_{i}, {\hat{y}}_{i}^{(t - 1)})}{\partial {\hat{y}}_{i}^{(t - 1)}}

,

h_{i} = \frac{\partial^{2} l (y_{i}, {\hat{y}}_{i}^{(t - 1)})}{\partial {\hat{y}}_{i}^{(t - 1)}^{2}}

. Each leaf node in the tree is assigned by a weight

w_{j}

, and the corresponding loss function at the leaf nodes is given by:

L^{(t)} = \sum_{j = 1}^{T} [G_{j} w_{j} + \frac{1}{2} H_{j} w_{j}^{2}] + Υ T + \frac{1}{2} λ \sum_{j = 1}^{T} w_{j}^{2}

(5)

where

G_{i} = \sum_{i \in I_{j}} g_{i}

,

H_{i} = \sum_{i \in I_{j}} h_{i}

. The optimal leaf weights

w_{j}

is obtained from the optimization equation:

w_{j}^{*} = - \frac{G_{i}}{H_{j} + λ}

(6)

The weight is combined with the objective function to obtain the objective value of the optimal segmentation:

L^{(t)} = - \frac{1}{2} \sum_{j = 1}^{T} \frac{G_{j}^{2}}{H_{j} + λ} + Υ T

(7)

Finally,

L^{(t)}

may be used to guide the selection of the best tree structure.

3.2.2. LightGBM

LightGBM is an algorithm developed by Microsoft Research Asia based on the GBDT framework [24]. The core principle is to build an ensemble of weak models, and each iteration aims to minimize the gradient of the loss function. The gradient boosting algorithm is employed to combine a histogram-based approach with a leaf-growth strategy. A maximum depth is limited on the tree to enhance the training efficiency and reduce memory usage [25]. Figure 6 depicts the level-wise and leaf-wise growth strategies. All leaves at the same depth are split simultaneously according to the level-wise growth approach. Only the leaves with the maximum information gained at the same depth are selected for splitting in the direction of growth. The strategy enhances parallel optimization and effectively regulates model complexity [23].

The general form of the loss function for the LightGBM model may be expressed as:

L (y, F (x))

, where

y

is the true value and

F (x)

is the predicted value of the model. A new base model may be found for the

i

th iteration to minimize the loss function:

h_{i} (x) = a r g m i n L (y, F_{i - 1} (x) + h (x))

(8)

where

F_{i - 1} (x)

is the integration of the first

i - 1

models and

h (x)

is the predicted value of the

i

th model. Each iteration must calculate the residuals between the predicted and true values of the model to fit the next base model. The residual

r_{i}

is calculated as:

r_{i} = y - F_{i - 1} (x)

(9)

A decision tree is chosen as the basic model to fit the residuals. The process is written as:

h_{i} (x) = \arg m i n \sum_{i} L (y_{i}, F_{i - 1} (x_{i}) + h (x_{i}))

(10)

A growth strategy, known as “Leaf-wise”, is employed to construct decision trees. The reduction in the loss function is computed after partitioning the sample into left and right child nodes at each leaf node, and the split yielding the highest gain is chosen. The process is expressed by:

G a i n = \frac{1}{2} [\frac{G_{L}^{2}}{H_{L} + λ} + \frac{G_{R}^{2}}{H_{R} + λ} - \frac{{(G_{L} + G_{R})}^{2}}{H_{L} + H_{R} + λ}] - λ

(11)

where

G_{L}

and

G_{R}

are the sum of the first-order gradients of the left and right child nodes;

H_{L}

and

H_{R}

are the sum of the second-order gradients of the left and right child nodes;

λ

is the regularization parameter; and

Υ

is the minimum gain threshold for splitting. LightGBM may efficiently process data and deliver accurate predictions with the aforesaid steps.

3.2.3. Grid Search and Cross-Validation

Grid search (GS) and k-fold cross-validation (CV) are employed for the systematic tuning and validation of hyperparameters. As illustrated in Figure 7, GS explores a predefined subset of hyperparameter combinations to identify the configuration that optimizes the performance of the model [26]. A predefined hyperparameter space is systematically traversed to comprehensively evaluate the effect of various parameter configurations on the performance of the model. This study incorporates a k-fold cross-validation mechanism to enhance model robustness and mitigate sensitivity for the stochastic distribution of training data. As shown in Figure 8, the mechanism divides the training set into mutually exclusive k subsets. Each subset serves as the validation set in turn. The remaining subsets constitute the training set, and the process is executed iteratively [27]. The results from each iteration provide an evaluation of model performance for different data partitions, and the final performance metric is derived as the mean validation score over kth iterations. The approach effectively eliminates assessment bias induced by a single data partition and ensures more reliable and representative hyperparameter tuning results. The integration of GS with k-fold CV facilitates effective hyperparameter tuning, improves performance on the training and validation sets, and strengthens the generalization capability on unseen data to ensure the efficiency and reliability of the model.

3.3. Development of the Model

At this stage, the ML database is obtained. The dataset is split into training (80%) and test (20%) sets using a random state of 42 (a widely adopted fixed seed in machine learning to ensure deterministic data splitting) for reproducibility. Subsequently, the training uses six ML models, including Decision Tree (DT), Random Forest (RF), Extreme random Tree (ET), K-Nearest Neighbors (KNN), XGBoost, and LightGBM, respectively. The Scikit-learn library and other auxiliary libraries are used to build and train the algorithmic models in Python. A 5-fold CV method is employed to assess the stability of the models, and hyperparameters are optimized by GS to enhance the performance of the model. The two best models are selected by comparing the differences and fitting abilities of the different models. Special attention is given to overfitting and underfitting issues during the training process. These issues are mitigated by adjusting model complexity and selecting appropriate regularization methods. Additionally, hyperparameter tuning is a critical step in improving the performance of the model. The parameter space is systematically explored by GS to identify the optimal combination of the parameters. Different models require distinct tuning strategies. The tree-based models may adjust the number of trees, maximum depth, etc. The linear models may improve the coefficients of the regularization terms.

The complete hyperparameter search spaces and optimal values for all six models are summarized in Table 2. The scoring metric for GS is negative mean squared error. For tree-based models, the key hyperparameters include maximum tree depth and minimum sample constraints; for boosting models, the learning rate and number of estimators are additionally tuned. Advanced optimization strategies, such as genetic algorithms (GAs) and particle swarm optimization (PSO), have been applied for hyperparameter tuning of ensemble ML models in structural engineering applications [28]. In this study, GS combined with k-fold CV is adopted, primarily due to the manageable size of the hyperparameter space and the deterministic nature of the search process, thereby ensuring full reproducibility of the results.

3.4. Evaluation Metrics

This study employs five distinct indicators to comprehensively quantify the accuracy and robustness of the regression model: the coefficient of determination (R²), mean square error (MSE), root mean squared error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE). R² is a well-established metric in classical regression analysis. It is defined as the proportion of variance “explained” by a regression model and used as a measure in predicting the dependency of the variables based on the independent variables [29]. MSE represents the mean squared prediction error and is frequently used in loss function design to penalize large errors. RMSE denotes the mean of the square roots of the squared errors between the fitted data and the original data [30]. RMSE retains the sensitivity of MSE to large errors, while restoring the error metric to the original data scale through a square root transformation. MAE is another widely used metric to evaluate the performance of the model [31]. The metric provides an intuitive, unbiased estimation of the error and is well-suited for scenarios where the effect of squared errors needs to be avoided. MAPE indicates the relative error of the predicted value compared to the true value [32]. The error is normalized to percentages to facilitate cross-scale comparisons. The formulae of five metrics are presented as follows [33]:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - p_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - p_{m})}^{2}}

(12)

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - p_{i})}^{2}

(13)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - p_{i})}^{2}}

(14)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - p_{i}|

(15)

M A P E = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - p_{i}}{y_{i}}|

(16)

where

p_{m}

,

y_{i}

, and

p_{i}

are the mean, experimental, and model values, respectively;

n

is the number of datasets.

4. Results and Discussion

4.1. Comparison of Regression

Figure 9 illustrates the regression performance of the model, where the blue dots represent the training set and the red dots represent the test set, respectively. The regression slopes are fitted to assess the performance of the model by plotting the true values on the x-axis and the predicted values on the y-axis, respectively. The previous studies [34] found that a slope greater than 0.80 reflects a strong agreement between the true and predicted values. The results indicate that the DT and KNN models show inferior performance, and the top-performing models are XGBoost and LightGBM. XGBoost and LightGBM exhibit a slope of 0.976 and 0.973 in the test set respectively, indicating an excellent one-to-one correspondence between the predicted and true values. A slope close to unity further suggests that both models exhibit a strong correlation between the predicted and true values. The XGBoost and LightGBM models are more robust than the other models, resulting in a better fit to the real data with lower errors.

4.2. Analysis of Test Sets

Figure 10 illustrates the predictive performance of six regression models on the test set by comparing the alignment between the predicted and true values. The results indicate that the DT and KNN models perform weaker than the other models. Significant deviations are observed between the predicted and true values in both DT and KNN models. The RF and ET models are better than the DT and KNN models, but lower than XGBoost and LightGBM models. It is observed that XGBoost and LightGBM exhibit the best performances on the test set, and their prediction curves closely align with the true curves, indicating that these two models effectively capture complex data patterns and maintain high predictive accuracy. The GBDT framework is employed to minimize errors by iterative optimization and effectively mitigating the overfitting. Additionally, XGBoost incorporates L1 and L2 regularizations to enhance the stability of the model when it deals with complex data. LightGBM enables efficient data processing and maintains robust predictive capability through the learning approach of histogram-based decision trees. The results indicate that XGBoost and LightGBM outperform all other models and show strong generalization capabilities.

4.3. Statistical Evaluation

Figure 11 presents the variations in evaluation indicators on the training and test sets. DT and RF, as representative tree-based models, perform well on the training set but exhibit limited generalization ability on the test set. The R² of DT reaches 0.866 in the training set but drops to 0.759 in the test set. The error also increases dramatically in DT. In contrast, RF maintains 0.927 for R² in the test set, and all error metrics are considerably better than for DT. ET and KNN exhibit slightly inferior performance compared to RF. ET achieves 0.920 for R² in the test set, indicating superior generalization ability. KNN exhibits higher errors due to the reliance on local samples. The MAE and MAPE of KNN are relatively higher, suggesting that KNN is less capable of handling complex data structures. However, XGBoost and LightGBM perform well on all evaluation indicators, and show strong fitting capability and generalization performance. XGBoost achieves an exceptional R² of 0.998 in the training set, indicating an almost perfect fit to the data. The R² is 0.976 in the test set, showing effective generalization to the unseen data. The test set errors (MSE = 20.329, RMSE = 4.509, MAE = 3.496, MAPE = 3.974%) are substantially lower than the other models, highlighting the robustness of the data variability. LightGBM closely parallels XGBoost since the values of R² are 0.996 and 0.973 in the training and test set, respectively. The error metrics are marginally higher, but the overall difference is minimal, suggesting the strongest generalization ability. The results show that XGBoost and LightGBM exhibit exceptional predictive performance. XGBoost marginally outperforms LightGBM in accuracy, and the two models are well-suited for high-performance regression tasks.

Several measures are employed to address the risk of overfitting. First, 5-fold cross-validation during hyperparameter tuning reduces overfitting to specific train–test splits. Second, regularization is incorporated in XGBoost (L1 and L2 regularization parameters), LightGBM (num_leaves constraint), and Decision Tree (cost-complexity pruning via ccp_alpha). The comparison between training and test performance serves as an overfitting diagnostic: XGBoost shows training R² = 0.998 vs. test R² = 0.976, and LightGBM shows training R² = 0.996 vs. test R² = 0.973, indicating modest and acceptable performance gaps. In contrast, DT exhibits the weakest generalization (training R² = 0.866 vs. test R² = 0.759), which reflects limited model capacity for this problem rather than overfitting. Formal uncertainty quantification methods (e.g., prediction intervals) are not implemented in the current study and could be explored in future work.

4.4. SHAP and MDI Analysis

SHAP analysis is a unified approach to interpreting ML models and introduces the concept of Shapely Additive interpretation [35]. Recent studies have demonstrated the effectiveness of SHAP for improving model interpretability in civil engineering applications. Lv et al. utilized an innovative neural network and SHAP to predict the shear strength of UHPC beams [36]. Abdollahi et al. leveraged SHAP values to elucidate an artificial intelligence (AI)-assisted slope stability analysis [37]. These studies confirm that SHAP provides actionable insights beyond traditional feature importance metrics.

In this study, global SHAP analysis is performed for XGBoost and LightGBM. Figure 12 and Figure 13 show the feature ranking and summary plots, respectively. Figure 12a and Figure 13a depict the average impact of each feature on the output of the model. Figure 12b and Figure 13b are summary plots that illustrate the distribution of SHAP for each feature and the direction of the impact on the model output. It is seen that the varying effects of changes in feature values on the model output are visible by changing the color scale from blue to red. The pile diameter has the greatest effect on the vertical load. In addition, soil characteristics exert a greater influence on the vertical bearing capacity of the pile. The corrosion depth and spalling thickness have relatively small effects on the vertical bearing capacity, possibly due to the lower variability in the data preparation process. However, SHAP analysis yields consistent results in both models, showing that the corrosion depth has a slightly greater effect than the lateral corrosion. Therefore, the erosion resistance of concrete at certain depths is of particular importance in practical engineering.

As shown in Figure 14, Mean Decrease in Impurity (MDI) analysis is performed for XGBoost and LightGBM models to further validate the SHAP-based feature importance rankings (Figure 14). The MDI-based feature ranking for XGBoost is: D (0.3854) > c (0.3142) > Es (0.1545) > φ (0.1212) > h (0.0135) > δ (0.0075) > L (0.0022) > Ep (0.0015). For LightGBM it is: c (0.1747) > φ (0.1747) > D (0.1724) > Es (0.1548) > h (0.1185) > δ (0.0913) > L (0.0686) > Ep (0.0448). Both MDI and SHAP methods identify pile diameter (D) and soil properties (c, Es, φ) as highly influential features. The primary difference is that MDI tends to assign higher relative importance to features involved in early tree splits (e.g., cohesion c ranks very high in MDI), while SHAP provides a more nuanced attribution accounting for feature interactions and considers each feature’s contribution across all predictions. It is worth noting that the ABAQUS finite element model is itself a deterministic numerical formulation: given the same input parameters, it produces the same output. Traditional bearing capacity formulas (e.g., Meyerhof, Vesic) are explicit and direct deterministic relationships. The ABAQUS model, while also deterministic, involves implicit, multi-stage numerical procedures where the relationships between input parameters and output are intrinsically complex. The SHAP analysis applied to ML models trained on this deterministic simulation data thus serves to reveal the implicit interaction effects between corrosion and geotechnical parameters that are embedded within the numerical solution but cannot be directly observed from the formulation itself.

The SHAP analysis reveals that corrosion depth (h) exerts a stronger influence on the vertical bearing capacity than spalling thickness (δ). From a physical perspective, corrosion depth determines the length of the pile section assigned the degraded material, directly affecting the shaft resistance along the corroded zone. A deeper corrosion penetration means a longer pile segment with near-zero stiffness, leading to greater loss of shaft friction capacity and reduced overall bearing capacity. In contrast, spalling thickness primarily affects the cross-sectional area at a localized level. The global bearing capacity is more sensitive to the extent (depth) of degradation than to its intensity (thickness) at any given cross-section. The SHAP summary plots further reveal that the effect of corrosion depth on bearing capacity depends on the pile diameter—larger diameter piles are less sensitive to corrosion depth than smaller diameter piles, indicating a nonlinear interaction between these features.

5. Conclusions

This study predicts the vertical bearing capacity of pile foundations in cold saline and alkaline environments. Datasets are batch-generated by establishing a 2D numerical model. Automated scripts are developed by combining Python. The ML database is constructed with multiple factors such as pile–soil materials and corrosion parameters. Subsequently, six ML models are developed and analyzed. The following conclusions are drawn below:

(1): The ML database is developed by using Python to script ABAQUS models, and the LHS method is used for the automatic generation of random parameters.
(2): XGBoost and LightGBM show the best performance of the six models. Moreover, XGBoost and LightGBM have potential in predictions because they are effectively generalized to new data.
(3): The pile diameter exerts the greatest effect on the vertical bearing capacity of the pile.
(4): The depth of corrosion has a more significant effect than the corrosion thickness on the vertical bearing capacity of the pile. Therefore, the depth of corrosion is prioritized for the design of the piles in cold saline environments.

Several limitations of this study should be acknowledged. First, the dataset is entirely generated from 2D axisymmetric FE simulations; direct validation against field measurements is needed. Second, the soil is modeled as a single homogeneous layer with Mohr–Coulomb behavior; multi-layered soils and groundwater effects are not considered. Third, corrosion is simulated by assigning a separate material with a negligibly small elastic modulus to the corroded zone; other degradation effects such as strength loss and reinforcement corrosion are not modeled. Fourth, the analysis considers steady-state conditions without time-dependent corrosion progression. Future work could extend to 3D models with non-uniform corrosion, incorporate field test data, develop time-dependent prediction models, and implement formal uncertainty quantification methods such as prediction intervals.

Author Contributions

Conceptualization, Y.J.; methodology, Y.J.; software, Y.J.; validation, Y.J.; formal analysis, Y.J.; investigation, Y.J.; data curation, Y.J.; writing—original draft preparation, Y.J.; writing—review and editing, Y.J. and Z.L.; visualization, Y.J.; supervision, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All data, models, and code generated or used during the study appear in the submitted article.

Acknowledgments

The authors thank the Natural Science Foundation of Hunan Province, China (Grant No. 2025JJ70034).

Conflicts of Interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

References

Zou, X.; Zhao, M. Axial bearing behavior of super-long piles in deep soft clay over stiff layers. J. Cent. South Univ. 2013, 20, 2008–2016. [Google Scholar] [CrossRef]
Budi, G.S.; Kosasi, M.; Wijaya, D.H. Bearing capacity of pile foundations embedded in clays and sands layer predicted using PDA test and static load test. Procedia Eng. 2015, 125, 406–410. [Google Scholar] [CrossRef]
Józefiak, K.; Zbiciak, A.; Maślakowski, M.; Piotrowski, T. Numerical modelling and bearing capacity analysis of pile foundation. Procedia Eng. 2015, 111, 356–363. [Google Scholar] [CrossRef]
Shao, W.; Shi, D. Numerical simulation of degradation behavior of concrete piles in sulfate saline soils. KSCE J. Civ. Eng. 2022, 26, 183–192. [Google Scholar] [CrossRef]
Yunusaliyev, E.; Makhsimov, K.; Makhsimov, K. Horizontally loaded piles in saline soils. E3S Web Conf. 2023, 452, 01048. [Google Scholar] [CrossRef]
Momeni, E.; Nazir, R.; Armaghani, D.J.; Maizir, H. Prediction of pile bearing capacity using a hybrid genetic algorithm-based ANN. Measurement 2014, 57, 122–131. [Google Scholar] [CrossRef]
Fattahi, H.; Ghaedi, H.; Malekmahmoodi, F.; Armaghani, D.J. Optimizing pile bearing capacity prediction: Insights from dynamic testing and smart algorithms in geotechnical engineering. Measurement 2024, 230, 114563. [Google Scholar] [CrossRef]
Sun, Z.; Liu, F.; Han, Y.; Min, R. Prediction of ultimate bearing capacity of rock-socketed piles based on GWO-SVR algorithm. Structures 2024, 61, 106039. [Google Scholar] [CrossRef]
Chen, H.; Zhang, L. A machine learning-based method for predicting end-bearing capacity of rock-socketed shafts. Rock Mech. Rock Eng. 2022, 55, 1743–1757. [Google Scholar] [CrossRef]
Chen, L.; Fu, D. Survey on machine learning methods for small sample data. Comput. Eng. 2022, 48, 1–13. [Google Scholar]
Kokol, P.; Kokol, M.; Zagoranski, S. Machine learning on small size samples: A synthetic knowledge synthesis. Sci. Prog. 2022, 105, 00368504211029777. [Google Scholar] [CrossRef] [PubMed]
Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; ACM: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
Zhang, W.; Ran, B.; Gu, X.; Zhang, Y.; Zou, Y.; Wang, P. Efficient reliability analysis of unsaturated slope stability under rapid drawdown using XGBoost-based surrogate model. Soils Found. 2024, 64, 101539. [Google Scholar] [CrossRef]
Zhang, W.; Wu, C.; Tang, L.; Gu, X.; Wang, L. Efficient time-variant reliability analysis of Bazimen landslide in the Three Gorges Reservoir Area using XGBoost and LightGBM algorithms. Gondwana Res. 2023, 123, 41–53. [Google Scholar] [CrossRef]
Wang, K.; He, B.; Samui, P.; Zhou, J. Predicting Rock Burst in Underground Engineering Leveraging a Novel Metaheuristic-Based LightGBM Model. CMES-Comput. Model. Eng. Sci. 2024, 140, 229–253. [Google Scholar] [CrossRef]
Deng, Y.; Zhang, K.; Feng, Z.; Zhang, W.; Zou, X.; Zhao, H. Machine learning based prediction model for the pile bearing capacity of saline soils in cold regions. Structures 2024, 59, 105735. [Google Scholar]
Yao, X.; Feng, Z. Numerical simulation and research on the vertical ultimate bearing capacity impact of highway bridge pile foundation in salt marshes corrosion. Highway 2017, 1, 60–66. [Google Scholar]
Feng, Z.; Huo, J.; Hu, H.; Li, T.; Yao, X.; Xu, Z.; Wang, F.; Liu, N. Corrosion damage and bearing characteristics of bridge pile foundations under dry-wet-freeze-thaw cycles in alpine salt marsh areas. J. Traffic Transp. Eng. 2020, 20, 135–147. [Google Scholar]
Loh, W.-L. On Latin hypercube sampling. Ann. Stat. 1996, 24, 2058–2080. [Google Scholar] [CrossRef]
Huntington, D.; Lyrintzis, C. Improvements to and limitations of Latin hypercube sampling. Probabilistic Eng. Mech. 1998, 13, 245–253. [Google Scholar] [CrossRef]
Zhu, X.; Chu, J.; Wang, K.; Wu, S.; Yan, W.; Chiam, K. Prediction of rockhead using a hybrid N-XGBoost machine learning framework. J. Rock Mech. Geotech. Eng. 2021, 13, 1231–1245. [Google Scholar] [CrossRef]
Ebrahimzadeh, M.; Mahmoudian, A.; Tajik, N.; Taleshi, M.M.; Ashtari, M.; Shakiba, M.; Bazli, M. Interpretable machine learning models for predicting flexural bond strength between FRP/steel bars and concrete. Structures 2025, 74, 108587. [Google Scholar] [CrossRef]
Alabdullah, A.A.; Iqbal, M.; Zahid, M.; Khan, K.; Amin, M.N.; Jalal, F.E. Prediction of rapid chloride penetration resistance of metakaolin based high strength concrete using light GBM and XGBoost models by incorporating SHAP analysis. Constr. Build. Mater. 2022, 345, 128296. [Google Scholar] [CrossRef]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems; NIPS: San Diego, CA, USA, 2017; Volume 30. [Google Scholar]
Liang, W.; Luo, S.; Zhao, G.; Wu, H. Predicting hard rock pillar stability using GBDT, XGBoost, and LightGBM algorithms. Mathematics 2020, 8, 765. [Google Scholar] [CrossRef]
Hartono, N.T.P.; Thapa, J.; Tiihonen, A.; Oviedo, F.; Batali, C.; Yoo, J.J.; Liu, Z.; Li, R.; Marrón, D.F.; Bawendi, M.G.; et al. How machine learning can help select capping layers to suppress perovskite degradation. Nat. Commun. 2020, 11, 4172. [Google Scholar] [CrossRef] [PubMed]
Stone, M. Cross-validation: A review. Stat. J. Theor. Appl. Stat. 1978, 9, 127–139. [Google Scholar]
Asgarkhani, N.; Kazemi, F.; Jankowski, R.; Formisano, A. Dynamic ensemble-learning model for seismic risk assessment of masonry infilled steel structures incorporating soil-foundation-structure interaction. Reliab. Eng. Syst. Saf. 2025, 267, 111839. [Google Scholar] [CrossRef]
Nagelkerke, N.J. A note on a general definition of the coefficient of determination. Biometrika 1991, 78, 691–692. [Google Scholar] [CrossRef]
Yang, H.; Liu, Z.; Li, Y.; Wei, H.; Huang, N. CatBoost–bayesian hybrid model adaptively coupled with modified theoretical equations for estimating the undrained shear strength of clay. Appl. Sci. 2023, 13, 5418. [Google Scholar] [CrossRef]
Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
Khair, U.; Fahmi, H.; Al Hakim, S.; Rahim, R. Forecasting error calculation with mean absolute deviation and mean absolute percentage error. J. Phys. Conf. Ser. 2017, 930, 012002. [Google Scholar] [CrossRef]
Sun, Z.; Wang, X.; Huang, H.; Yang, Y.; Wu, Z. Predicting compressive strength of fiber-reinforced coral aggregate concrete: Interpretable optimized XGBoost model and experimental validation. Structures 2024, 64, 106516. [Google Scholar] [CrossRef]
Jalal, F.E.; Xu, Y.; Iqbal, M.; Jamhiri, B.; Javed, M.F. Predicting the compaction characteristics of expansive soils using two genetic programming-based algorithms. Transp. Geotech. 2021, 30, 100608. [Google Scholar] [CrossRef]
Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems; NIPS: San Diego, CA, USA, 2017; Volume 30. [Google Scholar]
Lv, W.; Jia, J.; Chen, X.; Yao, X.; Bai, Y. Predicting shear strength in UHPC beams through an innovative neural network with SHAP interpretation. Case Stud. Constr. Mater. 2024, 20, e03211. [Google Scholar] [CrossRef]
Abdollahi, A.; Li, D.; Deng, J.; Amini, A. An explainable artificial-intelligence-aided safety factor prediction of road embankments. Eng. Appl. Artif. Intell. 2024, 136, 108854. [Google Scholar] [CrossRef]

Figure 1. Mesh of the 2D axisymmetric finite element model showing the pile, surrounding soil domain, and boundary conditions.

Figure 2. Vertical load-settlement of the pile.

Figure 3. Workflow of the proposed methodology: (1) FEM data generation and dataset preparation; (2) hyperparameter optimization and multi-model evaluation framework; (3) feature importance and model interpretation by SHAP and MDI analyses.

Figure 4. The diagram of data correlation. (*** indicates statistical significance at p < 0.001).

Figure 5. The schematic of the XGBoost regression.

Figure 6. Construction of decision tree.

Figure 7. Flowchart of grid search.

Figure 8. Illustration of cross-validation technique.

Figure 9. Comparison of regression slopes of models.

Figure 10. Comparison of true and predicted values of the test set.

Figure 11. Scatterplot of evaluation indicators: (a) comparison of the training set; (b) comparison of the test set.

Figure 12. Global interpretations of XGBoost model: (a) feature importance of SHAP; and (b) summary plot of SHAP.

Figure 13. Global interpretations of LightGBM model: (a) feature importance of SHAP; and (b) summary plot of SHAP.

Figure 14. The result of MDI analysis.

Table 1. The geometric and material parameters of pile and soil.

	Diameter (m)	Length (m)	Density (kN/m³)	Elastic Modulus (MPa)	Poisson’s Ratio	Cohesion (kPa)	Friction Angle (°)
Pile	1.0~2.0	30.0~60.0	25.0	(1.5~4.0) × 10⁴	0.2	/	/
Soil	/	/	18.0	15.0~40.0	0.3	10.0~30.0	10~30

Table 2. The parameter space.

Model	Hyperparameter	Search Space	Optimal Value
DT	max_depth	[3, 4, 5]	5
	min_samples_split	[10, 15, 20]	10
	min_samples_leaf	[5, 10, 15]	10
	max_features	[0.6, 0.8]	0.8
	ccp_alpha	[0.6, 0.8]	0.01
RF	n_estimators	[50, 100	100
RF	max_depth	[3, 5, None]	None
ET	n_estimators	[50, 100]	100
	max_depth	[5, 7]	7
	min_samples_split	[5, 10]	10
	min_samples_leaf	[5, 10]	5
	max_features	[0.7, 0.8]	0.8
KNN	n_neighbors	[3, 5, 7]	7
	weights	[uniform]	uniform
	p	[1]	1
XGBoos	learning_rate	[0.01, 0.1]	0.1
	max_depth	[3, 5]	3
	n_estimators	[100, 300]	300
LightGBM	num_leaves	[31, 63]	63
	learning_rate	[0.01, 0.1]	0.1
	n_estimators	[50, 100]	100

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jia, Y.; Li, Z. Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach. Buildings 2026, 16, 2042. https://doi.org/10.3390/buildings16102042

AMA Style

Jia Y, Li Z. Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach. Buildings. 2026; 16(10):2042. https://doi.org/10.3390/buildings16102042

Chicago/Turabian Style

Jia, Yuhan, and Zhaochao Li. 2026. "Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach" Buildings 16, no. 10: 2042. https://doi.org/10.3390/buildings16102042

APA Style

Jia, Y., & Li, Z. (2026). Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach. Buildings, 16(10), 2042. https://doi.org/10.3390/buildings16102042

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of the Vertical Bearing Capacity of Piles in Cold Saline Environments by a Multi-Dimensional Machine Learning Approach

Abstract

1. Introduction

2. Numerical Simulations

3. Methodology

3.1. Data Preparation

3.2. Overview of Machine Learning

3.2.1. XGBoost

3.2.2. LightGBM

3.2.3. Grid Search and Cross-Validation

3.3. Development of the Model

3.4. Evaluation Metrics

4. Results and Discussion

4.1. Comparison of Regression

4.2. Analysis of Test Sets

4.3. Statistical Evaluation

4.4. SHAP and MDI Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI