Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis

Xiong, Chaohua; Fang, Yuqing; Chen, Shuang; Zhao, Hongguo

doi:10.3390/buildings16112162

Open AccessArticle

Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis

by

Chaohua Xiong

^1,2,

Yuqing Fang

¹,

Shuang Chen

^1,* and

Hongguo Zhao

³

¹

School of Civil and Architectural Engineering, Guangxi University of Science and Technology, Liuzhou 545006, China

²

University Engineering Research Center of Digital and Intelligent Construction and Management, Liuzhou 545006, China

³

College of Information Science and Technology, Zhengzhou Normal University, Zhengzhou 450044, China

^*

Author to whom correspondence should be addressed.

Buildings 2026, 16(11), 2162; https://doi.org/10.3390/buildings16112162

Submission received: 18 April 2026 / Revised: 19 May 2026 / Accepted: 25 May 2026 / Published: 28 May 2026

(This article belongs to the Section Building Structures)

Download

Browse Figures

Versions Notes

Abstract

To accurately predict the shear capacity of fiber-reinforced polymer (FRP) bar-reinforced concrete (RC) beams and overcome the poor prediction stability of conventional machine learning models, this study develops and trains a heterogeneous weighted ensemble prediction model, namely the MLP-XGBoost model, by integrating the Multi-Layer Perceptron (MLP) and Extreme Gradient Boosting (XGBoost) algorithms. A comparative analysis was conducted among the MLP-XGBoost model, conventional baseline models, and traditional empirical formulas in this study. The results demonstrate that all machine learning models outperform traditional empirical formulas in predictive accuracy, while among these machine learning models, the MLP-XGBoost model proposed in this study exhibits the optimal performance in both accuracy and stability. Furthermore, to address the “black-box” nature of machine learning models, this study employs the Shapley Additive Explanations (SHAP) method to quantitatively assess the contribution of each input feature to the shear capacity of FRP-RC beams. A prediction formula for the shear capacity of FRP-reinforced concrete beams with high predictive accuracy, based on the interpretable results of SHAP analysis. This approach provides a more reliable tool for evaluating the shear performance of FRP-RC beams and offers valuable guidance for the optimal design of engineering structures.

Keywords:

FRP-RC beams; shear capacity; machine learning; weighted ensemble; SHAP

1. Introduction

Fiber-Reinforced Polymer (FRP) bars, known for their advantages such as light weight, high strength, and excellent corrosion resistance, have been widely adopted as internal longitudinal tensile reinforcement in concrete beams [1,2,3]. However, compared to steel bars, FRP bars have lower axial stiffness, which weakens their dowel action and reduces the crack resistance of FRP-RC beams under shear loading, making them more susceptible to brittle failure [4,5,6]. Therefore, understanding the mechanical behavior of FRP-RC beams under shear loading and accurately predicting their shear capacity is crucial for advancing the use of FRP bars in structural engineering, as well as ensuring the reliable design and performance evaluation of FRP-RC beams.

Although extensive experimental and theoretical research has been conducted on the shear behavior of beam members, most existing predictive formulas still rely on empirical derivations, limiting their predictive effectiveness [7]. Among the current prediction methods, models based on the mechanical properties of materials often require complex iterative calculations due to the highly nonlinear interactions among parameters [8,9,10]. This complexity makes these models difficult to apply in practical engineering calculations [11,12,13]. In contrast, the empirical formula method offers a straightforward calculation process and has been incorporated into several standards and codes [14,15]. However, it fails to fully capture the load-transfer mechanism of beams. Therefore, developing shear capacity prediction models that combine mechanical interpretability with excellent predictive accuracy remains a critical area for further research.

The rise of data-driven machine learning (ML), fueled by advancements in artificial intelligence, has led to its growing application in addressing complex structural engineering problems. ML techniques excel at capturing intricate relationships between input and output variables, enabling the development of robust and accurate predictive models [16,17,18,19,20]. Machine learning has been applied to various scenarios, including, but not limited to, predicting the load capacity and failure modes of RC walls and columns [21,22,23], estimating the shear capacity of conventional RC beams [24,25,26], assessing the delamination of concrete surface layers [27], evaluating the bond strength between concrete and longitudinal reinforcement [28], analyzing seismic damage in RC structures and bridges [29,30,31], and forecasting the mechanical properties of concrete [32]. Owing to its powerful predictive capability and demonstrated success, ML technology has been widely employed in studies on the shear behavior of beam components, as evidenced by the existing literature. To predict the shear capacity of FRP-reinforced concrete beams without stirrups, Alam et al. [33] developed a hybrid model integrating Support Vector Regression with a Bayesian optimization algorithm. Trained on a database of 216 samples with a shear span-to-depth ratio (a/d) greater than 2.5, the model achieved a coefficient of determination (R²) of 0.9773, demonstrating excellent agreement with experimental results. Phan Duy et al. [34]. proposed an extended Beam Arch Action (BAA) model to predict the shear capacity of deep FRP-RC beams without stirrups. This hybrid approach integrates mechanical modeling with data-driven analysis. By employing constrained optimization, they derived correction factors for arch action. Additionally, two data-driven models, namely M5P and Artificial Neural Network (ANN), were employed as benchmark models for validating the extended BAA model. The results demonstrated that the extended model achieved higher predictive accuracy than conventional machine learning models. Zhao Jitao et al. [35] utilized two algorithms, Artificial Neural Network (ANN) and XGBoost, to construct a shear capacity prediction model for FRP-RC beams based on 455 sets of experimental data. The prediction results of the model were compared with the calculated values from three code-specified formulas. The results demonstrate that the predictive accuracy of the machine learning model is significantly superior to that of the current code formulas, with the R² of ANN and XGBoost on the test set reaching 0.877 and 0.879, respectively.

Although ML techniques have demonstrated great potential in predicting the shear capacity of FRP-RC beams, clear technical gaps remain in current research. Conventional ML models can improve fitting ability, but their prediction stability and generalization performance are highly sensitive to hyperparameters and data fluctuations. Existing ensemble methods mostly rely on simple averaging or voting strategies, which neither achieve dynamic optimal fusion of complementary models tailored to the characteristics of FRP-RC shear problems nor focus on the weighted ensemble of heterogeneous models. As a result, it is difficult to overcome the performance bottleneck of conventional models through complementary advantages, and the demand for highly accurate and strongly generalizable prediction models in engineering practice remains unmet.

This study aims to propose a shear capacity prediction method for FRP-RC beams with both interpretability and high accuracy. To this end, the researchers constructed a database containing 634 sets of shear test data of FRP-RC beams. The comprehensive performance of five ML models was trained and evaluated, and through a thorough comparison, MLP and XGBoost were selected as the base learners. By adaptively weighting and fusing the prediction results of the two models, an MLP-XGBoost ensemble model was established to balance prediction accuracy and stability. To clarify the contribution degree of each input feature to the shear capacity and break the “black-box” limitation of the machine learning model, this study adopted the Shapley Additive Explanations (SHAP) method to interpret the model. Furthermore, based on the SHAP analysis results and the mechanical model of shear capacity, a mechanically interpretable prediction formula for the shear capacity of FRP-RC beams was proposed in this study.

2. Database

2.1. Feature Selection and Database Development

The shear resistance mechanism of FRP-RC beams is illustrated in Figure 1. As shown, the shear capacity mainly consists of three components: the contribution of concrete to the shear capacity, the contribution of longitudinal reinforcement, and the contribution of stirrups. Based on this, the study selected 10 parameters, classified into three categories, as input features for training the machine learning model. These parameters are classified as follows:

(1): Concrete-related parameters: the shear span-to-depth ratio (a/d), the effective depth (d₀), the section width (b), and the concrete strength (f_cu).
(2): Longitudinal reinforcement-related parameters: the ultimate tensile strength of longitudinal reinforcement (f_fu), the longitudinal reinforcement ratio (ρ_f), and the elastic modulus of longitudinal reinforcement (E_f).
(3): Stirrup-related parameters: the ultimate tensile strength of stirrups (f_fv), the stirrup ratio (ρ_v), and the elastic modulus of stirrups (E_fv).

It is worth noting that the concrete strength used in this study refers to the standard cubic compressive strength. During the data collection process, data on the axial compressive strength of concrete were converted according to Equation (1) [34].

f_c = 0.8 f_cu

(1)

where f_c is the compressive strength of concrete cylinders.

To avoid data heterogeneity caused by differences in loading protocols and boundary conditions, all shear capacity data in this study were collected from four-point bending static loading tests under uniformly applied simply supported boundary conditions. The database contains a total of 299 beams with stirrups and 335 beams without stirrups. All experimental beams were of rectangular cross-section and failed in shear, with shear-span ratios ranging from 0.5 to 7. Considering that key material parameters may also contribute to data heterogeneity, statistical analysis was conducted on all input variables, especially the critical material parameters, as presented in Table 1. All sources of the database are summarized in Appendix A. GFRP reinforcement was the most common, constituting 57.22% of the samples. Geometrically, 53.89% of the beams had a/d between 1 and 2.5, while only 8% had d₀ exceeding 300 mm. Material-wise, over 90% of the specimens used concrete with compressive strengths ranging from 30 to 60 MPa. Approximately 70% of the samples had ρ_f ranging from 0.5% to 4%. For beams with stirrups, about 80% had ρ_v between 0.3% and 0.5%, and 70% had f_fv ranging from 500 to 1500 MPa. Table 1 provides a statistical summary of the database, including the mean, standard deviation, maximum, and minimum values for all input features. Figure 2 shows the histogram and cumulative probability plot of the sample distribution.

It is worth noting that Figure 2 shows that most parameters are reasonably well covered within the typical range of engineering applications, while a slight sample imbalance exists in some extreme-value intervals. The normality test results in Table 1 indicate that none of the parameters follow a normal distribution. This deviation from normality contradicts the basic assumptions of classical machine learning algorithms, such as linear regression and Gaussian Naive Bayes [36]. In this study, a grouped cross-validation strategy was adopted to mitigate the influence of sample imbalance. Meanwhile, typical machine learning models, including MLP and XGBoost, were selected for predicting the shear capacity, ensuring that the established model exhibits stable predictive performance and satisfactory generalization ability.

2.2. Feature Correlation Analysis

To comprehensively evaluate the relationships among input features, both Pearson and Spearman correlation analyses were conducted in this study. The Pearson correlation coefficient was used to quantify the linear dependence between variables, while the Spearman rank correlation coefficient was used to capture monotonic, nonlinear relationships. The latter is more suitable for characterizing the complex mechanical behavior of shear capacity. To ensure model accuracy, multicollinearity analysis was performed using Pearson correlation matrices to identify and eliminate highly correlated variables. A correlation coefficient threshold of |r| = 0.85 [37] was adopted to detect multicollinearity. As shown in Figure 3, all correlation coefficients remained below this threshold, confirming the physical independence and informational diversity of the selected input features, which ensures reliable modeling. When the Pearson correlation coefficient is approximately equal to the Spearman rank correlation coefficient, the variables exhibit an inherent linear relationship. The comparison of correlation heat maps reveals that the coefficients of the same feature differ noticeably between the two analytical methods. This indicates that the factors affecting the shear capacity of FRP-RC beams present weak interdependence and a highly nonlinear relationship.

3. Methodology

The overall research framework of this paper is illustrated in Figure 4. First, all input features are subjected to Z-score standardization, as illustrated in Equation (2) [38]. This procedure eliminates the feature dominance problem caused by significant differences in dimensional scales and improves the convergence efficiency of the machine learning model.

X_{stand} = \frac{X - μ}{\sqrt{σ^{2}}}

(2)

where X, μ, and σ represent the specific values, mean values, and standard deviations of the input features, respectively.

Next, five supervised machine learning models are introduced. During model construction, the dataset is randomly divided into a training set, a validation set, and a test set at a ratio of 7:1:2.

The training set (70%) is used for model training and hyperparameter optimization.

The validation set (10%) is used to evaluate candidate hyperparameter configurations.

The test set (20%) is used for final performance evaluation and is never involved in the entire process of model training or hyperparameter tuning.

Referring to most existing studies in this field, researchers generally only partition the dataset into a training set and a test set, without explicitly separating an independent validation set in the manuscript description. Therefore, except for Section 3.1.6, the training set and validation set are collectively referred to simply as the training set throughout this paper. For hyperparameter optimization, the caterpillar fungus optimizer (CFO) [39] algorithm is adopted in this paper. Combined with repeated 10-fold cross-validation (5 repetitions) performed on the training set, this procedure effectively reduces the risk of overfitting and enhances model stability. The validation set is used to further evaluate all hyperparameter combinations to ensure the selection of the optimal configuration. After determining the optimal hyperparameters, each model is trained on the full training set to obtain the optimally trained model. Through comparative analysis, the two best-performing models, MLP and XGBoost, are selected. Their prediction results are weighted and combined to construct the MLP-XGBoost weighted ensemble model. The MLP-XGBoost model is compared with the other machine learning models and theoretical formulas, demonstrating its superior performance in predicting the shear capacity of FRP-RC beams. Finally, Leave-One-Study-Out (LOSO) cross-validation is conducted on the entire dataset to rigorously evaluate the model’s generalization ability across different experimental programs. Furthermore, the SHAP algorithm is employed to interpret the MLP-XGBoost model, thereby overcoming its “black-box” nature. Through quantitative analysis of the SHAP values, the correlation between each input feature and the shear capacity of FRP-RC beams is elucidated. Based on these SHAP values, a prediction equation for the shear capacity of FRP-RC beams is established.

3.1. Machine Learning Algorithms

3.1.1. Multilayer Perceptron (MLP)

The Multilayer Perceptron (MLP) is a fully connected, feedforward artificial neural network. It consists of an input layer, hidden layers, and an output layer. Each layer comprises multiple neurons, with each neuron fully connected to all neurons in the previous layer via weighted connections [40]. The input layer receives the raw features, and its number of neurons equals the dimensionality of the feature space. The hidden layers perform hierarchical transformations and feature abstraction of the inputs through weight matrices, bias vectors, and nonlinear activation functions, subsequently propagating the processed signals forward [41]. Finally, the output layer maps the resulting high-level features to the final predictions according to the specific task. The overall working principle of the MLP can be represented as a composite function, as shown in Equation (3):

\hat{y} = g (W_{o} \cdot f (W_{h} \cdot x + b_{h}) + b_{o})

(3)

where x denotes the input vector; W_o and W_h are the weight matrices of the hidden and output layers, respectively; b_h and b_o are the bias vectors of the hidden and output layers, respectively; and

\hat{y}

is the model’s output prediction vector.

3.1.2. Decision Tree (DT)

The Decision Tree (DT) regression model is a supervised machine learning method. It operates by recursively partitioning a dataset into subsets based on input features. The primary objective is to construct a reliable predictive model by inferring key decision rules from the features. The fundamental structure of a DT comprises a root node, internal nodes, branches, and leaf nodes. The root node represents the feature and corresponding threshold used for the first data split. Internal nodes contain features, which are attributes used to determine subsequent splits. The branches connecting the nodes illustrate the relationships between the values of independent variables and the target variable. Finally, leaf nodes, which are the terminal points of the tree, hold the predicted value of the target variable.

The primary objective of DT regression is to identify the optimal split points that partition the data into distinct regions

R_{1}, R_{2}, \dots, R_{n}

, where the predicted value for each region corresponds to the mean of the target values of the samples within that region. This is achieved by minimizing the following objective function, as shown in Equation (4):

\min \sum_{i = 1}^{n} \sum_{x_{j} \in R_{j}} (y_{j} - \bar{y_{R_{i}}})^{2}

(4)

where

y_{j}

is the true value, and

\bar{y_{R_{i}}}

is the mean of the target values within the region

R_{j}

.

3.1.3. Random Forest (RF)

The Random Forest (RF) regression model is an ensemble model that uses decision trees (DTs) as base learners [42]. Its core principle is to integrate the Bagging ensemble technique with a random feature selection strategy, combining multiple individual DTs to perform predictions. Specifically, it generates multiple training subsets via Bootstrap sampling and constructs a decision tree for each subset. During the node-splitting stage of each decision tree, the algorithm selects the optimal splitting feature from a randomly chosen subset of m features. This randomness effectively reduces inter-tree correlation, thereby improving the model’s generalization and robustness. Regarding the hyperparameter max_features, it is typically recommended to set its value to m = p/3. The final prediction result of the model is the arithmetic mean of the prediction values of each decision tree, as shown in Equation (5).

p (x) = \frac{1}{N} \sum_{i = 1}^{n} p_{i} (x)

(5)

where N represents the number of DTs, and p_i(x) represents the prediction result of the i-th DT.

3.1.4. Light Gradient Boosting Machine (LightGBM)

Light Gradient Boosting Machine (LightGBM) is a gradient boosting framework specifically designed for high computational speed and efficiency [43]. Built on decision tree algorithms, it significantly reduces training time and memory consumption while maintaining high predictive accuracy. Its core principle is based on an additive process, in which multiple weak classifiers are sequentially combined to construct a high-performance predictive model. The mathematical formulation for the inference process is provided below.

The objective function of the model, as shown in Equation (6):

\hat{y_{i}} = \sum_{k = 1}^{K} f_{k} (x_{i}), f_{k} \in ℝ

(6)

where

f_{k}

denotes the k-th regression tree,

ℝ

represents the space of all possible regression trees, and

K

indicates the total number of trees.

3.1.5. eXtreme Gradient Boosting (XGBoost)

eXtreme Gradient Boosting (XGBoost) is an ensemble machine learning model based on decision trees [44]. As a tree-based ensemble method, XGBoost employs gradient boosting to enhance performance by combining multiple weak learners (decision trees). The training process is iterative: at each iteration, a decision tree is added to correct errors from previous iterations and improve accuracy. The mathematical model of XGBoost can be expressed by Equation (7):

{\hat{y}}_{i} = \sum_{k = 1}^{K} f_{k} (x_{i}), f_{k} \in ℝ

(7)

Here,

{\hat{y}}_{i}

denote the predicted value for sample i; K is the number of weak learners; f_k denotes the output value of the k-th weak learner;

ℝ

represents the function space of all possible decision trees, and x_i is the feature vector of sample i.

3.1.6. Model Ensemble MLP-XGBoost

Ensemble learning is an established paradigm in machine learning that enhances the predictive performance and generalization of a model by leveraging the strengths of multiple base learners. The essence of this approach is to combine multiple weak learners through sample weighting and learner weighting, resulting in an ensemble learner that achieves superior performance and stronger generalization compared to any individual weak learner. An effective ensemble model requires that the base learners possess both a certain level of accuracy and sufficient diversity.

The dataset is firstly split into a training set, a validation set, and a test set at a ratio of 7:1:2. The MLP and XGBoost models are trained on the training set, and their hyperparameters are optimized using the Caterpillar Fungus Optimizer (CFO) algorithm [39]. Let the actual shear capacity in the validation set be y_v. The predicted shear values of the MLP and XGBoost models on the validation set are then obtained, denoted as y_v, _MLP and y_v, _XGBoost, respectively. The mean absolute error between the actual and predicted values on the validation set is calculated and defined as MAE_MLP and MAE_XGBoost, with the corresponding formula shown in Equations (8) and (9). Finally, the ensemble weights of the two models are determined according to the error performance on the validation set.

M A E_{MLP} = \frac{1}{n} \sum |y_{v, M L P} - y_{v}| M A E_{XGBoost} = \frac{1}{n} \sum |y_{v, X G B o o s t} - y_{v}|

(8)

W_{MLP} = \frac{M A E_{XGBoost}}{M A E_{XGBoost} + M A E_{MLP}} W_{XGBoost} = \frac{M A E_{MLP}}{M A E_{XGBoost} + M A E_{MLP}}

(9)

When a significant discrepancy exists between the training errors of the two base models, the better-performing model is selected as the final output to ensure higher accuracy. To implement this strategy, an error threshold, a = 0.1, is defined. Relative error analysis of the MLP and XGBoost models indicates that more than 50% of the relative errors between the predicted values and the experimental values fall within the range of −0.1 to 0.1. Therefore, a = 0.1 is selected as the threshold parameter. Functionally, this threshold avoids two extreme scenarios: if a is set too small, the strategy will frequently assign weights of 0 or 1, effectively reducing the ensemble to a single-model selection process and undermining the stability of weighted fusion; conversely, if a is too large, the model will rarely trigger the selection mechanism, forcing a simple weighted average even when there is a significant performance gap between the two base models, which may degrade prediction accuracy. Therefore, selecting a = 0.1 ensures both the stability of weight computation and the rationality of the ensemble decision. Accordingly, the model error is calculated as shown in Equation (10) when the relative difference between the two models exceeds this threshold.

\frac{|M A E_{M L P} - M A E_{X G B oost}|}{\max (E_{M L P}, E_{X G B o o s t})} > a if \{\begin{cases} M A E_{M L P} > M A E_{X G B o o s t} W_{M L P} = 0, W_{X G B oost} = 1 \\ M A E_{M L P} < M A E_{X G B o o s t} W_{M L P} = 1, W_{X G B oost} = 0 \end{cases}

(10)

On the test set, the MLP and XGBoost predictions are y_t,MLP and y_t,XGBoost, respectively. The final prediction result after model ensembling is given by:

y_{t} = W_{MLP} \cdot y_{t, MLP} + W_{XGBoost} \cdot y_{t, XGBoost}

.

To fully verify the rationality of the selected a value, a systematic sensitivity analysis was conducted. Five candidate values, 0.05, 0.10, 0.15, 0.20, and 0.25, were selected for comparison. Key evaluation metrics, including the coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE), were compared across different threshold levels, and the results are presented in Table 2. When a varied within the range of 0.05 to 0.25, the prediction metrics of each model fluctuated only within a very limited range, with the variation of R² being merely 0.7%, and no significant improvement or deterioration in performance was observed. When a = 0.10, the models achieved the best performance across all metrics. These results demonstrate that the models themselves possess a certain level of superiority, that the final predictive performance of the ensemble model is relatively insensitive to variations in this parameter, and that a can smoothly integrate the outputs of the two base models over a relatively wide range without overreacting to local fluctuations.

3.2. Performance Indicators

This study uses five evaluation metrics to quantitatively assess model performance: Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), Root Mean Square Error (RMSE), Mean Square Error (MSE), and the Coefficient of Determination (R²). MAPE measures the percentage deviation between true values and predicted values, with smaller values indicating lower prediction errors. RMSE reflects the standard deviation of prediction errors, where smaller values denote higher predictive accuracy. R² evaluates the overall goodness-of-fit of the model; values closer to 1 indicate a better fit. The formulas for all metrics are summarized in Table 3.

3.3. Performance Optimization

3.3.1. 10-Cross-Validation

To mitigate overfitting, 10-fold cross-validation was employed during model training. This method strikes an optimal balance between computational efficiency and predictive accuracy and is widely adopted in machine learning applications [45,46,47]. The training set is randomly divided into ten equally sized folds. In each iteration, one fold serves as the validation set, while the remaining nine form the training set. This process is repeated ten times, ensuring that each fold is used as the validation set once. The final performance is averaged across all iterations. Figure 5 illustrates the principle of 10-fold cross-validation. To verify the robustness of the MLP-XGBoost model, a repeated cross-validation strategy was employed in this study, specifically conducting five independent runs of 10-fold cross-validation. Table 4 presents the performance metrics of the MLP-XGBoost model from the repeated cross-validation. Figure 6a shows the cross-validation MAE values of the MLP-XGBoost ensemble model for FRP-RC beams, with the MAE values for all folds remaining within 15% of the mean, confirming that the model exhibits good stability and strong generalization ability during training.

In addition, to further verify the generalization ability of the model, leave-one-study-out cross-validation was additionally performed. Firstly, all collected literature sources were sorted chronologically according to their publication years, and the entire dataset was partitioned into ten subsets with comparable sample sizes. During the grouping process, all experimental data from the same independent study are completely retained in the same subset without random splitting, thereby effectively ensuring the independence of data sources between the training samples and the test samples. For multiple independent studies published in the same year, their corresponding data were allocated to different subsets. In cases where only a limited number of studies were available in a single year, those datasets were merged with experimental data from subsequent years to form an integrated subset. The results of the leave-one-study-out cross-validation are presented in Figure 6b. The cross-validation results demonstrate that the proposed MLP-XGBoost hybrid model maintains stable and reliable prediction accuracy across different data folds. Figure 7 illustrates the flowchart of dataset partitioning and application in this study.

3.3.2. Caterpillar Fungus Optimizer Intelligent Optimization Algorithm

This study used the independent validation set to train the models. To ensure optimal performance, the Caterpillar Fungus Optimizer (CFO) algorithm was applied to perform a global search for the optimal combination of hyperparameters [39]. The optimized hyperparameters for all models are summarized in Table 5, while the other hyperparameters not listed are set to their default values in the Statistics and Machine Learning Toolbox. The flowchart of the CFO algorithm is illustrated in Figure 8.

The Caterpillar Fungus Optimizer (CFO) is a novel bio-inspired metaheuristic algorithm inspired by the parasitic propagation and spore dispersal behavior of Ophiocordyceps sinensis. The algorithm performs optimization through two core phases: global exploration and local exploitation. In the exploration phase, individuals conduct adaptive search using two complementary operators: the wave advance operator for extensive spatial exploration and the spiral ascent operator for refined local neighborhood search. These operators dynamically adjust the search step sizes and trajectories to maintain population diversity. In the parasitic exploitation phase, CFO implements two types of parasitic behavior: re-parasitism, which enhances global search and helps escape local optima, and optimal parasitism, which strengthens convergence toward high-quality solutions. The position update mechanism integrates random walk, adaptive step size adjustment, and elite individual guidance, thereby achieving a balance between exploration and exploitation throughout the iterative process. In this study, the maximum number of iterations is set to 50, and the population size is 15. Figure 9 presents the convergence curve of hyperparameter optimization. To evaluate the algorithm’s convergence behavior, the fitness value of the best individual in each generation was monitored, confirming stable convergence within 50 iterations. Compared with traditional grid search and random search, the CFO optimization algorithm offers higher computational efficiency, avoiding the inherent curse of dimensionality in grid search and the blind nature of random search. Its guided search can progressively converge toward the global optimum through iterations, demonstrating stronger global optimization capability and effectively avoiding local optima [39].

4. Prediction Performance Comparison

4.1. Model Performance Evaluation

Although the dataset scale is relatively limited, considering the feature correlation analysis presented earlier, the factors influencing the shear capacity prediction exhibit nonlinear and complex feature coupling. Therefore, a model with sufficient expressive capacity is required to capture these intrinsic relationships. We comprehensively compared the proposed ensemble model with a series of representative baseline models with varying levels of complexity, including Decision Tree (DT), Random Forest (RF), LightGBM, standalone MLP, and standalone XGBoost. The results consistently demonstrate that the proposed ensemble framework achieves superior accuracy and stability without showing obvious overfitting. This confirms that the selected model complexity is reasonable and well-balanced with the available data.

In this study, the proposed MLP-XGBoost ensemble model was compared with five conventional machine learning models in predicting the shear capacity of FRP-RC beams. Table 6 presents a comprehensive summary of the performance metrics for all six models, while Figure 10 and Figure 11 illustrate their predictive performance on the training and testing datasets, respectively. To further assess the prediction reliability and potential systematic bias of the MLP-XGBoost model relative to the traditional models, a residual distribution analysis with 95% confidence intervals was conducted. As shown in Figure 10h, the vast majority of residuals for the MLP-XGBoost model lie within the 95% confidence interval and fluctuate randomly around zero, showing no apparent trend or heteroscedasticity. This indicates that the model does not systematically overestimate or underestimate the shear capacity across the entire dataset. Moreover, the confidence interval is relatively narrow, demonstrating that the model’s prediction uncertainty remains low and stable regardless of the magnitude of shear capacity. Collectively, these results indicate that the proposed model possesses satisfactory generalization ability and statistical robustness within the scope of the existing dataset, with its prediction errors consistently constrained within a reasonable confidence interval. Figure 11 presents multiple performance metrics of each model on both the training and testing datasets. Among the traditional models, XGBoost achieves the highest predictive accuracy, followed by MLP, both outperforming other machine learning models, including DT, RF, and LightGBM.

The superior performance of XGBoost can be attributed to its gradient-boosting framework, which builds the model iteratively. In each iteration, the optimization algorithm fits the residuals from the previous iteration. This iterative process allows the model to progressively learn complex patterns in the data, ultimately approximating the true regression function. Meanwhile, the MLP model also demonstrated competitive performance, surpassing other models such as DT, RF, and LightGBM on this dataset.

Overall, the MLP-XGBoost ensemble, which integrates two high-accuracy base models, achieved superior predictive accuracy and generalization capability among all models. Therefore, the MLP-XGBoost model can be regarded as the preferred choice for predicting the shear capacity of FRP-RC beams. To further verify the reliability, soundness, and generalization capability of the model, this study performed the Kolmogorov–Smirnov (K-S) statistical test on the relative errors between the actual and predicted values of the MLP-XGBoost model. The results of the statistical test are shown in Figure 10g, with the corresponding p-values for the training and test sets being 0.243 and 0.369, respectively. This indicates that, at the 0.05 significance level, the relative errors of both the training and test sets approximately follow a normal distribution, confirming that the model’s predictive performance shows good reliability.

4.2. Comparison with the Empirical Model

To conduct a more comprehensive evaluation of the MLP-XGBoost model’s predictive performance and strengthen the reliability of the evaluation results, this study compared the model with existing prediction formulas based on the complete dataset. The specific formulas are detailed in Table 7. The comparison results are presented in Figure 12, and the performance evaluation metrics for the formulas and models are summarized in Table 8. As shown in Figure 12, the existing prediction formulas show noticeable deviations from the actual trends, with their accuracy significantly lower than that of the MLP-XGBoost model. Specifically, the predictions of the ACI 440.1R-15 [48], GB 50608-2020 [49], and Model I [50] formulas are relatively conservative, potentially leading to underutilization of member capacity in practical engineering applications. In contrast, the predictions of the CSA S806-12 [51] formula often exceed the actual bond strength, which could pose considerable safety risks if applied directly in engineering practice. The data in Table 8 show that the R² values of the existing formulas are below 35%, indicating notable limitations in their ability to integrate parameters. Furthermore, the MAE, RMSE, MSE, and MAPE values of these formulas are all substantially higher than those of the MLP-XGBoost model. Among the existing formulas, Model I, which demonstrates the highest prediction accuracy, still shows substantially higher values for all performance metrics compared to the MLP-XGBoost model. Specifically, the MAE of Model I is more than seven times that of the MLP-XGBoost model. In conclusion, the MLP-XGBoost model proposed in this study outperforms existing prediction formulas in both parameter integration and prediction accuracy.

In addition, engineering design codes such as ACI 440.1R-15, CSA S806-12, and GB 50608-2020 prioritize structural safety. Their empirical formulas are deliberately made conservative and adjusted using partial safety factors, resulting in relatively conservative predictions of shear capacity. These formulas are calibrated to ensure structural safety and control failure probability rather than to maximize the accuracy of fitting experimental data. In contrast, the proposed machine learning model achieves higher predictive accuracy and lower bias. Nevertheless, appropriate safety calibration and reliability evaluation are still required before its direct application in engineering practice, in order to balance predictive accuracy with structural safety.

Existing code formulas and empirical formulas are mechanism-driven approaches, derived through mechanical analysis and parameter fitting. The inaccuracy of mechanism-driven formulas for predicting the shear behavior of reinforced concrete beams can be attributed to several factors: (1) The shear resistance mechanism is extremely complex, with highly nonlinear and coupled relationships among various parameters. (2) Numerous factors affect shear capacity, and the practical range of parameter values is wide. Traditional formulas are calibrated based on limited experimental data under specific conditions, resulting in a narrow applicability range. (3) Many assumptions and simplifications are introduced during formula derivation, making it impossible to capture threshold effects, saturation characteristics, and complex interactions among input variables. (4) For practical engineering safety, code formulas generally incorporate sufficient safety margins, leading to relatively conservative predictions. In contrast, machine learning algorithms are data-driven approaches that can directly capture complex mapping relationships between input and output variables, enabling the construction of more accurate predictive models. It should be noted that the predictive accuracy and reliability of machine learning models strongly depend on the size and quality of the dataset.

5. Interpretation of the SHAP Model

5.1. SHapley Additive Explanations (SHAP) Method

SHapley Additive exPlanations (SHAP) is a game-theoretic approach grounded in the concept of Shapley values from cooperative game theory. It quantifies the contribution of each feature to the model’s prediction results, ensuring a fair distribution of contributions among players [52]. SHAP is categorized as a post hoc explanation method: by perturbing the input features and measuring the resulting changes in predictions, it can reveal the specific effect of each feature on the model output in any black-box model. Its mathematical formulation is presented in Equation (11).

f (x) = φ_{0} + \sum_{i = 1}^{M} φ_{j} \bar{x_{j}}

(11)

where f(x) and

φ_{0}

are the mapping function and average predicted value of the model, respectively;

φ_{j}

and

\bar{x_{j}}

are the value of the j-th input variable and the joint vector of the j-th feature, respectively; M is the number of input variables.

5.2. Global Interpretation

In this study, the SHAP method was employed to interpret the MLP-XGBoost model using the test set. The interpretation results are presented in Figure 13, with Figure 13a showing the SHAP beeswarm plot and Figure 13b presenting the feature contribution bar and donut charts. As shown in Figure 13a, the input features are arranged along the y-axis according to their contribution to the shear capacity of the structural members, with the most influential factor placed at the top of the plot. Each point represents the SHAP value of a specific feature for an individual sample from the dataset. The color of the scatter points is determined by the corresponding scale on the right, with a transition from red to blue indicating an increase in influence. As shown in Figure 13a, for the shear span-to-depth ratio, the blue scatter points are concentrated on the negative semi-axis, indicating a negative correlation with the shear capacity of the members. The bar chart in Figure 13b presents the mean absolute SHAP value of each input feature across the full sample pool, which is used to assess feature importance: a higher value indicates a greater impact on member shear capacity. The donut chart shows the contribution proportion of each feature in the total SHAP values across all samples, providing an intuitive representation of the relative influence of each input feature. In conclusion, the shear span-to-depth ratio has the greatest impact on the shear capacity of members, followed by the effective depth. According to the ranking of average SHAP values, the shear span-to-depth ratio, effective depth, stirrup ratio, concrete strength, and longitudinal reinforcement ratio are identified as the five key parameters influencing the shear capacity of FRP-RC beams.

5.3. Parametric Analysis and Predictive Equation

SHAP analysis quantifies the statistical importance and correlation trends of each input feature, rather than directly reflecting inherent physical causality. The identified feature influences and variation trends represent statistical patterns derived from the experimental dataset. The interpretation of the physical mechanism is supported by integrating the SHAP statistical results with classical structural mechanics and shear resistance theory.

Figure 14 shows the variation patterns of SHAP values with respect to each input feature. Figure 14a illustrates the relationship between the shear span-to-depth ratio and SHAP values. It can be observed that the SHAP values decrease initially at a rapid rate and then gradually as the shear span-to-depth ratio increases, indicating a negative nonlinear correlation between the shear span-to-depth ratio and the beam’s shear capacity. This observation is consistent with the findings of Razaqpur and Isgor [53]. Figure 14b shows the variation in SHAP values with effective depth. As shown in the figure, the SHAP values increase as the effective depth rises, suggesting a significant size-dependent effect on shear capacity and a positive correlation between effective depth and shear capacity. Figure 14c and Figure 14d display the influence of stirrup ratio and concrete strength on SHAP values, respectively. In both cases, the SHAP values increase with the corresponding input features. Figure 14c shows that when the stirrup ratio is 0, the SHAP values are negative, indicating that the presence of stirrups significantly enhances the shear capacity of FRP-RC beams. Figure 14e shows the influence of the longitudinal reinforcement ratio on the SHAP values. The trend of the fitted curve clearly indicates that the SHAP values first increase and then decrease as the reinforcement ratio increases. This suggests that, within a certain range, increasing the reinforcement ratio can effectively improve the shear capacity of FRP-RC beams. However, when the longitudinal reinforcement ratio becomes too high, the interaction between concrete and FRP bars deteriorates, leading to stress concentration around the longitudinal reinforcement and a subsequent decrease in shear capacity [54]. Since FRP bars are inherently linear elastic materials, an over-reinforced condition exacerbates the brittle failure characteristics of FRP-RC beams. As shown in the corresponding figure, the strength of longitudinal reinforcement has a negligible effect on the shear capacity of FRP-RC beams. The SHAP values associated with longitudinal reinforcement strength in Figure 14f exhibit considerable dispersion. Figure 14g illustrates the effect of section width on the distribution of SHAP values. Compared with effective depth, section width has a relatively smaller impact on shear capacity. As shown in Figure 14g, within the sample range considered in this study, SHAP values increase with section width. Figure 14h and Figure 14i show the influence of the elastic modulus of longitudinal reinforcement and stirrups on SHAP values, respectively. It can be observed that over 90% of the SHAP value points fall within the range of −0.12 to 0.12. Figure 14j illustrates the effect of stirrup strength on SHAP values. As shown, more than 90% of the SHAP values for beams without web reinforcement are negative, while those for beams with web reinforcement increase with stirrup strength. However, the fitted curve reveals considerable scatter of SHAP values with respect to this parameter.

To address the insufficient prediction accuracy caused by the limited parameters in existing equations, this study employed a multiple regression model to perform linear regression analysis on 10 input features, resulting in Equation (12). Furthermore, based on the shear resistance mechanism of FRP-RC beams and the analysis of the influence of various parameters on SHAP values and shear capacity, a prediction equation for the shear capacity of FRP-RC beams was developed. This equation takes into account the variation patterns of SHAP values with respect to the parameters. A comparative study was then conducted between this equation, the traditional MLR model, and Model I, which exhibits the highest prediction accuracy among existing equations. It should be noted that Equation (12) does not consider the physical significance of each parameter.

\begin{array}{l} M L R (V_{u}) = 0.163 \cdot b + 0.4659 \cdot d_{0} + 0.239 \cdot f_{cu} - 0.3584 \cdot (\frac{a}{d}) + 0.063 \cdot ρ_{f} \\ - 0.0285 \cdot E_{f} + 0.0444 \cdot f_{fu} + 0.0582 \cdot ρ_{v} + 0.0027 \cdot E_{fv} + 0.0414 \cdot f_{fv} \end{array}

(12)

Based on the SHAP analysis results shown in Figure 13, the five input features with the largest mean absolute SHAP values were identified as the key factors influencing the shear capacity of beams and were selected as the independent variables for formula derivation. Subsequently, Figure 14 presents a detailed analysis of the distribution characteristics and variation trends of SHAP values for each parameter. For each feature, functions consistent with the scatter distribution trends were chosen to fit the relationship with the SHAP values, as detailed below. The SHAP values corresponding to the shear span-to-depth ratio exhibit a skewed distribution, showing a trend of initially rapid decrease followed by gradual decline. This indicates that, as the shear span-to-depth ratio increases, the shear-resisting mechanism of the beam gradually transitions from arch action to beam action, and the trend of the Bradley function can accurately capture this transformation. Accordingly, the Bradley function is employed for fitting in this study, yielding Equation (13).

SHAP (a / d) = 0.38043 \cdot \ln (1.97888 \cdot \ln (a / d))

(13)

The SHAP values of the effective depth show a monotonically increasing trend. As the effective depth increases, both the shear cross-sectional height and the internal lever arm of the beam increase. This not only enlarges the shear-resisting area of the concrete but also extends the load transfer path of diagonal cracks, thereby enhancing the inherent shear resistance of the concrete [55]. Therefore, the Hill function was selected for fitting, as shown in Equation (14).

SHAP (d_{0}) = \frac{- 0.31 + (3.156 + 0.309) \cdot {d_{0}}^{2}}{{863.2}^{2} + {d_{0}}^{2}}

(14)

The SHAP values of the stirrup ratio, concrete strength, and longitudinal reinforcement ratio all exhibit a clear increasing trend, although they exhibit greater dispersion than the first two features. Therefore, quadratic or cubic polynomial functions were employed to fit these features, as presented in Equations (15)–(17).

SHAP (ρ_{v}) = - 0.0987 + 0.664 \cdot ρ_{v} - 0.063 \cdot {ρ_{v}}^{2}

(15)

SHAP (f_{cm}) = - 0.0673 + 0.0021 \cdot f_{cu} + 0.083 \cdot {f_{cu}}^{2}

(16)

SHAP (ρ_{f}) = - 0.178 + 0.077 \cdot ρ_{f} + 0.018 \cdot {ρ_{f}}^{2} - 0.00342 \cdot {ρ_{f}}^{3}

(17)

It is worth noting that the R² values of all fitting curves exceed 0.7, confirming the presence of stable and continuous functional relationships between each parameter and its corresponding SHAP contribution. Finally, the five fitted SHAP values were employed as inputs in a multiple linear regression analysis. Based on the additivity of SHAP values, a closed-form prediction equation for the shear capacity of FRP-RC beams was established, as shown in Equation (18).

\begin{array}{l} SHAP (V_{u}) = & 484.9 + 18.6 \times SHAP (a / d) + 1.827 \times SHAP (d_{0}) + 7.78 \times SHAP (ρ_{f}) \\ + 2.92 \times SHAP (f_{cu}) + 0.63 \times SHAP (ρ_{v}) \end{array}

(18)

The coefficients in Equation (18) are determined using the ordinary least squares (OLS) method. This regression establishes a linear additive relationship between the shear capacity V_u of the beam and the SHAP contributions of the key features. The OLS algorithm solves for the optimal coefficients by minimizing the sum of squared residuals between the predicted and observed V_u values. The formula selects the top five key features with the highest mean absolute SHAP values and removes redundant variables. Low-complexity functions, such as the Hill function, Bradley function, and low-order polynomials, are employed for fitting. The R² for all univariate fittings range from 0.7 to 0.81. The final formula adopts a linear superposition form with a simple structure, effectively avoiding overfitting.

The SHAP-based prediction equation is established based on standardized machine learning features and dimensionless SHAP values. Essentially, this equation constitutes an empirical, data-driven statistical model; it is constructed by fitting the characteristic trends of influencing factors identified through SHAP interpretability analysis, rather than being a theoretical formula rigorously derived from first principles of structural mechanics. The equation does not satisfy strict dimensional homogeneity in the mechanical sense and is valid only under a specific unit system (mm, MPa, kN). All parameters in the formula are defined with fixed units corresponding to the training dataset, and high numerical accuracy is guaranteed only if these units are applied consistently. The feasibility of this method has been validated in the relevant literature [56,57]. Furthermore, the prediction accuracy and generalizability of this equation are inherently constrained by the parameter distribution and sample coverage of the existing experimental database.

The predictive performance of Equations (12) and (18), and Model I was compared, with the results presented in Figure 15. As shown in Figure 15a, Equation (18) achieves the highest R² and demonstrates the best overall fitting performance, confirming the effectiveness of the MLP-XGBoost model in selecting key predictive parameters. Figure 15b indicates that Equation (18) yields lower values of MAE, RMSE, and MAPE for the shear capacity of FRP-RC beams. Compared to Model I (which has the lowest MAPE among existing formulas), Equation (18) reduces the MAE by 67.52%. It is noteworthy that the shear capacity prediction equation proposed in this study is based on SHAP analysis results, and its prediction logic closely aligns with the shear resistance mechanism of FRP-RC beams, achieving a coherent integration of model predictions with the mechanical behavior of the parameters.

6. Conclusions

This study focuses on constructing a reliable machine learning model based on a weighted ensemble of heterogeneous machine learning models to predict the shear capacity of FRP-RC beams. The ensemble model integrates the MLP and XGBoost algorithms. The results demonstrate that the proposed MLP-XGBoost ensemble model outperforms all conventional models and empirical formulas considered in this study in terms of prediction accuracy. The 10-fold cross-validation results indicate that the stability of the ensemble model is superior to that of the conventional base models. In addition, the SHAP interpretability framework was adopted to dissect the prediction mechanism of the MLP-XGBoost model and to identify the influence patterns of individual input features on shear capacity. Based on the findings derived from SHAP interpretation, this research developed an explicit formula for predicting the shear capacity of FRP-RC beams. The main conclusions are presented as follows:

Compared with existing calculation formulas, all machine learning models exhibit significantly higher accuracy in predicting the shear capacity of FRP-RC beams. Among them, the weighted ensemble model MLP-XGBoost achieves the best predictive performance. Results from the cross-validation indicate that the predictive stability of the MLP-XGBoost model surpasses that of the conventional MLP and XGBoost models; across the 10 rounds of training, the model’s Mean Absolute Error (MAE) fluctuates within ±15% of its average MAE.
The Shapley Additive Explanations (SHAP) algorithm can reveal the contribution of various input features to the shear capacity of FRP-RC beams. The analysis results indicate that the shear span-to-depth ratio has the greatest influence and is negatively correlated with the shear capacity. In comparison, all other influencing factors contribute positively to the shear capacity.
Based on the SHAP interpretability analysis results, five key parameters, namely the shear span-to-depth ratio, effective depth, stirrup ratio, concrete strength, and longitudinal reinforcement ratio, were selected to establish an explicit prediction formula for the shear capacity of FRP-reinforced concrete beams with high prediction accuracy. Verification shows that the predicted values of this formula are in good agreement with the experimental data.

7. Practical Significance and Future Prospects

The research findings of this study have valuable practical significance for engineering design. Compared with conventional machine learning models and empirical formulas, the established MLP-XGBoost ensemble model achieves higher prediction accuracy and stability for the shear capacity of FRP-reinforced concrete beams, and can serve as a reliable tool for engineering evaluation. Through SHAP interpretability analysis, the influence of each parameter and the inherent shear-resistance mechanism are quantitatively clarified, providing a theoretical basis for the optimization of structural design parameters. Meanwhile, the derived high-precision explicit prediction formula for shear capacity can be conveniently applied by engineers for routine structural calculations, shear capacity verification, and component optimization design of FRP-RC beams in practical engineering.

It should be noted that the machine learning model proposed in this study, as well as the SHAP-derived predictive equation, can provide reasonably reliable predictions only within the applicable scope of the current database. Furthermore, this study focuses solely on the shear capacity of rectangular FRP-reinforced concrete beams, with parameter distributions mainly concentrated under conventional working conditions, while investigations under extreme conditions remain limited. In light of this, future research could supplement experimental tests under extreme conditions, combined with extended finite element analyses, to further expand the existing database, thereby better supporting the training and optimization of machine learning models.

Author Contributions

C.X.: Writing—review and editing, Investigation, Data curation. Y.F.: Writing—original draft, Methodology, Formal analysis, Data curation. S.C.: Writing—review and editing, Resources, Methodology, Funding acquisition, Data curation. H.Z.: Writing—original draft, Data curation. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Nature Science Foundation of Guangxi (Grant No. 2025GXNSFHA069213) and Science and Technology Project of Guangxi (Grant No. AD23026136), and Doctoral Foundation of Guangxi University of Science and Technology (Grant No. 21Z58). And The APC was funded by Doctoral Foundation of Guangxi University of Science and Technology (Grant No. 21Z58).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors hereby state that there are no competing financial or non-financial interests that could be perceived as influencing the work presented in this paper.

Appendix A. List of References for Database

Reference	Number of Specimens	Stirrups	Longitudinal Bar	Compressive Strength of Concrete (MPa)	V_exp (kN)
Evan C. Bentz [58]	21	with or without stirrups	GFRP	35–49	54.5–690
Ahmed S F [59]	4	without stirrups	GFRP and CFRP	38.7–39.3	1191–1906
Chung Ho Kim [60]	80	without stirrups	GFRP and CFRP	30–40.3	16.3–170.2
Douglas Tomlinson [61]	9	with or without stirrups	BFRP	51–60	23–106.9
Mohsen A [62]	12	with or without stirrups	BFRP	35.9	29–195.8
Mohamed Said [63]	10	with or without stirrups	GFRP	24.5–74.4	109.82–351
G.B. Maranan [64]	18	with or without stirrups	GFRP	31	126–723
Farid Abed [65]	10	without stirrups	BFRP	50	79–240.5
Ghazi B J [66]	19	with or without stirrups	BFRP	73.4	107.3–402.2
M. Krall [67]	17	with or without stirrups	GFRP	47.3	125.1–466.9
Tung T. Tran [68]	18	without stirrups	BFRP	28.7–66	28.7–53.5
Farid Abed [69]	13	without stirrups	BFRP	45–60	189.39–385.79
Abathar Al-H [70]	14	with or without stirrups	BFRP	44.5	54.7–142.2
Fei Peng [71]	11	without stirrups	GFRP	55.5–62.7	235–290
Zhiqiang Gu [72]	7	without stirrups	CFRP and GFRP	30	127.5–245.6
Fei Peng [73]	14	with or without stirrups	GFRP	40.3	1179–2045
Lingzhu Zhou [74]	9	with or without stirrups	GFRP	48–54	277–962
Baoqiang Liao [75]	9	with stirrups	BFRP	55	211–458
Hetao Qi [76]	5	with stirrups	CFRP	46.1	481–901
Kangkang Yang [4]	13	without stirrups	GFRP	52.8–59.7	122–235.1
A. Ghani R [77]	7	without stirrups	CFRP	40.5–49	36.11–96.18
M. S. Alam [78]	12	without stirrups	GFRP and CFRP	34.5–44.7	60.1–200.1
Ahmed El Refai [79]	10	without stirrups	BFRP	49	33.8–93.6
Omar Salman [80]	10	without stirrups	GFRP	144	348–914
Mahdi Nematzadeh [81]	15	without stirrups	GFRP	54–64	51–143
Ghazi B J [82]	22	without stirrups	BFRP	42.2–73.4	107–331
Ashraf F. Ashour [56]	6	without stirrups	CFRP	27–35	35.17–72.32
Koray Tureyen [83]	9	without stirrups	GFRP	39.7–43.6	94.7–203.7
Andrea Rizzo [84]	9	with stirrups	CFRP	29.3	244.3–352.8
Gyuseon Kim [85]	16	without stirrups	CFRP	34.7	62.5–182
Amr M.A. Moussa [86]	21	with or without stirrups	CFRP	48	113–261
Bo Song [87]	11	with stirrups	BFRP	34.6	130.3–735.4
Chenchen Li [88]	7	with stirrups	GFRP	47.8	1033.6–1954.7
Rui Zhou [89]	5	with stirrups	BFRP	54.26–62.68	267.86–568.45
Shoutan Song [90]	6	with stirrups	CFRP	118.5	361–460
Cheng Chen [91]	12	with stirrups	FFRP	42.8	319.6–443.9
Shiwen Han [92]	22	with or without stirrups	CFRP	37	137.1–177.7
Adel Younis [93]	6	with or without stirrups	GFRP	46–47.7	64.2–101.2
Abathar Al-H [94]	14	with stirrups	GFRP	35.05	99–196
Wenlong Li [50]	6	with stirrups	GFRP	28.67	159–206
Fen Zhou [95]	7	with stirrups	BFRP	128.15	250–497.02
Jiamei Lv [96]	4	with stirrups	GFRP	34.6–39.3	277–476
Yuan Ye [97]	14	with stirrups	GFRP	42	92.9–168.4
Zhiquan Xing [98]	70	without stirrups	GFRP	29.35–33.87	17.3–50.91

References

Duo, Y.; Liu, X.; Liu, Y.; Tafsirojjaman, T.; Sabbrojjaman, M. Environmental impact on the durability of FRP reinforcing bars. J. Build. Eng. 2021, 43, 102909. [Google Scholar] [CrossRef]
Siddika, A.; Al Mamun, A.; Alyousef, R.; Amran, Y.M. Strengthening of reinforced concrete beams by using fiber-reinforced polymer composites: A review. J. Build. Eng. 2019, 25, 100798. [Google Scholar] [CrossRef]
Ahmed, A.; Guo, S.; Zhang, Z.; Shi, C.; Zhu, D. A review on durability of fiber reinforced polymer (FRP) bars reinforced seawater sea sand concrete. Constr. Build. Mater. 2020, 256, 119484. [Google Scholar] [CrossRef]
Yang, K.; Wu, Z.; Zheng, K.; Shi, J. Shear behavior of regular oriented steel fiber-reinforced concrete beams reinforced with glass fiber polymer (GFRP) bars. Structures 2024, 63, 106339. [Google Scholar] [CrossRef]
Dhahir, M.K. Shear strength of FRP reinforced deep beams without web reinforcement. Compos. Struct. 2017, 165, 223–232. [Google Scholar] [CrossRef]
El-Hacha, R. Prestressing Concrete Structures with FRP Tendons (ACI 440.4R-04). In Proceedings of the Structures Congress 2005: Metropolis and Beyond, New York, NY, USA, 20–24 April 2005; pp. 1–8. [Google Scholar]
Chowdhury, M.A.; Islam, M.M. Shear strength prediction of FRP-reinforced concrete beams: A state-of the-art review of available models. J. Civ. Environ. Eng. 2015, 5, 1–10. [Google Scholar] [CrossRef]
Vecchio, F.J.; Collins, M.P. Predicting the response of reinforced concrete beams subjected to shear using modified compression field theory. ACI Struct. J. 1988, 85, 258–268. [Google Scholar] [CrossRef]
Classen, M. Shear Crack Propagation Theory (SCPT)—The mechanical solution to the riddle of shear in RC members without shear reinforcement. Eng. Struct. 2020, 210, 110207. [Google Scholar] [CrossRef]
Softened Membrane Model for Reinforced Concrete Elements in Shear. ACI Struct. J. 2002, 99, 460–469. [CrossRef]
Liu, C.; Xu, D.; Duanmu, X. Analysis of shear strength influencing factors in reinforced concrete deep beams: A modified calculating model. J. Build. Eng. 2024, 95, 110243. [Google Scholar] [CrossRef]
Ma, C.; Wang, W.; Wang, S.; Guo, Z.; Feng, X. Prediction of shear strength of RC slender beams based on interpretable machine learning. Structures 2023, 57, 105171. [Google Scholar] [CrossRef]
Oller, E.; Marí, A.; Bairán, J.M.; Cladera, A. Shear design of reinforced concrete beams with FRP longitudinal and transverse reinforcement. Compos. Part B Eng. 2015, 74, 104–122. [Google Scholar] [CrossRef]
ACI Committee. Building Code Requirements for Structural Concrete (ACI 318-08) and Commentary; American Concrete Institute: Farmington Hills, MI, USA, 2008. [Google Scholar]
50010-2010; Code for Design of Concrete Structures. Building Industry Press: Beijing, China, 2010.
Somala, S.N.; Karthikeyan, K.; Mangalathu, S. Time period estimation of masonry infilled RC frames using machine learning techniques. Structures 2021, 34, 1560–1566. [Google Scholar] [CrossRef]
Nguyen, H.D.; Truong, G.T.; Shin, M. Development of extreme gradient boosting model for prediction of punching shear resistance of r/c interior slabs. Eng. Struct. 2021, 235, 112067. [Google Scholar] [CrossRef]
Vu, Q.-V.; Truong, V.-H.; Thai, H.-T. Machine learning-based prediction of CFST columns using gradient tree boosting algorithm. Compos. Struct. 2021, 259, 113505. [Google Scholar] [CrossRef]
Marani, A.; Nehdi, M.L. Machine learning prediction of compressive strength for phase change materials integrated cementitious composites. Constr. Build. Mater. 2020, 265, 120286. [Google Scholar] [CrossRef]
Wakjira, T.G.; Alam, M.S.; Ebead, U. Plastic hinge length of rectangular RC columns using ensemble machine learning model. Eng. Struct. 2021, 244, 112808. [Google Scholar] [CrossRef]
Inel, M. Modeling ultimate deformation capacity of RC columns using artificial neural networks. Eng. Struct. 2007, 29, 329–335. [Google Scholar] [CrossRef]
Mangalathu, S.; Jang, H.; Hwang, S.-H.; Jeon, J.-S. Data-driven machine-learning-based seismic failure mode identification of reinforced concrete shear walls. Eng. Struct. 2020, 208, 110331. [Google Scholar] [CrossRef]
Deng, C.; Xue, X. Hybrid particle swarm optimization and group method of data handling for the prediction of ultimate strength of concrete-filled steel tube columns. Adv. Eng. Softw. 2024, 195, 103708. [Google Scholar] [CrossRef]
Naderpour, H.; Poursaeidi, O.; Ahmadi, M. Shear resistance prediction of concrete beams reinforced by FRP bars using artificial neural networks. Measurement 2018, 126, 299–308. [Google Scholar] [CrossRef]
Chou, J.-S.; Pham, T.-P.; Nguyen, T.-K.; Pham, A.-D.; Ngo, N.-T. Shear strength prediction of reinforced concrete beams by baseline, ensemble, and hybrid machine learning models. Soft Comput. 2019, 24, 3393–3411. [Google Scholar] [CrossRef]
Feng, D.; Wang, W.-J.; Mangalathu, S.; Hu, G.; Wu, T. Implementing ensemble learning methods to predict the shear strength of RC deep beams with/without web reinforcements. Eng. Struct. 2021, 235, 111979. [Google Scholar] [CrossRef]
Moj, M.; Czarnecki, S. Comparative analysis of selected machine learning techniques for predicting the pull-off strength of the surface layer of eco-friendly concrete. Adv. Eng. Softw. 2024, 195, 103710. [Google Scholar] [CrossRef]
Ge, P.; Yang, O.; He, J.; Liu, Z.; Chen, H. Metaheuristic algorithms-optimized machine learning models for FRP-concrete interfacial bond strength prediction. Adv. Eng. Softw. 2025, 208, 103971. [Google Scholar] [CrossRef]
Hwang, S.-H.; Mangalathu, S.; Shin, J.; Jeon, J.-S. Machine learning-based approaches for seismic demand and collapse of ductile reinforced concrete building frames. J. Build. Eng. 2021, 34, 101905. [Google Scholar] [CrossRef]
Mangalathu, S.; Hwang, S.-H.; Choi, E.; Jeon, J.-S. Rapid seismic damage evaluation of bridge portfolios using machine learning techniques. Eng. Struct. 2019, 201, 109785. [Google Scholar] [CrossRef]
Mangalathu, S.; Sun, H.; Nweke., C.C.; Yi, Z.; Burton, H.V. Classifying earthquake damage to buildings using machine learning. Earthq. Spectra 2020, 36, 183–208. [Google Scholar] [CrossRef]
Zhang, C.; Liu, P.; Song, T.; He, B.; Li, W.; Peng, Y. Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models. Buildings 2024, 14, 3184. [Google Scholar] [CrossRef]
Alam, M.S.; Sultana, N.; Hossain, S.M.Z. Bayesian optimization algorithm based support vector regression analysis for estimation of shear capacity of FRP reinforced concrete members. Appl. Soft Comput. 2021, 105, 107281. [Google Scholar] [CrossRef]
Nguyen, P.D.; Dang, V.H. Shear strength of FRP—Reinforced concrete deep beams: Extension of beam and arch action model based on data-driven analysis. Structures 2025, 74, 108553. [Google Scholar] [CrossRef]
Zhao, J.; Zhu, M.; Xu, L.; Chen, M.; Shi, M. Prediction of Shear Capacity of Fiber-Reinforced Polymer-Reinforced Concrete Beams Based on Machine Learning. Buildings 2025, 15, 1908. [Google Scholar] [CrossRef]
Benavoli, A.; Corani, G.; Mangili, F. Should we really use post-hoc tests based on mean-ranks? J. Mach. Learn. Res. 2016, 17, 152–161. [Google Scholar]
XuanRui, Y. Developing an artificial neural network model to predict the durability of the RC beam by machine learning approaches. Case Stud. Constr. Mater. 2022, 17, e01382. [Google Scholar] [CrossRef]
Sujon, K.M.; Hassan, R.B.; Towshi, Z.T.; Othman, M.A.; Samad, A.; Choi, K. When to Use Standardization and Normalization: Empirical Evidence From Machine Learning Models and XAI. IEEE Access 2024, 12, 135300–135314. [Google Scholar] [CrossRef]
Yang, B.; Liang, B.; Zhou, S.; Qian, Y.; Zheng, R.; Shu, H.; He, P.; Wang, J.; Jiang, L.; Sang, Y.; et al. A novel bio-inspired caterpillar fungus (Ophiocordyceps sinensis) optimizer for SOFC parameter identification via GRNN. Renew. Energy 2026, 256, 123995. [Google Scholar] [CrossRef]
Hassoun, M.H. Fundamentals of Artificial Neural Networks; MIT Press: Cambridge, MA, USA, 1995. [Google Scholar]
Gardner, M.W.; Dorling, S.R. Artificial neural networks (the multilayer perceptron)—A review of applications in the atmospheric sciences. Atmos. Environ. 1998, 32, 2627–2636. [Google Scholar] [CrossRef]
Farnaaz, N.; Jabbar, M.A. Random forest modeling for network intrusion detection system. Procedia Comput. Sci. 2016, 89, 213–217. [Google Scholar] [CrossRef]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. Lightgbm: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 2017, 30, 3146–3154. [Google Scholar]
Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Nguyen, H.; Vu, T.; Vo, T.P.; Thai, H.-T. Efficient machine learning models for prediction of concrete strengths. Constr. Build. Mater. 2021, 266, 120950. [Google Scholar] [CrossRef]
Bui, D.-K.; Nguyen, T.; Chou, J.-S.; Nguyen-Xuan, H.; Ngo, T.D. A modified firefly algorithm-artificial neural network expert system for predicting compressive and tensile strength of high-performance concrete. Constr. Build. Mater. 2018, 180, 320–333. [Google Scholar] [CrossRef]
Degtyarev, V.V. Neural networks for predicting shear strength of CFS channels with slotted webs. J. Constr. Steel Res. 2021, 177, 106443. [Google Scholar] [CrossRef]
ACI 440.1R-15; Guide for the Design and Construction of Structural Concrete Reinforced with Fiber-Reinforced Polymer (FRP) Bars. American Concrete Institute: Farmington Hills, MI, USA, 2015.
GB 50608-2020; Technical Standard for Fiber Reinforced Polymer (FRP) in Construction. Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2020.
Li, W.; Huang, W.; Fang, Y.; Zhang, K.; Liu, Z.; Kong, Z. Experimental and theoretical analysis on shear behavior of RC beams reinforced with GFRP stirrups. Structures 2022, 46, 1753–1763. [Google Scholar] [CrossRef]
CSA S806–12; Design and Construction of Building Structures with Fibre-Reinforced Polymers. Canadian Standards Association: Ottawa, ON, Canada, 2012.
Yang, Y.; Zhou, C.; Peng, J.; Li, H.; Dong, Y.; Cai, C. Theory-informed deep neural network-based time-dependent flexural reliability assessment of corroded PC structures. Eng. Struct. 2025, 329, 119819. [Google Scholar] [CrossRef]
Razaqpur, A.G.; Isgor, O.B. Proposed Shear Design Method for FRP-Reinforced Concrete Members without Stirrups. ACI Struct. J. 2006, 103, 93. [Google Scholar] [CrossRef] [PubMed]
Rasheed, M.H.F.; Taha, B.O.; Agha, A.Z.S.; Arbili, M.M.; Abdulrahman, P.I. Shear Capacity of Fiber-Reinforced Polymer (FRP)–Reinforced Concrete (RC) Beams Without Stirrups: Comparative Modeling with FRP Modulus, Longitudinal Ratio, and Shear Span-to-Depth. J. Compos. Sci. 2025, 9, 554. [Google Scholar] [CrossRef]
Ashour, A.F.; Kara, I.F. Size effect on shear strength of FRP reinforced concrete beams. Compos. Part B Eng. 2014, 60, 612–620. [Google Scholar] [CrossRef]
Yu, Y.; Li, S.; Wang, L.; Xian, G. Prediction and interpretation of bond strength between FRP bars and fiber reinforced concrete using machine learning and Shapley Additive exPlanations Analysis. Constr. Build. Mater. 2025, 496, 143797. [Google Scholar] [CrossRef]
Tian, L.; Wang, L.; Xian, G. Machine learning prediction of interfacial bond strength of FRP bars with different surface characteristics to concrete. Case Stud. Constr. Mater. 2024, 21, e03984. [Google Scholar] [CrossRef]
Bentz, E.C.; Massam, L.; Collins, M.P. Shear Strength of Large Concrete Members with FRP Reinforcement. J. Compos. Constr. 2010, 14, 637–646. [Google Scholar] [CrossRef]
Farghaly, A.S.; Benmokrane, B. Shear Behavior of FRP-Reinforced Concrete Deep Beams without Web Reinforcement. J. Compos. Constr. 2013, 17, 10. [Google Scholar] [CrossRef]
Kim, C.H.; Jang, H.S. Concrete Shear Strength of Normal and Lightweight Concrete Beams Reinforced with FRP Bars. J. Compos. Constr. 2014, 18, 9. [Google Scholar] [CrossRef]
Tomlinson, D.; Fam, A. Performance of concrete beams reinforced with basalt FRP for flexure and shear. J. Compos. Constr. 2015, 19, 04014036. [Google Scholar] [CrossRef]
Issa, M.A.; Ovitigala, T.; Ibrahim, M. Shear Behavior of Basalt Fiber Reinforced Concrete Beams with and without Basalt FRP Stirrups. J. Compos. Constr. 2016, 20, 11. [Google Scholar] [CrossRef]
Said, M.; Adam, M.A.; Mahmoud, A.A.; Shanour, A.S. Experimental and analytical shear evaluation of concrete beams reinforced with glass fiber reinforced polymers bars. Constr. Build. Mater. 2016, 102, 574–591. [Google Scholar] [CrossRef]
Maranan, G.; Manalo, A.; Benmokrane, B.; Karunasena, W.; Mendis, P.; Nguyen, T. Shear behaviour of geopolymer-concrete beams transversely reinforced with continuous rectangular GFRP composite spirals. Compos. Struct. 2018, 187, 454–465. [Google Scholar] [CrossRef]
Abed, F.; El Refai, A.; Abdalla, S. Experimental and finite element investigation of the shear performance of BFRP-RC short beams. Structures 2019, 20, 689–701. [Google Scholar] [CrossRef]
Jumaa, G.B.; Yousif, A.R. Size effect on the shear failure of high-strength concrete beams reinforced with basalt FRP bars and stirrups. Constr. Build. Mater. 2019, 209, 77–94. [Google Scholar] [CrossRef]
Krall, M.; Polak, M. Concrete beams with different arrangements of GFRP flexural and shear reinforcement. Eng. Struct. 2019, 198, 109333. [Google Scholar] [CrossRef]
Tran, T.T.; Pham, T.M.; Hao, H. Effect of hybrid fibers on shear behaviour of geopolymer concrete beams reinforced by basalt fiber reinforced polymer (BFRP) bars without stirrups. Compos. Struct. 2020, 243, 112236. [Google Scholar] [CrossRef]
Abed, F.; Sabbagh, M.K.; Karzad, A.S. Effect of basalt microfibers on the shear response of short concrete beams reinforced with BFRP bars. Compos. Struct. 2021, 269, 114029. [Google Scholar] [CrossRef]
Al-Hamrani, A.; Alnahhal, W.; Elahtem, A. Shear behavior of green concrete beams reinforced with basalt FRP bars and stirrups. Compos. Struct. 2021, 277, 114619. [Google Scholar] [CrossRef]
Peng, F.; Xue, W. Shear Behavior of Post-Tensioned Concrete Beams with Draped FRP Tendons and without Transverse Reinforcement. J. Compos. Constr. 2021, 25, 13. [Google Scholar] [CrossRef]
Gu, Z.; Hu, Y.; Gao, D.; Wang, T.; Yang, L. Shear behavior and strength prediction of HFRP reinforced concrete beams without stirrups. Eng. Struct. 2023, 297, 117030. [Google Scholar] [CrossRef]
Peng, F.; Cai, Y.; Yi, W.; Xue, W. Shear behavior of two-span continuous concrete deep beams reinforced with GFRP bars. Eng. Struct. 2023, 290, 116367. [Google Scholar] [CrossRef]
Zhou, L.; Zheng, Y.; Di, B.; Lv, J.; Taylor, S. Shear behaviour of SWSS-SCC beams reinforced with GFRP bars and Stirrups: Experimental and analytical investigations. Structures 2023, 56, 104946. [Google Scholar] [CrossRef]
Liao, B.; Du, Y.; Zhou, R.; Rahman, M.Z.; Zhu, D. Shear Behavior of Seawater-Sea Sand Concrete Beams Reinforced with BFRP Bars and Stirrups. J. Compos. Constr. 2024, 28, 21. [Google Scholar] [CrossRef]
Qi, H.; Jiang, H.; Wang, B.; Zhuge, P. Experimental Study on Shear Performance of Concrete Beams Reinforced with Externally Unbonded Prestressed CFRP Tendons. Fibers 2024, 12, 23. [Google Scholar] [CrossRef]
Razaqpur, A.G.; Isgor, B.O.; Greenaway, S.; Selley, A. Concrete Contribution to the Shear Resistance of Fiber Reinforced Polymer Reinforced Concrete Members. J. Compos. Constr. 2004, 8, 452–460. [Google Scholar] [CrossRef]
Alam, M.S.; Hussein, A. Size Effect on Shear Strength of FRP Reinforced Concrete Beams without Stirrups. J. Compos. Constr. 2013, 17, 507–516. [Google Scholar] [CrossRef]
El Refai, A.; Abed, F. Concrete Contribution to Shear Strength of Beams Reinforced with Basalt Fiber-Reinforced Bars. J. Compos. Constr. 2016, 20, 13. [Google Scholar] [CrossRef]
Salman, O.; Abed, F.; Alhoubi, Y. Shear performance of GFRP reinforced UHPC short beams. Compos. Struct. 2024, 351, 118637. [Google Scholar] [CrossRef]
Nematzadeh, M.; Hosseini, S.-A.; Ozbakkaloglu, T. The combined effect of crumb rubber aggregates and steel fibers on shear behavior of GFRP bar-reinforced high-strength concrete beams. J. Build. Eng. 2021, 44, 102981. [Google Scholar] [CrossRef]
Jumaa, G.B.; Yousif, A.R. Size Effect in Shear Failure of High Strength Concrete Beams without Stirrup reinforced with Basalt FRP Bars. KSCE J. Civ. Eng. 2019, 23, 1636–1650. [Google Scholar] [CrossRef]
Tureyen, A.K.; Frosch, R.J. Shear Tests of FRP-Reinforced Concrete Beams without Stirrups. Struct. J. 2002, 99, 427–434. [Google Scholar] [CrossRef]
Rizzo, A.; De Lorenzis, L. Behavior and capacity of RC beams strengthened in shear with NSM FRP reinforcement. Constr. Build. Mater. 2009, 23, 1555–1567. [Google Scholar] [CrossRef]
Kim, G.; Sim, J.; Oh, H. Shear strength of strengthened RC beams with FRPs in shear. Constr. Build. Mater. 2008, 22, 1261–1270. [Google Scholar] [CrossRef]
Moussa, A.M.; Said, H.O.; Khodary, F.; Hassanean, Y.A. Shear behavior of high-strength concrete beams reinforced with carbon fiber-reinforced polymer bars. Eng. Struct. 2025, 325, 119411. [Google Scholar] [CrossRef]
Song, B.; Jin, L.; Du, X.L. Experimental study and calculation of shear capacity of FRP-reinforced concrete short beams. J. Southeast Univ. (Nat. Sci. Ed.) 2024, 54, 1080–1088, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Li, C.; Wei, F.; Xing, X.; Zhu, H. Shear performance of GFRP bars reinforced basalt fiber concrete deep beams. J. Huazhong Univ. Sci. Tech. (Nat. Sci. Ed.) 2025, 53, 104–110, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Zhou, R.; Zhu, D.J. Shear behavior of BFRP bars reinforced seawater sea-sand concrete beam. J. Railw. Sci. Eng. 2023, 20, 3396–3405, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Song, S.T.; Cao, T.; Duan, N.; Liu, D.Y. Experiment and analysis on shear capacity of CFRP reinforced steel fiber concrete beams. J. Disaster Prev. Mitig. Eng. 2021, 41, 1012–1019, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Chen, C.; Li, X.; Li, C.; Zhou, Y.; Sui, L. Optimized flax FRP stirrup in reinforced concrete beam: Material property and shear performance. Compos. Struct. 2022, 302, 116219. [Google Scholar] [CrossRef]
Han, S.; Fan, C.; Zhou, A.; Ou, J. Shear behavior of concrete beams reinforced with corrosion-resistant and ductile longitudinal steel-FRP composite bars and FRP stirrups. Eng. Struct. 2022, 278, 115520. [Google Scholar] [CrossRef]
Younis, A.; El-Sherif, H.; Ebead, U. Shear strength of recycled-aggregate concrete beams with glass-FRP stirrups. Compos. Part C Open Access 2022, 8, 100257. [Google Scholar] [CrossRef]
Al-Hamrani, A.; Alnahhal, W. Shear behavior of basalt FRC beams reinforced with basalt FRP bars and glass FRP stirrups: Experimental and analytical investigations. Eng. Struct. 2021, 242, 112612. [Google Scholar] [CrossRef]
Zhou, F.; Chen, Y.M.; Zhu, D.J. Study on shear behavior of ultra-high performance seawater sea-sand concrete beams with FRP bars. J. Hunan Univ. (Nat. Sci.) 2023, 50, 159–168, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Lü, J.M.; Pan, J.R.; Di, B.; Zheng, Y.; Zhu, W.; Li, J.; Zhang, Z.; Zhou, L. Experimental study on shear capacity of seawater sea-sand and self-compacting concrete beams with GFRP bars and stirrups. Concrete 2021, 385, 53–57, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Yuan, Y.; Wang, Z.Y.; Wang, D.Y. Experimental study on the shear performance of concrete beams reinforced with new type closed winding GFRP stirrups. Acta Mater. Compos. Sin. 2022, 39, 5074–5085, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
Xing, Z.; Zhu, Y.; Shao, Y.; Ma, E.; Chung, K.-F.; Chen, Y. Experimental and numerical research on shear performance of GFRP bar reinforced seawater sea-sand concrete deep beams without stirrups. Case Stud. Constr. Mater. 2024, 20, e03142. [Google Scholar] [CrossRef]

Figure 1. Schematic Diagram of Shear Mechanism.

Figure 2. Sample Distribution and Cumulative Percentage of FRP-RC Beams.

Figure 3. Pearson correlation coefficients.

Figure 4. Procedures of model development and prediction.

Figure 5. Schematic Diagram of 10-Fold CV.

Figure 6. Cross-validation with ten folds.

Figure 7. Flowchart of dataset partitioning.

Figure 8. Flowchart of the CFO optimization algorithm.

Figure 9. CFO Iterative Fitness Value.

Figure 10. Comparison of prediction accuracy of machine learning models.

Figure 11. Radar chart of the performance evaluation indicators for ML models.

Figure 12. Comparison of prediction accuracy of code formulas.

Figure 13. Summary graph of the SHAP values based on predictions by the MLP-XGBoost model.

Figure 14. The influence of input features on the SHAP values.

Figure 15. Comparison of the prediction equations SHAP, MLR, and model I.

Table 1. Statistical Distribution of Database Parameters.

Input Parameter	b/mm	d₀/mm	f_cu/MPa	a/d	ρ_f/%	E_f/GPa	f_fu/MPa	ρ_v/%	E_fv/GPa	f_fv/MPa
Max	500.00	1111.00	144.00	7.00	6.50	240	2438.67	3.35	300.00	2438.14
Min	100.00	80.00	24.50	0.50	0.02	37.00	397.00	0.071	40.00	160.00
Average	168.11	251.71	41.86	2.07	1.90	89.12	940.14	0.40	135.08	703.89
SD	60.90	146.43	19.44	0.95	1.37	62.05	410.76	0.38	73.52	338.94
Skewness	3.83	3.89	3.21	1.10	1.07	6.63	1.33	1.39	1.13	0.75
Kurtosis	21.74	21.23	14.83	5.89	4.12	59.32	4.90	3.77	2.98	1.88
Normality (H)	1	1	1	1	1	1	1	1	1	1

H = 0: The data follows a normal distribution. H = 1: The data does not follow a normal distribution.

Table 2. Parameter sensitivity analysis.

	R²	RMSE	MAE
a = 0.05	0.986	24.254	13.26
a = 0.1	0.992	20.198	11.131
a = 0.15	0.991	20.897	11.247
a = 0.2	0.987	21.24	12.335
a = 0.25	0.985	22.597	12.813

Table 3. Formulas for performance evaluation metrics.

Indicators	Formulas
MAE	$M A E = \frac{1}{n} \sum_{i = 1}^{n} \|V_{pred, i} - V_{\exp, i}\|$
MAPE	$M A P E = \frac{1}{n} \sum_{i = 1}^{n} \|\frac{V_{pred, i} - V_{\exp, i}}{V_{\exp, i}}\| \times 100 %$
MSE	$M S E = \frac{1}{n} \sum_{i = 1}^{n} {(V_{pred} - V_{\exp})}^{2}$
RMSE	$R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(V_{pred, i} - V_{\exp, i})}^{2}}$
R²	$R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(V_{\exp, i} - V_{pred, i})}^{2}}{\sum_{i = 1}^{n} {(V_{\exp, i} - {\bar{V}}_{pred, i})}^{2}}$

Table 4. Repeated cross-validation metrics of the MLP-XGBoost model.

	1	2	3	4	5
Average	32.89	34.97	29.27	33.15	34.56
standard deviation	4.15	3.76	4.21	5.55	4.02
coefficient of variation	0.13	0.11	0.14	0.17	0.12

Table 5. Hyperparameters of the machine learning model.

ML Model	Hyperparameter	Optimal Value
MLP	Layer Sizes	23, 4
MLP	Lambda	0.031749
DT	MinLeafsize	2
	max_depth	13
	max_features	9
	criterion	squared_error
RF	Tree_num	166
RF	MinLeafsize	2
LightGBM	Num_leaves	6
	Max_depth	12
	Learning_rate	0.9
	Num_max_iter	63
	Num_early_stop	11
XGBoost	maxiter	98
	Depth_max	7
	Min_child	3

Table 6. Performance evaluation metrics.

ML Model	MAE		MAPE		MSE		RMSE		R²
ML Model	Train	Test	Train	Test	Train	Test	Train	Test	Train	Test
MLP	17.614	20.760	20.7	20.9	1125.591	2195.472	33.550	46.856	0.986	0.961
DT	27.23	29.693	13.1	13.5	2309.271	3674.684	48.095	60.619	0.934	0.907
RF	28.992	35.625	17.9	29.2	2079.088	4595.462	45.597	67.790	0.972	0.916
LightGBM	23.442	24.552	10.7	11.5	1719.850	2631.534	41.471	51.298	0.978	0.956
XGBoost	12.944	14.525	8.01	8.5	528.689	858.343	22.993	29.297	0.990	0.985
MLP-XGBoost	10.994	11.693	6.8	7.16	380.228	528.790	19.499	22.995	0.994	0.987

Table 7. Code formula.

Equation Source	Equations
ACI440.1R-15 [48]	$\frac{V_{u}}{b h_{0}} = \frac{2}{5} k \sqrt{f_{c}} + ρ_{sv} f_{fv}$ $k = \sqrt{2 ρ_{f} n_{f} + {(ρ_{f} n_{f})}^{2}} - ρ_{f} n_{f}$
CSA S806-12 [51]	$\frac{V_{u}}{b h_{0}} = 0.05 α k_{m} k_{r} k_{a} k_{s} {(f_{c})}^{\frac{1}{3}} + \cot θ ρ_{sv} f_{fv}$ $k_{m} = \sqrt{\frac{V h_{0}}{M}} \leq 1.0$ $k_{r} = 1 + {(E_{f} ρ_{f})}^{\frac{1}{3}}$ $1.0 \leq k_{a} = \frac{2.5 V h_{0}}{M} \leq 2.5$ $k_{s} = \frac{750}{450 + d} \leq 1.0$
GB 50608-2020 [49]	$\frac{V_{u}}{b h_{0}} = 0.86 k f_{t} + ρ_{sv} f_{yv}$ $k = \sqrt{2 ρ_{f} n_{f} + {(ρ_{f} n_{f})}^{2}} - ρ_{f} n_{f}$
Model I [50]	$V_{u} = \frac{1.75}{(a / d) + 1} f_{t} b h_{0} + (a / d) (- 0.19 \cdot {(a / d)}^{3} + 1.97 \cdot {(a / d)}^{2} - 6.48 \cdot (a / d) + 8.52) f_{fy} \frac{h_{0}}{s} A_{fv}$

Note: f_cu denotes the compressive strength of concrete;

b

represents the beam width; and

k

is the ratio of the depth of the neutral axis to the depth of the reinforcement;

ρ_{f}

denotes the longitudinal reinforcement ratio;

n_{f}

represents the ratio of the elastic modulus of the longitudinal reinforcement to that of concrete;

f_{fv}

is the tensile strength of the FRP stirrup;

f_{t}

and denotes the tensile strength of concrete;

ε_{x}

denotes the concrete strain at the depth of the interface;

V

and

M

represent the design shear force and design bending moment under load combination, respectively;

θ

is the angle of the diagonal compressive stress; and

d_{v}

denotes the effective shear depth.

K_{m}

,

K_{r}

,

K_{a}

and

K_{s}

represent the influence factors accounting for the effects of interface moment, reinforcement stiffness, arch action, and member size on the shear capacity of the member, respectively;

α

is the concrete density influence factor.

Table 8. Existing formulas.

Model	MAE	MAPE	MSE	RMSE	R²
MLP-XGBoost	12.409	0.073	672.132	25.926	0.988
ACI440.1R-15	150.987	0.711	70,395.881	265.322	0.166
CSA S806-12	189.109	1.030	102,780.348	320.594	0.232
GB 50608-2020	140.694	0.634	66,803.374	258.463	0.181
Model I	114.474	0.519	44,263.89	210.390	0.337

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiong, C.; Fang, Y.; Chen, S.; Zhao, H. Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis. Buildings 2026, 16, 2162. https://doi.org/10.3390/buildings16112162

AMA Style

Xiong C, Fang Y, Chen S, Zhao H. Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis. Buildings. 2026; 16(11):2162. https://doi.org/10.3390/buildings16112162

Chicago/Turabian Style

Xiong, Chaohua, Yuqing Fang, Shuang Chen, and Hongguo Zhao. 2026. "Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis" Buildings 16, no. 11: 2162. https://doi.org/10.3390/buildings16112162

APA Style

Xiong, C., Fang, Y., Chen, S., & Zhao, H. (2026). Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis. Buildings, 16(11), 2162. https://doi.org/10.3390/buildings16112162

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction and Interpretation of Shear Capacity of FRP-RC Beams Using Heterogeneous Weighted Ensemble Model and Shapley Additive Explanation Analysis

Abstract

1. Introduction

2. Database

2.1. Feature Selection and Database Development

2.2. Feature Correlation Analysis

3. Methodology

3.1. Machine Learning Algorithms

3.1.1. Multilayer Perceptron (MLP)

3.1.2. Decision Tree (DT)

3.1.3. Random Forest (RF)

3.1.4. Light Gradient Boosting Machine (LightGBM)

3.1.5. eXtreme Gradient Boosting (XGBoost)

3.1.6. Model Ensemble MLP-XGBoost

3.2. Performance Indicators

3.3. Performance Optimization

3.3.1. 10-Cross-Validation

3.3.2. Caterpillar Fungus Optimizer Intelligent Optimization Algorithm

4. Prediction Performance Comparison

4.1. Model Performance Evaluation

4.2. Comparison with the Empirical Model

5. Interpretation of the SHAP Model

5.1. SHapley Additive Explanations (SHAP) Method

5.2. Global Interpretation

5.3. Parametric Analysis and Predictive Equation

6. Conclusions

7. Practical Significance and Future Prospects

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. List of References for Database

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI