Shear Strength Prediction of Slender Steel Fiber Reinforced Concrete Beams Using a Gradient Boosting Regression Tree Method

: For the design or assessment of concrete structures that incorporate steel ﬁber in their elements, the accurate prediction of the shear strength of steel ﬁber reinforced concrete (SFRC) beams is critical. Unfortunately, traditional empirical methods are based on a small and limited dataset, and their abilities to accurately estimate the shear strength of SFRC beams are arguable. This drawback can be reduced by developing an accurate machine learning based model. The problem with using a high accuracy machine learning (ML) model is its interpretation since it works as a black-box model that is highly sophisticated for humans to comprehend directly. For this reason, Shapley additive explanations (SHAP), one of the methods used to open a black-box machine learning model, is combined with highly accurate machine learning techniques to build an explainable ML model to predict the shear strength of SFRC slender beams. For this, a database of 330 beams with varying design attributes and geometries was developed. The new gradient boosting regression tree (GBRT) machine learning model was compared statistically to experimental data and current shear design models to evaluate its performance. The proposed GBRT model gives predictions that are very similar to the experimentally observed shear strength and has a better and unbiased predictive performance in comparison to other existing developed models. The SHAP approach shows that the beam width and effective depth are the most important factors, followed by the concrete strength and the longitudinal reinforcement ratio. In addition, the outputs are also affected by the steel ﬁber factor and the shear-span to effective depth ratio. The ﬁber tensile strength and the aggregate size have the lowest effect, with only about 1% on average to change the predicted value of the shear strength. By building an accurate ML model and by opening its black-box, future researchers can focus on some attributes rather than others. robust. This indicates that the proposed model can be employed with high conﬁdence within the ranges of independent variables used to construct the model.


Introduction
Shear failure of reinforced concrete beams is a significant concern due to its brittle and sudden nature [1]. Traditional steel stirrups used as shear reinforcement have been shown to enhance shear capacity efficiently as well as prevent concrete failure. However, incorporating stirrups in narrow, asymmetrical, or congested areas might be challenging. Placing concrete can become a concern when the spacing between stirrups is small, leading to voids in the concrete [2]. Furthermore, traditional stirrups need much labor effort, resulting in more significant building expenses.
In recent years, steel fibers (SF) have acquired significant impetus when utilized in suitable volume fractions due to their potential to replace minimum shear reinforcement [3,4]. Therefore, it has been suggested that implementing SF in the construction industry could provide several advantages. First, SF can enhance the shear resistance by reducing cracks' width due to the transmission of tensile loads across diagonal cracks, as indicated by Dinh [3]. Concrete shrinkage behavior and post-cracking toughness can also be improved using SF [5,6]. Additionally, the SF inclusion enhances the resistance of the dowel action, which is due to an increase in the tensile strength of the concrete along with the reinforcements in the splitting plane [7].
Various experimental and computational investigations on steel fiber reinforced concrete (SFRC) have been conducted in the literature to examine the shear strength capability [8]. Furthermore, an analysis of the ultimate behavior of SFRC beams was carried out using finite elements and experimental modeling approach utilizing ANSYS software [9]. Additionally, the shear strength of concrete beams with fibers was predicted using a basic physical design model developed by Spinella et al. [10], which considered crack width and shear crack slips. Moreover, utilizing the slenderness ratio, an equation based on fundamental mechanic principles has been proposed to predict the shear strength of SFRC beams [11]. Additionally, SFRC has been studied extensively by conducting several shear experiments on prismatic beams [12].
Several empirical formulas for estimating the shear strength of SFRC beams have been developed in prior research during the last four decades. References [13,14] provide an overview of the most current shear design models and design guidelines for SFRC beams without stirrups. However, the range of validity of such empirical models is limited, which constitute a significant disadvantage. These models are built on the basis of a small number of data specimens, and their accuracy, when applied to additional data cases that fall beyond their range of validity, is arguable. According to a comparison made by references [13,15], it is still challenging to accurately predict the shear capacity of SFRC beams using the various shear resistance models for SFRC. In addition, shear strength predictions from different methodologies differ from one another and still differ from experimentally determined shear strengths. Given these uncertainties, a proper assessment of the reliability of SFRC beams in the event of shear failure is essential. For this, the predictive model must show the highest possible accuracy and the lowest possible variability.
An increase in the use of artificial intelligence (AI) has taken place in recent years due to innovations in computing. Machine learning (ML) models are built using extensive databases. As a result, they can significantly increase generalization capacity and accuracy for measuring the strength of concrete built with various mixing proportions. Therefore, researchers have been inspired by ML and AI to develop new models that can effectively predict the shear capacity of SFRC beams, while overcoming the abovementioned disadvantages. However, despite the emergence of those models, some researchers have focused on some ML methods rather than others for predicting the shear strength of SFRC beams. Support vector machines and artificial neural networks (ANNs), for example, have been widely utilized to predict the mechanical strength of SFRC beams [5,[16][17][18][19][20]. Another popular technique utilized for modeling the shear strength of SFRC beams is gene expression programming (GEP) [21][22][23][24][25][26]. However, there is still room for improvement of the prediction of the shear capacity of SFRC beams, even though ANN and GEP algorithms have been widely utilized in various researches concerning the shear capacity prediction of SFRC beams.
In this research, a highly efficient and widely used ML technique, called gradient boosting regression tree (GBRT), is adopted to simulate the process of predicting the shear strength of SFRC beams. GBRT is a powerful ML technique that employs several weak learners and is specifically intended to minimize overfitting issues [27]. Recent research has shown that GBRT shows an excellent prediction performance when compared with other ML algorithms [28][29][30]. Many researches have used GBRT to tackle civil engineering challenges [31][32][33][34].
While some ML-based techniques, such as random forest, neural networks, and support vector machine, can effectively solve regression problems, their operation is difficult to comprehend as these models are often referred to as "black-box" models [35]. As a result, ML-based models should be better described or interpreted to help researchers to better grasp the underlying mechanisms, such as how input parameters impact outputs and increase the persuasiveness of the created models. The Shapley additive explanations (SHAP) framework introduced by Lundberg and Lee [36] can be utilized to understand ML models. In SHAP, features are quantified based on their impact on the predictions. In addition to being able to explain the ML models globally, SHAP can also explain the ML models locally by looking at how the features impact the outputs for a single sample. The research in [37] used GBRT and SHAP for "Understanding the Factors Influencing Pedestrian Walking Speed over Elevated Facilities". Three ensemble ML models were developed by [38] to predict the creep behavior of concrete, and SHAP was utilized to interpret the predictions of the models. Last but not least, the XGBoost model was developed in reference [39] for load-carrying capacity prediction, and SHAP was used to interpret the ML models.
This study proposes an explainable ML-based technique for predicting the shear strength of SFRC beams. To the best of the authors' knowledge, the GBRT model is used for the first time to forecast the shear strength of SFRC slender beams. Furthermore, by interpreting the ML model through SHAP, variables impacting the shear strength of the SFRC beams are quantitatively investigated. For this, a database with 330 beam tests, whose shear strength was reported in the literature, was prepared. In addition, the GBRT model's optimal hyperparameters were identified using a five-fold cross-validation procedure. Additionally, the GBRT model's performance was compared to other empirical and MLbased equation models presented by other researchers. Finally, the SHAP approach was used to interpret the GBRT model that was built. The effects of several factors on the GBRT model outputs were also explored. Figure 1 depicts the whole workflow used to develop the suggested approach. The development of a database with SFRC slender beams was the initial stage. After that, some engineering features and filtering methods were applied to the gathered data. The third stage consisted in randomly dividing the data into two sets: one for training and the other for testing. The GBRT model was trained using the training set, and the model was validated using the testing set. The appropriate hyperparameters of the GBRT model were determined by using the five-fold cross-validation procedure during the training stage, this being the fourth stage. The fifth stage includes using the testing set to verify the performance of the model after it has been optimized for the hyperparameters. If the model performance is satisfactory, it can be termed as a final predictive model. During the final process, the SHAP approach is used to understand the model. Quantitatively, the SHAP approach was used to examine how features impact GBRT model predictions on a large dataset (global interpretation) and on a single sample (sample-specific interpretation or local interpretation). As a result, it is possible to analyze the factors influencing the outputs.

Dataset
The shear strength of SFRC beams without shear stirrups has been studied in several experiments. Ref. [13] recently compiled a wide database with 488 experiments on SFRC beams without stirrups. Non-slender beams with a shear-span to effective depth ratio of a/d < 2.5 and beams with shear-flexural mode failure were filtered out of the initial 488 trials, leaving a subset containing 330 experimental tests. The database with 330 experiments was used to build the model and is summarized in Appendix A. The evaluation database contains rectangular and flanged slender beams. The database specimens failed substantially by shear compression and diagonal stress with an a/d ratio higher than 2.5. The experimental database includes shear beams with varying geometry and reinforcement. Based on several studies [10,11,13,14] and to build an efficient ML model, several critical parameters that affect the shear strength of SFRC beams were chosen. Table 1 shows the statistical properties of the evaluation database's primary parameters. The primary parameter is the shear strength V u as the output variable, whereas the beam effective depth d, beam width b w , longitudinal reinforcement ratio ρ, concrete compressive strength f c , aggregate size d a , shear span to effective depth ratio a/d, tensile strength of fiber f t , and steel fiber factor F sf were considered as predictors. The steel fiber factor depends on the percentage volume V f , diameter d f , and fiber length L f (Equation (1)).

Dataset
The shear strength of SFRC beams without shear stirrups has been studied in several experiments. Ref. [13] recently compiled a wide database with 488 experiments on SFRC beams without stirrups. Non-slender beams with a shear-span to effective depth ratio of a/d < 2.5 and beams with shear-flexural mode failure were filtered out of the initial 488 trials, leaving a subset containing 330 experimental tests. The database with 330 experiments was used to build the model and is summarized in Appendix A. The evaluation database contains rectangular and flanged slender beams. The database specimens failed substantially by shear compression and diagonal stress with an a/d ratio higher than 2.5. The experimental database includes shear beams with varying geometry and reinforcement. Based on several studies [10,11,13,14] and to build an efficient ML model, several critical parameters that affect the shear strength of SFRC beams were chosen. Table 1 shows the statistical properties of the evaluation database's primary parameters. The primary parameter is the shear strength Vu as the output variable, whereas the beam effective depth d, beam width bw, longitudinal reinforcement ratio ρ, concrete compressive strength fc, aggregate size da, shear span to effective depth ratio a/d, tensile strength of fiber ft, and steel fiber factor Fsf were considered as predictors. The steel fiber factor depends on the percentage volume Vf, diameter df, and fiber length Lf (Equation (1)).

=
(1)  The histograms for the input and output variables from the evaluation database are presented in Figure 2. In general, the database's range of parameters matches what can be found in real design scenarios, as illustrated in Table 1 and Figure 2. Despite the lack of data for beams with large sizes, the dataset is thought to be representative of most real-world applications and design conditions covered by existing design codes.

Data-Splitting Procedure
The developed database from the previous section was divided into two parts to implement the ML model: the training dataset and the testing dataset. The GBRT model was developed using the training database, whereas the same predictive model was evaluated using the testing database. As much as possible, a statistically significant association was ensured between inputs of the training and testing datasets while dividing the database into subsets. Most of the developed database (80% of 330 tests) was used for training, while the remaining part was used for model testing (66 tests).

Data-Splitting Procedure
The developed database from the previous section was divided into two parts to implement the ML model: the training dataset and the testing dataset. The GBRT model was developed using the training database, whereas the same predictive model was evaluated using the testing database. As much as possible, a statistically significant association was ensured between inputs of the training and testing datasets while dividing the database into subsets. Most of the developed database (80% of 330 tests) was used for training, while the remaining part was used for model testing (66 tests).
As can noticed from Figure 2, some of the predictors and outcome variables do not obey the normal distribution curve. As a result, these variables need a feature transformation to prevent larger numeric ranges from dominating smaller numeric ranges [40]. In this case, the log transformation was applied to bring right-or left-skewed distributions to approximately normal distributions. Figure 3 shows the distribution of beams' effective depths (d) before and after log transformation.
As can noticed from Figure 2, some of the predictors and outcome variables do not obey the normal distribution curve. As a result, these variables need a feature transformation to prevent larger numeric ranges from dominating smaller numeric ranges [40]. In this case, the log transformation was applied to bring right-or left-skewed distributions to approximately normal distributions. Figure 3 shows the distribution of beams' effective depths (d) before and after log transformation.

GBRT Model Development
GBRT uses a statistical boosting method to improve the classic decision tree approach. In this method, instead of creating a single "optimal" model, this strategy aggregates several "weak" models to generate a single "strong" consensus model [41]. When using GBRT, the existing residuals are used to build new decision trees sequentially. Fundamentally, this is a form of a functional gradient descent approach for creating sequential models. Adding a new tree at each stage reduces the loss function, thereby improving the prediction [42].
Training data are assumed to exist in the form of a training set {( , )} , in which represents the input features and represents the shear capacity. For example, the squared error, the absolute error, the Huber error, etc., are all possible loss functions ( , ( )) that can be used to measure how much the predicted ( ) differs from the true shear strength . The GBRT framework assumes that D decision trees will be built, and hence it begins with an initial model ( ). For each iteration d = 1, 2, ..., D, compensating the residues is equivalent to optimizing the expansion coefficients and as shown in Equation (2): where argmin is an operation that finds the argument that gives the minimum value from a target function, and , is a pre-selected feasible loss function measuring the amount of how the predicted value ( ) deviates from the true response y. The weighting coefficients and the base learners are fitted to the training data in a greedy manner as follows: Equation (2), on the other hand, is difficult to solve directly. Even so, since the gradient-boosting model is additive, ℎ( ; ) may be seen as an increment along ℎ( ; ). It is possible to find the optimum using the least squares method, based on the principle of gradient descent:

GBRT Model Development
GBRT uses a statistical boosting method to improve the classic decision tree approach. In this method, instead of creating a single "optimal" model, this strategy aggregates several "weak" models to generate a single "strong" consensus model [41]. When using GBRT, the existing residuals are used to build new decision trees sequentially. Fundamentally, this is a form of a functional gradient descent approach for creating sequential models. Adding a new tree at each stage reduces the loss function, thereby improving the prediction [42].
Training data are assumed to exist in the form of a training set {(x i , y i )} N i=1 , in which x i represents the input features and y i represents the shear capacity. For example, the squared error, the absolute error, the Huber error, etc., are all possible loss functions L(y, F(x)) that can be used to measure how much the predicted F(x) differs from the true shear strength y. The GBRT framework assumes that D decision trees will be built, and hence it begins with an initial model F 0 (x). For each iteration d = 1, 2, . . . , D, compensating the residues is equivalent to optimizing the expansion coefficients ρ d and α d as shown in Equation (2): where argmin is an operation that finds the argument that gives the minimum value from a target function, and L[y i , F d−1 ] is a pre-selected feasible loss function measuring the amount of how the predicted value F(x) deviates from the true response y. The weighting coefficients and the base learners are fitted to the training data x in a greedy manner as follows: Equation (2), on the other hand, is difficult to solve directly. Even so, since the gradient-boosting model is additive, ρh(x i ; α) may be seen as an increment along h(x i ; α). It is possible to find the optimum α d using the least squares method, based on the principle of gradient descent: where β is a weight factor and r i is the negative gradient evaluated using the previous model.
One-dimensional optimization can be used to further improve the gradient-descent step size or weight of the obtained decision tree: Finally, according to Equation (3), the prior model will be added to the newly evaluated residue model. Algorithm 1 represents the pseudocode for the generic gradient boosting.
Gradient boosting allows for a wide variety of smooth loss functions, including AdaBoost, LogitBoost, and L2Boosting [43]. Because of its simplicity and coherence in solving regression problems, the squared loss function is employed in this study: Regularization techniques are typically used during the training stage to reduce overfitting and to boost the model's generalization capacity. In the following equation, Gradient boosting uses a new variable called ν d to regulate the model's update rate, which is known as shrinkage or learning rate: The model is updated more slowly when ν d is smaller. According to [44], utilizing small learning rates leads to better model generalization without shrinkage; however, this comes at the cost of greater computing time because more decision trees are required. In addition, numerous additional parameters that are strongly related to the final tree's structure and model complexity, such as depths (maximum number of splits) and the number of trees D, must be fine-tuned to maximize the performance of the model.

Cross-Validation
The division of the complete dataset into three subsets-training, validation, and testing-is a standard approach for evaluating the performance of ML models. While the training set is used to complete the learning process, the validation set tracks the performance of the model. As a final step, the model's extrapolation skills are tested by running it through a set of samples that it has never seen before (testing set) [40]. However, dividing data into three subsets reduces the size of the dataset, which might result in an inadequately trained model. As a result, cross-validation is a typical strategy for avoiding over-reduction of the training set, particularly for small datasets [40]. Cross-validation is performed in various ways, the most common of which is omitting random data to verify the model. K-fold cross-validation was used in this research. Cross-validation with K-fold is a resampling technique that divides data into k subsets, one for validation and the other k-1 for training.

Hyperparameter Tuning
The tuning of hyperparameters is an essential step in developing reliable ML models. Tuning an ML model reduces overfitting and increases the model adaptability to new data [45]. Choosing the best hyperparameters is also a key component in improving the accuracy of the model [46]. Many ways to automate hyperparameter selection have been developed to prevent manual tuning, including grid search and random search hyperparameter optimization [47]. The domain of the possible values evaluated in the search effort distinguishes these techniques from each other. Random search methods choose distinct hyperparameter values randomly for a given number of iterations, while grid search investigates all potential values in a pre-defined domain for the hyperparameters [47]. The Scikit-learn package in Python [48] was used to explore possible values of hyperparameters using a grid search technique with five-fold cross-validation (GridSearchCV). Figure 4 depicts the five-fold cross-validation used in this work for training and for the hyperparameter selection of the model.
The tuning of hyperparameters is an essential step in developing reliable ML models. Tuning an ML model reduces overfitting and increases the model adaptability to new data [45]. Choosing the best hyperparameters is also a key component in improving the accuracy of the model [46]. Many ways to automate hyperparameter selection have been developed to prevent manual tuning, including grid search and random search hyperparameter optimization [47]. The domain of the possible values evaluated in the search effort distinguishes these techniques from each other. Random search methods choose distinct hyperparameter values randomly for a given number of iterations, while grid search investigates all potential values in a pre-defined domain for the hyperparameters [47]. The Scikit-learn package in Python [48] was used to explore possible values of hyperparameters using a grid search technique with five-fold cross-validation (GridSearchCV). Figure  4 depicts the five-fold cross-validation used in this work for training and for the hyperparameter selection of the model.

Model Performance Metrics
Various statistical measures, such as R 2 , mean absolute error (MAE), root mean squared error and (RMSE), were used to evaluate the performance of the built ML-based models. For example, the best model has an R 2 value close to 1, while RMSE and MAE values are close to zero. In order to obtain the MAE value, the absolute difference between actual and predicted values must be averaged. The equation for MAE is the following one: Various statistical measures, such as R 2 , mean absolute error (MAE), root mean squared error and (RMSE), were used to evaluate the performance of the built ML-based models. For example, the best model has an R 2 value close to 1, while RMSE and MAE values are close to zero. In order to obtain the MAE value, the absolute difference between actual and predicted values must be averaged. The equation for MAE is the following one: When the R 2 value is 1, the predicted and true/actual values are perfectly aligned. R 2 has the following mathematical representation: where y i obs and y i pre are the actual output and predicted values, respectively, and y −obs is the average of all observed data.
The difference between the predicted and actual values is the error and the RMSE is calculated as the square root of the average squared errors. The RMSE is computed as follows: The model uncertainty, or standard deviation (scatter) of the model error, and the mean (bias) are used to evaluate the built models. Due to the lack of knowledge of the problem, conservative assumptions, and mathematical simplifications, the model uncertainty is defined as a model inability to effectively reflect and express a physical phenomenon (in this case, the shear strength). The model uncertainty is described as a random variable with a standard deviation, mean value, and probability distribution in the structural reliability framework. Shear reliability analysis has been proven to be significantly impacted by it. In this work, the predictive model uncertainty related to beam x is equal to the ratio between the experimental and the predicted shear strength, as stated in Equations (12) and (13).
For a single beam test x, M x is the model uncertainty. The predictors for the GBRT model (a/d, d, b w , ρ, f c , f t , d a , and F sf ) are represented by X. The mean and standard deviation of the model uncertainty are represented by σ M and µ M , respectively.
It is better to choose a model with a mean µ M close to 1 and a standard deviation σ M close to 0 for the model uncertainty. µ M > 1 indicates that the model underestimates the shear capacity of the beam specimen and, consequently, underestimates its failure load. However, if µ M < 1, it suggests that the model overestimates the shear resistance.

Shapley Additive Explanations (SHAP) Framework
Lately, Explainable black-box ML models have attracted more study interest because they allow users to trust the created ML models by helping them to comprehend the ML models' involved mechanism. SHAP is a method for explaining "black-box" ML models. Lundberg and Lee [36] were the first to suggest SHAP, which is based on the notion of Shapley game theory. The SHAP seeks to assess the contribution of each input variable or feature to the observation, and it can determine whether the contribution of each feature is positive or negative. To help with the global and local explanation of ML models, SHAP can calculate the contribution from each feature for every observation. SHAP creates a model of explanation that can be written as: where z ∈ {0, 1} K and K represent the number of input features; φ j ∈ R is the SHAP value for the j-th feature; φ 0 is the constant if all inputs are missing. The SHAP value for the j-th feature can be calculated as: where F is the set of all features and x S is the value of the input.

Programming Languages and Softwares
In this research, the Python programming language combined with the Scikit-learn library was used to build the system for the estimation of the shear capacity and the interpretation of the GBRT model. Python is a high-level, easy-to-learn, open-source, extensible, and object-oriented programming language (OOP). Python is also an interpreted and versatile language widely used in many fields, such as for building independent programs using graphical interfaces and web applications. In addition, it can be used as a scripting language to control the performance of many programs. It is often recommended for beginners in programming to learn this language because it is among the fastest programming languages to learn [49].
On the other hand, Scikit-learn [48] is an ML library in Python. It contains many algorithms and methods used in the field of ML, such as classification, clustering, and regression, in addition to being used in the stages of data processing and model evaluation. It was built based on the libraries of Scipy, Numpy, Matplotlib, and many others. This study implemented the data preprocessing, filtering techniques, and GBRT modeling using the Python programming language and the Scikit-learn library. At the same time, the plots and figures were created using OriginLab software.

K-Fold Cross-Validation
Before running the model, there is the need to fine-tune several of GBRT's hyperparameters. The hyperparameters of the GBRT model were optimized using a grid search process and a five-fold cross-validation. The most critical hyperparameters for the GBRT model are the n estimators and the learning rate, representing the number of the model's weak learners and the weights assigned to each estimator, respectively. Additionally, the GBRT model prediction performance can be considerably affected by its max depth parameter, which indicates the complexity of each tree, and its subsample parameter, which represents the fraction of samples to be used for fitting the individual base learners [50]. The tuned values for each of the four hyperparameters are shown in Table 2. The coefficient of determination (R 2 ) was closely examined as a statistical error to obtain hyperparameters with the maximum accuracy while minimizing over-fitting. To execute GBRT modeling and tuning, the Scikit-learn program [48] was used. A total of 264 data records was used to train the GBRT model, and 66 samples were used to test it. The five-fold cross validation results are shown in Figure 5. Again, there is no noticeable fluctuation in the results of the five folds, and the overall accuracy remains excellent. For example, Fold 1 has a minimum R 2 value of 0.9580, and Fold 2 has a maximum R 2 value of 0.9852. Table 3

GBRT Performance on Testing Set
The prediction performance of the proposed method can be tested after the hyperparameters have been identified. The prediction findings are shown in Figure 6, with the X and Y axes representing the experimental and predicted shear strengths, respectively. The training and testing outcomes are represented by the blue dot and red triangle, respectively. In most cases, the difference between the predicted and actual shear strength is within a margin of error of 20% or less. There were three further iterations of the experiment, each using a different mix of training and testing datasets. The predictions for all four experiments are reported in Table 4. The testing RMSE and MAE were always less than 30 and 17, respectively. The mean absolute percentage error, or MAPE, was less than 14%. That is to say, for every sample and instance, the deviation between the predicted and actual shear strength was less than 17 kN (equivalently 14%). These results show that the GBRT approach can be considered an effective tool for estimating the shear capacity of SFRC beams.

GBRT Performance on Testing Set
The prediction performance of the proposed method can be tested after the hyperparameters have been identified. The prediction findings are shown in Figure 6, with the X and Y axes representing the experimental and predicted shear strengths, respectively. The training and testing outcomes are represented by the blue dot and red triangle, respectively. In most cases, the difference between the predicted and actual shear strength is within a margin of error of 20% or less. There were three further iterations of the experiment, each using a different mix of training and testing datasets. The predictions for all four experiments are reported in Table 5. The testing RMSE and MAE were always less than 30 and 17, respectively. The mean absolute percentage error, or MAPE, was less than 14%. That is to say, for every sample and instance, the deviation between the predicted and actual shear strength was less than 17 kN (equivalently 14%). These results show that the GBRT approach can be considered an effective tool for estimating the shear capacity of SFRC beams.     Figure 7 presents the histogram of the predicted shear strength (V pred ) from the GBRT model compared to the actual shear strength (V act ) (case 1). Again, most of the shear strengths predicted by GBRT are within a margin of 20% or less of error. The standard deviation (σ M ) and mean value (µ M ) of the ratio V act /V pred are taken into consideration when evaluating the accuracy of the GBRT model. The standard deviations (σ M ) for the training and testing data were 0.058 and 0.145, respectively, whereas the mean values (µ M ) were 1.002 and 0.980, respectively. A normally distributed relationship between the GBRTpredicted values and the experimental data shows that the error is dispersed randomly. deviation ( ) and mean value ( ) of the ratio Vact/Vpred are taken into consideration when evaluating the accuracy of the GBRT model. The standard deviations ( ) for the training and testing data were 0.058 and 0.145, respectively, whereas the mean values ( ) were 1.002 and 0.980, respectively. A normally distributed relationship between the GBRT-predicted values and the experimental data shows that the error is dispersed randomly.

The Reliability of the GBRT Model Prediction
For a total of 330 SFRC beams, the statistical evaluation of the various models [1, 21,26,[51][52][53] used to estimate the shear strength, and also for the model developed in the present study, can be found in Table 6. The formulas from the models used for the comparative analysis can be found in Table 7. Ashour et al. [52] proposed two sets of equations based on a regression model for observed data gathered from 18 high-strength SFRC beam specimens. An essential parameter, the fiber factor (F), which accounts for the influence of steel fiber size and shape, was incorporated into the first equation, taken from the ACI Building Code's shear equation. In addition, to account for the role of reinforcement and concrete in the shear capacity, the authors incorporated the shear span to effective depth (a/d) ratio in their equation. The second equation of Ashour et al. [52] is based on a modified version of Zsutty's equation [54], which includes the fiber factor. Deep beams ( < 2.5) and slender beams (

The Reliability of the GBRT Model Prediction
For a total of 330 SFRC beams, the statistical evaluation of the various models [1, 21,26,[51][52][53] used to estimate the shear strength, and also for the model developed in the present study, can be found in Table 5. The formulas from the models used for the comparative analysis can be found in Table 6. Ashour et al. [52] proposed two sets of equations based on a regression model for observed data gathered from 18 high-strength SFRC beam specimens. An essential parameter, the fiber factor (F), which accounts for the influence of steel fiber size and shape, was incorporated into the first equation, taken from the ACI Building Code's shear equation. In addition, to account for the role of reinforcement and concrete in the shear capacity, the authors incorporated the shear span to effective depth (a/d) ratio in their equation. The second equation of Ashour et al. [52] is based on a modified version of Zsutty's equation [54], which includes the fiber factor. Deep beams ( a d < 2.5) and slender beams (  [51] omits some of the essential parameters, such as the ratio l f /d f and F, which substantially impact the shear capacity of SFRC. The referred author used 41 experiments to validate his equation. Rather than incorporating the actual reinforcement ratio, the equation from Greenough and Nehdi [21] simplifies a formula derived from genetic programming by using a percentage for ρ. Additionally, 208 SFRC beam test results from earlier research were analyzed using multi-expression programming to obtain the formula presented by Sarveghadi et al. [26]. The authors have produced two sets of equations: one set contains expressions specific to high-strength concrete, and the other set is a composite equation for both types of concrete (normal-and high-strength). An equation for predicting the shear strength of SFRC beams based on 293 previous experiments was recently published by Sabetifar and Nematzadeh [53]. The previous two studies [26,53] built their models based on genetic programming (GP). GP is a machine learning-based approach for developing nonlinear regression. The Darwinian ideas of natural selection and genetic spreading of features chosen by biologically growing organisms are the foundations of GP. Even though both researches [26,53] were published recently, the datasets utilized to train and test the models were quite constrained, resulting in models with only limited application. The equations for the shear capacity of SFRC beams proposed in the previous referred studies are given in Table 6.  Table 6. Previous equations for the shear capacity of SFRC beams.
[53] Sabetifar and Nematzadeh where: f c and f t are the compressive and tensile strengths of concrete, respectively; F is the fiber factor; η o is the fiber orientation factor, τ is the average fiber-matrix interfacial bond stress.
The mean value for the ratio of the experimental values to the model predicted values (µ M ) and their variability (CV) was used to evaluate the performance of the model. The prediction is more accurate when µ M is near to 1 and when the CV is low. As can be observed in Table 5, five of the µ M values are higher than 1.00, implying that the models from [1, 21,51,52] underestimate the shear capacity of SFRC beams, while the models from [26,53] slightly overestimate the shear resistance. The explanation for this observation might be linked to the fact that the previously referred equations were produced based on a limited set of data with low variation between specimens' properties. Furthermore, the equations proposed in the referred literature omit some critical factors that contribute to the shear strength of SFRC.
With µ M = 0.996, STD = 0.08, and COV = 12%, it can be stated that the proposed model in this research beat all previous models. As a result, the model has a reduced error rate and a higher degree of linearity between the anticipated and actual values. Furthermore, the GBRT model has the lowest coefficient of variation (COV) when compared to the other models, indicating that its projected values have the slightest variance around the mean.
The experimental to prediction ratios for each input variable are presented in Figure 8, in order to check if a bias exists between the prediction of the GBRT model and one or more input variables. From Figure 8, it seems that a significant trend or preference toward these variables does not exist. The accuracy of the GBRT prediction for the shear strength seems robust. This indicates that the proposed model can be employed with high confidence within the ranges of independent variables used to construct the model. the mean.
The experimental to prediction ratios for each input variable are presented in Fig  8, in order to check if a bias exists between the prediction of the GBRT model and on more input variables. From Figure 8, it seems that a significant trend or preference tow these variables does not exist. The accuracy of the GBRT prediction for the shear stren seems robust. This indicates that the proposed model can be employed with high co dence within the ranges of independent variables used to construct the model.

Interpretation of the GBRT Model
The SHAP approach was used to understand the developed GBRT model and how its inputs impact its outputs. The summary of the SHAP values and the feature importance factor for all of the input features can be seen in Figure 9. Each dot on the graphs represents a dataset instance and its corresponding feature SHAP value. The x-axis indicates each feature's effectiveness on the dependent variable, while the y-axis shows the model's ranking of features by significance. A red dot denotes a high feature value, corresponding to a higher SHAP value. The significance of each feature is determined as the mean absolute SHAP values for the whole dataset, as shown in Figure 9a. Figure 9a shows that both the beam width (b w ) and the effective depth (d) are the most important factors, followed by the concrete strength (fc) and the longitudinal reinforcement ratio (ρ). In addition, the outputs are also affected by the steel fiber factor (F sf ) and the shear-span to effective depth ratio (a/d). Finally, the fiber tensile strength (f t ) and the aggregate size (d a ) have the lowest effect. According to the results in Figure 9a, b w and d have the ability to alter the estimated value of shear strength by an average of 16%, while f t and d a have the lowest ability with only about 1%. As can be seen in Figure 9b, most of the features mentioned above positively influence the model outcome, which indicates that when one of those features increases, the shear capacity of the SFRC slender beam increases. The only exception among those features is a/d. This observation can be explained due to the influence of the arch action, which depicts the compressive force created along with the beam supports and the loading points. Loads are borne in part by the arch action in the area of small shear spans. The applied shear is resisted by the arch action, which leads to a decreased shear for higher a/d [55]. The conclusions presented in this section can assist constructers and designers in determining the importance of each feature in SFRC slender beams for the output shear strength, and whether it is positive or negative.

Interpretation of the GBRT Model
The SHAP approach was used to understand the developed GBRT model and how its inputs impact its outputs. The summary of the SHAP values and the feature importance factor for all of the input features can be seen in Figure 9. Each dot on the graphs represents a dataset instance and its corresponding feature SHAP value. The x-axis indicates each feature's effectiveness on the dependent variable, while the y-axis shows the model's ranking of features by significance. A red dot denotes a high feature value, corresponding to a higher SHAP value. The significance of each feature is determined as the mean absolute SHAP values for the whole dataset, as shown in Figure 9a.
(a) (b) Figure 9. The relative importance of each feature and SHAP summary plot: (a) relative importance; (b) SHAP summary plot. Figure 9a shows that both the beam width (bw) and the effective depth (d) are the most important factors, followed by the concrete strength (fc) and the longitudinal reinforcement ratio (ρ). In addition, the outputs are also affected by the steel fiber factor (Fsf) and the shear-span to effective depth ratio (a/d). Finally, the fiber tensile strength (ft) and the aggregate size (da) have the lowest effect. According to the results in Figure 9a, bw and d have the ability to alter the estimated value of shear strength by an average of 16%, while ft and da have the lowest ability with only about 1%. As can be seen in Figure 9b, most of the features mentioned above positively influence the model outcome, which indicates that when one of those features increases, the shear capacity of the SFRC slender beam increases. The only exception among those features is a/d. This observation can be explained due to the influence of the arch action, which depicts the compressive force created along with the beam supports and the loading points. Loads are borne in part by the arch action in the area of small shear spans. The applied shear is resisted by the arch action, which leads to a decreased shear for higher a/d [55]. The conclusions presented in this section can assist constructers and designers in determining the importance of each feature in SFRC slender beams for the output shear strength, and whether it is positive or negative.

Conclusions
An investigation on the use of an explainable ML method for the prediction of the shear strength of SFRC slender beams was conducted in this study. Using SHAP to interpret the ML model, the factors impacting the shear strength were examined. A database with 330 SFRC slender beam tests was created and randomly divided into testing and training sets. Using a five-fold cross-validation procedure paired with a grid search strat-

Conclusions
An investigation on the use of an explainable ML method for the prediction of the shear strength of SFRC slender beams was conducted in this study. Using SHAP to interpret the ML model, the factors impacting the shear strength were examined. A database with 330 SFRC slender beam tests was created and randomly divided into testing and training sets. Using a five-fold cross-validation procedure paired with a grid search strategy, optimal hyperparameters of the GBRT model were found based on the training dataset. The testing dataset was used to validate the performance of the built GBRT model. Meanwhile, six empirical and machine learning-based equation models were chosen and compared to comprehensively analyze the performance of the GBRT model. Additionally, to analyze the GBRT model globally across the whole dataset, the SHAP approach was used. The SHAP values were used to discuss factors that impact the model results. The following are the main key conclusions that can be derived from the research findings:

•
The GBRT model predicts the shear capacity of SFRC slender beams with high accuracy. The model has R 2 values of 0.963 and 0.972 for the testing and training sets, respectively. In addition, both the training and testing sets of the GBRT model have low RMSE and MAE values, indicating that the prediction capability of the GBRT model can be trusted with high confidence; • A comparison between the predicted and experimental shear strengths was also performed, using previously established equations from the literature. The results show that the predicted values from previous models do not apply to a wide range of data and have a high variance; • Most of the proposed equations from the literature show a mean value for the model uncertainty larger than 1, implying that they all underestimate the shear capacity of the SFRC slender beams from the database; • With low error measurements and µ M near unity, the results showed that the GBRT method surpassed the other models mentioned in this study.

Conflicts of Interest:
The authors declare no conflict of interest.