The Efficiency of Hybrid Intelligent Models in Predicting Fiber-Reinforced Polymer Concrete Interfacial-Bond Strength

Fiber-reinforced polymer (FRP) has several benefits, in addition to excellent tensile strength and low self-weight, including corrosion resistance, high durability, and easy construction, making it among the most optimum options for concrete structure restoration. The bond behavior of the FRP-concrete (FRPC) interface, on the other hand, is extremely intricate, making the bond strength challenging to estimate. As a result, a robust modeling framework is necessary. In this paper, data-driven hybrid models are developed by combining state-of-the-art population-based algorithms (bald eagle search (BES), dynamic fitness distance balance-manta ray foraging optimization (dFDB-MRFO), RUNge Kutta optimizer (RUN)) and artificial neural networks (ANN) named “BES-ANN”, “dFDB-MRFO -ANN”, and “RUN-ANN” to estimate the FRPC interfacial-bond strength accurately. The efficacy of these models in predicting bond strength is examined using an extensive database of 969 experimental samples. Compared to the BES-ANN and dFDB-MRFO models, the RUN-ANN model better estimates the interfacial-bond strength. In addition, the SHapley Additive Explanations (SHAP) approach is used to help interpret the best model and examine how the features influence the model’s outcome. Among the studied hybrid models, the RUN-ANN algorithm is the most accurate model with the highest coefficient of determination (R2 = 92%), least mean absolute error (0.078), and least coefficient of variation (18.6%). The RUN-ANN algorithm also outperformed mechanics-based models. Based on SHAP and sensitivity analysis method, the FRP bond length and width contribute more to the final prediction results.


Introduction
Fiber-reinforced polymer (FRP) has been widely employed as a successful approach for reinforcing concrete structures [1][2][3] and steel structures [4,5].FRP comprises numerous layers, and essential performance advantages, such as excellent mechanical properties, fatigue, creep, and corrosion resistance [6,7].According to relevant studies, FRP composite is a potential steel substitute due to its lightweight, high strength, and improved fatigue and corrosion resistance [8].A considerable amount of research effort has been allotted to investigate the bearing performance of FRP reinforced concrete structures under static and dynamic conditions.For example, Bagheri et al. [9] performed seismic analyses to obtain the optimum length of the FRP on the retrofitted columns of reinforced concrete frames.Various bond strength models based on empirical or semi-empirical methodologies have been developed [10][11][12][13][14].Previous research has primarily relied on experimental findings from the single-lap shear test to develop bond strength formulations [15,16].Since the valuable bond length of the FRP was not taken into account, the prediction powers of some models were limited [17][18][19].
On the other hand, most empirical frameworks may yield considerable results when applied to new data because they are generated based on the available experimental results.Furthermore, some simplified hypotheses must be used to construct these empirical strength models.Machine learning (ML), on the other hand, is one of the six branches of artificial intelligence (AI) that allows us to train machines how to complete tasks by showing them how they should be performed [20].Several studies have recently been reported on machine learning algorithms for damage estimation and prediction in civil engineering.Based on the machine learning prediction outcomes, these studies can be divided into two groups [21,22]: (1) techniques based on regression, when the output is a continuous quantity (e.g., shear force), and (2) approaches based on classification, where the output is a discrete variable (e.g., failure mode).These algorithms can account for complex underlying interactions among input and output parameters.There is some research on the interfacial bond strength of predicting FRP-concrete (FRPC) using machine learning models [23][24][25].Su et al. [26] used three distinct ML approaches, such as a support vector machine (SVM) and a multiple linear regression (MLR), to establish the link between the influential variables and the bond strength.They utilized random and grid search to determine the best hyperparameters for the models.Chen et al. [27] developed an FRPC interfacial bond strength prediction model built on 520 tested samples using an ensemble learning approach called gradient boosted regression trees.Basaran et al. [28] employed ML techniques, such as Gaussian process regression, artificial neural networks (ANN), SVM regression, regression tree, and multiple linear regression for FRPC bond strength prediction and compared them with experimental results.Zhou et al. [29] built artificial neural networks for estimating fiber-reinforced polymer-concrete interfacial bond strength using the backpropagation neural network method.The carbon FRP steel interfacial bond strength is predicted by Chen et al. [30] using gradient enhanced decision trees and random forests.They used 113 carbon FRP steel single-shear test samples to train the data-driven model.While modern ML-based frameworks in structural engineering problems have produced good results, new hybrid intelligent models (ANN with optimization algorithm) for predicting FRPC interfacial bond strength have yet to be discovered.
The feed-forward neural network, which is trained using backpropagation, is the most popular of the ANNs, but they have a variety of drawbacks, such as falling into local minima and learning at a slow rate [31].According to the literature, such flaws can be solved by employing effective optimization strategies [32].As a result, the research reported in this paper aims to address the following issues: (1) providing an accurate and efficient machine learning model for estimating the interfacial bonding strength of FRPC using hybrid approaches; (2) Examining the forecasting accuracy of the best machine learning model against that of existing mechanics-based models and (3) using the SHapley Additive exPlanation approach [33] to describe the importance and participation of input variables that influence FRPC interfacial bond strength.Hybrid algorithms are created by combining state-of-the-art optimization algorithms, such as bald eagle search (BES), dynamic fitness distance balance (dFDB)-manta ray foraging optimization (MRFO) dFDB-MRFO, and RUNge Kutta optimizer (RUN) with AAN to achieve this purposes.

Data of Single-Lap Shear Tests
Nine hundred sixty-nine test data from available single-lap shear experiments (Figure 1) on FRPC interfacial bond strength have been employed to build appropriate hybrid ML models.This laboratory information was collected by Zhou et al. [29] from available studies [13,14,[34][35][36][37][38][39][40][41][42].The design parameters or input variables retrieved from the gathered specimens are used in machine learning models.However, while establishing input parameters, parameters with no practical value for practicing engineers and parameters without meaning for existing structures were not considered, such as loading rate.Besides the test results, the general form of many existing bond strength models based on either the effective bond length model or fracture mechanics was reviewed by Zhou et al. [29] when input parameters were determined.Here, six design criteria are considered as input variables in these tests, and they are divided into two groups; (1) geometric dimensions: the fiber-reinforced polymer sheet (FRPS) thickness (t f ), FRPS width (b f ), the bond length of the FRPS (l f ), and width of the concrete (b c ); (2) material properties: concrete compressive strength ( f c ) and FRPS elastic modulus (E f ).P u shown in Figure 1 is the ultimate bond strength of a sample.The experimental dataset is randomly divided into a testing set (20%) and a training set (80%) for the testing and training phase.To enhance the robustness of the database, the original repository has been processed according to the basic rules: (1) One sample should be eliminated from a set with the same test parameters if the ultimate bond strength differs by more than fifteen percent from the other test data and the other data points differ by less than fifteen percent.(2) If the difference between two samples in a set of data under the identical test parameters is greater than fifteen percent, the entire set of data must be discarded.The 969 samples were lowered to 855 using the filtering procedures outlined above.Table 1 presents the statistical properties of the dataset.Figure 2a shows statistical distributions of input variables.The correlation coefficient for the inputs is shown in Figure 2b.The degree of the relationship between two separate input variables is shown by each matrix element.Correlations have a very low absolute value.As a result, Multicollinearity is not a problem in this case.

Artificial Neural Network
Neurons or nodes generate the architecture of ANNs.Multiple inputs ar to each neuron, and these inputs are mixed, and after processing, the result of t nation is an output.The nodes are linked together, with each cell′s output ser input for the next cell.Additionally, hidden layers are the layers between the output layers.The connections between the neurons are weighted, reflecting bution amplitude of each neuron to the neurons in the adjacent layer to whic nected.When the network object is built, the weights are randomized at first, b modified when the ANN is trained.Here, to enhance the predictive accuracy the optimization algorithms are applied to fine-tune the weights and biases o model.

Overview of BES-ANN Algorithm
The bald eagle search (BES) algorithm [43] simulates the behavior of b

Artificial Neural Network
Neurons or nodes generate the architecture of ANNs.Multiple inputs are available to each neuron, and these inputs are mixed, and after processing, the result of that combination is an output.The nodes are linked together, with each cell s output serving as the input for the next cell.Additionally, hidden layers are the layers between the input and output layers.The connections between the neurons are weighted, reflecting the contribution amplitude of each neuron to the neurons in the adjacent layer to which it is connected.When the network object is built, the weights are randomized at first, but they are modified when the ANN is trained.Here, to enhance the predictive accuracy of ANNs, the optimization algorithms are applied to fine-tune the weights and biases of the ANN model.

Overview of BES-ANN Algorithm
The bald eagle search (BES) algorithm [43] simulates the behavior of bald eagles while hunting.As a result, this algorithm has three stages: picking the search space, searching inside the selected search area, and swooping.Within the designated search zone, bald eagles identify and select the ideal place (in terms of volume of food) where they can seek prey during the select stage.Equation ( 1) is a mathematical representation of this behavior.
where r is a random value in (0, 1), α is a constant with a value of 2, p best is the best position based on previous experience, p mean denotes that the eagles have consumed all of the data from the previous points, and p(i) is the position at iteration ith.Bald eagles look for prey within the designated search space and travel in various directions inside a spiral zone (Figure 3) to speed up their search in the search phase.The best swoop location is represented mathematically in Equation (2).
where rand is a random number in (0, 1].The flowchart of the BES-ANN algorithm is shown in Figure 4. ls 2022, 15, x FOR PEER REVIEW spiral zone (Figure 3) to speed up their search in the search pha is represented mathematically in Equation ( 2). ( Where rand is a random number in (0,1] .The flowchart of shown in Figure 4.

Overview of dFDB-MRFO-ANN Algorithm
A Manta ray foraging algorithm is a bio-inspired method that differs from other meta-heuristic search (MHS) techniques due to the nutritional tactics that are important in determining the life cycle of the search process.Manta ray foraging optimization (MRFO) uses a powerful selection method, so-called dynamic fitness-distance balance (dFDB), in its design called dFDB-MRFO [44].The dFDB features a dynamically changing weight parameter, which is the most essential characteristic of the dFDB algorithm.Consequently, the system is now capable of adjusting quickly and effectively to the search spaces for a variety of situations.The dFDB-MRFO has three powerful nutritional strategies [44]: (1) chain foraging: all of the manta rays are lined up.Those behind manta rays can, therefore, harvest plankton that the front manta ray cannot (Equation ( 3)); (2) cyclone foraging: Manta rays swim in deep water in a spiral shape toward plankton (Equation ( 4)); (3) somersault foraging: A food source′s location is chosen as a pivot point.Manta rays try to improve their position by rotating around this pivot (Equation ( 5)).

Overview of dFDB-MRFO-ANN Algorithm
A Manta ray foraging algorithm is a bio-inspired method that differs from other meta-heuristic search (MHS) techniques due to the nutritional tactics that are important in determining the life cycle of the search process.Manta ray foraging optimization (MRFO) uses a powerful selection method, so-called dynamic fitness-distance balance (dFDB), in its design called dFDB-MRFO [44].The dFDB features a dynamically changing weight parameter, which is the most essential characteristic of the dFDB algorithm.Consequently, the system is now capable of adjusting quickly and effectively to the search spaces for a variety of situations.The dFDB-MRFO has three powerful nutritional strategies [44]: (1) chain foraging: all of the manta rays are lined up.Those behind manta rays can, therefore, harvest plankton that the front manta ray cannot (Equation (3)); (2) cyclone foraging: Manta rays swim in deep water in a spiral shape toward plankton (Equation ( 4)); (3) somersault foraging: A food source s location is chosen as a pivot point.Manta rays try to improve their position by rotating around this pivot (Equation ( 5)).
where x d i (t) is a vector of a position for the dth dimension at a time t, r is randomly generated vectors in (0, 1], β is the weight coefficient, T is the total number of iterations, r i is a random number between (0, 1], and N is the search space dimension.The flowchart of the dFDB-MRFO-AAN algorithm is shown in Figure 5. of iterations, i r is a random number between (0,1] , and N is the search space dimen- sion.The flowchart of the dFDB-MRFO-AAN algorithm is shown in Figure 5.

Overview of RUN-ANN Algorithm
RUNge Kutta optimizer (RUN) [45] is a novel swarm-based framework with stochastic elements.RUN′s main concept is based on the Runge Kutta (RK) method′s calculated slope, which is given by Equation ( 6), where y is a function of x [46].
In the first step, for a population of size N, N positions are produced at random.Each member ( n x ) in the population is a D-dimensional solution to an optimization problem.
The RK technique is used by solutions to update their positions at each iteration (Equation ( 7)).

Overview of RUN-ANN Algorithm
RUNge Kutta optimizer (RUN) [45] is a novel swarm-based framework with stochastic elements.RUN's main concept is based on the Runge Kutta (RK) method's calculated slope, which is given by Equation ( 6), where y is a function of x [46].
In the first step, for a population of size N, N positions are produced at random.Each member (x n ) in the population is a D-dimensional solution to an optimization problem.The RK technique is used by solutions to update their positions at each iteration (Equation ( 7)).i f rand < 0.5 where rand is a random value between 0 and 1, x ri is a random position, x b is the best position that achieved so far, x lbest is the best solution of the iteration, a and b are two constant numbers, i is iteration number, Max(i) is the total iteration number, and ϕ is a random number in (0, 1].In the RUN algorithm, enhanced solution quality (ESQ) is utilized to increase the solutions' quality and prevent local optimum solutions in each algorithm iteration.In ESQ, the mean of three random choices and the best position are used to generate a new solution.The reader is directed to Ref. [45] for more information on the RUN method.The flowchart of the RUN-AAN algorithm is shown in Figure 6.
Materials 2022, 15, x FOR PEER REVIEW 9 of 19 Where rand is a random value between 0 and 1, ri x is a random position, b x is the best position that achieved so far, lbest x is the best solution of the iteration, a and b are two constant numbers, i is iteration number, () Max i is the total iteration number, and  is a random number in (0,1] .In the RUN algorithm, enhanced solution quality (ESQ) is utilized to increase the solutions′ quality and prevent local optimum solutions in each algorithm iteration.In ESQ, the mean of three random choices and the best position are used to generate a new solution.The reader is directed to [45] for more information on the RUN method.The flowchart of the RUN-AAN algorithm is shown in Figure 6.The mean absolute error (MAE, Equation ( 8)), the coefficient of determination (R 2 , Equation ( 9)), and coefficient of variation (C.O.V, Equation ( 10)) are utilized to assess the performance of hybrid algorithms.The mean absolute error (MAE, Equation ( 8)), the coefficient of determination (R 2 , Equation ( 9)), and coefficient of variation (C.O.V, Equation (10)) are utilized to assess the performance of hybrid algorithms.
where n presents the number of samples, y i is the recorded value, y i * is the predicted value, y is the mean of the total of samples, and σ is standard deviation results.

Results and Discussion
The neural network structure considered in this study has only three layers (input, hidden, and output layer).Parameters that must be determined by trial-and-error are the number of neurons in the hidden layer and the neural network activation function.ANNs with many neurons and functions in the middle layer are created and evaluated to discover the ideal structure for an ANN.Following the training and testing phases, networks with the Tansig function are selected for the middle and output layers (codes written in Matlab [47]).In addition, the number of neurons in the neural network s hidden layer is increased from two to 30.Changes in the number of neurons in the inner layer are significant because they show the interaction between input parameters.This means that increasing the number of neurons and creating more relationships if these relationships are not appropriate may reduce the efficiency of the neural network and the accuracy of network prediction.Figure 7 shows changes in models' performance (coefficient of determination-R 2 ) versus changes in the number of neurons for different hybrid models.The maximum value of R 2 for BES-ANN, dfDB-MRFO-ANN, and RUN-ANN is 0.91, 0.88, and 0.92 with 18, 10, and 30 neurons, respectively.Figure 7d shows the results of the feedforward neural network trained using backpropagation.All hybrid models outperform the feed-forward neural network trained using backpropagation.
Figure 8 shows the experimentally measured (P u,exp ) to analytically-predicted (P u,pre ) ratio of the interfacial bond strength as a function of the FRPS thickness (t f ) and the FRPS width (b f ). Figure 8 depicts how hybrid approaches suffer from variability when estimating the effect of FRPS thickness on interfacial bond strength.Figure 8 also reveals that the novel RUN-ANN model, on the other hand, minimizes such predictions by reducing diversity.This enhanced performance is further validated by monitoring the impact of FRPS width (b f ).Table 2 summarizes the average of the models' errors.The standard deviation is a number that expresses how far the values are spread out.A low standard deviation indicates that the majority of the data points are near to the average.Therefore, the coefficient of variation (C.O.V) also is computed.C.O.V represents the ratio of the standard deviation to the mean, and it is a useful statistic for comparing the degree of variation from one data series to another [20].The mean of the RUN-ANN algorithm s errors (P u,exp /P u,pre ) is quite near to a value of 1.The RUN-ANN also has the lowest standard deviation.The quantitative measures of the hybrid models indicate that the RUN-ANN algorithm is more in line with the experimental data.The value of standard deviation and C.O.V are high for the BES-ANN and dfDB-MRFO-ANN models, which means that there is a large amount of variability among the data points (P u,exp /P u,pre ).The RUN-ANN algorithm also has the lowest MAE (0.078), whereas MAE is 0.084 and 0.098 for the BES-ANN and dfDB-MRFO-ANN models.
icant because they show the interaction between the input parameters.This means that increasing the number of neurons and creating more relationships if these relationships are not appropriate may reduce the efficiency of the neural network and the accuracy of network prediction.Figure 7 shows changes in models' performance (coefficient of determination-2 R ) versus changes in the number of neurons for different hybrid models.The maximum value of 2 R for BES-ANN, dfDB-MRFO-ANN, and RUN-ANN is 0.91, 0.88, and 0.92 with 18, 10, and 30 neurons, respectively.Figure 7d shows the results of the feedforward neural network trained using backpropagation.All hybrid models outperform the feed-forward neural network trained using backpropagation.Figure 9 shows the Taylor diagram of the hybrid models.Taylor diagrams, which are based on the Pearson correlation coefficient, the root-mean-square (RMS) error, and the standard deviation, are statistical diagrams that reveal which of the various models is the most reliable [48,49].The more accurate a model predicts, the closer it is to the "observed" point.The standard deviation of models that lie on the solid red-arc is right (closer to the pattern of experimental data).In Figure 9 it can be seen that the RUN-ANN and dfDB-MRFO-ANN algorithms′ standard deviations are closer to the experimental data′s standard deviation.However, the RUN-ANN algorithm has a slightly higher correlation coef-  Figure 9 shows the Taylor diagram of the hybrid models.Taylor diagrams, which are based on the Pearson correlation coefficient, the root-mean-square (RMS) error, and the standard deviation, are statistical diagrams that reveal which of the various models is the most reliable [48,49].The more accurate a model predicts, the closer it is to the "observed" point.The standard deviation of models that lie on the solid red-arc is right (closer to the pattern of experimental data).In Figure 9 it can be seen that the RUN-ANN and dfDB-MRFO-ANN algorithms' standard deviations are closer to the experimental data's standard deviation.However, the RUN-ANN algorithm has a slightly higher correlation coefficient with experimental data.Although the RUN-ANN and BES-ANN models are the ones that most closely match experimental data (their points are near the point that shows as "observed" on the x-axis), the BES-ANN model's standard deviation is smaller than the experimental data's standard deviation.

Model Interpretations
For engineers and researchers, model interpretability is critical.As a result, engin and researchers would like to discover how the model makes decisions and how th tributes influence the model′s outcome.SHapley Additive Explanation (SHAP) is util in this section to assist with model interpretation.Game theory′s Shapley value insp the concept of the SHAPE method.The SHAP creates the EM() explanation model, w may be stated as:

Model Interpretations
For engineers and researchers, model interpretability is critical.As a result, engineers and researchers would like to discover how the model makes decisions and how the attributes influence the model's outcome.SHapley Additive Explanation (SHAP) is utilized in this section to assist with model interpretation.Game theory's Shapley value inspired the concept of the SHAPE method.The SHAP creates the EM() explanation model, which may be stated as: where z ∈ {0, 1} IF IF is the number of input parameters, φ j ∈ R is the SHAP value for the jth input parameter, and φ 0 stands for the baseline prediction, which is defined as all the features missing from the input space.The Shapley value specifies how the contributions should be distributed adequately among the attributes.The SHAP value for the jth input parameter can be determined using the following formula: where m i is an instance, |D| is the number of non-zero members of D, F m i represents the possible subsets that do not include m i , and the model outcome for the subset D is represented by v(D).According to Equation ( 12), the model must be retrained twice for each subset: once for the subset including m i and once it.The marginal contribution per variable is derived by multiplying the likelihood of occurrence for each subgroup without that variable by the different output with/without that variable [33].
Figure 10 shows the feature importance based on the best hybrid model (RUN-ANN).The parameters are listed in descending order on a bar-important plot based on their contribution.The top parameters have a higher predictive influence than the bottom ones, contributing more to the model prediction.In fact, the importance factor values in Figure 10 are the mean of the absolute SHAP values for features across the whole dataset.The most important variable is the FRPS width (b f ), which is followed by the bond length of the FRPS (l f ), FRPS elastic modulus (E f ), and width of the concrete (b c ).The SHAP approach can also illustrate how much each attribute impacts the target parameter, both positively and negatively, using the summary plot.The summary plot (Figure 11) is constructed from all of the samples in the test data.).The S approach can also illustrate how much each attribute impacts the target parameter, positively and negatively, using the summary plot.The summary plot (Figure 11) is structed from all of the samples in the test data.Figure 11 shows that

Comparison with Mechanics-Based Models
It is useful to compare the RUN-ANN to existing equations.For comparisons, three equations from the literature are used, which are given as follows (the parameters are introduced in Section 2):  Model 1: Wu et al. [50] Figure 11.Summary plot.

Comparison with Mechanics-Based Models
It is useful to compare the RUN-ANN to existing equations.For comparisons, three equations from the literature are used, which are given as follows (the parameters are introduced in Section 2): • Model 2: Lin et al. [12] P The tested bond strength versus computed values from the three empirical models and the RUN-ANN is shown in Figure 12.Table 3 summarizes the quantitative measures of the experimentally measured (P u,exp ) to analytically-predicted (P u,pre ) ratio for the models.It can be seen that selecting the best model from the three empirical models (Models 1-3) is difficult.The mean results of Model 1 and Model 2 are larger than 1, which reveals that the two models conservatively predict the bond strength.However, Model 3 overestimates the bond strength.The quantitative indicators of the RUN-ANN are notably better than that of the three empirical models.For example, the coefficient of variation of the tested to predict ratios from the empirical models is in the range of 34 to 43 percent, while for the RUN-ANN model, it will decrease to 18.6%.MAE of the RUN-ANN model is lower than 2.5 kN, while the MAE of the rest models is higher than 4 kN.These results demonstrate that the RUN-ANN model is more accurate than mechanics-based models.

Sensitivity Analysis Based on Sobol′s Method
Another conventional method for investigating the impact of the changes in input parameters on the model's outcome and vice versa is the use of the global sensitivity analysis (SA) based on Sobol's method [52].Compared with the local SA method suitable for linear models, the global SA technique is appropriate for complicated non-linear models [52].The most prominent advantage of Sobol′s variance-based approach is that it does not require any special analytic functions and can be used in a variety of situations.The Python sensitivity analysis library (SALib [53]) is used in this study.A bar graph of the total  The weakness of mechanics-based models can be due to the following reasons: (1) They were generated using a limited data set.As a result, when used to forecast experimental bond strength results, other than those utilized for calibration/validation, they show significant inconsistency and low accuracy.(2) Simple assumptions are considered in building these models because they are often constructed using standard mathematical programming and analytical or numerical methodologies.This makes complex nonlinear shear behaviors not well considered.In other words, despite their effectiveness in dealing with basic and idealized situations, similar strategies have exhibited significant flaws when dealing with complicated systems, primarily because of the simplifying assumptions that are taken into account when developing them.

Sensitivity Analysis Based on Sobol's Method
Another conventional method for investigating the impact of the changes in input parameters on the model's outcome and vice versa is the use of the global sensitivity analysis (SA) based on Sobol's method [52].Compared with the local SA method suitable for linear models, the global SA technique is appropriate for complicated non-linear models [52].The most prominent advantage of Sobol's variance-based approach is that it does not require any special analytic functions and can be used in a variety of situations.The Python sensitivity analysis library (SALib [53]) is used in this study.A bar graph of the total order index is shown in Figure 13. Figure 13 indicates that the FRPS bond length (l f ), width (b f ), elastic modulus (E f ), and thickness (t f ) are the dominant parameters that contribute to the interfacial-bond strength.These results are in agreement with Chen et al. [27], which also reported that FRP sample's width and bonding length were controlling factors in forecasting the interfacial-bond strength.

Conclusions
Fiber-reinforced polymers (FRPs) have been commonly reinforced concrete structures′ strength.The nonlinear rela variables and the bond strength make predicting the FRPC b this paper, novel hybrid approaches were developed for pr FRPC using an artificial neural network (ANN) coupled w such as bald eagle search (BES), dynamic fitness distance ba aging optimization (MRFO) dFDB-MRFO, and RUNge K weights and biases of the ANN model were adjusted usi SHAP (SHapley Additive exPlanation) was used to determ

Conclusions
Fiber-reinforced polymers (FRPs) have been commonly used as an option to enhance reinforced concrete structures' strength.The nonlinear relationships between the input variables and the bond strength make predicting the FRPC bond strength challenging.In this paper, novel hybrid approaches were developed for predicting the bond strength of FRPC using an artificial neural network (ANN) coupled with optimization algorithms, such as bald eagle search (BES), dynamic fitness distance balance (dFDB)-manta ray foraging optimization (MRFO) dFDB-MRFO, and RUNge Kutta optimizer (RUN).The weights and biases of the ANN model were adjusted using optimization algorithms.SHAP (SHapley Additive exPlanation) was used to determine the characteristics' relevance.
The following conclusions were reached as a result of this research: 1.
All hybrid models outperformed the feed-forward neural network trained using the backpropagation algorithm.

Figure 2 .
Figure 2. Correlation matrix and statistical distributions of input variables.(a) Distribu input variables.(b) Correlation matrix for input variables.

Figure 2 .
Figure 2. Correlation matrix and statistical distributions of input variables.(a) Distributions of input variables.(b) Correlation matrix for input variables.

Figure 3 .
Figure 3. Searching within a spiral space.

Figure 3 .
Figure 3. Searching within a spiral space.

Figure 6 .
Figure 6.Flowchart of the RUN-ANN algorithm.

Figure 6 .
Figure 6.Flowchart of the RUN-ANN algorithm.

Figure 7 .
Figure 7. Changes in model performance versus changes in the number of neurons.

Figure 8
Figure 8 shows the experimentally measured ( ,exp u P ) to analytically-predicted ( , u pre P ) ratio of the interfacial bond strength as a function of the FRPS thickness ( f t ) and the FRPS width ( f b ). Figure 8 depicts how hybrid approaches suffer from variability when estimating the effect of FRPS thickness on interfacial bond strength.Figure 8 also reveals that the novel RUN-ANN model, on the other hand, minimizes such predictions by reducing diversity.This enhanced performance is further validated by monitoring the impact of FRPS width ( f b ).Table 2 summarizes the average of the models′ errors.The standard deviation is a number that expresses how far the values are spread out.A low standard deviation indicates that the majority of the data points are near to the average.Therefore, the coefficient of variation (C.O.V) also is computed.C.O.V represents the ratio of the standard deviation to the mean, and it is a useful statistic for comparing the degree of variation from one data series to another [20].The mean of the RUN-ANN algorithm′s errors ( ,exp , / u u pre P P ) is quite near to a value of 1.The RUN-ANN also has the lowest standard deviation.The quantitative measures of the hybrid models indicate that the RUN-ANN algorithm is more in line with the experimental data.The value of standard deviation and C.O.V are high for the BES-ANN and dfDB-MRFO-ANN models, which means that there is a large amount of variability among the data points ( ,exp , / u u pre P P).The RUN-ANN algorithm also has the lowest MAE (0.078), whereas MAE is 0.084 and 0.098 for the BES-ANN and dfDB-MRFO-ANN models.

Figure 7 .Figure 8 .
Figure 7. Changes in model performance versus changes in the number of neurons.Materials 2022, 15, x FOR PEER REVIEW 12 of 19

Figure 8 .
Figure 8. Variability in predicting with respect to the FRPS thickness.

Figure 9 .
Figure 9.Taylor diagram of hybrid models.

Figure 9 .
Figure 9.Taylor diagram of hybrid models.
Figure 11 shows that b f and l f are positively correlated to the interfacial bond strength.In other words, increasing b f and l f leads to an increase in the bond strength.Materials 2022, 15, x FOR PEER REVIEW 1 10 are the mean of the absolute SHAP values for features across the whole dataset most important variable is the FRPS width ( f b ), which is followed by the bond leng the FRPS ( f l ), FRPS elastic modulus ( f E ), and width of the concrete ( c b correlated to the interfacial bond strength.In other words, increasing f b an leads to an increase in the bond strength.

Figure 10 .
Figure 10.Bar-important plot of the best model.

Figure 10 .
Figure 10.Bar-important plot of the best model.

Figure 10 .
Figure 10.Bar-important plot of the best model.

Figure 12 .
Figure 12.Comparisons of the Models.

Figure 12 .
Figure 12.Comparisons of the Models.

Table 1 .
Statistical properties of the dataset.

Table 1 .
Statistical properties of the dataset.

Table 2 .
Characteristics of the predict-to-test ratio.

Table 2 .
Characteristics of the predict-to-test ratio.