Enhancing Ultimate Bearing Capacity Prediction of Cohesionless Soils Beneath Shallow Foundations with Grey Box and Hybrid AI Models

: This study examines the potential of the soft computing technique, namely, multiple linear regression (MLR), genetic programming (GP), classiﬁcation and regression trees (CART) and GA-ENN (genetic algorithm-emotional neuron network), to predict the ultimate bearing capacity (UBC) of cohesionless soils beneath shallow foundations. For the ﬁrst time, two grey-box AI models, GP and CART, and one hybrid AI model, GA-ENN, were used in the literature to predict UBC. The inputs of the model are the width of footing (B), depth of footing (D), footing geometry (ratio of length to width, L/B), unit weight of sand ( γ d or γ (cid:48) ), and internal friction angle ( φ ). The results of the present model were compared with those obtained via two theoretical approaches and one AI approach reported in the literature. The statistical evaluation of results shows that the presently applied paradigm is better than the theoretical approaches and is competing well for the prediction of q u . This study shows that the developed AI models are a robust model for the q u prediction of shallow foundations on cohesionless soil. Sensitivity analysis was also carried out to determine the effect of each input parameter. The ﬁndings showed that the width and depth of the foundation and unit weight of soil ( γ d or γ (cid:48) ) played the most signiﬁcant roles, while the internal friction angle and L/B showed less importance in predicting q u .


Introduction
In the realm of geotechnical engineering, the consideration of ultimate bearing capacity (UBC) and allowable settlement continues to be of paramount importance when designing shallow foundations [1][2][3].These aspects are crucial factors that dictate the structural integrity and performance of any construction project.UBC, a fundamental parameter, is deeply intertwined with the shear strength characteristics of the underlying soil, and its accurate estimation is essential for ensuring the stability of foundations.Theoretical formulations proposed by eminent geotechnical scholars such as Terzaghi [4], Meyerhof [5], Hansen [6], and Vesic [7] have provided a foundation for calculating UBC.However, the application of these theories has revealed that various UBC equations exhibit significant disparities, primarily due to the inherent simplifications and assumptions employed in these approaches [8].
The pursuit of accurate UBC prediction has led to a proliferation of research studies that explore numerical and semi-empirical solutions.The works of Conte et al. [9] and Achmus and Thieken [10] stand as examples of such efforts, where researchers have sought to elucidate the complex bearing capacity phenomenon.These models are frequently evaluated through meticulously designed footing tests carried out on scale models.Yet, bridging the gap between laboratory-scale experiments and real-world, full-scale foundation scenarios remains a challenge.Scholars, cognizant of the potential scale effects, have undertaken initiatives to minimize discrepancies when extrapolating findings from controlled laboratory conditions to the complexities of real-world projects.This endeavor to account for scale effects, as seen in the works of de Beer [11] and Yamaguchi et al. [12], underscores the dedication within the geotechnical community to ensuring the accuracy and reliability of their predictions.
In a pursuit to unravel the intricate facets of bearing capacity, researchers have embarked on investigations that dissect the influence of factors such as particle size on UBC.The study carried out by Tatsuoka et al. [13] exemplifies this endeavor, as it unravels the nuances of soil behavior during model-scale footing experiments.Interestingly, the results drawn from large-scale tests conducted on compacted sand have unveiled pronounced variations in shearing strains along the slip line.These findings have challenged conventional formulas that solely rely on the concept of maximum friction angle (φ max ) [14].This revelation underscores the critical need for a nuanced understanding of the underlying mechanics and limitations of various theoretical equations, ultimately driving the demand for innovative approaches to enhance the accuracy of UBC predictions in real-world scenarios.
Yet, despite the remarkable strides made by AI in various fields [57][58][59], its penetration into the geotechnical engineering sphere remains somewhat limited, as evidenced by the existing literature.This research endeavor boldly pioneers the utilization of grey box AI methodologies and a hybrid AI approach to predict UBC in the context of shallow foundations on cohesionless soils.
This ambitious research journey encompasses a comprehensive ensemble of four distinct mathematical models, each contributing a unique facet to the overarching goal.The multiple linear regression (MLR) model forms the foundation as a statistical framework, shedding light on the significance of individual parameters.Concurrently, the integration of grey box AI models such as genetic programming (GP) and classification and regression tree (CART) methods introduces a novel dimension to UBC prediction, bridging the gap between traditional analytical approaches and cutting-edge AI methodologies.Then, the Genetic Algorithm-Emotional Neural Network (GA-ENN) method, a hybrid AI approach that harnesses the strengths of genetic algorithms and neural networks, was employed.This amalgamation of diverse approaches is harnessed for the very first time to unravel the enigma of predicting the ultimate bearing capacity of cohesionless soils beneath shallow foundations.
The culmination of this research is not merely a prediction tool; it represents a paradigm shift that opens new vistas for accurate and reliable foundation design.This shift comes with a rigorous evaluation of the significance of input parameters, accompanied by a comprehensive sensitivity analysis to ensure the robustness and applicability of the predictive models in real-world scenarios.As we peer into the future, the impact of this pioneering effort is poised to extend beyond the boundaries of geotechnical engineering, ushering in a new era of precision and innovation across the construction industry.

Database Collection and Processing
The collected database contains a total of 97 datasets comprising results from tests on square, rectangular, and strip footings of various sizes conducted in sand beds with different unit weights (dry unit weight, γ d or effective unit weight, γ ).To improve the model's performance, the data are evenly distributed, ensuring an equal number of samples for large and small-sized footings.
For the large-scale tests conducted at the DEGEBO test area in Berlin, submerged unit weights (γ d or γ ) are utilized due to the submerged conditions during testing [14,60].The angle of shearing resistance reported by the respective authors is adopted for the analysis, despite some differences in the mobilized angle of shearing resistance at failure between axisymmetric and plain strain conditions.Nevertheless, these differences do not exceed 10% [14,60].
In the case of laboratory model tests, the internal friction angles used are obtained from direct shear tests conducted at very low normal stresses, which also includes the effect of dilation.
Regarding the determination of ultimate load, for large-scale footings, it is defined as the load corresponding to the point where the slope of the load settlement curve is at its minimum.For smaller-sized model footings, it is defined as the load corresponding to the point of break of the load settlement curve in a log-log plot.Figure 1 shows the distribution of the database.
Figure 2 illustrates the influence of the foundation depth on the foundation load.From these diagrams, it can be observed that the bearing capacity of the shallow foundation increases with the increasing foundation depth.
Prior to utilizing the existing database, a thorough check for inappropriate data was conducted.To accomplish this, a box plot for the output parameter was generated, and Figure 3a displays the results.Following the examination, 22 datasets were identified as inappropriate and subsequently removed from the overall database.The refined database consists of 75 suitable datasets, and their corresponding box plot is depicted in Figure 3b.
Table 1 presents the distribution of the utilized database after the exclusion of outliers.The data reveal valuable insights into the statistical characteristics of each variable.The ultimate bearing capacity of the foundation, represented by q u , varies significantly across the datasets, ranging from 58.500 kPa to 464.000 kPa, with a mean value of 211.274 kPa and a standard deviation of 108.959 kPa.On the other hand, the foundation width (B) displays a relatively tighter distribution, ranging from 0.059 m to 0.600 m, with a mean of 0.250 m and a standard deviation of 0.195 m.Similarly, the foundation depth (D) exhibits a narrow range, varying from 0.000 m to 0.500 m, with a mean value of 0.098 m and a standard deviation of 0.106 m.The aspect ratio (L/B), calculated as the ratio of foundation length to width, shows wider variability, ranging from 1.000 to 6.000, with a mean of 3.409 and a standard deviation of 2.269.The unit weight of the soil (γ d or γ ) displays variations across the datasets, ranging from 9.850 kN/m 3 to 17.100 kN/m 3 , with a mean value of 14.615 kN/m 3 and a standard deviation of 2.625 kN/m 3 .Similarly, the internal friction angle of the soil (φ) exhibits moderate variability, ranging from 34.000 • to 42.500 • , with a mean value of 38.363 • and a standard deviation of 2.691 • .The information provided in Table 1 serves as a basis for further analysis, offering crucial insights into the key factors influencing surface foundation load behavior and supporting improved foundation design and performance assessment.Width of footing (B), depth of footing (D), footing geometry (L/B), unit weight of sand (γ d or γ ), and internal friction angle (φ).Furthermore, Figure 4 shows histograms for the output (q u ).   Figure 2 illustrates the influence of the foundation depth on the foundation load.From these diagrams, it can be observed that the bearing capacity of the shallow foundation increases with the increasing foundation depth.Prior to utilizing the existing database, a thorough check for inappropriate data was conducted.To accomplish this, a box plot for the output parameter was generated, and Figure 3a displays the results.Following the examination, 22 datasets were identified as inappropriate and subsequently removed from the overall database.The refined database consists of 75 suitable datasets, and their corresponding box plot is depicted in Figure 3b.Table 1 presents the distribution of the utilized database after the exclusion of outliers.The data reveal valuable insights into the statistical characteristics of each variable.The ultimate bearing capacity of the foundation, represented by qu, varies significantly across the datasets, ranging from 58.500 kPa to 464.000 kPa, with a mean value of 211.274 kPa and a standard deviation of 108.959 kPa.On the other hand, the foundation width (B) displays a relatively tighter distribution, ranging from 0.059 m to 0.600 m, with a mean of 0.250 m and a standard deviation of 0.195 m.Similarly, the foundation depth (D) exhibits  Prior to utilizing the existing database, a thorough check for inappropriate data was conducted.To accomplish this, a box plot for the output parameter was generated, and Figure 3a displays the results.Following the examination, 22 datasets were identified as inappropriate and subsequently removed from the overall database.The refined database consists of 75 suitable datasets, and their corresponding box plot is depicted in Figure 3b.Table 1 presents the distribution of the utilized database after the exclusion of outliers.The data reveal valuable insights into the statistical characteristics of each variable.The ultimate bearing capacity of the foundation, represented by qu, varies significantly across the datasets, ranging from 58.500 kPa to 464.000 kPa, with a mean value of 211.274 kPa and a standard deviation of 108.959 kPa.On the other hand, the foundation width (B) displays a relatively tighter distribution, ranging from 0.059 m to 0.600 m, with a mean of 0.250 m and a standard deviation of 0.195 m.Similarly, the foundation depth (D) exhibits

Multiple Linear Regression (MLR)
MLR (multiple linear regression) is a statistical technique used to predict one output variable from multiple independent input variables.This method is an improved version of linear regression that uses a single input variable and one output variable.MLR considers linear relationships between input parameters and output parameters.The data are also normalized in this method.
The best line in MLR is obtained by selecting regression coefficients so that the model has the lowest error.Prior to AI methods, MLR was employed to check the accuracy of linear regression, which is one of the simplest regression methods.

Multiple Linear Regression (MLR)
MLR (multiple linear regression) is a statistical technique used to predict one output variable from multiple independent input variables.This method is an improved version of linear regression that uses a single input variable and one output variable.MLR considers linear relationships between input parameters and output parameters.The data are also normalized in this method.
The best line in MLR is obtained by selecting regression coefficients so that the model has the lowest error.Prior to AI methods, MLR was employed to check the accuracy of linear regression, which is one of the simplest regression methods.

Classification and Regression Trees (CART)
Classification and Regression Trees (CART) constitute a fundamental machine learning technique that holds its roots in the realm of decision trees.This method embodies a versatile approach capable of addressing both classification and regression tasks.The origins of CART can be attributed to its introduction by Breiman et al. [64,65] in the 1984 book titled "Classification and Regression Trees," where they elaborated on its principles and applications.In essence, a CART algorithm partitions the feature space into a hierarchical structure of binary decisions, resulting in a tree-like structure wherein each internal node represents a feature and a threshold that conditions the data's traversal along one of its two branches [66].Through a recursive process, this partitioning culminates in terminal nodes, or leaves, which correspond to the predicted outcome for a given input instance.The efficacy of CART lies not only in its interpretability, owing to its intuitive representation, but also in its capacity to accommodate various data types and handle non-linear relationships between features and target variables [67,68].
The CART algorithm operates through an iterative procedure aimed at optimizing its splitting criteria, typically based on metrics such as Gini impurity for classification tasks and mean squared error for regression tasks [69].By evaluating potential splits and selecting the one that minimizes the chosen metric's value, the algorithm generates a hierarchy of decision rules that sequentially refine the prediction process.This process is governed by pruning strategies to mitigate overfitting and ensure generalizability.Although it is susceptible to certain limitations like instability and sensitivity to data variations, CART's inherent adaptability and ability to handle missing values contribute to its enduring relevance and applicability in diverse domains, ranging from medical diagnosis to financial forecasting.

Genetic Programming (GP)
John Koza [70], a computer scientist and researcher at Stanford University, developed the first genetic programming system in the early 1990s.Using this system, he developed computer programs capable of solving mathematical problems and controlling robots [69].Several genetic programming frameworks and toolkits were developed as a result of this development [71].
As a subfield of artificial intelligence and evolutionary computation, genetic programming (GP) is a method for solving complex problems using a process inspired by natural selection and genetics [72].In this approach, potential solutions are represented as trees of operations (also known as computer programs) and then improved using genetic algorithms [73].
The GP algorithm generates an initial population of computer programs at random and evaluates their fitness in accordance with a problem-specific objective function [74].Through genetic operators such as crossover and mutation, the best programs are selected to produce the next generation.The process continues until a satisfactory solution is found or a stopping criterion is met.
A wide range of problems can be solved using GP, including function approximation, symbolic regression, and even game play [75].There are several strengths of GP, including its ability to find complex solutions that are difficult or impossible to discover manually, as well as its ability to handle high levels of uncertainty and noise in the data.It can, however, be computationally expensive and may have difficulty finding solutions in a reasonable amount of time for very complex problems.

Genetic Algorithm-Emotional Neural Network (GA-ENN)
The GA-ENN (Genetic Algorithm-Emotional Neural Network) method is a hybrid approach that combines Genetic Algorithms (GA) and Ensemble Neural Networks (ENN).It is primarily used for classification tasks in machine learning and data mining [76].In the GA-ENN method, a Genetic Algorithm is employed to optimize the architecture and parameters of an Ensemble Neural Network.The Genetic Algorithm applies principles inspired by natural selection and genetic mechanisms, such as mutation, crossover, and selection, to iteratively evolve a population of potential neural network architectures [77].
Recently, Lotfi and Akbarzadeh [78] pioneered the development of a limbic-based Emotional Neural Network (ENN).This novel network architecture draws inspiration from the emotional processes in the brain and is characterized by a single-layer design.In contrast to artificial neural networks (ANNs) that imitate biological neurons, ENNs are crafted to replicate the intricate interactions among four specific neural regions within the emotional brain: the thalamus, sensory cortex, orbitofrontal cortex (OFC), and amygdala.
During each iteration of the Genetic Algorithm, a set of candidate neural network architectures is evaluated based on their fitness, which is typically defined by the classification accuracy on a validation set [79].The fittest architectures are selected to undergo genetic operators like crossover and mutation, which create new offspring architectures with modified characteristics [80].Once the genetic evolution process is complete, an ensemble of neural networks is constructed by training each candidate architecture on the training dataset.The ensemble combines the predictions of multiple neural networks to make the final classification decision.This aggregation of predictions from diverse models often leads to improved generalization performance and robustness.
The GA-ENN method offers several advantages.Firstly, it leverages the optimization power of Genetic Algorithms to search for optimal neural network architectures in a vast solution space.This helps to identify architectures that can capture complex patterns in the data and improve classification accuracy.Secondly, by creating an ensemble of neural networks, it reduces the risk of overfitting and enhances the model's ability to handle noise and variability in the data.

Database Preparation
In the database, individual variables were measured in their respective units.Data normalization is employed to decrease network error and enhance network training speed.The normalization linear function utilized in this study is presented below.
This equation has four terms, X max , X min , X and X norm , which are the maximum, minimum, actual and normalized values, respectively.
In all mathematical models, the training and testing databases were randomly divided.A total of 80% of the main database was used for training, while the remaining 20% was used for testing.
Evaluating statistical parameters for both databases is crucial, as it directly impacts the final model's accuracy.Based on Tables 2 and 3, the training and testing databases exhibited similar statistical parameters, including minimum, maximum, mean, and standard deviation values.The performance of a network can be evaluated through a variety of parameters such as coefficient of determination (R 2 ) and mean absolute error (MAE).Equations (2)- (7) show the definitions of Mean Absolute Error (MAE), Mean Square Error (MSE), Root Mean Square Error (RMSE), Mean Squared Logarithmic Error (MSLE), Root Mean Squared Logarithmic Error (RMSLE), and Coefficient of Determination (R 2 ) [81].
where N is the number of datasets, X m and X p are actual and predicted values, and X m and X p are the average of actual and predicted values, respectively.Ideally, the model should have an R 2 value of 1 and a MAE, MSE, RMSE, MSLE, RMSLE value of 0.

Multiple Linear Regression (MLR)
Table 4 presents the Pearson correlation matrix, showcasing the linear relationships between input and output parameters.This table assesses linear relationships between various parameters.The Pearson correlation matrix is a statistical instrument employed to assess the linear connection between variable pairs within a dataset.It encompasses correlation coefficients, which gauge the magnitude and orientation of the linear link between these variables.These matrix elements span from −1 to 1, furnishing valuable insights into the interrelationship among variables.When the correlation coefficient is positive, it signifies a direct positive linear association between the two variables.In other words, as one variable increases, the other also tends to increase.The closer the coefficient approaches +1, the more robust the positive relationship.Conversely, when the correlation coefficient is negative, it indicates an inverse linear relationship between the two variables.In this scenario, as one variable increases, the other typically decreases.The closer the coefficient nears −1, the more pronounced the negative relationship.When the correlation coefficient stands precisely at zero, it denotes the absence of a linear relationship between the variables.Changes in one variable do not provide predictive information about changes in the other.
The findings reveal that there are no input parameters that exhibit a strong linear relationship with the output parameter.The highest correlation value observed is 0.594.
The findings from Table 4 indicate that the MLR model is unlikely to yield strong predictions for the output parameter.However, the MLR model was thoroughly examined, and the best performing model was selected.Figure 5 illustrates the comparison between the predicted and actual values generated by the numerical model.Despite the efforts, the MLR model did not accurately predict the parameter q u , lacking sufficient precision in its predictions.The findings from Table 4 indicate that the MLR model is unlikely to yield strong predictions for the output parameter.However, the MLR model was thoroughly examined, and the best performing model was selected.Figure 5 illustrates the comparison between the predicted and actual values generated by the numerical model.Despite the efforts, the MLR model did not accurately predict the parameter qu, lacking sufficient precision in its predictions.Furthermore, the results reveal that the suggested MLR model performed better in predicting lower qu values compared to higher qu values.This observation suggests that, in cases of lower qu, the relationship between inputs and output parameters tends to exhibit closer adherence to linear behavior.
Table 5 summarizes the overall performance of the best multiple linear regression (MLR) model in predicting the ultimate bearing capacity of shallow foundations for both the training and testing databases.Furthermore, the results reveal that the suggested MLR model performed better in predicting lower q u values compared to higher q u values.This observation suggests that, in cases of lower q u , the relationship between inputs and output parameters tends to exhibit closer adherence to linear behavior.
Table 5   Similarly, in the testing database, the model performs well with an MAE of 32.260 kPa, MSE of 2026.062, and RMSE of 45.012.The MSLE is 0.116, and RMSLE is 0.341.The R 2 value for the testing database is 0.837, suggesting a good fit.Overall, the table demonstrates that the MLR model provides reasonably accurate predictions for the ultimate bearing capacity of shallow foundations in both the training and testing databases.While some discrepancies exist, the model generally performs well in capturing the relationships between the input variables and the target output.Although the linear model exhibits relative accuracy, the performance of the frame model continues to improve.

Genetic Programming (GP)
Upon applying Multiple Linear Regression (MLR) as the statistical model, a grey box AI model called Genetic Programming (GP) was utilized to forecast the bearing capacity of shallow foundations (q u ).Through a thorough analysis of various GP models, the most optimal model was identified.As depicted in Figure 6, the outcomes of the best GP model show a favorable alignment between actual and predicted values.These results demonstrate the remarkable accuracy of the GP model in forecasting q u (bearing capacity) values.

RMSLE
0.252 0.341 R 2 0.809 0.837 Similarly, in the testing database, the model performs well with an MAE of 32.260 kPa, MSE of 2026.062, and RMSE of 45.012.The MSLE is 0.116, and RMSLE is 0.341.The R 2 value for the testing database is 0.837, suggesting a good fit.Overall, the table demonstrates that the MLR model provides reasonably accurate predictions for the ultimate bearing capacity of shallow foundations in both the training and testing databases.While some discrepancies exist, the model generally performs well in capturing the relationships between the input variables and the target output.Although the linear model exhibits relative accuracy, the performance of the frame model continues to improve.

Genetic Programming (GP)
Upon applying Multiple Linear Regression (MLR) as the statistical model, a grey box AI model called Genetic Programming (GP) was utilized to forecast the bearing capacity of shallow foundations (qu).Through a thorough analysis of various GP models, the most optimal model was identified.As depicted in Figure 6, the outcomes of the best GP model show a favorable alignment between actual and predicted values.These results demonstrate the remarkable accuracy of the GP model in forecasting qu (bearing capacity) values.Furthermore, the high R 2 value of 0.975 in the testing database showcases the model's robustness in explaining the variance in unseen data, reaffirming its strong predictive ability.Overall, the results in Table 6 underscore the GP model's efficacy in accurately estimating the ultimate bearing capacity of shallow foundations, making it a valuable tool for practical applications in foundation engineering and design.
An important distinction between genetic programming (GP) and black box models lies in the GP model's ability as a grey box model to produce an equation as its outcome, which can be directly utilized by the reader.In this study, Equation ( 8) represents the GP model's output for predicting the bearing capacity of shallow foundations (q u ) values.The length of Equation ( 8) can be attributed to its incorporation of five distinct inputs: width of footing (B), depth of footing (D), footing geometry (L/B), unit weight of sand (γ d or γ ), and internal friction angle (φ).Notably, conventional methods lack the capability to generate an equation like Equation (8), considering all five influential inputs.Thus, this study marks the first successful attempt to formulate an equation that predicts q u based on these five crucial inputs.It is essential to mention that the parameters in Equation ( 8) are normalized values, and readers should normalize their input values based on Table 1 before applying them to Equation (8) for accurate predictions.

Classification and Regression Trees (CART)
After utilizing GP as a grey box AI model for predicting the bearing capacity of shallow foundations, this discussion was further extended to include another grey box AI model, namely Classification and Regression Trees (CART), to complete this group of grey box models.Following a comprehensive evaluation of multiple CART models, the most optimal model was identified.Figure 7 illustrates the results obtained from the best CART model, showcasing a remarkable agreement between the actual and predicted values.These findings highlight the CART model's exceptional accuracy and precision in forecasting bearing capacity of shallow foundations (q u ) values.The incorporation of both GP and CART models provides valuable insights into the predictive capabilities of grey box AI techniques for estimating bearing capacity, enabling more reliable and informed decision making in foundation engineering applications.model, showcasing a remarkable agreement between the actual and predicted values.These findings highlight the CART model's exceptional accuracy and precision in forecasting bearing capacity of shallow foundations (qu) values.The incorporation of both GP and CART models provides valuable insights into the predictive capabilities of grey box AI techniques for estimating bearing capacity, enabling more reliable and informed decision making in foundation engineering applications.In the testing dataset, the CART model exhibited a slightly higher MAE of 23.583 kPa, indicating a slightly lower predictive accuracy on unseen data compared to the training dataset.Nevertheless, the model's predictive ability extended well to the testing data, as evidenced from the high R 2 value of 0.921, suggesting its effectiveness in explaining the variance in the testing dataset.Although the MSE and RMSE values of 981.006 and 31.321,respectively, were higher in the testing dataset, the model still maintained good performance, as indicated by the relatively low MSLE of 0.031 and RMSLE of 0.177.Overall, the results in Table 7 validate the reliability of the best CART model in predicting the bearing capacity of shallow foundations and underscore its potential as a valuable tool for  In the testing dataset, the CART model exhibited a slightly higher MAE of 23.583 kPa, indicating a slightly lower predictive accuracy on unseen data compared to the training dataset.Nevertheless, the model's predictive ability extended well to the testing data, as evidenced from the high R 2 value of 0.921, suggesting its effectiveness in explaining the variance in the testing dataset.Although the MSE and RMSE values of 981.006 and 31.321,respectively, were higher in the testing dataset, the model still maintained good performance, as indicated by the relatively low MSLE of 0.031 and RMSLE of 0.177.Overall, the results in Table 7 validate the reliability of the best CART model in predicting the bearing capacity of shallow foundations and underscore its potential as a valuable tool for practical applications in foundation engineering, enabling more accurate and informed decision making.
The output of the CART model is presented as a decision tree, which effectively captures and represents the data intervals described in Figure 8.This decision tree serves as a graphical representation of the model's decision-making process.It illustrates a sequence of decisions made by the model to ultimately arrive at its prediction.To utilize the decision tree, the estimation process commences at the initial node and follows the branches to subsequent nodes, adhering to the rules written on the leaves (see Figure 8 for reference).This sequential process continues until the final node is reached, which corresponds to the model's ultimate prediction.The decision tree offers a straightforward and interpretable way to understand how the model analyzes the input data and makes informed predictions based on specific conditions and thresholds.It is important to notice that Figure 8 is based on the normalized values.
Tables 8 and 9 offer a comprehensive description of the CART (Classification and Regression Trees) model, providing valuable insights into its structure and predictive capabilities.Table 8 presents an overview of the CART's architecture, encompassing details such as the number of terminal nodes, split variables, values for each node, parent and son nodes, and the corresponding predicted values.On the other hand, Table 9 delves deeper into CART's intricacies, presenting the specific rules and predicted values for each terminal node.If D <= 0.28 and φ <= 0.68 and φ <= 0.39 and φ > 0.18 and D > 0.13 then q u = 0.2 in 8.3% of cases 68 0.17 If D <= 0.28 and φ <= 0.68 and φ <= 0.39 and φ > 0.18 and D <= 0.13 and D <= 0.03 then q u = 0.2 in 6.7% of cases 69 0.10 If D <= 0.28 and φ <= 0.68 and φ <= 0.39 and φ > 0.18 and D <= 0.13 and D > 0.03 then q u = 0.1 in 6.7% of cases By referring to these tables, readers can gain a better understanding of the influential variables and their relationships with the target variable.The CART model assists in identifying critical decision points and the potential outcomes associated with each decision, facilitating improved model accuracy and enhanced predictions.Utilizing the insights from CART can contribute to better decision making in various domains, such as foundation engineering or other fields where predictive modeling is applicable.These tables serve as valuable references, empowering readers to explore the model's inner workings and leverage its interpretability for informed and accurate predictions.

Genetic Algorithm-Emotional Neural Network GA-ENN
In the context of this research, a hybrid approach referred to as GA-ENN was employed to predict the ultimate bearing capacity of sallow foundation.Through an array of analyses, the researchers pinpointed the optimal GA-ENN model, which is presented and described in detail.Figure 9 illustrates the comparison between the predicted and real values of the ultimate bearing capacity for the shallow foundation.The findings demonstrate the high effectiveness of this hybrid approach in accurately predicting the output.In the context of this research, a hybrid approach referred to as GA-ENN was employed to predict the ultimate bearing capacity of sallow foundation.Through an array of analyses, the researchers pinpointed the optimal GA-ENN model, which is presented and described in detail.Figure 9 illustrates the comparison between the predicted and real values of the ultimate bearing capacity for the shallow foundation.The findings demonstrate the high effectiveness of this hybrid approach in accurately predicting the output.Table 10 presents a comprehensive evaluation of the overall performance of the best Genetic Algorithm-Emotional Neural Network (GA-ENN) model for predicting the ultimate bearing capacity of shallow foundations.The assessment was conducted on both the Table 10 presents a comprehensive evaluation of the overall performance of the best Genetic Algorithm-Emotional Neural Network (GA-ENN) model for predicting the ultimate bearing capacity of shallow foundations.The assessment was conducted on both the training and testing datasets, utilizing various performance metrics to gauge the model's predictive accuracy.In the training dataset, the GA-ENN model achieved a Mean Absolute Error (MAE) of 6.634, indicating that, on average, its predictions deviated by approximately 6.634 units from the actual values.The Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) values of 72.503 and 8.515, respectively, signify that the model's predictions displayed relatively low variance from the actual values, indicating its precision in capturing the underlying patterns in the training data.The Mean Squared Logarithmic Error (MSLE) of 0.004 and the Root Mean Squared Logarithmic Error (RMSLE) of 0.061 further confirm the model's accurate handling of both small and large errors.Moreover, the high R-squared (R 2 ) value of 0.994 indicates that the GA-ENN model explains an impressive 99.4% of the variance in the training data, showcasing its robust fit to the dataset.
In the testing dataset, the GA-ENN model performed exceptionally well, with a lower MAE of 6.018 compared to the training dataset, suggesting an even higher predictive accuracy on unseen data.The MSE and RMSE values of 65.397 and 8.087, respectively, reinforce the model's efficacy in producing predictions with minimal variability in the testing data.The MSLE of 0.005 and RMSLE of 0.072 reflect the model's consistent performance in handling errors on the testing data.Additionally, the high R 2 value of 0.995 demonstrates the GA-ENN model's robustness in explaining an impressive 99.5% of the variance in the testing data, emphasizing its ability to generalize well to new and unseen data.

Comparison of Different Proposed Models
Table 11 provides a comprehensive comparison of the predictive performance of different models for estimating the ultimate bearing capacity of shallow foundations.The evaluation is conducted on both the training and testing databases, and various performance metrics are utilized to assess the accuracy and reliability of each model.For the training database, the Mean Absolute Error (MAE) values range from 33.938 for the Multiple Linear Regression (MLR) model to an impressive 6.634 for the Genetic Algorithm-Emotional Neural Network (GA-ENN) model.This indicates that the GA-ENN model exhibits the lowest average absolute deviation between its predictions and the actual values, showcasing its superior accuracy in predicting the bearing capacity of shallow foundations during training.The Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) also demonstrate the same trend, where the GA-ENN model outperforms all other models, yielding the smallest errors.
Similarly, in the testing dataset, the GA-ENN model attains the lowest MAE (6.018), MSE (65.397), and RMSE (8.087) values, indicating its remarkable predictive accuracy on unseen data.Additionally, the GA-ENN model shows the lowest Mean Squared Logarithmic Error (MSLE) and Root Mean Squared Logarithmic Error (RMSLE) values, indicating its ability to accurately predict both small and large errors on the testing data.
Furthermore, the coefficient of determination (R 2 ) values for the GA-ENN model are the highest in both the training (0.994) and the testing (0.995) datasets, illustrating the model's excellent fit to the data and its capability to explain a significant proportion of the variance in the target variable.
In Figure 10, the outcomes of various statistical and AI models for predicting the bearing capacity of shallow foundations are displayed.Among these models, it is evident that the Genetic Programming (GP), Classification and Regression Trees (CART), and In Figure 10, the outcomes of various statistical and AI models for predicting the bearing capacity of shallow foundations are displayed.Among these models, it is evident that the Genetic Programming (GP), Classification and Regression Trees (CART), and Genetic Algorithm-Emotional Neural Network (GA-ENN) models have exhibited the most promising performance.

Comparison with Previous Studies
To delve deeper into the investigation, Table 12 displays the outcomes of this study alongside numerical models from the historical record, as well as the Gaussian Process Regression Approach introduced by Ahmad et al. [60].To facilitate a more comprehensive comparison, an additional parameter referred to as Nash-Sutcliffe Efficiency (NSE), as defined in Equation ( 9), was employed alongside the R 2 .The findings clearly highlight the superiority of the suggested models in contrast to the pre-existing historical models.In the equations, T is the number of data points, and Q t o and Q t m are the actual and predicted output of ith sample of the data, respectively; Q o is the averaged actual output of the data.

Sensitivity Analysis
In the domain of artificial intelligence methodologies, the evaluation of the importance of input parameters holds a pivotal role.This assessment follows a structured procedure: each individual input parameter is deliberately adjusted both upwards and downwards by 100%.Subsequently, the resulting discrepancies within the models are meticulously monitored.This thorough analysis functions as a mechanism for assessing the sensitivity of each model to specific parameters.Elevated error values indicate heightened model sensitivity to those distinct parameters, while lower error values suggest that the scrutinized parameter has a relatively modest impact on the overall model efficacy.This approach enables us to precisely identify the input parameters that wield substantial influence over the model's outcomes, facilitating the refinement of the model for superior performance.
To visually illustrate this analysis of parameter significance across diverse models for predicting q u , Figure 11 is presented.These visuals provide valuable insights into the models' responses to variations in various input parameters and aid in the identification of pivotal factors that shape the predictive proficiency of the models for each property.
predicted output of ith sample of the data, respectively;  is the averaged actual output of the data.

Sensitivity Analysis
In the domain of artificial intelligence methodologies, the evaluation of the importance of input parameters holds a pivotal role.This assessment follows a structured procedure: each individual input parameter is deliberately adjusted both upwards and downwards by 100%.Subsequently, the resulting discrepancies within the models are meticulously monitored.This thorough analysis functions as a mechanism for assessing the sensitivity of each model to specific parameters.Elevated error values indicate heightened model sensitivity to those distinct parameters, while lower error values suggest that the scrutinized parameter has a relatively modest impact on the overall model efficacy.This approach enables us to precisely identify the input parameters that wield substantial influence over the model's outcomes, facilitating the refinement of the model for superior performance.
To visually illustrate this analysis of parameter significance across diverse models for predicting qu, Figure 11 is presented.These visuals provide valuable insights into the models' responses to variations in various input parameters and aid in the identification of pivotal factors that shape the predictive proficiency of the models for each property.Table 13 presents a hierarchy of input parameter importance across various models employed for the anticipation of qu.Within this ranking, a rank of 1 corresponds to the utmost importance, signifying the paramount parameter, while a rank of 5 designates the least significance.
Analyzing the outcomes of Table 13, it becomes evident that the parameters foundation depth, internal friction angle of soil and foundation width emerge as the most pivotal Table 13 presents a hierarchy of input parameter importance across various models employed for the anticipation of qu.Within this ranking, a rank of 1 corresponds to the utmost importance, signifying the paramount parameter, while a rank of 5 designates the least significance.Analyzing the outcomes of Table 13, it becomes evident that the parameters foundation depth, internal friction angle of soil and foundation width emerge as the most pivotal elements in predicting ultimate bearing capacity of shallow foundations (q u ).These parameters are instrumental in determining the foundation's ability to withstand load and stress distribution.Conversely, the parameters unit weight of soil (γ d or γ ) and L/B exhibit minimal relevance in this predictive context.The unit weight of soil (γ d or γ ) primarily influences the overall density of the foundation materials, which might not be as critical a factor in the prediction of ultimate bearing capacity as compared to parameters directly related to the geometry and material properties of the foundation.Similarly, the parameter L/B, representing the ratio of length to width, showcases minimal importance in this prediction.This could be attributed to the fact that other parameters like foundation depth, internal friction angle, and foundation width have more direct and substantial impacts on the foundation's load-bearing capability.

Implications and Applications
This study presents significant implications for the field of geotechnical engineering by exploring the potential of soft computing techniques in predicting the ultimate bearing capacity (UBC) of shallow foundations on cohesionless soils.The inclusion of multiple linear regression (MLR), Genetic Programming (GP), Classification and Regression Trees (CART), and GA-ENN (Genetic Algorithm-Emotional Neuron Network) models in this research represents a novel approach, as they are used for the first time as grey box AI models in the literature for UBC prediction.The application of these advanced AI models expands the range of predictive tools available to engineers and practitioners, offering diverse options for accurate and reliable UBC estimation.
The use of input parameters such as the width of footing (B), depth of footing (D), footing geometry (L/B), unit weight of sand (γ d or γ ), and internal friction angle (f) further enhances the versatility and practicality of the developed AI models.Engineers can leverage these models to perform UBC predictions for a wide range of shallow foundation designs on cohesionless soils, optimizing foundation designs and ensuring structural stability.The comparison of the AI models with existing theoretical approaches demonstrates their superiority in UBC prediction, underscoring their potential to outperform traditional methods.Consequently, the adoption of AI models like GP, CART, and GA-ENN can lead to more accurate and efficient foundation design practices, resulting in cost-effective and safer construction projects.Moreover, the sensitivity analysis conducted in this study empowers engineers to gain a deeper understanding of the influence of individual input parameters on UBC predictions, facilitating the identification of critical factors that impact the foundation's load-bearing capacity.Overall, the successful integration of AI models in UBC prediction for shallow foundations on cohesionless soil contributes to the advancement of geotechnical engineering practices and promotes more reliable and informed decision making during the design and construction phases.

Geotechnical Challenges and Remediation Strategies
The prediction and assessment of the ultimate bearing capacity (UBC) for shallow foundations on cohesionless soils pose several geotechnical challenges that need to be addressed for reliable and safe foundation design.One of the primary challenges is the complex and heterogeneous nature of cohesionless soils, which can exhibit significant spatial variability in properties such as grain size distribution, void ratio, and shear strength.This variability can lead to unpredictable bearing capacity behavior, making it challenging to accurately estimate UBC using traditional methods.Moreover, the presence of soft spots, loose zones, or weak layers within the soil profile can further complicate the foundation response and load-carrying capacity.Dealing with these uncertainties and spatial variations is crucial to ensure the integrity and stability of shallow foundations.
To overcome these geotechnical challenges, the implementation of advanced soft computing techniques, such as Genetic Programming (GP), Classification and Regression Trees (CART), and Genetic Algorithm-Emotional Neuron Network (GA-ENN), as demonstrated in this study, offers promising remediation strategies.These AI models possess the capability to learn and adapt to complex patterns and non-linear relationships within the data, enabling more accurate and robust predictions of UBC for shallow foundations on cohesionless soils.By utilizing these AI models, engineers can account for the spatial variability and uncertainties in soil properties, leading to more reliable and safer foundation designs.
Another significant geotechnical challenge is the consideration of various influential parameters, such as the width of footing (B), depth of footing (D), footing geometry (L/B), unit weight of sand (γ d or γ ), and internal friction angle (f), when estimating UBC.The interplay of these parameters requires a comprehensive understanding of their individual and combined effects on the foundation's load-bearing capacity.Soft computing techniques, as used in this study, facilitate sensitivity analysis to assess the impact of each input parameter on the UBC prediction, aiding engineers in identifying critical factors and optimizing foundation design.
In addition to AI-based approaches, thorough site investigations, including detailed soil exploration and testing, play a vital role in addressing geotechnical challenges.The collection of high-quality data enables engineers to characterize the soil properties accurately and capture the spatial variations, providing a solid foundation for the AI models' training and validation.

Limitations and Future Research
Despite the promising results and contributions of this study in predicting the ultimate bearing capacity (UBC) of shallow foundations on cohesionless soils using advanced soft computing techniques, there are certain limitations that should be acknowledged.One limitation is the reliance on historical data for model development and validation.The accuracy and performance of the AI models heavily depend on the quality and representativeness of the available data.Therefore, future research should focus on expanding the dataset and including data from a broader range of soil conditions and foundation geometries to enhance the models' generalization and applicability to diverse engineering scenarios.
Another limitation lies in the assumption of certain soil behaviors and conditions.The AI models developed in this study may not capture certain complex geotechnical phenomena, such as soil liquefaction or soil-structure interaction effects, which can significantly influence the bearing capacity of shallow foundations.Incorporating additional factors and advanced modeling techniques to address these complexities can further improve the accuracy and reliability of UBC predictions in more challenging geotechnical conditions.
Furthermore, the present study only considers cohesionless soils, while, in practice, engineers encounter various types of soils with different characteristics.Extending the AI models to account for cohesive soils and investigating their performance in mixed soil conditions can enrich the predictive capabilities of the models and enable a more comprehensive assessment of foundation bearing capacity.

Algorithms 2023 , 26 Figure 1 .
Figure 1.Distribution of collected datasets and effect of (a) dimension of foundation and (b) soil properties on qu.

Figure 1 .
Figure 1.Distribution of collected datasets and effect of (a) dimension of foundation and (b) soil properties on q u .

Figure 2 .
Figure 2. Effect of foundation depth on the bearing capacity of the shallow foundation in the collected datasets.

Figure 3 .
Figure 3. Process of removing outliers using plotting boxplots for (a) initial database and (b) final database.

Figure 2 .
Figure 2. Effect of foundation depth on the bearing capacity of the shallow foundation in the collected datasets.

Figure 2 .
Figure 2. Effect of foundation depth on the bearing capacity of the shallow foundation in the collected datasets.

Figure 3 .
Figure 3. Process of removing outliers using plotting boxplots for (a) initial database and (b) final database.

Figure 3 .
Figure 3. Process of removing outliers using plotting boxplots for (a) initial database and (b) final database.

Figure 4 .
Figure 4. Histogram of output in used database.

Figure 4 .
Figure 4. Histogram of output in used database.

Figure 5 .
Figure 5. Results of proposed MLR based on (a) training database and (b) testing database.

Figure 5 .
Figure 5. Results of proposed MLR based on (a) training database and (b) testing database.
summarizes the overall performance of the best multiple linear regression (MLR) model in predicting the ultimate bearing capacity of shallow foundations for both the training and testing databases.Various performance metrics are employed to assess the accuracy and precision of the model's predictions.In the training database, the model exhibits an average Mean Absolute Error (MAE) of 33.938 kPa, indicating the average absolute difference between the predicted and actual values.The Mean Squared Error (MSE) is 2189.854,representing the average squared difference between predictions and actual values.The Root Mean Squared Error (RMSE) is 46.796, which indicates the average magnitude of the error.The MSLE (Mean Squared Logarithmic Error) is 0.063, and the RM-SLE (Root Mean Squared Logarithmic Error) is 0.252, showcasing the model's performance in handling logarithmic differences.The R 2 value, a measure of how well the model fits the data, is 0.809 for the training dataset, indicating a reasonably good fit.

Figure 6 .
Figure 6.Results of proposed GP based on (a) training database and (b) testing database.Figure 6. Results of proposed GP based on (a) training database and (b) testing database.

Figure 6 .
Figure 6.Results of proposed GP based on (a) training database and (b) testing database.Figure 6. Results of proposed GP based on (a) training database and (b) testing database.Table 6 provides a comprehensive assessment of the best GP model's overall performance in predicting the ultimate bearing capacity of shallow foundations.The evaluation was conducted on both the training and testing databases, employing various performance metrics.In the training database, the GP model demonstrated a reasonable performance with a Mean Absolute Error (MAE) of 19.677 kPa, indicating that, on average, the model's predictions were within approximately 19.677 units of the actual values.The model's predictions showed a relatively good variance from the actual values, as reflected in the Mean Squared Error (MSE) of 700.751 and the Root Mean Squared Error (RMSE) of 26.472.Additionally, the model exhibited remarkable accuracy in dealing with both small and large errors, as evidenced from the low Mean Squared Logarithmic Error (MSLE) of 0.018 and the Root Mean Squared Logarithmic Error (RMSLE) of 0.136.Moreover, the GP model displayed a strong overall fit to the training data, as indicated by the high R-squared (R 2 ) value of 0.939, which signifies the model's capability to explain a substantial portion of the variance in the training data.

Figure 7 .
Figure 7. Results of proposed CART based on (a) training database and (b) testing database.Table 7 provides a comprehensive assessment of the overall performance of the best Classification and Regression Trees (CART) model in predicting the ultimate bearing capacity of shallow foundations.The evaluation was conducted on both the training and testing datasets, utilizing various metrics to gauge the model's predictive accuracy.In the training dataset, the CART model displayed a Mean Absolute Error (MAE) of 18.388 kPa, indicating that, on average, its predictions deviated by approximately 18.388 units from the actual values.The model achieved a high R-squared (R 2 ) value of 0.951, signifying its ability to explain a significant portion of the variance in the training data, thereby affirming its robust fit to the dataset.Additionally, the CART model demonstrated precise handling of errors, evidenced from the low Mean Squared Logarithmic Error (MSLE) of 0.017 and the Root Mean Squared Logarithmic Error (RMSLE) of 0.129.In the testing dataset, the CART model exhibited a slightly higher MAE of 23.583 kPa, indicating a slightly lower predictive accuracy on unseen data compared to the training dataset.Nevertheless, the model's predictive ability extended well to the testing data, as evidenced from the high R 2 value of 0.921, suggesting its effectiveness in explaining the variance in the testing dataset.Although the MSE and RMSE values of 981.006 and 31.321,respectively, were higher in the testing dataset, the model still maintained good performance, as indicated by the relatively low MSLE of 0.031 and RMSLE of 0.177.Overall, the results in Table7validate the reliability of the best CART model in predicting the bearing capacity of shallow foundations and underscore its potential as a valuable tool for

Figure 7 .
Figure 7. Results of proposed CART based on (a) training database and (b) testing database.Table 7 provides a comprehensive assessment of the overall performance of the best Classification and Regression Trees (CART) model in predicting the ultimate bearing capacity of shallow foundations.The evaluation was conducted on both the training and testing datasets, utilizing various metrics to gauge the model's predictive accuracy.In the training dataset, the CART model displayed a Mean Absolute Error (MAE) of 18.388 kPa, indicating that, on average, its predictions deviated by approximately 18.388 units from the actual values.The model achieved a high R-squared (R 2 ) value of 0.951, signifying its ability to explain a significant portion of the variance in the training data, thereby affirming its robust fit to the dataset.Additionally, the CART model demonstrated precise handling of errors, evidenced from the low Mean Squared Logarithmic Error (MSLE) of 0.017 and the Root Mean Squared Logarithmic Error (RMSLE) of 0.129.

Algorithms 2023 , 26 Figure 8 .
Figure 8.The proposed tree for prediction of qu.

Figure 8 .
Figure 8.The proposed tree for prediction of q u .

Figure 9 .
Figure 9. Results of proposed GA-ENN based on (a) training database and (b) testing database.

Figure 9 .
Figure 9. Results of proposed GA-ENN based on (a) training database and (b) testing database.

Table 1 .
Statistical information of used database after removing outliers.

Table 2 .
Distribution of training database.

Table 3 .
Distribution of testing database.

Table 4 .
Pearson matrix for input and output parameters (darkness level represents the degree of strength in the linear relationship).

Table 5 .
Overall performance of the best MLR model to predict the ultimate bearing capacity of shallow foundations for both training and testing databases.

Table 6 .
Overall performance of the best GP model to predict the ultimate bearing capacity of shallow foundations for both training and testing databases.On the testing database, the GP model demonstrated even better predictive performance, as evidenced by the reduced MAE of 16.164 kPa compared to the training database.The model's predictions exhibited decreased variability from the actual values, with MSE and RMSE values of 313.386 and 17.703, respectively.The GP model's adeptness in handling errors was consistent, as indicated by the low MSLE of 0.012 and RMSLE of 0.111.

Table 7 .
Overall performance of the best CART model to predict the ultimate bearing capacity of shallow foundations for both training and testing databases.

Table 8 .
The best developed CART model.

Table 8 .
The best developed CART model.

Table 9 .
Additional information for the best CART model.

Table 10 .
Overall performance of the best GA-ENN model to predict the ultimate bearing capacity of shallow foundations for both training and testing databases.

Table 11 .
Results of all models predict ultimate bearing capacity of shallow foundations for both training and testing databases.
Genetic Algorithm-Emotional Neural Network (GA-ENN) models have exhibited the most promising performance.

Table 12 .
Comparison of the results of this study with literature models.

Table 12 .
Comparison of the results of this study with literature models.

Table 13 .
Ranking results of variable importance for the proposed mathematical models to predict q u .