On the Use of Neuro-Swarm System to Forecast the Pile Settlement

In civil engineering applications, piles (deep foundations) are pushed into the ground in order to perform as steady support of structures. As these type of foundations are able to carry a huge amount of load, they should be carefully designed in terms of their settlement. Therefore, the control and estimation of settlement is a significant issue in pilling design and construction. The objective of the present study is to introduce a modeling process of a hybrid intelligence system namely neural network optimized by particle swarm optimization (neuro-swarm) for estimation of pile settlement. To do that, properties results of several piles socketed into rock mass together with their settlements were considered as established databased to propose neuro-swarm model. Then, several sensitivity analyses were carried out to determine the most influential particle swarm optimization parameters for pile settlement prediction. Eventually, five neuro-swarm models were constructed to understand the behavior of this hybrid model on them in pile settlement prediction. As a result, according to results of five performance indices, dataset number 4 showed the highest prediction capacity among all five datasets. The coefficient of determination (R2) and system error values of (0.851 and 0.079) and (0.892 and 0.099) were obtained respectively for train and test stages of the best neuro-swarm model which reveal the capability level of this hybrid model in predicting pile settlement. The modeling process introduced in this study can be useful for the researchers who are interested to work on the same hybrid technique.


Introduction
Structural loads are carried by rock-socketed piles via their end bearing, or their shaft resistance, or a mixture of both. As stated by Carrubba [1], the application of drilled piles socketed into rock is among the best alternatives when bedrock is overlain by layers of loose soil at shallow depths. In such getting trapped in local minima. Therefore, using optimization algorithms (OAs) such as ICA and PSO show significant gain in order to dominate these restrictions and allow promotion of models predicted by adjusting bias and weight of ANNs. Recently, considerable capacity of these algorithms drew more attentions and researchers tried to combine them in order to optimize ANN models [26,[36][37][38]. In piling technology, several studies have been conducted to solve the problems by developing PSO-ANN models. For example, Armaghani et al. [7] conducted a study of prediction of pile bearing capacity using a combination of PSO and ANN models. They predicted pile bearing capacity through the use of the most important parameters on the pile capacity and showed that the prediction performance of the PSO-ANN model is higher than the performance prediction of ANN model. A research on the load-deformation behavior of axially loaded piles was conducted by Ismail et al. [39]. They developed a PSO-ANN technique to predict the behavior of single piles embedded in soil and subjected to an axial load. Pile load, pile diameter, pile modulus, soil stiffness and pile length were used as inputs whereas pile load-deformation was considered as system output. In another study, the PSO was used to optimize ANN weights and biases in predicting lateral deflection of piles by Khari et al. [40]. They successfully indicated that their developed hybrid model is able to provide a high level of accuracy. Apart from the examples of PSO-ANN implementation studies, PSO has been used for optimization purposes in many geotechnical applications (e.g., [41][42][43]).
In this study, the development of hybrid ANN-based model namely PSO-ANN (or neuro-swarm) with detailed modeling process is presented herein for estimating the settlement of rock-socketed piles. For comparison purposes, a pre-developed ANN model is also developed to predict the settlement of piles. Then, the performance prediction of these techniques is evaluated and the best model among them is selected. The rest of this study is organized in the following way. In Section 2, the background of ANN, PSO and PSO-ANN models is described. Then, after giving some descriptions about database and study area, development process and the used models will be given in detail. Eventually, their prediction capacities are evaluated to introduce the best predictive model for estimation of the pile settlement.

Artificial Neural Network (ANN)
ANN can be formed and expanded for data processing attributable to the structure of a human brain [44]. An ANN structure comprises three major parts, namely input, output and hidden layers. In each network layer, data is transmitted among layers by binding elements, such as neurons. Then, the weights created by the system, take control of this transmission for strengthening or weakening processes. An activation function (sometimes sigmoid type) needs to be used for the calculation of the output of each neuron in the layer [45,46]. The No. of inputs determines the No. of neurons available in every layer. Typically, to obtain the number of neurons, one can implement a complicated approach or a trial-and-error process [47,48]. The back propagation (BP) training algorithm has been applied more frequently-amongst all network training algorithms-in the field of engineering [49][50][51][52]. Furthermore, network creation and determination of corresponding weight constitute ANN modeling. The BP training algorithm is implemented to regulate network weights according to the minimization of the error value [21,53,54]. A comparison is made between the desired output values with the values reached each stage. The process calumniates once the desired values are obtained and the system error is diminished [55][56][57].

Particle Swarm Optimization (PSO)
Kennedy and Eberhart [58] first developed the PSO and afterwards it has been updated several times by other researchers (e.g., [59]). Compared to GA, PSO takes advantage of high learning speed and requires less memory. In PSO, particles search the best global (G best ) and the best personal (P best ) positions to reach the superior position [60]. On the other hand, a particle moves to find the best positions (P best and G best ) in each iteration of the system. The following relations are used to obtain the velocity and position of particles.
where X and V stand for the current position and velocity of particle, respectively. X new and V new represent new position and new velocity of particle, respectively. Moreover, C 1 and C 2 are two positive acceleration constants with recommended values of 2 for each of them and w is the inertia weight (in the range of 0.25-1). In addition, r 1 and r 2 denote a random number in a range of (0, 1). The complete or better explanations of PSO details is available in the previous related studies [25,26,28]. The PSO algorithm is shown in Figure 1.

Particle Swarm Optimization (PSO)
Kennedy and Eberhart [58] first developed the PSO and afterwards it has been updated several times by other researchers (e.g., [59]). Compared to GA, PSO takes advantage of high learning speed and requires less memory. In PSO, particles search the best global (Gbest) and the best personal (Pbest) positions to reach the superior position [60]. On the other hand, a particle moves to find the best positions (Pbest and Gbest) in each iteration of the system. The following relations are used to obtain the velocity and position of particles.
V new = w * V + C 1 . r 1 (P best − X) + C 2 . r 2 (G best − X) (1) where X and V stand for the current position and velocity of particle, respectively. X new and V new represent new position and new velocity of particle, respectively. Moreover, C 1 and C 2 are two positive acceleration constants with recommended values of 2 for each of them and w is the inertia weight (in the range of 0.25-1). In addition, r 1 and r 2 denote a random number in a range of (0, 1). The complete or better explanations of PSO details is available in the previous related studies [25,26,28]. The PSO algorithm is shown in Figure 1.

Hybrid Algorithm
Researchers tried to improve ANN's efficiency in solving engineering problems by mean of various optimization algorithms such as PSO. Nevertheless, some algorithms like BP that act as a learning algorithm for local search cannot provide acceptable solution for finding ANN optimum solution [61]. However, OAs show satisfactory results in term of regulate bias and weight in ANNs in order to advancement ANN's prediction. While ANNs show more convergence at local minimums, OAs perform better in global minimum. Therefore, by utilizing ANNs composed with OAs like PSO-ANN (neuro-swarm), one can get advantages of both systems capabilities that means in first step, PSO finds global minimum and then ANN uses their result to optimize solution in local minimum.
The learning procedure of the hybrid neuro-swarm model is displayed in Figure 2. This procedure starts with the initialization of a group of random particles, during which the positions of those particles that are representative of the ANN weights and biases are assigned randomly. After this stage, considering the initial weights and biases (i.e., the initial particle positions), the hybrid neuro-swarm is run and then, the system error is calculated between the predicted and actual values. The calculated error is reduced by changing the positions of particles in each iteration. To

Hybrid Algorithm
Researchers tried to improve ANN's efficiency in solving engineering problems by mean of various optimization algorithms such as PSO. Nevertheless, some algorithms like BP that act as a learning algorithm for local search cannot provide acceptable solution for finding ANN optimum solution [61]. However, OAs show satisfactory results in term of regulate bias and weight in ANNs in order to advancement ANN's prediction. While ANNs show more convergence at local minimums, OAs perform better in global minimum. Therefore, by utilizing ANNs composed with OAs like PSO-ANN (neuro-swarm), one can get advantages of both systems capabilities that means in first step, PSO finds global minimum and then ANN uses their result to optimize solution in local minimum.
The learning procedure of the hybrid neuro-swarm model is displayed in Figure 2. This procedure starts with the initialization of a group of random particles, during which the positions of those particles that are representative of the ANN weights and biases are assigned randomly. After this stage, considering the initial weights and biases (i.e., the initial particle positions), the hybrid neuro-swarm is run and then, the system error is calculated between the predicted and actual values. The calculated error is reduced by changing the positions of particles in each iteration. To update the velocity equation, values of → P best (i.e., the lowest error obtained by each particle until that moment) and → G best (i.e., the lowest error obtained by all particles until that moment) are utilized. As a result, a value is produced with which to adjust particle positions to the best solutions. Then, a new error is achieved Appl. Sci. 2020, 10, 1904 5 of 17 considering the updated positions. Afterwards, the mentioned process continues until the termination criteria are met (which is normally system error).
update the velocity equation, values of ⃗⃗⃗⃗⃗⃗⃗⃗⃗ (i.e., the lowest error obtained by each particle until that moment) and ⃗⃗⃗⃗⃗⃗⃗⃗⃗ (i.e., the lowest error obtained by all particles until that moment) are utilized. As a result, a value is produced with which to adjust particle positions to the best solutions. Then, a new error is achieved considering the updated positions. Afterwards, the mentioned process continues until the termination criteria are met (which is normally system error).

Figure 2.
The learning procedure of neuro-swarm model.

Case study and Data Source
The Klang Valley Mass Rapid Transit (KVMRT) project which was constructed and operated in Kuala Lumpur, Malaysia, was aimed to reduce the traffic crowding in this big city. During site investigation phase of this project, for supporting purposes of the KVMRT project, it was decided to construct many of large-diameter bored piles. Part of the above piles is implemented on different kinds of rocks such as granite and limestone. Our research focuses on outcomes of 96 rock-socketed piles (built in granite rock type). Granitic rock of Triassic era, dominating the undulant terrain, was noticed in the research zone. Subsurface data and materials were accomplished in the pile locations to examine the predominant geological situations. Based on our observations, underlying subsoil contours comprised remaining soils of various rocks (typically granite). In this project, bedrock depth was attained in a range of (0.7-14 m) below the ground level. The overload soil generally contains gravely silt or sandy or silty sand or gravely. Nonetheless, the phrases below summarize borelog-related data and field surveys; (a) a range of slightly weathered rock mass to highly weathered rock mass was observed during the investigations, (b) the minimum and maximum UCS values were obtained as 25 MPa and 68 MPa, respectively (the tests were carried out based on ISRM [62]), (c) till the depth of 16.5 m, a highly weathered residual soil was observed and the SPT N-values of the main soil (hard sandy silt) were obtained in a range of (4-167) blows/300 mm.
The first step for conducting a predictive model is to prepare an appropriate database. Indeed, identification of parameters with the highest influence on model output is considered as a precondition for a predictive intelligence system. As noted above, the results of 96 pile driving analyzer (PDA) tests conducted on rock-socketed piles were taken into consideration for developing hybrid model. The tests were carried out with the help of pile driving analyzer equipment

Case Study and Data Source
The Klang Valley Mass Rapid Transit (KVMRT) project which was constructed and operated in Kuala Lumpur, Malaysia, was aimed to reduce the traffic crowding in this big city. During site investigation phase of this project, for supporting purposes of the KVMRT project, it was decided to construct many of large-diameter bored piles. Part of the above piles is implemented on different kinds of rocks such as granite and limestone. Our research focuses on outcomes of 96 rock-socketed piles (built in granite rock type). Granitic rock of Triassic era, dominating the undulant terrain, was noticed in the research zone. Subsurface data and materials were accomplished in the pile locations to examine the predominant geological situations. Based on our observations, underlying subsoil contours comprised remaining soils of various rocks (typically granite). In this project, bedrock depth was attained in a range of (0.7-14 m) below the ground level. The overload soil generally contains gravely silt or sandy or silty sand or gravely. Nonetheless, the phrases below summarize borelog-related data and field surveys; (a) a range of slightly weathered rock mass to highly weathered rock mass was observed during the investigations, (b) the minimum and maximum UCS values were obtained as 25 MPa and 68 MPa, respectively (the tests were carried out based on ISRM [62]), (c) till the depth of 16.5 m, a highly weathered residual soil was observed and the SPT N-values of the main soil (hard sandy silt) were obtained in a range of (4-167) blows/300 mm.
The first step for conducting a predictive model is to prepare an appropriate database. Indeed, identification of parameters with the highest influence on model output is considered as a precondition for a predictive intelligence system. As noted above, the results of 96 pile driving analyzer (PDA) tests conducted on rock-socketed piles were taken into consideration for developing hybrid model.
The tests were carried out with the help of pile driving analyzer equipment manufactured by Pile Dynamic, Inc. (Cleveland, OH, USA). According to previous investigations, piling geometry factors play a very important role on pile settlement. In this study, relevant parameters i.e., L s /L r (length of pile in the soil layer/length of pile in the rock layer) and L p /D (total length of pile/pile diameter), were selected to be considered as inputs. Additionally, UCS parameter was also selected as a model input because of its importance in the part of pile socketed into the rock. Another input parameter taken into consideration was SPT N-value for representing the situation of soil layer. There is no need to point out that the pile load affects the pile settlement; hence, pile ultimate bearing capacity (Q u ) was fixed as another input to predict the pile settlement (S p ). Ranges of (3.9-31.6), (0.23-32.7), (3.7-166.8), , (12,300-43,000 kN) and (5-20 mm), were obtained for L p /D, L s /L r , SPT N-value, UCS, Q u and S p , respectively. To have a better understanding on the established database, input and output parameters for all 96 datasets are shown in Figure 3. In the next section, approaching the implement of neuro-swarm system in predicting pile settlement will be described.
Appl. Sci. 2020, 10, x FOR PEER REVIEW 6 of 18 manufactured by Pile Dynamic, Inc. According to previous investigations, piling geometry factors play a very important role on pile settlement. In this study, relevant parameters i.e., Ls/Lr (length of pile in the soil layer/length of pile in the rock layer) and Lp/D (total length of pile/pile diameter), were selected to be considered as inputs. Additionally, UCS parameter was also selected as a model input because of its importance in the part of pile socketed into the rock. Another input parameter taken into consideration was SPT N-value for representing the situation of soil layer. There is no need to point out that the pile load affects the pile settlement; hence, pile ultimate bearing capacity (Qu) was fixed as another input to predict the pile settlement (Sp). Ranges of ( Figure 3. In the next section, approaching the implement of neuro-swarm system in predicting pile settlement will be described.

Model Development
After preparing and establishing the database, settlement of the piles can be predicted in a form of Sp = f [Lp/D, Ls/Lr, SPT N-value, UCS and Qu]. Therefore, this part presents the modeling procedure of ANN and neuro-swarm models in estimating the settlement of rock-socketed piles. In the following, first, an ANN model is designed and an architecture of ANN is introduced for pile

Model Development
After preparing and establishing the database, settlement of the piles can be predicted in a form of Sp = f [Lp/D, Ls/Lr, SPT N-value, UCS and Qu]. Therefore, this part presents the modeling procedure of ANN and neuro-swarm models in estimating the settlement of rock-socketed piles. In the following, first, an ANN model is designed and an architecture of ANN is introduced for pile settlement prediction.
Then, the design stages of the neuro-swarm model will describe in order to show how to solve the problem of this study.

ANN Design
This section describes the implementation of the ANN model to predict pile settlement. Prior to start ANN modeling for prediction of pile settlement, all datasets regarding training or testing feature must be separated. Nelson and Illingworth [63] based on their studies suggested that (20%-30%) of whole datasets should be considered as testing datasets. Accordingly, we allocated 20% (19 datasets) of whole datasets to testing datasets. Since prosperous applying of Levenberg-Marquardt (LM) training algorithm has noticed in many researches [64,65], it was also used in this study in order to design ANN. As an undeniable fact related to number of hidden layer, according to several researchers [29,66,67], almost all problems can be approximated through the use of only one hidden layer. On the other hand, based on results of the study conducted by Hornik et al. [68], the maximum number of nodes in a hidden layer is 2 × N input + 1, where N input is the number of model inputs. Replacing N input = 5 in this equation indicates that the problem of pile settlement can be solved by a range of 1 to 11. Therefore, several ANN models with hidden node numbers in the mentioned range were constructed and the results of analyses are shown based on coefficient of determination (R 2 ) in Table 1. The formula of R 2 is presented in follows: where, x imeas and x ipred are the measured and predicted values, respectively. x represents the average measured values. According to results of Table 1, an ANN model with only three hidden nodes performs better compared to the others. This model with R 2 of 0.809 and 0.816 for train and test stages, respectively, shows an acceptable performance prediction in forecasting pile settlement proposing ANN model. Hence, an architecture of ANN with the size of 5 × 3 × 1 was found to offer more enhanced results. Note that, in the suggested ANN architecture, 5 is the number of input parameters, 3 is the number of hidden neurons and 1 is number of output parameter.

Neuro-Swarm Design
In this part, neuro-swarm modeling procedure to predict pile settlement is presented considering the suggested ANN architecture in previous section. It is worth mentioning that neuro-swarm models were constructed using 80% of the whole database as training datasets and 20% of the whole database as testing datasets. Approaching the implement of neuro-swarm model starts by determining the number of swarm or swarm size (SS). In order to select the best SS, values of 25, 50, 75, 100, 150, 200, 250, 300, 350 and 400 were employed and through a parametric study the best one among them was selected. Table 2 tabulates the parametric study's results. In conducting this parametric study, other effective parameters of PSO were considered constant as follows (C 1 = C 2 = 2), (max iteration = 100) and (w = 0.25). Ranking system used in Table 2, was utilized according to the study conducted by Zorlu et al. [69]. According to this method, each performance index (i.e., herein root mean square error, RMSE and R 2 ) is ordered in its class. with total ranking of 33 was chosen as the best model and 400 was selected as the best SS. Note that, the equation of RMSE is presented as follows: where, x imeas and x ipred are the measured and predicted values, respectively. N is the total No. of datasets. The second parametric study is related to determination of max iteration in predicting pile settlement. The same SS values of the previous stage were used with max iteration number of 500 as shown in Figure 4. The results of this parametric study were assessed based on RMSE values. According to Figure 4, the changes of RMSE values are very small for all SS after iteration number of 400. Hence, the max iteration number was considered as 400.
Appl. Sci. 2020, 10, x FOR PEER REVIEW 8 of 18 neuro-swarm model numbers of 1-10, with rankings of 3,4,9,6,7,8,8,5,10 and 9, respectively. Therefore, as a result, model number 10 with total ranking of 33 was chosen as the best model and 400 was selected as the best SS. Note that, the equation of RMSE is presented as follows: where, and are the measured and predicted values, respectively. N is the total No. of datasets.
The second parametric study is related to determination of max iteration in predicting pile settlement. The same SS values of the previous stage were used with max iteration number of 500 as shown in Figure 4. The results of this parametric study were assessed based on RMSE values. According to Figure 4, the changes of RMSE values are very small for all SS after iteration number of 400. Hence, the max iteration number was considered as 400.  The next stage of modeling, it seems that there is a need to conduct another parametric study to determine coefficients of velocity equation. Kennedy and Eberhart [58] mentioned that the highest performance prediction of neuro-swarm model can be obtained when = = 2 with total summation of 4. Therefore, another parametric study with various combinations of and The next stage of modeling, it seems that there is a need to conduct another parametric study to determine coefficients of velocity equation. Kennedy and Eberhart [58] mentioned that the highest performance prediction of neuro-swarm model can be obtained when C 1 = C 2 = 2 with total summation of 4. Therefore, another parametric study with various combinations of C 1 and C 2 (with total summation of 4) was considered and applied in order to choose the best C 1 and C 2 . Table 3 presents the results of these combinations based on R 2 and RMSE. According to this table and also application of simple ranking by Zorlu et al. [69], model number 3 with the combination of C 1 = 1.714 and C 2 = 2.286 outperforms the other implemented models. These C 1 and C 2 values will be used for the rest of neuro-swarm models. It is worth noting that in these models, SS = 400 and max iteration of 400 obtained from the previous stages, were used. Table 3. Neuro-swarm models with different coefficient of velocity equations.

Model
No. Selection of the appreciate value for inertia weight is the next modeling stage. This has been done by using the different values of w i.e., 0.25, 0.5, 0.75 and 1 to see their prediction performance in estimating pile settlement. In conducting this parametric study, the obtained values from the previous stages have been considered and utilized. Table 4 shows results of 4 neuro-swarm models with w = 0.25, w = 0.5, w = 0.75 and w = 1 based on their performance prediction together with rankings and total rankings of each model. As a result, the highest prediction performance can be achieved when inertia weight set as 1 (model number 4). Hence, w = 1 was chosen as the best inertia weight in modeling of neuro-swarm intelligence system. Table 4. Neuro-swarm models with different inertia weights.

Model
No. As the last modeling stage of neuro-swarm system, there is a need to divide the database into 5 different trains and tests sections to see their effects on system's results. The process has been suggested in the previous studies as well [19,70]. Five neuro-swarm models have been constructed to forecast settlement of the pile according to the architecture of ANN (5 × 3 × 1), SS = 400, max iteration = 400, C 1 = 1.714 and C 2 =2.286 and w = 1. Five time running is due to the fact that the system could use the randomly selected datasets, train them and test them to see their effects on the neuro-swarm model performance. The performance predictions of these hybrid models to forecast the pile settlement will be assessed in details later.

Results and Discussion
During ANN model development, it was found that a model with three hidden neurons provides the best performance prediction for estimation of pile settlement. The R 2 values of 0.809 and 0.816 for train and test stages were achieved respectively for the best ANN model. Then, in order to propose higher performance capacity in predicting pile settlement, the effects of the most important PSO parameters on the neuro-swarm system have been investigated. It was observed that by determining of each effective PSO parameter, the prediction performance of neuro-swarm system improves based on system error and R 2 . Finally, based on the obtained values of the most significant PSO parameters, five neuro-swarm models have been built to predict pile settlement. To select the best predictive neuro-swarm model, five performance indices including variance account for (VAF), R 2 , mean absolute error (MAE), RMSE and the a20-index have been used and applied. These performance indices have been widely utilized to assess model performance in the previous related works [23,[71][72][73][74][75][76][77][78][79][80][81]. The computation formulas of VAF, MAE and a20-index are presented as follows: where, x imeas and x ipred are the measured and predicted values, respectively. N is the total No. of datasets, and m20 is the No. of samples with values of rate measured/predicted value (range between 0.8-1.2). The values of 1, 100%, 0 and 0 are required for performance indices of R 2 , VAF, RMSE and MAE, respectively, when a perfect predictive model is of interest. It is important to mention that the a20-index with value of one should be obtained for a perfect predictive model. Table 5 represents the prediction performance of 5 neuro-swam models in estimating settlement of pile. Great performances of the training datasets imply the fruitfulness of the learning process. Moreover, more accurate testing datasets results reveal the well-generalized nature of the proposed models. It should be noted that the authors split data into the training (80%) and testing (20%) datasets in this study. This combination has been suggested and used by many scholars (e.g., [65,82,83]).
Since the best neuro-swarm model in Table 5 was barely recognizable because the results of 5 models are very close to each other, the ranking system proposed by Zorlu et al. [69] has been used to recognize the best model similar to the model development stages. In this table, the total rank refers to the ranking values obtained by each datasets (i.e., training and testing), while, the final rank is a summation value of total rank for the specific datasets (train and test). Therefore, the final rank values should be used for comparison purposes of the neuro-swarm models. The best results in predicting pile settlement are related to model number 4 with final rank of 32 and total rank values of 13 and 19, respectively for train and test. Results of (0.851, 0.079, 85.056%, 0.071 and 0.649) and (0.892, 0.099, 84.738%, 0.087 and 0.474) were obtained for the performance indices of R 2 , RMSE, VAF, MAE and a20-index, respectively. Generally, these results reveal that the neuro-swarm model is capable enough to predict pile settlement. Figures 5 and 6 depict the predicted pile settlement values along with their actual values for train and test sections of the best neuro-swarm model, respectively. These two figures show the ability of the neuro-swarm model in solving pile settlement with low level of system error. It is important to say that the same database has been applied by Armaghani et al. [3] to introduce GEP model for pile settlement estimation. In that study, they successfully indicated that GEP model is able to deliver higher capacity in comparison with the MLR model. The results of GEP model based on R 2 were 0.872 and 0.861, respectively for train and test stages. Considering results of the present study, it was found that the selected neuro-swarm model is able to perform better in testing or model evaluation section compared to GEP model by Armaghani et al. [3]. This shows that neuro-swarm model is a strong technique in part of model development. On the other hand, the evaluation in the present study is based on 5 performance indices compared to the study by Armaghani et al. where 3 performance indices have been used for system evaluation. Needless to say that evaluation process of predictive techniques with more details is always of interest and advantage. In conclusion of this part, the developed neuro-swarm model is a powerful hybrid intelligence system that can solve ANN shortcomings and it can be applied for similar purpose of this study in other fields of engineering.

Conclusions
To implement a hybrid ANN model namely neuro-swarm, results of 96 PDA tests together with properties of piles and earth were considered and used. In this respect, five parameters, including

Conclusions
To implement a hybrid ANN model namely neuro-swarm, results of 96 PDA tests together with properties of piles and earth were considered and used. In this respect, five parameters, including

Conclusions
To implement a hybrid ANN model namely neuro-swarm, results of 96 PDA tests together with properties of piles and earth were considered and used. In this respect, five parameters, including L s /L r , L p /D, UCS, SPT N-value and Q u were set as inputs to forecast S p . In fact, through developing neuro-swarm model, we intend to have the best relationship of S p = f (L s /L r , L p /D, UCS, SPT N-value and Q u ). According to the most effective PSO parameters, several parametric studies have been conducted to first select the best one among them and then improve the prediction performance of neuro-swarm model in predicting pile settlement. It was found that the prediction performance of neuro-swarm is improved by fixing the effective PSO parameters in each stage. At the last modeling stage, 5 neuro-swarm models were constructed to see effects of various combinations of training and testing datasets. Then, these 5 models in estimating pile settlement were evaluated using five performance indices i.e., R 2 , RMSE, VAF, MAE and a20-index. Eventually, a slightly higher prediction of performance was recorded by dataset number 4 among all 5 neuro-swarm models. Based on R 2 , and system RMSE, values of (0.851 and 0.079) and (0.892 and 0.099) were observed for train and test stages, respectively, which confirm the high strength of neuro-swarm model in predicting pile settlement. It should be noted that the results of a pre-developed ANN model in predicting pile settlement based on R 2 were obtained as 0.809 and 0.816 for training and testing datasets, respectively, which showed a lower performance capacity of ANN model compared to neuro-swarm predictive technique. Therefore, by developing the neuro-swarm model in predicting pile settlement, the performance capacity based on R 2 can be significantly improved i.e., from 0.809 to 0.851 for training datasets and from 0.816 to 0.892 for testing datasets. To conclude, the neuro-swarm model can be successfully applied in order to solve ANN shortcomings. Approaching of the implement of neuro-swarm technique presented in this study can be used by other researchers for solving problems in fields of engineering.