Investigation on the Potential to Integrate Different Artificial Intelligence Models with Metaheuristic Algorithms for Improving River Suspended Sediment Predictions

Suspended sediment load (SLL) prediction is a significant field in hydrology and hydraulic sciences, as sedimentation processes change the soil quality. Although the adaptive neuro fuzzy system (ANFIS) and multilayer feed-forward neural network (MFNN) have been widely used to simulate hydrological variables, improving the accuracy of the above models is an important issue for hydrologists. In this article, the ANFIS and MFNN models were improved by the bat algorithm (BA) and weed algorithm (WA). Thus, the current paper introduces improved ANFIS and MFNN models: ANFIS–BA, ANFIS–WA, MFNN–BA, and MFNN–WA. The models were validated by applying river discharge, rainfall, and monthly suspended sediment load (SSL) for the Atrek basin in Iran. In addition, seven input groups were used to predict monthly SSL. The best models were identified through root-mean-square error (RMSE), Nash–Sutcliff efficiency (NSE), standard deviation ratio (RSR), percent bias (PBIAS) indices, and uncertainty analysis. For the ANFIS–BA model, RMSE and RSR varied from 1.5 to 2.5 ton/d and from 5% to 25%, respectively. In addition, a variation range of NSE was between very good and good performance (0. 75 to 0.85 and 0.85 to 1). The uncertainty analysis showed that the ANFIS–BA had more reliable performance compared to other models. Thus, the ANFIS–BA model has high potential for predicting SSL.


Introduction
Through basin management, different strategies have been applied to decrease sediment volume. Sedimentation processes and soil erosion are important in the fields of hydrology and hydraulics [1]. Sediment load relies on rainfall and runoff, as these hydrological variables can change the sedimentation 1.
To improve ANFIS and MFNN model efficiency by applying new optimization algorithms. These algorithms are used to obtain the best ANFIS and MFNN structures and parameters; 2.
To predict monthly sediment load by applying improved ANFIS and MFNN models; 3.
To examine the uncertainty of the predictions; and 4.
To obtain a sediment map of the case study.

Literature Review
Guven and Kisi [13] applied a machine learning genetic programming (MLGP) model to obtain daily-suspended load. It was observed that the MLGP model outperformed the genetic expression programming (GEP) model. The prediction error, root-mean-square error (RMSE), obtained from the MLGP model was 175 ton/d, while for the GEP model, the RMSE was 231 ton/d.
Kisi et al. [14] compared the genetic programming (GP) model with the support vector machine (SVM) and ANN models in daily-suspended load prediction. The results indicated that the GP model outperformed the SVM and ANN models.
Chaing et al. [15] compared the SVM, ANN, and linear regression models to predict suspended sediment load. The results revealed that the SVM model outperformed the ANN and linear regression Appl. Sci. 2019, 9,4149 3 of 24 models. They applied 85% of the input data for training the soft computing models, and the remaining 15% of the data were applied for testing.
Another paper studied the capabilities of SVM and ANN models to predict daily-suspended load. The suspended load was modeled by using the inputs of rainfall and streamflow. A gamma test was used to select input data [16]. It was found that the SVR model with radial basis kernel function presented more accurate outputs than other models.
Another study investigated the performance of the ANN model in forecasting sediment movement by applying the assumptions of self-filtering sewer models [17]. It was found that the ANN model outperformed SVM and GP models. Velocity and shear stress were used as input data.
Rainfall, discharge, and antecedent sediment load were used as input to the ANN model [18]. Outputs from the ANN model were benchmarked against multiple linear regression (MLR). The reported ANN model could be effectively applied to decrease the frequency of high-priced operations for sediment evaluations.
Shiau and Chen [19] applied a probabilistic system for forecasting daily-suspended loads. The accuracy of the probabilistic model was better than experimental rating curve methods. Statistical characteristics such as median, mean, and mode of the obtained probability distribution were the model outputs.
Chen and Chau [20] applied a hybrid double-forward neural network (HDFNN) model to predict suspended load. The obtained peak value of suspended sediment load (SSL) showed that the HDFNN model was more suitable than the multilayer feed-forward neural network.
A staking method was used to forecast SSL [21]. To investigate the suggested method, two case studies from the USA were considered. Streamflow and suspended sediment concentration were model inputs. The estimated outputs illustrated that the suggested method performed better than genetic programming and regression models.
In a study in Turkey, the capabilities of SVM, ANN, and ANFIS models were studied to predict suspended load [22]. The simulation was made by applying two input variables, streamflow, and suspended sediment load. Results showed that the ANFIS model could give a lower RMSE than the other models.
To estimate SSL, a multilayer perceptron (MLP) neural network model was trained by different algorithms [23]. The results indicated that the Levenberg-Marquardt algorithm could reach fast convergence. The performance of a scaled conjugate gradient (SCG) algorithm had the next priority.
Hybrid tree models were used to forecast SSL [24]. Water temperature, water discharge, and electrical conductivity were model inputs. The Friedman test was applied to determine the best models. The results indicated that the hybrid tree models outperformed simple tree models. The random subspace error pruning (RSEP) model performed worse than other models.
Kisi and Yaseen [14] applied the evolutionary fuzzy (EF) method to forecast SSL. The prediction model was created by applying discharge data. The obtained results proved the high capability of the EF model for SSL prediction. They suggested that the model could be a useful tool for river engineering.
Another study investigated the ability of the ANN method for modeling sophisticated nonlinear SSL in the Himalayan area [25]. The research not only presented an understanding of the sedimentation process, but it also studied the impact of other hydrological variables on SSL.
A three-layer back propagation model was used to estimate SSL. Rainfall, water discharge, and sediment discharge were model inputs [4]. To investigate the ability of the model, error was computed by benchmarking the observed and simulated data. It was found that the model could present accurate outputs in the prediction of SSL.
In general, the literature review showed that soft computing models are widely used for SSL prediction. However, soft computing models can be improved by combining them with optimization algorithms. In the current study, two new optimization algorithms are used to improve the ANFIS and MFNN models. Bat algorithm (BA) and weed algorithm (WA) are widely used in a number of different issues, including water resource management, water quality, hydraulic structure design, mathematical problems, power generation, hydrological simulations, and flood control [26,27]. These algorithms are widely used because of their fast convergence, simple computation process, good balance between exploration and exploitation abilities, high flexibility, accurate results, and the high capability of local and global searches. To our knowledge, the hybrid ANFIS-BA, MFNN-BA, ANFIS-WA, and MFNN-WA models have not been previously used for SSL prediction.
The current study not only predicts monthly SSL using improved ANFIS and MFNN models, but it also presents comprehensive information about the uncertainty of the prediction models. In addition, a comprehensive evaluation is carried out to study the advantages and disadvantages of the applied soft computing models.

Case Study
The Atrek River in Iran is located in the southeastern district of the Caspian Sea. This basin is located geographically at 56 01 32 to 56 01 32 E and 36 • 56 32 to 38 • 14 42 N, as shown in Figure 1. mathematical problems, power generation, hydrological simulations, and flood control [26,27]. These algorithms are widely used because of their fast convergence, simple computation process, good balance between exploration and exploitation abilities, high flexibility, accurate results, and the high capability of local and global searches. To our knowledge, the hybrid ANFIS-BA, MFNN-BA, ANFIS-WA, and MFNN-WA models have not been previously used for SSL prediction. The current study not only predicts monthly SSL using improved ANFIS and MFNN models, but it also presents comprehensive information about the uncertainty of the prediction models. In addition, a comprehensive evaluation is carried out to study the advantages and disadvantages of the applied soft computing models.

Case Study
The Atrek River in Iran is located in the southeastern district of the Caspian Sea. This basin is  Atrek Basin. Figure 2 shows average monthly rainfall. The months of April (range of rainfall: 30-150 mm) and October (range of rainfall: 12-16 mm) have the highest and lowest rainfall uncertainty among all the seasons. Spring is the wettest season, averaging 98 mm. The minimum rainfall is 24 mm in the summer. In addition, the months of June and March have the highest and lowest temperature uncertainty ( Figure 3). As observed from Figures 2 and 3, the temperature and rainfall have high variations. This shows a complex relationship between sediment load and hydrological variables. The measured discharges are shown in Figure 4. The Spring season has the most discharge variability.  This shows a complex relationship between sediment load and hydrological variables. The measured discharges are shown in Figure 4. The Spring season has the most discharge variability. The months of March and July have the highest and lowest discharge. Importantly, significant variability in the temporal and spatial distribution of temperature, rainfall, and discharge presented here has significant effects on SSL prediction. Thus, the study of temporal and spatial variability of SSL is important to enhance knowledge of the sedimentation process. The discharge variations are considered as inputs, and their effects are evaluated on the outputs or sediment load. The months of March and July have the highest and lowest discharge. Importantly, significant variability in the temporal and spatial distribution of temperature, rainfall, and discharge presented here has significant effects on SSL prediction. Thus, the study of temporal and spatial variability of SSL is important to enhance knowledge of the sedimentation process. The discharge variations are considered as inputs, and their effects are evaluated on the outputs or sediment load.

Data and Parameters Applied in Suspended Sediment Load (SSL) Prediction
The information applied here to forecast SSL in the Atrek basin contains monthly SSL, monthly discharge, and precipitation during 1997-2017. The data were obtained from the Iran water resource management organization. Seven groups of input data were applied for improved ANFIS and MFNN

Data and Parameters Applied in Suspended Sediment Load (SSL) Prediction
The information applied here to forecast SSL in the Atrek basin contains monthly SSL, monthly discharge, and precipitation during 1997-2017. The data were obtained from the Iran water resource management organization. Seven groups of input data were applied for improved ANFIS and MFNN models. A lag time of 3 to 9 months was considered for SSL prediction. The best inputs were obtained by principal component analysis (PCA). In the case of hydrological predictions, accurate knowledge of effective predictors is important for decision makers [28]. PCA is an identification method that applies orthogonal transformation to turn a collection of measurements of conceivably associated parameters into a collection of values of linearly unassociated parameters named principal components. However, there are min (m −1 , z) components when m observations and z variables are defined for data. When a component has the largest variance, the considered component is the most effective component. PCA is widely applied to determine the effective inputs for estimation of hydrological variables. Therefore, the proposed structures for the model inputs-outputs pattern were obtained from PCA, as shown in Table 1. The proposed models have been developed using Matlab 9.5 software version R2018b (MathWorks, USA) with a system 17 processor, SSD, and 64 GB RAM. On the other hand, the PCA computations have been carried out using SPSS 2018 software version 25.0.0.0 (IBM Corporation, USA). All the information and the data used in this study have been collected from water resource management authorities in Iran.

Adaptive Neuro Fuzzy System (ANFIS) Method
The structure of the ANFIS model is shown in Figure 5. There are five layers to this system [29]: 1.
The first layer calculates the membership degree. Each node produces a membership degree. The fuzzy sets apply membership functions [29].
where x and y are outputs, A i and B i are linguistic labels, and µ Ai and µ Bi−2 are the degree of membership function for A i and B i , respectively.

2.
The output of the second layer (fire strengths) is computed based on the computed membership degrees. In fact, membership functions of the previous layer are compounded together to generate the firing strengths.
where O 2i is the output of this layer called fire strength.

3.
The firing strengths are normalized in this level. The contribution of the firing strengths is computed by the constant nodes: Appl. Sci. 2019, 9, 4149 8 of 24 4. In this layer, consequent parameters are used to determine the proportion of the ith rule to the overall outcomes: where p i , q i , and r i are consequent parameters.

5.
This layer uses summation of input signals to obtain the overall output.
A previous study showed that a Gaussian membership function led to accurate outputs [30]. Thus, the current study uses this membership function [29]: where c i and σ i are the parameters for the membership function.

Feed-Forward Neural Network (FNN)
The MFNN was used for the current study. This method is widely used for predicting different hydrological variables. This method generally includes three layers between inputs and outputs [31]. The first layer includes Z inputs (z = 1, 2, 3, …, Z). The second layer is known as the hidden layer and includes Q inputs (q = 1, 2, 3, …, Q). The weight parameters (wzk, z = 1, 2, 3, …, Z, k = 1, 2, …, K) and (wqk, q = 1, 2, 3, …, Q, k = 1, 2, …, K) connect the input layer to the hidden layer and the hidden layer to the output layer, respectively. Each input neuron to the hidden layer (tq) generates an output (vq). dk and yq are the input neuron and total output of the output layer.
where f(.) determines the transfer function. This function is used to investigate the connections

Feed-Forward Neural Network (FNN)
The MFNN was used for the current study. This method is widely used for predicting different hydrological variables. This method generally includes three layers between inputs and outputs [31]. The first layer includes Z inputs (z = 1, 2, 3, . . . , Z). The second layer is known as the hidden layer and includes Q inputs (q = 1, 2, 3, . . . , Q). The weight parameters (w zk , z = 1, 2, 3, . . . , Z, k = 1, 2, . . . , K) and (w qk , q = 1, 2, 3, . . . , Q, k = 1, 2, . . . , K) connect the input layer to the hidden layer and the hidden layer to the output layer, respectively. Each input neuron to the hidden layer (t q ) generates an output (v q ). d k and y q are the input neuron and total output of the output layer.
where f (.) determines the transfer function. This function is used to investigate the connections between the inputs and outputs.

ANFIS and Multilayer FNN (MFNN) Models and Optimization Algorithms
While the ANFIS model has a good ability to obtain the output from the inputs as a soft computing model, a long and complex computation process is required to find the best values of the consequent and MF (membership function) parameters. The MFNN model has a few unknown components: number of neurons in the hidden layer weights, biases, and activation functions (either per layer or per neuron).
Thus, the optimization algorithms are used for computing ANFIS and MFNN parameters. A comprehensive explanation is presented for each algorithm based on the following parts: 1.
Bat algorithm (BA): BA is an optimization algorithm that acts based on echolocation characteristics. Each of the flying bats has a random velocity and flies at random positions. When it is searching for prey, its loudness, frequency, and pulsation rates are widely varied. A local random walk is used to increase the search ability [26]. The velocity and position components are used to obtain the optimal solutions. Each position is a candidate solution. Thus, the best position is the best solution while the algorithm should avoid trapping in the local optimums ( Figure 6). In fact, bats generate sounds that are returned from the surroundings. They can differentiate an obstacle from prey based on returned frequencies [26]. The velocity, frequency, and position are defined as: where f l is frequency, f min is minimum frequency, f max is maximum frequency, v l (t) is velocity, x l (t) is position, and x * is the best position. Equation (9) defines the local search applying a random walk [30]: where A t is loudness and ε is a random value. The pulsation and loudness rate are varied during computational levels. If a bat finds its prey, the pulsation (r l ) and loudness rate have an increasing and decreasing trend, respectively. The quality solutions are identified by the objective function computation ( Figure 6).

2.
Weed algorithm (WA): Weeds attempt to find the best growth position. Among all the characteristics of this optimization algorithm, its easy structure, high accuracy, and few random parameters make it useful for practice in optimization problems.
WA has different levels as follows [27]: • Initialization: The initial population is distributed randomly in a d dimensional problem space. The locations of weeds are considered as the decision-making components. • Reproduction: The weeds generate a particular number of seeds. The number of seeds varies from S min (minimum number of seeds) to S max (maximum number of seeds). The number of seeds is S max if a weed has the best objective function value. Although the quality of some weeds is not good, the reproduction process allows them to have a chance again for continuation of life. This issue is important because some of them may have important information. If they do not have another chance, important information is eliminated from the algorithm cycle.

•
Competitive level: The combination of weeds produces the next weed generation. If the population exceeds a threshold, the weeds with low quality are eliminated compared to weeds with high quality.

•
Termination level: The algorithm finishes when the number of algorithm levels of iteration (Iter) reaches the maximum number of iterations (iter-max), which has been proposed to be 1000 iterations. (Figure 7).   1. Percent bias: 3. Nash-Sutcliff efficiency

Sensitivity Analysis for Optimization Algorithms
To apply the optimization algorithms to the ANFIS and MFNN models, accurate adjustment of The following indices are used to evaluate different models [31,32]:

Standard deviation of RMSE observations
3. Nash-Sutcliff efficiency 4. RMSE (root-mean-square error) Y obs i is observed data, Y sim i is simulated data, N is the number of data, and Y mean is the average value of the data.

Sensitivity Analysis for Optimization Algorithms
To apply the optimization algorithms to the ANFIS and MFNN models, accurate adjustment of random parameters must be obtained a priori. As in the evolutionary algorithms, the BA and WA are fully sensitive to initialize random parameters. In fact, the random parameters change the accuracy of optimization algorithms. Thus, it is necessary to find the optimal value of random parameters so that the objective function values (RMSE in this study) converges to the lowest value.
Hence, to obtain better efficiency of the BA and WA, it is necessary to set the random parameters. As the WA and BA parameters were various, to create confidence that a suitable adjustment of random parameters was used, different experiments were used for generating the sensitivity analysis. The value of desired parameters was gradually changed, and the analysis was accomplished. The default values of other random parameters were constant when the user varied one parameter. The algorithm variations were examined based on the mean of the objective function value (i.e., RMSE). One sample of implanted sensitivity analysis to obtain proper adjustment of parameters systematically as the consecutive levels of needed analysis is shown in Table 2. In fact, Table 2 is a sample to show how to obtain the optimal value of random parameters. First, a population size, MaxF, MinF, MaxA, and MinA (1) were selected, and an objective function was computed for this combination. Then, MaxF, MinF, and MaxA with next value of MinA (3) were applied to compute the objective function value for the next combination. When all of the available MaxF, MinF, and MaxA with different values of MinF were generated, and the objective function value for each combination was generated, Max f was changed to the next value (0.5). Then, the available combination was generated for the previous population size (10), MaxF (0.5), the previous MinF (0.1), and the previous MaxA (3) with MinA (1). Then, all available combinations for SP:10, MaxF (0.5), MinF (0.1), MaxA (3), and Min A from 1 to 4 were generated, and the objective function value was computed. This process was similarly repeated for the other population sizes and parameter values. Each combination had the lowest value for the objective function, and the best value of random parameters can be seen in Table 3. In fact, the parameters in Table 3 are the optimal values of random parameters that generated the lowest value of the objective function.
It should be mentioned that the best value of the parameters occurred at the lowest objective function value. RMSE or objective function had small error index values and were suitable for the simulations. Finally, Table 3 reports the best values of BA and WA parameters based on the above process.

Performance Index Analysis
The ANFIS-BA, ANFIS-WA, MFNN-BA, and MFNN-WA models were employed to predict monthly SSL. Figure 9 was generated based on the following performance indices: RMSE, Nash-Sutcliff efficiency (NSE), standard deviation ratio (RSR), and percent bias (PBIAS). In order to present the performances of these models, monthly SSL data were predicted utilizing the proposed models, and the monthly error values were computed based on the suggested performance indicator. Recall that the SSL predictions were achieved based not only on different model types but also on different input patterns as a model structure. Figure 9 was generated in this paper in order to simplify the comparison process and to present comprehensive information about the performance of each model. Since seven different model input combinations were considered, there were seven different error patterns for each model. The gradient charts were produced based on the lowest and highest error values of the attained errors, and then a classification for the error domain was considered. Finally, variations of error values were shown by varying the intensity of colors. In fact, Figure 9 was generated based on the application of each input combination and the computation of each index for each predicated SSL output. The vertical axis shows the number of input combinations: 1. SSL (suspended sediment load) (lag time t-3: three months ago, number: 1); 2. Q (discharge), SSQ (lag time t-3: three months ago, number: 2); 3. SSQ, R (rainfall) (lag time t-3: three months ago, number: 3); In fact, Figure 9 was generated based on the application of each input combination and the computation of each index for each predicated SSL output. The vertical axis shows the number of input combinations:
The results for the models were compared with the observed data, and the statistical results were reported.
1. RMSE (Figure 10) The range of RMSE values varied from 1.5 to 5.5 ton/d. The ANFIS-BA had the best results since RMSE varied from 1.5 to 3.5 to a large extent (Figure 10). For the seventh combination, SSQ, Q, and R (time lag: t-9), it can be understood that during testing and training levels, the ANFIS-BA resulted in minimal RMSE. For the first combination, SSQ (time lag: t-3), it can be understood that during testing and training levels, the ANFIS-BA had the weakest performance among the ANFIS-BA models with different combinations. For the ANFIS-WA, the range of RMSE values varied from 2.5 to 4.5 ton/d. After ANFIS-BA, with different combinations, ANFIS-WA had the next priority for predicting SSL. For the MFNN-WA, it had the weakest performance since RMSE varied from 3.5 to 5.5. It should be mentioned that during the testing level, the MFNN-WA resulted in maximal RMSE (5.5). In all models, the seventh and first combination had the best and weakest performances, respectively. 5.5. It should be mentioned that during the testing level, the MFNN-WA resulted in maximal RMSE (5.5). In all models, the seventh and first combination had the best and weakest performances, respectively. 2. RSR ( Figure 11) RSR range varied from 0.05 to 0.25 ( Figure 11). The small values showed the best performance of the models. For the ANFIS-BA, the RSR range varied from 0 to 0.60. Thus, it had the best 2. RSR ( Figure 11) RSR range varied from 0.05 to 0.25 ( Figure 11). The small values showed the best performance of the models. For the ANFIS-BA, the RSR range varied from 0 to 0.60. Thus, it had the best performance among the models. After the ANFIS-BA, the ANFIS-WA and MFNN-BA had the best and weakest performances, respectively. The fifth, sixth, and seventh combinations had the same components, but the lag time was different for those models. It can be seen (Figure 9) that increasing the lag time improved the results for all models. As can be observed, the MFNN-BA model had better performance than the MFNN-WA model, since most of the results has smaller RSRs than that of the MFNN-WA model. 3. NSE and PBIAS (Figures 12 and 13) There are general efficiency ratings for NSE and PBIAS as follows [20]: It can be seen that the performance of ANFIS-BA was very good and good. In fact, NSE and PBIAS were separately computed for each month and then classified based on these efficiency ratings. To a large extent, the MFNN-WA model had unsatisfactory results. However, the MFNN-BA had a better performance than the other MFNN-WA models because the good and satisfactory ratings for It can be seen that the performance of ANFIS-BA was very good and good. In fact, NSE and PBIAS were separately computed for each month and then classified based on these efficiency ratings. To a large extent, the MFNN-WA model had unsatisfactory results. However, the MFNN-BA had a better performance than the other MFNN-WA models because the good and satisfactory ratings for the MFNN-BA model were more than those of the MFNN-WA model. Generally, the training level had a better performance than the testing level (Figures 12 and 13).
Previous studies showed the application of the ANFIS model with other optimization algorithms [15]. For example, a genetic algorithm was used to improve the ANFIS model, and SSL was simulated. In fact, the ANFIS parameters were considered as the initial population of chromosomes [33]. Another study used the ANFIS model with particle swarm optimization (ANFIS-PSO) [34]. In fact, the ANFIS parameters were considered as the particle positions. Then, the velocity and position were updated for each level. The results of the current paper were compared to the ANFIS-PSO and ANFIS-GA models developed in previous studies, as seen in Figure 14. The Taylor diagram shows better performance of the ANFIS-BA model compared to the ANFIS-PSO and ANFIS-GA models. The Taylor diagram is based on standard deviation, correlation coefficient, and normalized RMSE. The ANFIS-BA model was closer to the observed or reference point compared to the other models. Thus, it had better performance than the other models.  Previous studies showed the application of the ANFIS model with other optimization algorithms [15]. For example, a genetic algorithm was used to improve the ANFIS model, and SSL was simulated. In fact, the ANFIS parameters were considered as the initial population of chromosomes [33]. Another study used the ANFIS model with particle swarm optimization (ANFIS-PSO) [34]. In fact, the ANFIS parameters were considered as the particle positions. Then, the velocity and position were updated for each level. The results of the current paper were compared to the ANFIS-PSO and ANFIS-GA models developed in previous studies, as seen in Figure 14. The Taylor diagram shows better performance of the ANFIS-BA model compared to the ANFIS-PSO and ANFIS-GA models. The Taylor diagram is based on standard deviation, correlation coefficient, and normalized RMSE. The ANFIS-BA model was closer to the observed or reference point compared to the other models. Thus, it had better performance than the other models. Previous studies showed the application of the ANFIS model with other optimization algorithms [15]. For example, a genetic algorithm was used to improve the ANFIS model, and SSL was simulated. In fact, the ANFIS parameters were considered as the initial population of chromosomes [33]. Another study used the ANFIS model with particle swarm optimization (ANFIS-PSO) [34]. In fact, the ANFIS parameters were considered as the particle positions. Then, the velocity and position were updated for each level. The results of the current paper were compared to the ANFIS-PSO and ANFIS-GA models developed in previous studies, as seen in Figure 14. The Taylor diagram shows better performance of the ANFIS-BA model compared to the ANFIS-PSO and ANFIS-GA models. The Taylor diagram is based on standard deviation, correlation coefficient, and normalized RMSE. The ANFIS-BA model was closer to the observed or reference point compared to the other models. Thus, it had better performance than the other models.

Discussion of Results
From the previous investigations, it can be noticed that the soft computing models are in the classification of box or semi box models. Thus, this results in complexity in the uncertainty of the study. In soft computing models, input data are applied to minimize the model's errors in the training (calibration) stage. In fact, the model's errors are dependent on the system's input data. Thus, it is important to examine the model's efficiency to determine the soft computing model's uncertainty generated by variations in initial data. In this article, a specified percentage of inputs was used to calibrate the soft computing models. The number of 2000 iterations was considered appropriate to carry out the input sampling cycle. For each random instance, 2000 model parameters were generated from calibrated patterns. In this method, uncertainty of all input data (e.g., precipitation, discharge, and SSL) were considered in the evaluation of model performances. Two important indices quantified the uncertainty values. The p index, percentage of measured data bracketed by 95% (95 PPU) uncertainty, was applied to measure the uncertainty values. First, the cumulative distribution was generated for the final outputs. The 95 PPU was computed at the lower (2.5%) and upper (97.5%) bounds of the obtained distribution. A p value of 1 showed that the model had a good performance. Average thickness of the 95 PPU was used to determine the r index. This index was computed by dividing the average thickness by the standard deviation of the measured data.

p index:
The extent of p indices varied from 0.6 to 0.90. As can be seen in Figure 11, the p values of 0.8 and 0.9 covered a large extent of the ANFIS-BA model. As observed in Figure 15, all models had the lowest p values for the first and second combination. The MFNN models had more uncertainty because the p values decreased with MFNN models. The MFNN-WA models had the worst performance because the p value varied from 0.6 to 0.7 to a large extent in these models.

r indices
The average thickness of 95 PPU for the ANFIS-BA model was less than that of the other models because the r value varied from 0.15 to 0.20. A large extent of the MFNN-WA model had r values of 0.25 and 0.30 and thus had more uncertainty compared to the ANFIS-BA models. Generally, the previous results and uncertainty results showed that the ANFIS-BA and ANFIS-WA models had the best performances among all models.
Although different studies show that regression and ANN models have successful performances [35], this study showed that the ANFIS models were better than the ANN models. In addition, the results were different when different optimization algorithms were used to improve the ANFIS and ANN models. This study showed that dependence on the performance of the ANFIS and ANN models was high for SLL prediction. The general results indicated that the soft computing models with BA had better results than the prediction models with WA. The BA could accurately find the ANFIS and MFNN parameters. Thus, the selection of simple and accurate algorithms is important for soft computing models. The magnified maps were generated by an inverse distance weight (IDW) interpolation ( Figure 16). Seven combinations were used for the soft computing models because the previous results showed that the combinations mentioned had the best performance among all models. The kappa index was used to evaluate the agreement between the simulated and observed maps. The improved ANFIS and MFNN models had better performances than the standalone ANFIS and MFNN models. In addition, the ANFIS-BA model generated more accurate results (kappa: 0.90) compared to the other models used. Although different studies show that regression and ANN models have successful performances [35], this study showed that the ANFIS models were better than the ANN models. In addition, the results were different when different optimization algorithms were used to improve the ANFIS and ANN models. This study showed that dependence on the performance of the ANFIS and ANN models was high for SLL prediction. The general results indicated that the soft computing models with BA had better results than the prediction models with WA. The BA could accurately find the ANFIS and MFNN parameters. Thus, the selection of simple and accurate algorithms is important for soft computing models. The magnified maps were generated by an inverse distance weight (IDW) interpolation ( Figure 16). Seven combinations were used for the soft computing models because the previous results showed that the combinations mentioned had the best performance among all models. The kappa index was used to evaluate the agreement between the simulated and observed maps. The improved ANFIS and MFNN models had better performances than the standalone ANFIS and MFNN models. In addition, the ANFIS-BA model generated more accurate results (kappa: 0.90) compared to the other models used.

Conclusions
In this paper, the efficiencies of the soft computing models ANFIS-BA, ANFIS-WA, MFNN-BA, and MFNN-WA were examined for SSL prediction for the Atrek basin in Iran. The results were compared with single ANFIS and MFNN models and indicated that the ANFIS-BA and ANFIS-WA models had good performances for SSL prediction, being the best and the second-best ranked models, respectively. The ANFIS-BA model outperformed the other models based on statistical indices and uncertainty analysis. The findings of this article show good performances of BA and WA as optimization algorithms to enhance the efficiency of the ANFIS model for SSL prediction. Regarding the outputs, future investigations can consider the performance of multiobjective algorithms for improving soft computing models. In addition, the current version of the BA and WA can be modified to have a better performance during the simulation process. The applied models had uncertainty in

Conclusions
In this paper, the efficiencies of the soft computing models ANFIS-BA, ANFIS-WA, MFNN-BA, and MFNN-WA were examined for SSL prediction for the Atrek basin in Iran. The results were compared with single ANFIS and MFNN models and indicated that the ANFIS-BA and ANFIS-WA models had good performances for SSL prediction, being the best and the second-best ranked models, respectively. The ANFIS-BA model outperformed the other models based on statistical indices and uncertainty analysis. The findings of this article show good performances of BA and WA as optimization algorithms to enhance the efficiency of the ANFIS model for SSL prediction. Regarding the outputs, future investigations can consider the performance of multiobjective algorithms for improving soft computing models. In addition, the current version of the BA and WA can be modified to have a better performance during the simulation process. The applied models had uncertainty in the simulation, and uncertainty computations showed that the ANFIS-BA had the best results with a lower uncertainty value. However, the evaluation of models can be completed when the models are evaluated in climate change conditions. In addition, the results of the models can be more comprehensive if the results are compared to the outputs of empirical models.