Experimental and Artiﬁcial Neural Network (ANN) Modeling of Instream Vegetation Hydrodynamic Resistance

: This study examines the impact of upstream structures on the bulk drag coefﬁcient of vegetation through experimental means, which has not been previously conducted. An embankment model was placed upstream of the vegetation, both with and without a moat/depression. The results showed that the presence of an upstream structure reduced the bulk drag coefﬁcient of vegetation as the structure shared the drag. When only the embankment was placed upstream, a maximum decrease of 11% in the bulk drag coefﬁcient was observed. However, when both the embankment and moat models were placed upstream, a 20% decrease in the bulk drag coefﬁcient was observed. Regression models and artiﬁcial neural network (ANN) models were developed to predict the bulk drag coefﬁcient based on the variables affecting it. Five ANN models with different training functions were compared to ﬁnd the best possible training function, with performance indicators such as coefﬁcient of determination ( R 2 ), root mean square error ( RMSE ), Nash–Sutcliffe efﬁciency ( NSE ), sum of square error ( SSE ), mean absolute error ( MAE ), and Taylor’s diagrams used to evaluate the model performance. The ANN model with nine neurons in each hidden layer performed the best, achieving the highest R 2 and NSE values and the lowest RMSE , SSE, and MAE values. Finally, the comparison between the regression model and the ANN model showed that the best ANN model outperformed the regression models, achieving R 2 values of 0.99 and 0.98 for the training and validation subsets, respectively.


Introduction
The vegetation along the bank of a river can affect many physical processes of flow in an open channel [1,2].The presence of vegetation may induce resistance [3,4], reduce flow [5,6], modify velocity profiles [7], and decrease flow energy [8,9].The effect of vegetation on flow hydraulics is mainly determined by the resistance induced by the vegetation [10,11], as experimental and numerical investigations by various researchers have demonstrated [12,13].The drag coefficient is commonly used as a quantifying parameter to represent the drag force induced by the vegetation.Likewise, researchers usually consider experimental studies focusing on the effect of vegetation in open-channel flows, employing rigid cylinders made of wood or steel to achieve geometrical similarity [14][15][16].
Researchers have defined natural vegetation varying in density in different ways in the past.For instance, some studies defined the density of vegetation by the ratio of the spacing between the two cylinders in the cross-stream direction of the channel to the diameter of cylinders [8,17], and others calculated it by the formula λ = π/(2 √ 3) d ˆ2/s, where λ is the density of vegetation, d is the diameter of the cylinder, and s is the center-to-center spacing between two cylinders [3,15].Previously, drag force has been represented by the drag coefficient of isolated cylinders [15,18,19] or bulk drag coefficient of vegetation patch rather than by isolated cylinders [20][21][22].
The computation of vegetation resistance is challenging due to the variation in density and geometric characteristics of rigid natural vegetation in the flood plain.For modeling purposes, rigid cylinders enable an accurate representation of the geometric characteristics of natural vegetation [23].The relationships between the drag coefficient of vegetation and other related parameters, i.e., the density of vegetation, vegetation pattern, and depthaveraged flow velocity, were analyzed theoretically in the past [24].The drag force on isolated rigid vegetation models placed in an open-channel flume has been analyzed, finding a decreasing trend of the drag coefficient with the square of the flow velocity [25].Furthermore, the effects of the Reynolds number have been studied, finding a direct proportionality with the drag coefficient [26].Wu [27] suggested that, based on flume experiments, the drag coefficient increased with the increase in the relative water depth under different densities of vegetation.Previously, the drag coefficient was calculated by considering force equilibrium, and it was found that the submergence ratio of vegetation greatly affects the vegetation elements' hydrodynamic drag [28].A similar approach is adopted in the current study.
Composite defense structures have been recently introduced as resilient means to prevent the loss of human life and property due to catastrophic floods and tsunamis.For example, the composite/hybrid defense system comprising of embankment and vegetation [29], embankment, moat, and vegetation [8,30], and backward-facing steps with vegetation [31] have been investigated previously.Therefore, there is an immense need to examine the effect of an upstream structure on the drag coefficient of vegetation, which is currently missing in the literature.Moreover, the regression models utilized to develop mathematical equations in estimating hydraulic variables, such as drag coefficient, in vegetated channels are found to either over-or underestimate the hydraulic variables [32].Nevertheless, in the modern world, soft-computing techniques such as artificial neural networks (ANN) are gaining popularity among researchers because of their ability to model complex nonlinear interactions between input and output parameters [32].ANN models have been widely used in the fields of hydraulics, hydrology, and water-resource engineering to predict physical variables, e.g., water levels, velocity, discharge coefficient, the drag coefficient of vegetated flow, and hydraulic jump [33][34][35].
The objective of the current study is to investigate the effect of an upstream structure on the bulk drag coefficient of vegetation and to develop regression and artificial neural network (ANN) models to predict the bulk drag coefficient.The paper also aims to compare the performance of the ANN models with different training functions and to compare the ANN models with the regression models.The study seeks to determine the optimal upstream structure to decrease the bulk drag coefficient of vegetation and to evaluate the performance of the developed models using various performance indicators such as R 2 , RMSE, NSE, SSE, MAE, and Taylor's diagrams.
In the current study, the drag coefficient of rigid vegetation was determined by experimentally computing the bulk drag coefficient of rigid emergent vegetation in the laboratory flume.Then, regression models were developed based on the variables affecting the drag coefficient to predict C BD .Experimentally, the drag force (F D ) exerted by the flow on rigid vegetation was calculated by calculating the force of the water on the upstream (F 1 ) and downstream (F 2 ) faces of the emergent rigid vegetation patch and inside the vegetation patch (F 3 ).The resultant drag force was further used to calculate the bulk drag coefficient (C BD ).The effect of an upstream defense structure on C BD was examined against the composite defense structure comprising an embankment and vegetation (EV) and the combination of an embankment, moat/depression, and vegetation (EMV).

Flow Resistance of Vegetation
In order to estimate flow resistance for a steady subcritical flow through an emergent vegetation patch, the summation of all the forces on the upstream (F 1 ), downstream (F 2 ), and inside the vegetation patch (F 3 ) can be found.In a two-dimensional flow, the forces F 1 and F 2 are the hydrodynamic forces acting on the upstream and downstream faces of vegetation, respectively, while F 3 is the product of the mass of water flowing per second and change in velocity, which is the vegetation resistance force acting in the opposite direction to F 1 in the control volume section.Figure 1a,b show an example of this process.Authors Liu et al. [36] used a similar method to calculate the drag force acting on rigid vegetation under subcritical conditions.Equations ( 1) and ( 2) were used to express the force-balance equation along the x direction in their study, neglecting the side wall and bottom effect, as previous studies have [36]: and where F D = Net resistive force caused by the number of stems (n) in staggered strip 1 (SS1 in Figure 1c).The first two rows make the staggered arrangement (strip 1), and the other two are behind the first strip [37].Hence, maximum force and corresponding velocity are applied on the first strip of the vegetation arrangement, and the vegetation in other strips has a lower velocity and drag force, as also observed in previous studies [38].Therefore, the number of vegetation elements (n) in strip 1 is multiplied by F 1 , F 2 , and F 3 (Equation ( 2)), γ = specific volume of water (0.001 m 3 /kg), ρ = density of water (1000 kg/m 3 ), q = specific discharge (m 2 /s), i.e., the discharge per unit width, h 1 and h 2 are the water depths on the upstream and downstream side of vegetation, respectively, yi is the water depth inside the vegetation patch of the ith row, and V 1 and V 2 are the velocities on the upstream and downstream side of the vegetation patch, respectively.The drag force in the stream-wise direction for the vegetation stem can be calculated by Equation (3), which was also used previously [12,14].
Replacing the drag force in Equation (3) with that from Equation (2), Equation ( 4) is obtained, for calculating C BD as shown: where V v is the average pore velocity is close to the actual approach velocity, calculated from the measured flow discharges and λ, i.e.; s 2 , where Q = flow discharge, B = channel width, h = flow depth, s = center-to-center distance between the vegetation stems (Figure 1c), λ = average solid fraction defined as the fraction of the stem-occupied bed area, which represents the density of vegetation [3,15], and A f = the frontal area of each row of vegetation.Equation ( 4) is used to calculate C BD for all the cases presented in Table 1.

Dimensional Analysis
In this study, some basic physical quantities affecting the bulk drag coefficient were determined as (Equation ( 5)) where h o = the depth of flow without any model placed in the channel, V o = the depth average velocity without any model placed in the channel, Wv = the width of the vegetation patch, Re = V v d/υ is the Reynolds number, where V v is the average flow velocity through the vegetation, d is the diameter of each cylinder used to simulate the vegetation, and υ is the kinematic viscosity of water (1.004 × 10 −6 m 2 /s), which is the ratio of the viscosity of water to the density of water, and λ = density of vegetation.

Flow Conditions
The paper utilized Equation ( 4) to calculate the bulk drag coefficient (C BD ) for different flood defense systems.The experimental setup involved two steps.In the first step, the flume was run without any model to record the initial conditions of flow depth (h o ) and depth-averaged velocity (V o ).In the second step, the models were placed in the flume, and different cases were tested to calculate C BD .These cases included embankment with vegetation of density λ1 (EVλ1), embankment with vegetation of density λ2 (EVλ2), embankment and moat with vegetation of density λ1 (EMVλ1), and embankment and moat with vegetation of density λ2 (EMVλ2).Moreover, the flow structure of OV, EV, and EMV are shown in Figures 1a-c and 2a,b, and the experimental setup is shown in Figure 3a-c.The study considered two densities of vegetation, i.e., 0.025 (OVλ1) and 0.062 (OVλ2).The Froude numbers considered were 0.40, 0.44, 0.50, 0.57, 0.60, 0.63, and 0.65, with corresponding depths of flow of 4.5 cm, 5.3 cm, 6.8 cm, 7.1 cm, 7.7 cm, 8.2 cm, and 8.5 cm, respectively.The paper measured flow (Q) with an electromagnetic flow meter attached to the channel, and depth-average velocity (V) was calculated by the relation Q = AV.The depth was measured using a rail-mounted adjustable point gauge with the least count of 1 mm.The bed slope of the channel was considered negligible, and the physical modeling scale of 1:100 was used for all the cases mentioned in Table 1.

Vegetation Conditions
Previously, the drag coefficient was calculated by placing flood defense models in an open channel flume of various dimensions (length × width) e.g., 15 m × 0 [40] and 10 m × 0.41 m [41].The rectangular glass flume used in the current study was 10 m long, 0.5 m deep, and 0.31 m wide.The rigid vegetation was modeled as steel rods embedded in a wooden bed, which was placed inside the flume at a right angle to the flow direction, covering almost the full width of the channel.Two different values of vegetation density (λ) were used to examine their effects, with densities of 0.025 and 0.062, and the spacing between vegetation elements in the cross-stream direction of the flume was 1.88 cm and 1.25 cm, respectively.The diameter of vegetation used for all cases is 0.3 cm.Previous studies have used open-channel flumes of various dimensions to calculate the drag coefficient by placing flood defense models.The width of vegetation (W v ) was calculated by considering the thickness of vegetation dn = 180 No.cm, where dn = 2 , which is the cumulative diameter of trees and is defined as the product of the diameter of trees and the number of trees in a rectangle with a frontage of unit length along the bank of the river and depth equal to the width of vegetation (W v ); the same formula has been used in the past [37].The selected vegetation pattern, Eucalyptus trees in a staggered arrangement, was based on their average height and diameter in the ecological zone of Punjab province, Pakistan [42], and the use of steel rods as a model is consistent with the 1:100 scale used in the study.Previous research has shown that the drag force behavior is different in grid and staggered vegetation patterns, with some studies finding that staggered arrangements result in higher drag forces and greater effectiveness than grid arrangements [43,44].Therefore, the staggered arrangement was chosen for the current study.
Froude numbers considered were 0.40, 0.44, 0.50, 0.57, 0.60, 0.63, and 0.65, with corresponding depths of flow of 4.5 cm, 5.3 cm, 6.8 cm, 7.1 cm, 7.7 cm, 8.2 cm, and 8.5 cm, respectively.The paper measured flow (Q) with an electromagnetic flow meter attached to the channel, and depth-average velocity (V) was calculated by the relation Q = AV.The depth was measured using a rail-mounted adjustable point gauge with the least count of 1 mm.The bed slope of the channel was considered negligible, and the physical modeling scale of 1:100 was used for all the cases mentioned in Table 1.

L dv Embankment
Critical depth

Frontal Area of Trees (A f )
To calculate the frontal area of trees, the current study followed the approach used by Liu et al. [36], which involves considering the diameter of stems, the frontal area of stems, and the number of stems.The frontal area is defined as the sum of all the areas covered by the water depth in front of each row of vegetation patches.The equation used to calculate the frontal area of trees is: where n j and n k are the number of vegetation elements in odd and even rows, h j and h k are the depths of flow in front of odd and even rows, respectively, and d is the diameter (0.3 cm) which is kept constant for all the cases mentioned in Table 1.The water depth A f and h 2 were measured by a rail-mounted point gauge, and the flow depths in front of each row inside the vegetation patch were calculated by interpolation.Due to the staggered arrangement of the vegetation, the number of vegetation elements in even rows (n k ) is n j − 1.

Artificial Neural Network (ANN)
Artificial neural networks (ANNs) are a type of artificial intelligence that simulates the function of human neurons [45].ANNs have become increasingly popular for hydrologic modeling and solving problems in engineering and other applied sciences [46].In this study, five ANN models were tested to determine the best model for predicting C BD , with the input and output values presented in Table 2.While some previous studies have used only training and validation sections for processing ANN models [47,48], others have included training, testing, and validation sections [49,50].For this study, 67% of the experimentally obtained data were used for validation, while the remaining 33% were used for training the ANN models.The results of the best model are discussed in the parametric study section below.The multilayer perceptron (MLP) is a feed-forward network with layers of neurons that include input, hidden, and output layers.Neurons in the input layer buffer input signals to allocate to neurons in the hidden layer.The flow chart of the feed-forward ANN model is shown in Figure 4.

Model Performance Evaluation Criteria
The performance of the developed ANN models was tested by statistical error measures, including the coefficient of determination (R 2 ), root mean square error (RMSE), the sum of square error (SSE), the Nash-Sutcliffe model efficiency (NSE), and the mean absolute error (MAE).The formulae of R 2 , RMSE, SSE, NSE, and MAE are given in Equations ( 7)- (11).
The literature indicates that the value of R 2 ranges from 0 to 1, where the value 0 implies no correlation and a value of 1 suggests that the model can explain all of the observed variance [48].

Model Performance Evaluation Criteria
The performance of the developed ANN models was tested by statistical error measures, including the coefficient of determination (R 2 ), root mean square error (RMSE), the sum of square error (SSE), the Nash-Sutcliffe model efficiency (NSE), and the mean absolute error (MAE).The formulae of R 2 , RMSE, SSE, NSE, and MAE are given in Equations ( 7)- (11).
The literature indicates that the value of R 2 ranges from 0 to 1, where the value 0 implies no correlation and a value of 1 suggests that the model can explain all of the observed variance [48].
The root mean square error, RMSE, evaluates how closely the predictions match experimental observations; values may range from 0 (perfect fit) to +∞ (no fit) based on the relative range of the data [48].
The sum of squares error (SSE) is a statistical metric used to identify the dispersion of data and how well the data can fit the model.The value of SSE should be a minimum [51].
The Nash-Sutcliffe efficiency, NSE, measures the model's ability to predict variables different from the mean and gives the proportion of the initial variance accounted for by the model, where NSE ranges from 1 (perfect fit) to −∞.Values less than zero indicate that the observation means would be a better predictor than the model [48].An NSE value ranging from 0.75 to 1.0 can be categorized as 'very good' [52].
The mean absolute error, MAE, measures the difference between observed and modeled results [48], where C i and C i represent the experimental C BD and computed C BD , C, (by the ANN) C represents the average experimental C BD and average computed C BD, and n = no. of observations.

Experimental Results of Bulk Drag Coefficient (C BD )
The experimental bulk drag coefficient was calculated using Equation ( 4).In the current study, two variables, i.e., density and Reynold's number, were considered as previ-ously described.Based on these two variables, Equation (12) was developed by regression analysis to calculate C BD within the selected range of λ and R d for OVλ1 and OVλ2.
The results of Equation ( 4) show that the calculated average values of C BD for OVλ1 and OVλ2 were 1.41 and 1.61, respectively, whereas the computed average values of C BD for OVλ1 and OVλ2 were 1.45 and 1.65, respectively.The relation between the calculated values (by Equation ( 4)) and computed values (by Equation ( 12)) of C BD is shown in Figure 5a.The R 2 values for OVλ1 and OVλ2 were 0.80 and 0.87, respectively, as shown in Figure 5b.Similarly, the drag coefficient of vegetation (CBD) was calculated using Equation (4) for the composite defense system in the sequence of the embankment, moat/depression, and the vegetation of variable densities.The calculated average values of CBD for EMVλ1 and EMVλ2 were 1.16 and 1.29, respectively, based on these two variables, i.e., density and Reynolds number.Finally, Equation ( 14) was developed by regression analysis to compute the bulk drag coefficient for the EMVλ1 and EMVλ2 cases within the selected range of variables.The computed average values of CBD for EMVλ1 and EMVλ2 were 1.16 and 1.30, respectively.
The graphical representation between the calculated values (Equation ( 4)) and the computed values (Equation ( 14)) showed a close relationship, as the R 2 values were 0.96 and 0.95 for EMVλ1 and EMVλ2, respectively, as shown in Figure 5b The bulk drag coefficient of vegetation (C BD ) usually represents the resistance to flow.C BD can be varied in the case of any structure on the upstream or downstream side of the vegetation.Equation ( 13) was developed by means of regression analysis to calculate the bulk drag coefficient for EVλ1 and EVλ2 within the selected range of λ and R d .
By using Equation ( 4), C BD was calculated against the composite defense system's different values of λ and R d .Two cases of composite defense systems was tested, i.e., embankment with vegetation of density 0.025 (EVλ1) and embankment with vegetation of density 0.062 (EVλ2).It was found that the calculated value of C BD was decreased by increasing Reynold's number and increased by increasing λ, as shown in Figure 5a.As a result, the calculated average values of C BD for EVλ1 and EVλ2 were 1.26 and 1.46, respectively.The drag coefficient values for EVλ1 and EVλ2 were also calculated using Equation (4).
The comparison between the calculated values (Equation ( 4)) and the computed values (Equation ( 13)) of C BD showed a close relationship, as the value of R 2 was 0.97 in both cases, as shown in Figure 5b.The computed average values of C BD for EVλ1 and EVλ2 were 1.41 and 1.65, respectively.
Similarly, the drag coefficient of vegetation (C BD ) was calculated using Equation ( 4) for the composite defense system in the sequence of the embankment, moat/depression, and the vegetation of variable densities.The calculated average values of C BD for EMVλ1 and EMVλ2 were 1.16 and 1.29, respectively, based on these two variables, i.e., density and Reynolds number.Finally, Equation ( 14) was developed by regression analysis to compute the bulk drag coefficient for the EMVλ1 and EMVλ2 cases within the selected range of variables.The computed average values of C BD for EMVλ1 and EMVλ2 were 1.16 and 1.30, respectively.
The graphical representation between the calculated values (Equation ( 4)) and the computed values (Equation ( 14)) showed a close relationship, as the R 2 values were 0.96 and 0.95 for EMVλ1 and EMVλ2, respectively, as shown in Figure 5b.A summary of C BD values calculated (Equation ( 4)) and computed (Equations ( 12)-( 14)) is presented in Table 3.

Computed Results of Bulk Drag Coefficient (C BD ) by ANN Model
To predict C BD , five different ANN models were tested to optimize the best model.Each ANN model contained two layers, which were kept constant for all models.However, the number of neurons in each layer varied.The result shows that the performance of the ANN9 model, i.e., with nine neurons in each layer, was the best among all models.As discussed previously, the performance of models was judged by various performance indicators.Therefore, the value of R 2 and NSE of the ANN9 model (for testing and validation) was the highest, showing the best performance, as presented in Table 4 and Figure 6a,b.Each ANN model contained two layers, which were kept constant for all models.However, the number of neurons in each layer varied.The result shows that the performance of the model, i.e., with nine neurons in each layer, was the best among all models.
As discussed previously, the performance of models was judged by various performance indicators.Therefore, the value of R 2 and NSE of the ANN9 model (for testing and validation) was the highest, showing the best performance, as presented in Table 4 and  Moreover, the values of SSE, RMSE, and MAE of the ANN9 model (for testing and validation) were the lowest among all models, as shown in Table 4 and presented in Figure 6c-e.The relationship between experimental and computed values (from the ANN9 model) is shown in Figure 6f, showing a good relationship as the value of R 2 was 0.99, Moreover, the values of SSE, RMSE, and MAE of the ANN9 model (for testing and validation) were the lowest among all models, as shown in Table 4 and presented in Figure 6c-e.The relationship between experimental and computed values (from the ANN9 model) is shown in Figure 6f, showing a good relationship as the value of R 2 was 0.99, which is close to unity.In Figure 6a-e, "T" and "V" represent testing and validation, respectively.Previously, the performance of various prediction models was also tested by various authors by constructing Taylor's diagram [53,54], which was also adopted in the current study.Taylor's diagram shows the relationship between standard deviation, correlation coefficient (R 2 ), and root mean square error (RMSE), as shown in Figure 7a,b.It was observed that the standard deviation and RMSE values of ANN9 were performed at least as well as or better than the other ANN models, as well as the regression model (RM) for both training and validation sections.On the other hand, the values of R 2 of all the ANN models were in the range of 0.970-0.999,which is higher than 0.920 for the regression model.Therefore, the ANN models, including ANN9, had a better overall performance compared to the regression model, as they had higher R 2 values and lower RMSE and standard deviation values.

Prediction Results of CBD against Various Parameters
The predicted values of the bulk drag coefficient against all variables are shown in Figure 8 (generated by the software).The prediction results show that CBD increased by increasing the variables Δh/ho and λ; however, the reverse relation is observed for ΔV/Vo, Af, Re, and Wv.Maximum variation in CBD is observed against Δh/ho and minimum against Wv.Hence, the variables ΔV/Vo, Af, Re, and Wv should decrease the bulk drag coefficient.The drag coefficient of vegetation may lead to tree breakage.Implementing this study's analysis framework and results in hydraulic engineering practice can help the planning and management divisions avoid tree breakage and efficiently slow the flow during extreme hydrologic events and floods.

Prediction Results of C BD against Various Parameters
The predicted values of the bulk drag coefficient against all variables are shown in Figure 8 (generated by the software).The prediction results show that C BD increased by increasing the variables ∆h/ho and λ; however, the reverse relation is observed for ∆V/V o , A f , R e , and W v .Maximum variation in C BD is observed against ∆h/h o and minimum against Wv.Hence, the variables ∆V/V o , A f , R e , and W v should decrease the bulk drag coefficient.The drag coefficient of vegetation may lead to tree breakage.Implementing this study's analysis framework and results in hydraulic engineering practice can help the planning and management divisions avoid tree breakage and efficiently slow the flow during extreme hydrologic events and floods.
Af, Re, and Wv.Maximum variation in CBD is observed against Δh/ho and minimum against Wv.Hence, the variables ΔV/Vo, Af, Re, and Wv should decrease the bulk drag coefficient.The drag coefficient of vegetation may lead to tree breakage.Implementing this study's analysis framework and results in hydraulic engineering practice can help the planning and management divisions avoid tree breakage and efficiently slow the flow during extreme hydrologic events and floods.

Discussion
In previous studies, drag force has been investigated in terms of drag coefficient, using various methods such as the energy-slope method [1], water-surface-slope method [39], and strain gauges [55].Researchers have experimentally calculated the drag coefficient of vegetation with variable densities (λ) and various configuration patterns.For instance, Ishikawa [39] and James [1] used a staggered pattern of vegetation with densities ranging from 0.00314 to 0.0126 [39] and 0.0035 to 0.0314, respectively.Meanwhile, Tanino and Nepf [14] and Coscarella [56] considered random vegetation patterns with densities ranging from 0.15 to 0.35 and 0.022 to 0.038, respectively.
The study conducted experiments to calculate the bulk drag coefficient (C BD ) under subcritical steady-flow conditions for two cases of only vegetation (OV), i.e., OVλ1 and OVλ2 (where λ1 = 0.025 and λ2 = 0.062), placed in a staggered pattern.The calculated values of C BD showed a direct relation with density (λ) and an inverse relation with Reynolds number (R d ), consistent with previous studies [3,5], which also found a similar relationship between drag coefficient and either Froude number [39] or Reynolds number [57].Various methods have been explored in previous studies to calculate drag force in terms of drag coefficient, with calculated ranges of bulk drag coefficient against different variables of 1.22-1.39[22] The study also applied the same method to calculate C BD for a composite defense system comprising four parts, i.e., EVλ1, EVλ2, EMVλ1, and EMVλ2, respectively.The calculated results showed that the combination of an upstream structure reduces the drag coefficient of vegetation because the drag coefficient is shared by both the embankment and vegetation in the composite structure.By placing the embankment upstream of vegetation, the average C BD for EVλ1 and EVλ2 decreased by 10.6% and 11%, respectively.
In a similar vein, the C BD was reduced by 17.73% and 20% for the EMVλ1 and EMVλ2 cases, respectively, when compared to only vegetation.Like previous researchers, we employed regression analysis to develop equations for computing the C BD , taking into account variables such as λ and R d .The comparison between the computed and calculated results indicates a high level of agreement, as evidenced by an R 2 value close to unity.The present study investigated the impact of upstream structures, which has been studied previously.Contemporary engineering practices and research can employ artificial neural networks to forecast various hydraulic parameters.
Previous studies have employed ANN models and gene-expression programming (GEP) to predict the flow resistance of flexible vegetation in laboratory channels and compared the results with regression models.The ANN model was found to perform better than GEP and regression analysis [61].In the present study, similar results were obtained, where the performance of the ANN model was superior to that of the regression models.Five ANN models were evaluated with different hidden layers and varying numbers of neurons, and the best-performing model was found to be the one with two hidden layers and nine neurons (ANN9) in each layer.The performance of the ANN models was assessed using five performance indicators, and ANN9 was found to have the highest R 2 and NSE values and the lowest RMSE, SSE, and MAE values for both training and validation.The comparison between regression models and the ANN9 model showed that the R 2 value for ANN9 was 0.99 and 0.98 for training and validation, respectively, while the maximum R 2 for regression analysis was 0.97 for both EVλ1 and EVλ2.The regression model performance for EMVλ1 and EMVλ2 was also satisfactory with R 2 values of 0.96 and 0.95, respectively.The regression model performance for OVλ1 and OVλ2 was acceptable, with R 2 values of 0.80 and 0.87, respectively.Nonetheless, the ANN9 model was found to be the best among all the models.
To evaluate the performance of the ANN models and compare them with regression models, Taylor's diagram was constructed, which has been previously adopted by other researchers.The diagram confirms that the ANN9 model outperformed both other ANN and regression models.The parametric study conducted shows that C BD increases with variables such as ∆h/h o and λ, while variables such as ∆V/V o , A f , R e , and W v decrease the bulk drag coefficient, with the minimum variation observed for W v .The study highlights the potential risk of tree breakage due to the drag coefficient of vegetation, and planning and management divisions should consider the parametric study results to prevent tree breakage during high floods.It is important to note that the experimental setup used in the study, with an embankment model and moat/depression models upstream of the vegetation, may not be representative of all natural environments where vegetation and upstream structures exist.Therefore, the findings of the study may have limited applicability to other environments.However, the study can serve as a baseline to analyze drag force on vegetation in composite defense systems that incorporate hard and soft solutions.As such, the effect of upstream structures should be considered in a composite defense system, and future research should also investigate the effect of downstream structures on the drag coefficient of vegetation.

Conclusions
Based on the results of the bulk drag coefficient through vegetation and the effect of an upstream structure, the following conclusions were drawn: a.
The calculated C BD for OVλ1 and OVλ2 showed a direct relationship with vegetation density (λ) and an inverse relationship with Reynolds number (R d ).The calculated ranges of C BD for OVλ1 and OVλ2 were When compared to the prediction of C BD , the ANN9 model outperformed the regression models tested using Taylor's diagram.
The results of the current study provide insights into the performance comparison of different models, including ANN models and regression models, in predicting a certain variable.This information can be used by researchers to select the appropriate model for their own research based on the performance criteria of the model.The current study highlights the superiority of ANN models, particularly the ANN9 model, over the regression model in terms of standard deviation, correlation coefficient (R 2 ), and root mean square

Figure 1 .
Figure 1.(a,b) offer distinct conceptual illustrations of the theoretical model for deriving the bulk vegetation drag coefficient, CBD; (c) is the theoretical arrangement of vegetation in strips.

Figure 2 .
Figure 2. Illustration of free-water surface characteristic profiles for the cases of: (a) embankment and vegetation (EV) model; (b) embankment, moat, and vegetation (EMV) model.

Figure 3 .
Figure 3. Illustration of the test section of the different experiments assessed herein: (a) only vegetation (OV); (b) embankment and vegetation (EV) model; (c) embankment, moat, and vegetation (EMV) model.

Figure 4 .
Figure 4. Depiction of the architecture of the feed-forward ANN model.

Figure 4 .
Figure 4. Depiction of the architecture of the feed-forward ANN model.

Figure 5 .
Figure 5. (a) Experimental C BD vs. Re and (b) relation between experimental and computed C BD .

Figure 7 .
Figure 7. Taylor's diagram showing the relationship between standard deviation, correlation coefficient, and root mean square error (represented with blue lines) for CBD; (a) Training and (b) Validation.

Figure 7 .
Figure 7. Taylor's diagram showing the relationship between standard deviation, correlation coefficient, and root mean square error (represented with blue lines) for C BD ; (a) Training and (b) Validation.

Figure 8 .
Figure 8. Predicted values of the bulk drag coefficient against variables; (a) ∆h/h o , (b) ∆V/V o , (c) A f , (d) R e , (e) W v and (f) λ.

Table 2 .
Presentation of ANN model inputs and outputs.

Table 3 .
Presentation of the hydraulic parameters for the conducted experiments.

Table 4 .
Performance evaluation of the assessed ANN models.

Table 4 .
Performance evaluation of the assessed ANN models.
1.24-1.84and 1.40-2.18,respectively, with average C BD values of 1.41 and 1.61, respectively.b.The average C BD values for EVλ1 and EVλ2, which represent composite defense cases, were decreased by 10.6% and 11% compared to OVλ1 and OVλ2, respectively.As a result, the calculated average C BD values for EVλ1 and EVλ2 were 1.26 and 1.41, respectively.c.The average C BD values for EMVλ1 and EMVλ2, which also represent composite defense cases, were decreased by 17.73% and 20% compared to OVλ1 and OVλ2, respectively.The calculated average C BD values for EMVλ1 and EMVλ2 were 1.16 and 1.29, respectively.d.The ANN9 model provided the best performance among the five ANN models, with the highest R 2 and NSE values and the lowest RMSE, SSE, and MAE values.e.