1. Introduction
In river engineering, the Manning’s coefficient
plays a vital role in the computation of flood discharge, velocity distribution, designing structures, calculating energy losses [
1,
2], and other hydraulic parameters. Accurate estimation of Manning’s roughness
plays an important factor in flood conveyance estimation. Manning’s
not only represented bed roughness, but also signifies the resistance to flow. Various equations are used for getting the discharge in simple channels using Manning’s
, Chezy’s
and Darcy-Weisbach’s
[
3], but those equations are not adequate well, while predicting discharge in meandering compound channels. This study is focused on developing a machine learning based data-driven models for predicting Manning’s
. There are various elements that affect the resistance coefficient, such as the bed roughness, slope of the bed, geometry of the channel, and other parameters of a river.
Generally, the Manning’s formula is used for calculating the discharge in open channel flow. A significant amount of research has been carried out for selecting roughness coefficients for discharge calculation in an open channel. Cowan [
4] considered the irregularity of the surface geometry of the channel, obstructions, and sinuosity to propose a model for predicting Manning’s roughness coefficient in meandering channels. The Soil Conservation Service (SCS) had developed an equation for estimating Manning’s
for meandering channels by considering the sinuosity of the channel [
5]. Limerinos [
6] established a method for estimating the roughness coefficient by considering the various hydraulic parameters. Formulation for Manning’s roughness showing the gradient effect in channels having longitudinal slopes greater than 0.002 was derived by Jarrett [
7]. Due to variation in flow depth, geometry and slope, the distinction of roughness coefficient in a straight channel as compared to a meandering channel were discussed by Arcement and Schneider [
8]. Yen [
9] proposed Manning’s
for simple uniform flows taking account for a geometric measure, such as unevenness of the edge. Further, SCS method was linearized by James [
10] for a meandering channel, named the Linearized SCS (LSCS) and recommended the Manning’s roughness value for the different sinuosity channels. Shiono et al. [
11] developed a model to predict discharge considering the Manning’s
n in the case of longitudinal slope and the meander effect of the channel. Jena [
12] proposed an equation to calculate the
by considering the effect of channel width, longitudinal channel slope, and the flow depth of the channel. The variations of wetted area, wetted perimeter, and the velocity data on Manning’s roughness coefficient by considering the flow simulation and sediment transport in irrigated channels was investigated by Mailapalli et al. [
13]. Khatua et al. [
14,
15,
16] formulated a mathematical equation for roughness coefficients by varying the sinuosity and geometry of the meandering compound channel. Xia et al. [
17] and Barati [
18,
19,
20] carried out the experiments for predicting discharge by taking care of the effect of bed roughness. Dash et al. [
21] modeled the Manning’s roughness coefficient by considering the aspect ratio, viscosity, slope of the bed, and sinuosity. Pradhan et al. [
22] proposed an empirical formulation of predicting Manning’s
by dimensional analysis for the compound meandering channel, which affects relative depth, width ratio, longitudinal channel slope, and sinuosity.
In the recent years, ML techniques have been successfully implemented for predicting complex phenomena in hydrology and hydraulics. Zhang et al. [
23] used the concept of diversity in the Group Method of Data Handling (GMDH) method for the prediction of time series to improve the noise-immunity capability. Mrugalski [
24] proposed a method for error estimation based on the dynamic GMDH and multiple inputs, as well as output neurons. Najafzadeh and Lim [
25] optimized the neuro-fuzzy GMDH model by the particle swarm optimization process to forecast the localized scour.
Vapnik and Lerner [
26] extended the generalized statistical learning theory and introduced a term
representing the intensive loss function for measuring the risk intensity in the support vector machine algorithm (SVM). More details on SVM can be found from many publications [
27,
28,
29,
30]. Dibike et al. [
31] appraised SVM for rainfall-runoff modeling and revealed the significance of SVM in the field of civil engineering. Pal and Goel [
32] also used the SVM technique to determine the discharge and end-depth ratio for circular and semi-circular channels. Further, Han et al. [
33] applied SVM methodology for flood forecasting. Genc et al. [
34] used Machine learning (ML) algorithms, i.e., SVM, ANN, and k-nearest neighbor (k-NN) to predict the velocity in small streams. Samui et al. [
35] compared that the multivariate adaptive regression splines (MARS) methodology with ANN and FEM model and perceived that the MARS gives best outcomes for the uplift capacity of the suction caisson. Samui and Kurup [
36] used MARS and LSSVM for the prediction of the consolidation ratio of clay deposits. Samui [
37] predicted the elastic modulus of rock using MARS and show the performances of the MARS model better than the ANN.
However, collecting the velocity, discharge data during high flows in rivers, especially during unsteady, non-uniform and high flows are very dangerous and difficult task. Under these circumstances, the Machine learning (ML) models are highly helpful as the alternative approach for predicting Manning’s and the discharge in hydraulics engineering.
2. Experimental Setup
Experimental investigations are carried out in meandering compound channels with different sinuosity at the hydraulics-engineering laboratory under the department of civil engineering, National Institute of Technology Rourkela (NITR), India. The meandering compound channels are cast out of perspex sheets inside the flume of 10 m long, 1.7 m wide, and 0.25 m deep. The channel surfaces consist of the roughness coefficient as 0.01. Three types of experimental channels are set up for the present research works, as shown in
Figure 1,
Figure 2 and
Figure 3. In the first and second type of meandering compound channels (NITR Type-I and NITR Type-II), the main channel is meandering with sinuosity 1.37 and 1.035, respectively, and is flanked by straight floodplain on both sides. The third one (NITR Type-III) is a doubly meandering compound channel, where both the main channel and floodplain levee are meandering with different sinuosity. Details of the experimental setup of these compound channels shown in
Table 1.
The bed slope of the channel is recorded as 0.001 for all the three types of channel. A movable bridge is placed across the flume with the facility of traversing on both the directions over the flume for taking measurements at different locations in the channel section. A measuring tank is situated at the downstream end of the flume. Water from the measuring tank is collected in a sump, which again feedbacks to the overhead tank with the help of a centrifugal pump. A closure valve is fitted at the downstream volumetric tank. A schematic illustration of the experimental setup of the compound meandering channel is shown in
Figure 4. For a better understanding of the experimental channel setup, the sample photograph of NITR Type I, Type II, and Type III meandering compound channels are shown in
Figure 5.
The measuring equipment involves point gauge having least count of 0.1 mm for measuring the flow depth in the channels. A Pitot tube coupled with manometer is used for measurement of pressure difference from which the point velocities of flow in the compound channels are computed. All of the readings are taken at the predefined grid points, as shown in
Figure 6, at the bend apex, Section B and a geometric crossover of the three types of channels (
Figure 4).
The Manning’s for the perspex sheet channel boundary is experimentally determined as 0.01. For this, the in-bank flow in the meandering main channel is collected in the volumetric tank located at its end. From the time-rise data of the water level in the volumetric tank, the discharge rate is computed. By knowing the channel cross-sectional area and the wetted perimeter, Manning’s is computed for four consecutive depths of flow, and the mean value of is computed as 0.01 [(0.0088 + 0.0096 + 0.0104 + 0.012)/4], which is found to be close to its value given various handbooks for the type of bed materials.
Traditional flow formulae are used for the estimation of discharge are Manning’s equation. From the observed discharge results, we can calculate Manning’s coefficient
by the back calculation. After getting discharge, mean velocity
can be computed by dividing area to the discharge. Then, from Manning’s equation, corresponding
can be evaluated, which is given in the below equation:
where
is indicated the mean velocity for the compound section of the compound channel,
the bed slope of the compound channel, and
the hydraulic radius of the compound channel. It is known that Manning’s
increases slightly with the depth of flow in the channel representing a lumped response to all of the flow resistances. Since the basic purpose of the present work is to model the Manning’s
for overbank flow, the variation of observed
value for the compound channel is shown in figure later in the manuscript.
Besides the present work, reported data of other researchers are also used to calibrate models for predicting Manning’s
value. Other experimental works from which data is used in the present work are the FCF-B; the investigational studies done in the UK, Flood Channel Facility (FCF) during 1990–1992 on large-gauge meandering channels (Phase B). The data are acquired from the Birmingham university website (
http://www.birmingham.ac.uk/) and also collected from other articles [
10,
38,
39]. Other than FCF-B, the authors have also used data from Toebes-Sooky [
40], Kar [
41], Das [
42], Kiely [
43], Patra-Kar [
44], Khatua [
45], Mohanty [
46], and Pradhan [
22] to predict the roughness coefficient, as given in
Table 2. All of the experimental channel data used in the present work consists of similar hydraulic nature having smooth meandering compound channels.
Factors Affecting Roughness Coefficient
Studies [
8,
11,
15,
16,
21,
22,
47,
48] have indicated that Manning’s
not only represent the resistance to the flow in the channel, but it represents the lumped response of all the hydraulic and geometric parameters influencing the flow in the channel. Therefore, in the current study, the author considers the influencing non-dimensional factors, such as the width ratio of the channel
, depth ratio or relative depth
, sinuosity
, longitudinal channel slope
, and meander belt width ratio
as the inputs to model for predicting the Manning’s
value. A regular channel retains diversity in flow and surface conditions that need to be incorporated to develop a suitable model for predicting the roughness coefficient at various flood conditions. It is important to incorporate all of these variables to predict a suitable roughness coefficient at all flood conditions.
For the present work, the Group Method of Data Handling neural network (GMDH-NN) algorithm is used to predict the roughness coefficient, which is compared with machine learning methods, such as the multivariate adaptive regression spline (MARS) and the support vector machine regression (SVR) models, for developing a best alternative for modelling Manning’s value for compound channels.
4. Development of Predictive Models
The core objective of the study is to explore the practicality of the GMDH-NN methodologies in the prediction of the Manning’s . Thus, a foremost task is to determine the relevant testing and training data subset to construct a predictive model and to evaluate its performance. The dataset that was used in this study is achieved by small scaled model experimental setup, performed at NIT Rourkela, India, and some collected data of previous researchers that has similar geometrical conditions.
Here, in the ML modeling, the normalized value of independent variables is taken as input variables, such as
,
,
,
, and
. The output of the GMDH, MARS, and SVR models is the normalized
value. Researchers have used different data division between testing and training data and generally, it varies with problems. There is no particular rule for the divisional process of training and testing data. In this study, about 75% of data are designated for training, and another 25% of data are used for testing the proposed model. The training and testing data have been chosen randomly form the original dataset. The data is scaled between 0 and 1. It has been done by using the following equation:
where
= any finite data,
= minimum value of the data,
= maximum value of the data, and
= normalized value of the data. This paper shows a comparative study of suggested models for prediction of
in the meandering compound channels with the observed data.
GMDH algorithm was established while using GMDH Shell software [
49] to find non-linear relationships between the inputs and output variables. The algorithm is characterized as a set of neurons in which various pairs of the neurons are related through a quadratic polynomial of GMDH-NN network. The data was trained while using the quadratic neural function and consequently resulting new neurons in the subsequent layer [
8].
In the training of each layer, the neurons are generated based on all the possible combinations of input variables. Then, these neurons are screened automatically based on their ability to predict the target variable. Only those neurons having good prediction powers (meets the criterion) are fed forward for the training of the next layer, and the rest are discarded [
64]. The formulas that were obtained from the GMDH model for predicting the Manning’s
are given by following equations.
where
In the MARS model, 19 basis functions are used initially by forwarding step out of which three basic functions are deleted by the backward step process. Finally, an optimum model for prediction of
has restricted, given rise to 16 number of basis functions, whose constructions are brief as
and
in
Table 3. In MARS, RSS, and GCV criteria are also performed to known the importance of the predictors by using Equations (9) and (10), respectively. Before predicting the value of
, the performance ranking test of independent variables is performed by RSS and GCV criteria. Following RSS and GCV value, the independent variables are ranked consequently as
,
,
,
, and
. The analysis demonstrates that the longitudinal bed slope
is more sensible for predicting
.
The model for predicting Manning’s
is quantified by a linear arrangement of the constant 0.38 and the basis functions are presented in
Table 3 superimposing with their respective coefficients that were achieved by models. The optimum model for the prediction of
is from the result of Equation (26).
where
denotes the Manning’s
at floodplain,
is the basic function, and
the coefficient.
While using the Equation (26) for evaluation of Manning’s
, we need the series of
and
values. For the present case, the
value can be computed by substituting
,
,
,
,
values in the column (1),
Table 3. By multiplying the corresponding terms, we get the final value of MARS model that can be used for calculating Manning’s
, which can be subsequently used to get a discharge in the compound channels.
For NITR Type I channel at flow depth of 0.17 m, the values of , , , , and are 5.96, 0.3, 0.7, 0.001, and 1.37, respectively, which gives as −0.3688 giving rise to values as 0.0112 and as 0.04969 m3/s for this depth of flow as against the observed value of as 0.0107 and as 0.05216 m3/s.
For SVR,
is the penalty to the error that causes the generalization ability of the model. The large value
assigns higher penalties to errors so that the regression is trained to minimize error with lower generalization, whereas a small
assigns fewer penalties to errors, which allows for the minimization of margin with errors, thus, higher generalization ability. The higher value
gives, the fewer support vectors, which leads to a decrease the final prediction performance [
65]. If
is too small, many support vectors are selected, which leads to the risk of overfitting. A large value of
indicates a stronger smoothing of the Gaussian kernel. The SVR model is trained using the Radial Basis Function (RBF) with 36 numbers of support vectors. The best result is obtained with regularization parameter (C = 84), insensitive loss function (epsilon = 0.1), and kernel parameter
.
The developed SVM gives the following equation by putting
,
and
in Equation (16) for prediction of
.
where
be the Lagrange multipliers.
6. Application of the Model to River Data
The developed model should be adequate for its practical application, imitating the natural phenomena. The most effective hydraulic experimental solutions may not necessarily be the most globally acceptable solutions. Therefore, the proposed model is applied to river data to know its appropriateness. The Baitarani River is chosen for the application of the developed model for calculation of discharge involving meandering compound channel reach, for 7.5 m depth of flow in the year 1985 flood and 8.63 m depth of flow in the year 1975, which is located between longitude
and latitude
at Anandapur, Odisha, India.
Figure 11 shows the plan and cross-sectional view of River Baitarani [
44]. At Anandapur site, the river has developed a good meandering plan form with a drainage area of 8570 km
2. The average bank full depth
, the width of the main river
, a top width of the main river
, sinuosity
, Meander Belt width
, and amplitude
of the river is scaled as 5.4 m, 210 m, 230 m, 1.334, 108 m, and 425 m, respectively. The surface condition for the main channel is sandy, whereas the floodplain has grass vegetation. The average longitudinal slope of the channel bed is 0.0011 for both the floodplain and main river. The river data is collected from Government of India Central Water Commission, Eastern Rivers Division, India.
While using the model equations that are provided by GMDH (Equations (20 through 25)), MARS (Equation (26)), and SVR (Equation (27)) for the evaluation of Manning’s
, we need the values of input parameters. For the river Baitarani the values of
,
,
,
,
are given in
Table 6. By putting the corresponding values in the equations (Equations (20), (26) and (27)), we get the final value of Manning’s
by three models, which can subsequently use to get a discharge in the compound channels. For 7.5 m depth of flow in the river Baitarani, the Manning’s
is calculated by GMDH, MARS, SVR model as 0.0239, 0.0205, and 0.025, respectively. Similarly, for 8.63 m flow depth, the Manning’s
is calculated by GMDH, MARS, SVR model as 0.0242, 0.0201, and 0.0255, respectively. The area of the river is measured as 1995 m
2, 2485 m
2 and perimeter as 460 m, 510 m for two different flood flow depth. Subsequently, the values of discharge are estimated by substituting
value in Manning’s general equation. Discharge results that are based on the application of GMDH-NN, MARS, and SVR models are given in
Table 6, which indicates the adequacy of the GMDH-NN model.
The percentage of mean error for prediction of
by various models and their standard deviation is shown in
Figure 12. The authors also accomplished the analysis of inaccuracy in terms of the root-mean-square error (
RMSE), mean absolute error (
MAE) and mean absolute percentage error (
MAPE) as described in
Table 7 for developed models. It is observed that the GMDH-NN model gives the minimum mean percentage error. It is found that the least values of respective
MAE,
RMSE, and
MAPE as 0.00105, 0.00114, and 4.904 for GMDH-NN model. The results give a clear suggestion of the efficiency of the GMDH-NN model and its use in practical applications.