Rainfall-Runoff Modeling Using Artificial NeuralNetwork—A Case Study of Purna Sub-Catchment of Upper Tapi Basin, India "2279

The present study examines the rainfall-runoff-based model development by using artificial neural networks (ANNs) models in the Yerli sub-catchment of the upper Tapi basin for a period of 36 years, i.e., from 1981 to 2016. The created ANN models were capable of establishing the correlation between input and output data sets. The rainfall and runoff models that were built have been calibrated and validated. For predicting runoff, Feed-Forward Back Propagation Neural Network (FFBPNN) and Cascade Forward Back Propagation Neural Network (CFBPNN) models are used. To evaluate the efficacy of the model, various measures such as mean square error (MSE), root mean square error (RMSE), and coefficient of correlation (R) are employed. With MSE, RMSE, and R values of 0.4982, 0.7056, and 0.96213, respectively, FFBPNN outperforms two networks with model architectures of 6-4-1 and Transig transfer function. Additionally, in this study, the Levenberg– Marquardt (LM), Bayesian Regularization (BR), and Conjugate Gradient Scaled (CGS) algorithms are used to train the ANN rainfall-runoff models. The results show that LM creates the most accurate model. It performs better than BR and CGS. The best model is the LM-trained method with 30 neurons, which has MSE values of 0.7279, RMSE values of 0.8531, and R values of 0.95057. It is concluded that the constructed neural network model was capable of quite accurately predicting runoff for the Yerli sub-catchment.


Introduction
Hydrologists have been attempting to understand the translation of rainfall to runoff for many years to estimate streamflow for objectives including water supply, flood control, irrigation, drainage, water quality, power production, recreation, and fish and wildlife propagation [1]. Rainfall-runoff modeling is one of the most prominent hydrological models used to examine the relationship between rainfall and runoff generated by various watershed physical factors [2]. In the real world, all physical catchment features influence rainfall-runoff; hence, generalizing all physical catchment characteristics is a difficult process. It is difficult to depict such a large range of values in a lumped hydrological model since the parameter values must be averaged for each watershed [3].
In the past, academics and hydrologists have presented different ways for effectively forecasting runoff by building several models of rainfall-runoff (RR) [4]. The process of rainfall-runoff is highly nonlinear and incredibly complex and is still poorly understood [5]. Furthermore, several rainfall-runoff models require a substantial amount of data, which are used for calibration and validation time scale, making them computationally intensive and, thus, unpopular [6]. Machine learning techniques are becoming more prevalent due to their ease of use, simplicity, and efficiency [7]. Machine learning techniques are a good

Study Area and Data Collection
The current study area comprises a portion of the Upper Tapi Basin known as the Purna sub-catchment ( Figure 1). The area lies between Maharashtra and Madhya Pradesh, between latitudes of 20 • 09 N to 22 • 03 N and longitudes of 75 • 56 E to 78 • 17 E. It has subtropical to temperate climatic conditions. The mean annual precipitation in the chosen area varies from 833 to 990 mm. Table 1 reveals the sources of data for this study. intensive and, thus, unpopular [6]. Machine learning techniques are becoming more prev alent due to their ease of use, simplicity, and efficiency [7]. Machine learning techniques are a good option when there are minimal data and the process is complex [8]. In the context of estimating issues, artificial neural networks (ANNs) are a subclass of machine learning that have received significant attention [9]. ANNs are data-processing systems that mimic the capabilities of the human brain [10]. ANNs were first developed in the 1940s and come in a wide variety [11]. ANN models are also known as black-box models [12]. The application of ANNs in the creation of models results in trustworthy and versa tile learning ability, which makes ANNs promising for forecasting [13]. ANN models have been extremely prevalent in the domains of hydrology, water resources, and watershed management in recent decades [14]. The ANN contains three layers, an input layer, a hid den layer, and an output layer [15]. The weight of communication is the relationship be tween the neurons in the consecutive layers [16]. In the given study, the input layer con sists of six types of data, namely (rainfall, minimum temperature, maximum temperature surface pressure, specific humidity, and wind speed). The hidden layer consists of layers with two different sets of number of neurons 10 and 20, respectively. The output laye comprises predicted runoff.
The objectives of the present study are as follows: (i) to develop a rainfall-runoff model for Upper Tapi using an Artificial Neural Networks Technique, (ii) compare ANN rainfall-runoff models developed using NNTOOL with different neural network types i.e., FFBPNN and CFBPNN, and (iii) to compare ANN rainfall-runoff models trained us ing LM, BR, and SCG algorithms.

Study Area and Data Collection
The current study area comprises a portion of the Upper Tapi Basin known as the Purna sub-catchment ( Figure 1). The area lies between Maharashtra and Madhya Pradesh between latitudes of 20°09′ N to 22°03′ N and longitudes of 75°56′ E to 78°17′ E. It has subtropical to temperate climatic conditions. The mean annual precipitation in the chosen area varies from 833 to 990 mm. Table 1 reveals the sources of data for this study.

Methodology
The theoretical aspects and research methodology used in the current study to identify the best neural network model to perform the rainfall and runoff modeling for the Yerli subcatchment have been discussed in this section.

Methodology
The theoretical aspects and research methodology used in the current study to identify the best neural network model to perform the rainfall and runoff modeling for the Yerli sub-catchment have been discussed in this section. Figures 2 and 3 depict the methodological flowchart of NNTOOL and NNSTART respectively.

Methodology
The theoretical aspects and research methodology used in the current study to identify the best neural network model to perform the rainfall and runoff modeling for the Yerli sub-catchment have been discussed in this section. Figures 2 and 3 depict the methodological flowchart of NNTOOL and NNSTART respectively.    Output: Desired predicted output is obtained after fixing the regression plot.

Model Evaluation Criteria
The findings of the ANN model applied in this study were evaluated by means of: • Mean Square Error (MSE): • Root Mean Square Error (RMSE): • Regression Coefficient (R): Using Regression Plot between predicted and observed runoff.
where Q p is the value of predicted runoff; Q o is the value of observed runoff;Q(i) is the n estimated runoff value; and Q(i) is the n observed runoff value.

Feed Forward Back Propagation Neural Network (FFBPNN)
FFBPN, while considering 6-2-1, 6-3-1, and 6-4-1 architectures, the transig function provides the best value for performance. The most effective model architecture for the Transig function is 6-4-1, which has a value of MSE 0.4982, the value of RMSE 0.7056, and the value of R 0.96213. Table S1 contains the inclusive outcomes. However, in comparison to other transfer functions, the transig transfer function with architecture 6-4-1 yielded better results in the current study. Figure 4 depicts the best regression plot. The multilayer FFBPNN and CFBPNN algorithms with Levenberg-Marquardt (LM are utilized to optimize the learning approach in this study. Two different models wer developed, i.e., (FFBPNN) and (CFBPNN) with three different architectures (6-2-1, 6-3and 6-4-1) using several combinations of transfer functions, i.e., (transig, logsig, and purelin) along with two sets of neurons, 10 and 20, and then compared for their capability to estimate the flow for the period 1981-2016.

Feed Forward Back Propagation Neural Network (FFBPNN)
FFBPN, while considering 6-2-1, 6-3-1, and 6-4-1 architectures, the transig function provides the best value for performance. The most effective model architecture for th Transig function is 6-4-1, which has a value of MSE 0.4982, the value of RMSE 0.7056, and the value of R 0.96213. Table S1 contains the inclusive outcomes. However, in comparison to other transfer functions, the transig transfer function with architecture 6-4-1 yielded better results in the current study. Figure 4 depicts the best regression plot.

Cascade Forward Back Propagation Neural Network (CFBPNN)
Similarly, for CFBPNN, while considering 6-2-1, 6-3-1, and 6-4-1 architectures, th transig function provides the best value for performance. The most effective model archi tecture for the transig function is 6-4-1, which has MSE values of 0.8813, the value of RMSE 0.9387, and the value of R 0.96096. Table S2 contains the inclusive outcomes [17]. However in comparison to other transfer functions comparison to other, the transig transfer func tion with architecture 6-4-1 yields better results. Figure 5 depicts the best regression plot

Cascade Forward Back Propagation Neural Network (CFBPNN)
Similarly, for CFBPNN, while considering 6-2-1, 6-3-1, and 6-4-1 architectures, the transig function provides the best value for performance. The most effective model architecture for the transig function is 6-4-1, which has MSE values of 0.8813, the value of RMSE 0.9387, and the value of R 0.96096. Table S2 contains the inclusive outcomes [17]. However, in comparison to other transfer functions comparison to other, the transig transfer function with architecture 6-4-1 yields better results. Figure 5 depicts the best regression plot.

NNSTART
In this study, three different algorithms, namely Levenberg-Marquardt (trainlm) Bayesian Regularization (trainbr), and Scaled Conjugate Gradient (trainscg), were used for model development. Table S3 shows the Yerli station results for the ANN trained by LM, BR, and CGS. The study compares ANN models that were trained with LM, BR, and CGS. LM-trained algorithm with 30 neurons is the best model with an MSE value o 0.7279, an RMSE value of 0.8531, and an R value of 0.95057. Figure 6 shows the best re gression plot for the LM algorithm with 30 neurons.

NNSTART
In this study, three different algorithms, namely Levenberg-Marquardt (trainlm), Bayesian Regularization (trainbr), and Scaled Conjugate Gradient (trainscg), were used for model development. Table S3 shows the Yerli station results for the ANN trained by LM, BR, and CGS. The study compares ANN models that were trained with LM, BR, and CGS. LM-trained algorithm with 30 neurons is the best model with an MSE value of 0.7279, an RMSE value of 0.8531, and an R value of 0.95057. Figure 6 shows the best regression plot for the LM algorithm with 30 neurons.

NNSTART
In this study, three different algorithms, namely Levenberg-Marquardt (trainlm) Bayesian Regularization (trainbr), and Scaled Conjugate Gradient (trainscg), were used for model development. Table S3 shows the Yerli station results for the ANN trained by LM, BR, and CGS. The study compares ANN models that were trained with LM, BR, and CGS. LM-trained algorithm with 30 neurons is the best model with an MSE value o 0.7279, an RMSE value of 0.8531, and an R value of 0.95057. Figure 6 shows the best re gression plot for the LM algorithm with 30 neurons.

Conclusions
This study described how ANN models are used to estimate yearly runoff for the Yerli sub-catchment of the upper Tapi basin. Runoff estimation was undertaken using NNTOOL and NNSTART. Adopting NNTOOL, two different models were developed, i.e., FFBPNN and CFBPNN networks, using several combinations of input data and then comparing their capability of flow estimation for the period 1981-2016. For estimating runoff using NNTOOL, two NNs are used, with the values of MSE, RMSE, and R calculated.