Pan Evaporation Estimation in Uttarakhand and Uttar Pradesh States, India: Validity of an Integrative Data Intelligence Model

: Appropriate input selection for the estimation matrix is essential when modeling non-linear progression. In this study, the feasibility of the Gamma test (GT) was investigated to extract the optimal input combination as the primary modeling step for estimating monthly pan evaporation (EP m ). A new artiﬁcial intelligent (AI) model called the co-active neuro-fuzzy inference system (CANFIS) was developed for monthly EP m estimation at Pantnagar station (located in Uttarakhand State) and Nagina station (located in Uttar Pradesh State), India. The proposed AI model was trained and tested using di ﬀ erent percentages of data points in scenarios one to four. The estimates yielded by the CANFIS model were validated against several well-established predictive AI (multilayer perceptron neural network (MLPNN) and multiple linear regression (MLR)) and empirical (Penman model (PM)) models. Multiple statistical metrics (normalized root mean square error (NRMSE), Nash–Sutcli ﬀ e e ﬃ ciency (NSE), Pearson correlation coe ﬃ cient (PCC), Willmott index (WI), and relative error (RE)) and graphical interpretation (time variation plot, scatter plot, relative error plot, and Taylor diagram) were performed for the modeling evaluation. The results of appraisal showed that the CANFIS-1 model with six input variables provided better NRMSE (0.1364, 0.0904, 0.0947, and 0.0898), NSE (0.9439, 0.9736, 0.9703, and 0.9799), PCC (0.9790, 0.9872, 0.9877, and 0.9922), and WI (0.9860, 0.9934, 0.9927, and 0.9949) values for Pantnagar station, and NRMSE (0.1543, 0.1719, 0.2067, and 0.1356), NSE (0.9150, 0.8962, 0.8382, and 0.9453), PCC (0.9643, 0.9649, 0.9473, and 0.9762), and WI (0.9794, 0.9761, 0.9632, and 0.9853) values for Nagina stations in all applied modeling scenarios for estimating the monthly EP m . This study also conﬁrmed the supremacy of the proposed integrated GT-CANFIS model under four di ﬀ erent scenarios in estimating monthly EP m . The results of the current application demonstrated a reliable modeling methodology for water resource management and sustainability.

network embedded with particle swarm optimization (PSO-ANN), genetic algorithm (GA-ANN), and Stephens-Stewart (SS) models. The results of the modeling comparison demonstrated that the ELM, PSO-ANN, and GA-ANN models provided better estimates than the SS model. Furthermore, several advanced models have been developed for modeling multiple scales of the pan evaporation process across the globe [28,[34][35][36][37][38][39][40][41][42]. In accordance with the literature, the exploration of new versions of AI models is still an ongoing research area. Therefore, the current research investigated the potential of a newly explored AI model (i.e., the co-active neuro-fuzzy inference system (CANFIS)), as a widespread approximator for any non-linear occupation. The foremost excellence of the CANFIS model is evidenced by pattern-dependent coefficients (weights) among the consequent and premise layers [43]. Although there have been several studies focused on establishing a robust and reliable predictive model, few have focused on improving the models' accuracy. The current research can contribute to prediction performance improvement based on the incorporation of the best selection of input variables. The implementation of the CANFIS model was advanced by integrating a non-linear input selection approach called the Gamma test (GT), in order to identify the input attributes correlating to the targeted predicted variable.
To date, few studies have investigated the appropriate input variables for constructing an integrative AI predictive model for evaporation estimation. Therefore, to solve this problem, a non-linear modeling and analysis tool, i.e., GT proposed by Stefánsson et al. [44], was used to identify the appropriate input variables. Recently, a number of applications on GT's capacity have been produced in diverse hydrological fields [35,[45][46][47][48][49].
As per the authors' information, preceding studies have not stated the utility of an integrated GT approach with the CANFIS model for estimating the monthly pan evaporation process in two different climates. The potential of the proposed model was validated against MLPNN, MLR, and Penman model (PM) models in four different scenarios based on several statistical indicators and graphical inspection. The predictive models developed were examined for historical climatological data collected from Pantnagar and Nagina stations, located in India.

Case Study and Data Description
Two different meteorological stations (i.e., Pantnagar and Nagina) located in Uttarakhand and Uttar Pradesh states, respectively, were used to build the modeling procedure. Figure 1 demonstrates the coordinates of the study stations (Pantnagar: 79 • 38 0" E longitude and 29 • 0 0" N latitude with 243.8 m above mean sea level (MSL), and Nagina: 78 • 25 58.8" E longitude and 29 • 26 34.8" N latitude with 282 m above MSL). The climatic information including the maximum and minimum air temperatures (T max and T min ), wind speed (WS), relative humidity at 7:00 a.m. (RH-1) and 2:00 p.m. (RH-2), bright sunshine hours (SSH), and monthly pan evaporation (EP m ) were obtained from observatories located at the Pantnagar Crop Research Centre (PCRC), Uttarakhand, and Rice Research Station Nagina, Bijnor district in Uttar Pradesh State, India. Figure 2a,b illustrates the climatic parameters measured from January 2009 to December 2016 (eight year period) at both stations using a box and whisker plot, which presents statistics on the minimum value, first quartile, median, third quartile, and maximum value for climatic parameters (reading from lower to upper values). Additionally, Table 1 reports the brief statistical properties of climatic variables intended for both stations. The statistical characteristics reveal the platykurtic (−) and leptokurtic (+) nature of the climatic variables at both stations. Table 2 shows the correlations between the monthly EP m and other climatic variables for both stations. It can be observed from Table 2 that all of the six variables (T min , T max , RH-1, RH-2, WS, and SSH) have significant correlations with the EP m at a 5% significance level.
If the values of Γ, A, standard error (SE), and V ratio approach the minimum, it indicates the goodness of input variables [47,56]. The V ratio is computed using the following expression: where, σ 2 (y) is the variance of the output y and Γ is the gamma function. In this research, the CANFIS, MLPNN, and MLR techniques were employed for designating the optimal group of input variables (decided through the lowest value of A, SE, V ratio , and Γ), to estimate the monthly EP m at Pantnagar and Nagina stations.

Co-Active Neuro-Fuzzy Inference System (CANFIS) Model
The CANFIS model is a hybrid AI model that was introduced by Jang et al. [57]. The hierarchy network of the CANFIS structure is composed of five layers: (i) fuzzification layer (categorization of inputs by applying certain membership functions), (ii) fuzzy rule layer (multiplication concept is applied), (iii) normalization layer (normalize the output of the previous layer using the activation function), (iv) defuzzification layer (de-fuzzified the output of the previous layer by applying the learning algorithm), and (v) summation layer (obtain final results as a crisp output). Figure 3 demonstrates the working architecture of the CANFIS model. More details on the working function of each layer are provided in [8,43].  In this research, through supervised learning, the network of the CANFIS model was designed with a trial and error procedure using two Gaussian and Bell membership functions, the hyperbolic tangent activation function (with a range from -1 to 1), the Sugeno fuzzy inference system, and the delta-bar-delta (DBD) learning algorithm. The CANFIS model was trained and terminated after 1000 iterations over the 0.001 threshold level in NeuroSolutions 5.0 software produced by the NeuroDimension, Inc., Gainesville, Florida, USA.

Multilayer Perceptron Neural Network (MLPNN) Model
The main concept of the MLPNN model was introduced by Haykin [58]. MLPNN consists of parallel processing elements called neurons. Figure 4 illustrates the structure of the feed-forward MLPNN model, which consists of three layers-an input layer (i), hidden layer (j), and output layer (k)-with interconnected weights between the layers (Wij and Wjk). The appropriate weights are adjusted to minimize the error between the observed and predicted output through back propagation encountered from right to left, as depicted in Figure 4. In this research, through supervised learning, the network of the CANFIS model was designed with a trial and error procedure using two Gaussian and Bell membership functions, the hyperbolic tangent activation function (with a range from −1 to 1), the Sugeno fuzzy inference system, and the delta-bar-delta (DBD) learning algorithm. The CANFIS model was trained and terminated after 1000 iterations over the 0.001 threshold level in NeuroSolutions 5.0 software produced by the NeuroDimension, Inc., Gainesville, FL, USA.

Multilayer Perceptron Neural Network (MLPNN) Model
The main concept of the MLPNN model was introduced by Haykin [58]. MLPNN consists of parallel processing elements called neurons. Figure 4 illustrates the structure of the feed-forward MLPNN model, which consists of three layers-an input layer (i), hidden layer (j), and output layer (k)-with interconnected weights between the layers (W ij and W jk ). The appropriate weights are adjusted to minimize the error between the observed and predicted output through back propagation encountered from right to left, as depicted in Figure 4.  The structure of the MLNN model was designed through the knowledge process by a trial and error procedure in this study. The network was made up of three (input, hidden, and output) layers with a single output (i.e., EPm). Data normalization was conducted with the hyperbolic-tangent (tanh) activation function (ranges from -1 to 1). As recommended by previous studies, the maximum number of neurons contained in the hidden layer was projected through the 2n + 1 idea, where n is the number of input variables [59]. Training of the MLPNN model was terminated over a 0.001 threshold value after 1000 epochs in NeuroSolutions 5.0 software. The framework of the MLPNN model designed in this study was built to incorporate all of the six input climate variables collected.

Multiple Linear Regression (MLR) Model
For a general comparison and validation, following several established research studies in the literature, the MLR model was employed. Conceptually, MLR is a statistical approach that uses the collinearity among the targeted variables and the independent variables [60]. Mathematically, the regression equation of MLR is written as where, Y is the target variable; X1, X2, ....... Xn are the independent variables; and 0 , 1 , 2 ……… are the regression coefficients.

Penman Model (PM)
Penman [61] developed an empirical model for computing the rate of evaporation using climatic parameters, expressed as where EP is the rate of evaporation (mm/month), represents the net radiation (MJ/m 2 /month), ∆ is the slope of the saturation vapor pressure-air temperature curve (kPa/ o C), indicates the psychrometric constant (kPa/ o C), and is the aerodynamic function (mm/month) and computed using Equation (5): The structure of the MLNN model was designed through the knowledge process by a trial and error procedure in this study. The network was made up of three (input, hidden, and output) layers with a single output (i.e., EP m ). Data normalization was conducted with the hyperbolic-tangent (tanh) activation function (ranges from −1 to 1). As recommended by previous studies, the maximum number of neurons contained in the hidden layer was projected through the 2n + 1 idea, where n is the number of input variables [59]. Training of the MLPNN model was terminated over a 0.001 threshold value after 1000 epochs in NeuroSolutions 5.0 software. The framework of the MLPNN model designed in this study was built to incorporate all of the six input climate variables collected.

Multiple Linear Regression (MLR) Model
For a general comparison and validation, following several established research studies in the literature, the MLR model was employed. Conceptually, MLR is a statistical approach that uses the collinearity among the targeted variables and the independent variables [60]. Mathematically, the regression equation of MLR is written as where, Y is the target variable; X 1 , X 2 , . . . . . . X n are the independent variables; and β 0 , β 1 , β 2 . . . . . . . . . β k are the regression coefficients.

Penman Model (PM)
Penman [61] developed an empirical model for computing the rate of evaporation using climatic parameters, expressed as where EP is the rate of evaporation (mm/month), R n represents the net radiation (MJ/m 2 /month), ∆ is the slope of the saturation vapor pressure-air temperature curve (kPa/ • C), γ indicates the psychrometric constant (kPa/ • C), and E a is the aerodynamic function (mm/month) and computed using Equation (5): where, e s is the saturation vapor pressure (kPa), e a is the actual vapor pressure, and f (u) represents an empirical or theoretically-derived aerodynamic wind function, which can be calculated using Equation (6): where, u s is the wind speed (m/s) at a 2 m elevation, and a w and b w are empirical coefficients. Penman [62] suggested 0.5 and 0.537 values for a w and b w for open water bodies. In this study, the R n , ∆, γ, e s and e a parameters were calculated by adopting the procedure provided by Allen et al. [63] in the Food and Agriculture Organization of manual 56 (FAO-56).

Modeling Scenarios
In this research, the monthly pan evaporation at Pantnagar and Nagina stations was estimated using the CANFIS, MLPNN, and MLR techniques for four different scenarios. The total available climatic datasets from January 2009-December 2016 (period of eight years) of both stations were split into four scenarios, with different percentages of training (calibration) and testing (validation) datasets. The details of the training and testing data percentages in the four scenarios are as follows: Scenario-1 contains 25% data for training (January 2009 to December 2010) and 75% data for testing (January 2011 to December 2016). II.
Scenario-4 contains 75% data for training (January 2011 to December 2016) and 25% data for testing (January 2009 to December 2010).
The first three modeling scenarios, data was divided into training and testing phases continuously. Whereas, the fourth modeling scenario testing phase data was selected somewhere in the middle of the data span in order to test the applicability of the applied predictive models for the reverse prediction mechanism. Figure 5 illustrates the percentage of training and testing datasets in the four different scenarios used in this study for monthly EP m estimation at the two study locations.
IV. Scenario-4 contains 75% data for training (January 2011 to December 2016) and 25% data for testing (January 2009 to December 2010). The first three modeling scenarios, data was divided into training and testing phases continuously. Whereas, the fourth modeling scenario testing phase data was selected somewhere in the middle of the data span in order to test the applicability of the applied predictive models for the reverse prediction mechanism. Figure 5 illustrates the percentage of training and testing datasets in the four different scenarios used in this study for monthly EPm estimation at the two study locations.

Performance Appraisal Indicators
The outcomes of the applied models, i.e., CANFIS, MLPNN, MLR, and PM models, were assessed for four different scenarios using the normalized root mean square error (NRMSE), Nash-Sutcliffe efficiency (NSE), Pearson correlation coefficient (PCC), Willmott index (WI), and relative error (RE), and graphical interpretation employing a time variation plot, scatter plot, and Taylor diagram [64]. The performance indicators are described as follows: I. Normalized root mean square error [65][66][67]:
Relative error [66,73]: where, Y est and X obs are the estimated and observed monthly EP m values for the ith dataset, respectively; X obs and Y est are the mean observed and estimated monthly EP m values for ith dataset, respectively; and N is the number of observations.

Optimal Input Variable Selection Using GT
The capacity of AI models relies on several modeling adjustments. The selection of appropriate input parameters is one of the essential prior steps for the learning process of AI implementation. Hence, the current research investigated the potential of various input combinations that anticipate several climatological parameters which have a positive influence on the EP m at the Pantnagar and Nagina meteorological stations (Table 3). To complete this, the feasibility of the GT approach was adopted to identify the related input combinations that are crucial to building predictive models. The statistical results of GT are reported in Table 4 for both stations. Based on the GT results in Table 4, and with a fixed mask example (111111), the minimum values of Γ = 0.0017, A = 0.0665, SE = 0.0013, and V ratio = 0.0070 were obtained for Pantnagar station (Figure 6a), while the minimum values of Γ = 0.0112, A = 0.0395, SE = 0.0024, and V ratio = 0.0448 were obtained for Nagina station (Figure 6b). The mask demonstrated the incorporation of the six input variables for estimating the EP m . Hence, the following input variables were utilized for EP m estimation (i.e., T max , T min , RH-1, RH-2, WS, and SSH) at Pantnagar and Nagina stations, respectively. It is worth mentioning that including the WS variable as an input parameter provides a better score compared to the SSH (compare the Gamma scores of T max , RH-2, WS and T max , RH-2, SSH; or scores of T max , WS and T max , SSH in Table 4), according to the Gamma test, at both stations. This is in direct agreement with the correlations between WS or SSH and EP m given in Table 2.
√ is used to indicate the input variable for different combinations.

Estimation of EPm under Different Scenarios at Pantnagar Station
The combination of input variables (Tmax, Tmin, RH-1, RH-2, WS, and SSH) was used for training and testing the applied methods (CANFIS, MLPNN, and MLR) under four different scenarios based on the performance evaluation indicators (NRMSE, NSE, PCC, and WI). The values of NRMSE, NSE, PCC, and WI in the four different scenarios in the testing phase are summarized in Table 5, which indicate that in scenario-1, the NRMSE (mm/month) = 0.1364, 0.1404, and 0.1402; NSE = 0.9439, 0.9406, and 0.9408; PCC = 0.9790, 0.9751, and 0.9758; and WI = 0.9860, 0.9857, and 0.9851 for the CANFIS-1, MLPNN-1 (with the structure of 6 inputs-9 processing elements-1 output), and MLR-1 models, respectively. In scenario-2 (Table 5) Table 5 clearly shows, the CANFIS-1 models out-performed the other models in all four scenarios, followed by the MLPNN-1 models. Therefore, the CANFIS-1 models followed the best statistical criteria (i.e., maximum rate of WI, PCC, and NSE, and minimum rate of NRMSE) for the testing period and were selected as the best among the three models. Likewise, the performance of PM was found to be the worst in all scenarios for monthly EPm estimation at the Pantnagar station.

Estimation of EP m under Different Scenarios at Pantnagar Station
The combination of input variables (T max , T min , RH-1, RH-2, WS, and SSH) was used for training and testing the applied methods (CANFIS, MLPNN, and MLR) under four different scenarios based on the performance evaluation indicators (NRMSE, NSE, PCC, and WI). The values of NRMSE, NSE, PCC, and WI in the four different scenarios in the testing phase are summarized in Table 5, which indicate that in scenario-1, the NRMSE (mm/month) = 0.1364, 0.1404, and 0.1402; NSE = 0.9439, 0.9406, and 0.9408; PCC = 0.9790, 0.9751, and 0.9758; and WI = 0.9860, 0.9857, and 0.9851 for the CANFIS-1, MLPNN-1 (with the structure of 6 inputs-9 processing elements-1 output), and MLR-1 models, respectively. In scenario-2 (Table 5) Table 5 clearly shows, the CANFIS-1 models out-performed the other models in all four scenarios, followed by the MLPNN-1 models. Therefore, the CANFIS-1 models followed the best statistical criteria (i.e., maximum rate of WI, PCC, and NSE, and minimum rate of NRMSE) for the testing period and were selected as the best among the three models. Likewise, the performance of PM was found to be the worst in all scenarios for monthly EP m estimation at the Pantnagar station.  The percentages of relative errors (obtained using Equation (11)) between the estimated and observed EP m values of the CANFIS-1, MLPNN-1, MLR-1, and PM models in the four different scenarios over the testing period are illustrated in Figure 8a-d. As reported in Figure 8a-d, the RE percentage was limited to between +40 and −20 for the first scenario. The highest RE% was experienced using the Penman model (PM). However, the minimum relative error percentage was obtained for the CANFIS model, and it was limited to ±10, with some observations reaching ±20. Greater spreading of the RE for all prescribed models seemed to occur at the peak value of monthly EP m . It can also be observed from Figure 8, together with Figure 7, that the models' accuracy varies over diverse scenarios and this suggests the use of various scenarios in evaluating the potential of AI models, as also discussed by Kisi and Heddam [32]. The percentages of relative errors (obtained using Equation (11)) between the estimated and observed EPm values of the CANFIS-1, MLPNN-1, MLR-1, and PM models in the four different scenarios over the testing period are illustrated in Figure 8a-d. As reported in Figure 8a-d, the RE percentage was limited to between +40 and -20 for the first scenario. The highest RE% was experienced using the Penman model (PM). However, the minimum relative error percentage was obtained for the CANFIS model, and it was limited to 10, with some observations reaching 20. Greater spreading of the RE for all prescribed models seemed to occur at the peak value of monthly EPm. It can also be observed from Figure 8, together with Figure 7, that the models' accuracy varies over diverse scenarios and this suggests the use of various scenarios in evaluating the potential of AI models, as also discussed by Kisi and Heddam [32].      Table 6 presents the values of performance evaluation indicators (NRMSE, NSE, PCC, and WI) for the selected input combination (i.e., Tmax, Tmin, RH-1, RH-2, WS, and SSH) during the testing period under four different scenarios. Nagina station in scenario-1 (Table 6) achieved NRMSE values ranging from 0.1543 to 0.1866 and NSE/PCC (WI) values ranging from 0.9150 to 0.8758/0.9643 to 0.9437 (0.9794 to 0.9698) for the CANFIS-1, MLPNN-1 (6-10-1), and MLR-1 models, respectively. In scenario-2 (Table 6), the NRMSE values ranged from 0.1719 to 0.2299 and NSE/PCC (WI) values ranged from 0.8962 to 0.8144/0.9649 to 0.9346 (0.9761 to 0.9579) for the CANFIS-1, MLPNN-1 (6-10-1), and MLR-1 models, respectively. In the case of scenario-3 (Table 6) Table 6 presents the values of performance evaluation indicators (NRMSE, NSE, PCC, and WI) for the selected input combination (i.e., T max , T min , RH-1, RH-2, WS, and SSH) during the testing period under four different scenarios. Nagina station in scenario-1 (Table 6) achieved NRMSE values ranging from 0.1543 to 0.1866 and NSE/PCC (WI) values ranging from 0.9150 to 0.8758/0.9643 to 0.9437 (0.9794 to 0.9698) for the CANFIS-1, MLPNN-1 (6-10-1), and MLR-1 models, respectively. In scenario-2 (Table 6), the NRMSE values ranged from 0.1719 to 0.2299 and NSE/PCC (WI) values ranged from 0.8962 to 0.8144/0.9649 to 0.9346 (0.9761 to 0.9579) for the CANFIS-1, MLPNN-1 (6-10-1), and MLR-1 models, respectively. In the case of scenario-3 (Table 6) MLPNN-1 (6-10-1), and MLR-1 models, respectively. A comparison of the CANFIS, MLPNN, and MLR models against the PM for all scenarios exposed the better performance of the CANFIS model, followed by the MLPNN and MLR models. As Table 6 clearly indicates, the CANFIS-1 models out-performed the other models in all four scenarios, followed by the MLPNN-1 models. Therefore, CANFIS-1 followed the best statistical criteria (i.e., minimum values = NRMSE, and maximum values = NSE, PCC, and WI) during the testing period and was selected as the best of the three models. Hence, the models' estimation accuracies show variations over the four scenarios and increasing the training data length generally improves the models' exactness in estimation of the monthly EP m , as also discussed by Kisi and Heddam [32].  Figure 10a-d illustrates the time variation and scatter plots of observed and estimated monthly EP m values obtained by the CANFIS-1, MLPNN-1, MLR-1, and Penman models under scenarios one to four during the testing period at Nagina station. In scenario-1, the R 2 is 0.9299, 0.9201, 0.8905, and 0.2879; scenario-2, the R 2 is 0.9311, 0.8999, 0.8735, and 0.2863; scenario-3, the R 2 is 0.8973, 0.8550, 0.8188, and 0.2944; and scenario-4, the R 2 is 0.9529, 0.9498, 0.9343, and 0.3911 for the CANFIS-1, MLPNN-1, MLR-1, and PM models during the testing phase, respectively. In addition, the scenarios one, two, and three demonstrated that the regression line (RL) is above the 1:1 line for all of the methods and this means that the CANFIS-1, MLPNN-1, and MLR-1 models under these scenarios slightly over-predict, while in scenario-4, they under-predict, the monthly EP m values at Nagina station. Moreover, scenarios one to four verified that the Penman model highly over-estimates the EP m values. Therefore, the use of various data splitting scenarios is recommended when testing data-driven methods in the estimation of EP m . The percentage of RE between the estimated and observed monthly EPm values for all applied predictive models for Nagina station is displayed in Figure 11a-d for the four different scenarios over the testing period. Figure 11a-d shows that the RE percentage was between +25 and -50 for the first scenario. The maximum RE% was experienced for the Penman model. Apparently, the best performance based on this metric was attained for the CANFIS model and for the fourth scenario. The relative error percentage was limited to 15 for about 80% of the testing phase of modeling. Conversely, the rest of the data observations ranged by 20%. Therefore, extreme spreading of the RE for the peak value of the monthly EPm was detected for all prescribed models. The percentage of RE between the estimated and observed monthly EP m values for all applied predictive models for Nagina station is displayed in Figure 11a-d for the four different scenarios over the testing period. Figure 11a-d shows that the RE percentage was between +25 and −50 for the first scenario. The maximum RE% was experienced for the Penman model. Apparently, the best performance based on this metric was attained for the CANFIS model and for the fourth scenario. The relative error percentage was limited to ±15 for about 80% of the testing phase of modeling. Conversely, the rest of the data observations ranged by ±20%. Therefore, extreme spreading of the RE for the peak value of the monthly EP m was detected for all prescribed models.    is closer to the observed values of monthly EPm compared to the other applied models. Therefore, the CANFIS-1 model with Tmax, Tmin, RH-1, RH-2, WS, and SSH climatic variables at Nagina station can be employed for monthly EPm estimation.

Comparison and Discussion
In this research, the monthly EPm was estimated at Pantnagar and Nagina stations under four different scenarios by employing the CANFIS, MLPNN, and MLR models in conjunction with GT, which revealed the appropriate input variable combination for this task. The results produced by the CANFIS model were validated against the MLPNN and MLR models using several statistical metrics and visual interpretations. The CANFIS model optimized with the Takagi-Sugeno-Kang (TSK) fuzzy inference system, hyperbolic-tangent (tanh) activation function, delta bar delta algorithm, and Gaussian membership functions demonstrated a substantial predictability performance of the modeled evaporation process. This was observed for all of the investigated modeling scenarios.
Machine learning models behave based on the supplied dataset. Hence, in this research the applicability of the applied predictive models in the reverse back mode was tested. The models were

Comparison and Discussion
In this research, the monthly EP m was estimated at Pantnagar and Nagina stations under four different scenarios by employing the CANFIS, MLPNN, and MLR models in conjunction with GT, which revealed the appropriate input variable combination for this task. The results produced by the CANFIS model were validated against the MLPNN and MLR models using several statistical metrics and visual interpretations. The CANFIS model optimized with the Takagi-Sugeno-Kang (TSK) fuzzy inference system, hyperbolic-tangent (tanh) activation function, delta bar delta algorithm, and Gaussian membership functions demonstrated a substantial predictability performance of the modeled evaporation process. This was observed for all of the investigated modeling scenarios.
Machine learning models behave based on the supplied dataset. Hence, in this research the applicability of the applied predictive models in the reverse back mode was tested. The models were trained using a recent historical data set trend and were tested to predict the evaporation at an earlier span period. In this way, more informative visualization can be attained on the applicability of the predictive for predicting the evaporation in the reverse time period. This is because, from a logical aspect: arranging the data and consequently training and then testing, could be by chance to obtain good results. Hence, machine learning models should be investigated by training the models using any dataset, and the testing phase can be randomly selected at any span period desired to be examined. The research findings evidenced the capacity of the proposed AI predictive model for the reverse back mode scenario.
To corroborate the findings of the research, a comparison was analyzed with respect to the NRMSE of the MLPNN-1, MLR-1, and Penman models in terms of the prediction accuracy, which reduced from 85.8% to 2.8%, 90.8% to 1.7%, 90.5% to 4.6%, and 90.2% to 12.0% in scenarios one to four, respectively, with the CANFIS-1 model at Pantnagar station. Similarly, the prediction accuracy concerning the NRMSE of MLPNN-1, MLR-1, and Penman models reduced from 88.5% to 14.9%, 87.7% to 9.5%, 85.1% to 9.4%, and 85.5% to 8.2% with the CANFIS-1 model in scenarios one to four, respectively, at Nagina station. Therefore, the applied modeling approach can be employed to build a straightforward reliable AI system, which is crucial for the sustainable planning and management of water resources.
In accordance with the attained estimation accuracy, the data span used was eight years of weather information, which consisted of the minimum and maximum air temperature, morning and afternoon relative humidity, wind speed, and sunshine hours. Overall, the investigated eight-year monthly-scale climatic information was sufficient for constructing the predictive models. It can also be noted that the co-active neuro-fuzzy inference system introduced performed remarkably for mimicking the actual trend of the evaporation process. This was due to the major capability of simulating the uncertainty problem where the CANFIS model is characterized. The monthly EP m process is usually highly associated with major uncertainties of climatic variability. Therefore, proposing the CANFIS model was an excellent idea for addressing this complex climate process. For this region, it was evidenced that the incorporation of all related available hydrological information was highly crucial for modeling the monthly EP m . This was due to the high stochasticity of the monthly EP m , which is very much influenced by different climate attributes. Decadal research on pan evaporation estimation has shown the wide applications of AI models. Through validation against literature studies [28,[74][75][76][77][78][79], the present implementation of the CANFIS model confirmed its predictability capacity using statistical metrics.
The current research could be further extended with other modeling scenarios by incorporating the most significant correlated climate variables. For instance, the integration of metaheuristic optimization algorithms [80] can advance the prediction process to abstract the essential correlated attributes. Conceptually, this is a factual implementation with the possibility of eliminating unnecessary climate variables and obtaining an acceptable prediction accuracy with less input variability. This is highly beneficial for catchments located in developing countries where a lack of climatological data availability exists.

Conclusions
The evaporation process in nature is characterized by highly non-linear and stochastic phenomena. In the current research, the first phase was established to extract the input variables relating to the monthly EP m using the Gamma test. The second phase of modeling was employed to estimate the value of monthly EP m using different AI models, including CANFIS and MLPNN at Pantnagar station (located in Uttarakhand State) and Nagina station (located in Uttar Pradesh State), India. Owing to the fact that different span data can influence the efficiency of the applied AI models, several scenarios were investigated, applying different training and testing data set spans. For validation purposes, the predictability potential of CANFIS and MLPNN were evaluated against the classical multiple linear regression and traditional Penman model. The modeling procedure was inspected using multiple statistical metrics (i.e., normalized root mean square error, Nash-Sutcliffe efficiency, Pearson correlation coefficient, Willmott index, and relative error). In addition, graphical inspections were performed, including a time variation plot, scatter plot, relative error percentage graph, and Taylor diagram. The findings of the research evidenced the capacity of the CANFIS-1 model for estimating monthly-scale pan evaporation incorporating all of the weather information, including the minimum and maximum air temperature, morning and afternoon relative humidity, wind speed, and sunshine hours. The superiority of the CANFIS-1 model was observed for all examined scenarios and at both Pantnagar and Nagina stations. Additionally, the performance of the traditional Penman model was found to be the worst in all scenarios for estimating the monthly EP m at both stations. Overall, the result of the current research demonstrated the feasibility of the CANFIS model as a newly developed data-intelligent approach for simulating pan evaporation in the Indian region, where it can be applied for several water resource engineering applications.
In future studies, the different percentages of training and testing datasets should be considered for monthly pan evaporation estimation using advanced hybrid metaheuristic models (i.e., the grey wolf optimizer, particle swarm optimizer, multi verse optimizer, whale optimization algorithm, and ant lion optimizer), and compared with empirical and semi-empirical climate-based models (Stephens-Stewart, Griffiths, Christiansen, Priestley-Taylor, and Jensen-Burman-Allen) in different climates. This research focuses on the specific area of Pantnagar and Nagina stations, so the results of the current research cannot be used to generalize about the ability of applied AI models for different climatic regions. Therefore, it is also suggested that, in future studies, the generalization of applied AI models should be examined by considering multiple stations in different environmental conditions.