A Novel Methodology for Prediction Urban Water Demand by Wavelet Denoising and Adaptive Neuro-Fuzzy Inference System Approach

: Accurate and reliable urban water demand prediction is imperative for providing the basis to design, operate, and manage water system, especially under the scarcity of the natural water resources. A new methodology combining discrete wavelet transform (DWT) with an adaptive neuro-fuzzy inference system (ANFIS) is proposed to predict monthly urban water demand based on several intervals of historical water consumption. This ANFIS model is evaluated against a hybrid crow search algorithm and artificial neural network (CSA-ANN), since these methods have been successfully used recently to tackle a range of engineering optimization problems. The study outcomes reveal that 1) data preprocessing is essential for denoising raw time series and choosing the model inputs to render the highest model performance; 2) both methodologies, ANFIS and CSA-ANN, are statistically equivalent and capable of accurately predicting monthly urban water demand with high accuracy based on several statistical metric measures such as coefficient of efficiency (0.974, 0.971, respectively). This study could help policymakers to manage extensions of urban water system in response to the increasing demand with low risk related to a decision.


Introduction
The availability of urban freshwater resources in future is likely to decrease in many cities in the world because of several issues such as climate change (drought), socioeconomic changes (rapid urbanization, economic growth), and water pollution [1,2]. These issues lead to an imbalance between water delivered and needed, which adversely impacts on the sustainable management of urban water resources. Traditionally, water demand overestimation has had a role on the depletion of freshwater resources, which are already under stress due to climate factors [3][4][5]. Therefore, it is essential to adopt methodological approaches that can accurately predict urban water demand to assist decision makers to ascertain whether expanding existing infrastructures would be more convenient than building new facilities [6,7]. Given that the pattern of urban water consumption varies with time based on several factors, including climate (e.g. temperature), demographic (e.g., number of population), socioeconomic (e.g., water price), and strategic (e.g., initiatives of water conservation) factors [6,7], a precise water demand prediction is essential to guarantee the continuous delivery of water to consumers, meeting the expected standards of a reliable urban water system such as quantity, quality, and pressure [8]. Melbourne is a city of nearly 5 million inhabitants, making it one of the biggest cities in Australia, whose municipal water system is affected by climate (drought) and socioeconomic (economic growth) factors, as well as several water policies and initiatives [9].
In the past few decades, different methods and approaches for predicting urban water demand have been reviewed by House-Peters and Chang [10], Donkor et al. [11], Ghalehkhondabi et al. [12], and de Souza Groppo et al. [13]. The literature review reveals that artificial intelligence (AI) models are superior relative to the conventional models of regression and time series such as those in Mouatadid and Adamowski [14], Toth et al. [15] and Guo et al. [16]. Some of the AI methods used to predict water demand are support vector machines (SVMs) [17], adaptive neuro fuzzy inference system [18], random forests [19], and artificial neural network (ANN) [9]. A number of studies recommend applying hybrid models to improve prediction accuracy [20][21][22][23].
Among these methods, ANN is deemed to be a preferred choice for the prediction of water demand due to its ability to handle large amounts of nonlinear data in a robust way and its capabilities to deal with noise [24]. One example is Bai et al. [25], who proved that ANN models are effective in the short-, medium-, and long-term urban water demand prediction. In addition, Ghalehkhondabi et al. [12] and de Souza Groppo et al. [13] pointed out that there was still further work required and AI methods should be further exploited to improve water demand prediction, for example, exploring different ANN architectures and metaheuristics.
There is no global technique that can outperform all the models in all research areas, it is needful to test each case separately, evaluating the performance of each technique or the combination of techniques [13]. Different types of metaheuristics algorithms were applied to find all the hyperparameters of machine learning models (also called automated machine learning) [26,27]. Recently, various metaheuristics algorithms were applied successfully in the field of hydrology. These algorithms include, but are not limited to, particle swarm optimization (PSO) [28], gravitational search algorithm (GSA) [21], genetic algorithms (GA) [29], and Bayesian optimization [27]. Also, the combined technique of metaheuristic algorithm and machine learning model outperformed the same single model of machine learning, such as [23,30].
Although different methods of automated machine learning was used recently, there is still room for enhancement concerning urban water demand prediction [13], for example, Candelieri and Archetti [27] mentioned that the results in this study reveal a significant improvement in prediction accuracy concerning previous studies [26,31]. Also, the Candelieri and Archetti [27] intend to use the additional prediction technique in other application domains. These studies increase the motivation for researchers to examine different new methodologies, which deliver scientific insight to the decision makers.
From the application area viewpoint, another important consideration is that the majority of the literature available in this area focuses on short-term water demand prediction, with only a few studies focusing on medium-to long-term. In other words, the literature on medium-term monthly water demand prediction based on several historical intervals of water consumption is limited. Recently, different researchers have applied previous water consumption as a single input in their short-term prediction models. Their models revealed reasonably precise predictions [16,[32][33][34]. Also, it has been reported in the literature that the adaptive neuro-fuzzy inference system (ANFIS) technique has limited application in the field of prediction of urban water demand [12,13,35].
Most of the urban water demand prediction models in the literature have used data-driven approaches, which led to the improvement of the accuracy of results [31], such as [36][37][38][39]. Recently, different techniques for data preprocessing have been applied and summarized by Eggimann et al. [40], showing that it has an important role in the implementation of the prediction models. Also, data preprocessing has been successfully applied in various fields, for example, urban water demand [38], irrigation water prediction [41], and estimation of relative humidity [36].
Shah et al. [42] stated that anticipating urban water demand has been an active field of study for several decades. Also, it becomes increasingly critical due to the scarcity of freshwater resources and an increase in water consumption resulting from socioeconomic and climate factors. Accordingly, real uncertainty still remains for managers of water companies about the ability of the existing water system to deal with this rapid increase in water demand.
The aim of this study is to develop a new methodology that is able to accurately predict mediumterm urban water demand utilizing previous water consumption data, which means considering the variability of climatic, demographic, and socioeconomic factors. In order to achieve this, the following objectives will be performed: 1. To apply data preprocessing techniques to denoise water consumption time series and select best model input scenario. 2. To evaluate the performance of an adaptive neuro fuzzy inference system (ANFIS) to predict mid-term municipal water demand based on several time intervals of water consumption. 3. To apply a hybrid crow search algorithm and artificial neural network (CSA-ANN) to evaluate the results of the ANFIS model. 4. To increase the predicting range and reduce the uncertainty of outcomes for urban water demands by testing different hyperparameters, such as the various types and orders of the wavelet denoising technique and different kinds and numbers of membership functions of the ANFIS technique.
To the best of the authors' knowledge, this is the first time that this novel methodology has been employed to simulate medium-term municipal water demand depending on several intervals of water consumption.

Study Area and Data Set
A catchment zone in Australia sited in Melbourne City has been used to evolve models of municipal water demand. City West Water (CWW) is one of three retail water utilities that serve the area, which includes the Melbourne's CBD and western suburbs that contain some of the largest industrial customer base in Victoria, and also the fastest population growth corridors in Australia. CWW delivers clean water to more than one million capita over more than 700 km 2 as a service area. It supplies around 100 billion liters per year of clean water to both residential and nonresidential customers (418,000 residential properties, 41,000 nonresidential customers) [43]. CWW purchases water wholesale from Melbourne Water, which is generally harvested from protected catchments in the mountains [44]. Historical monthly data of municipal water consumption (in megaliter, ML) over 15 years (20012015) for the area being served by City West Water was used to build and assess models of water demand based on several time intervals of water consumption. Figure 1

Methodology
The prediction of monthly municipal water demand based on time intervals of water consumption, proposed in this study, includes the following five steps ( Figure 2): (I) data preprocessing, (II) hybrid crow search algorithm and artificial neural network (CSA-ANN), (III) an adaptive neuro fuzzy inference system (ANFIS), (IV) data division, and (V) model performance criteria. Detailed descriptions of these steps are explained below:

Data Preprocessing
Data preprocessing is vital to ensure that all independent factors receive the same attention during the training stage, and it commonly speeds up the training process as well. It can be divided into three parts, that is, data normalization, data cleaning, and selection of the best model input [7].

Normalization
Normalization is a technique used to treat or reduce the impact of outliers. This approach modifies the shape of the time series to a more nearly normal distribution (i.e., outliers cases tails in the time series) [45]. In this research, natural logarithm was applied for normalizing the time series and to reduce the multicollinearity between independent variables (model input) to avoid incorrect conclusions [21], by using SPSS 24 statistics package.

Data Cleaning
As outliers and noise can have adverse effects on any model [45], this study applied the boxwhisker method to detect the outliers that lie outside the period ± 1.5 IQR (IQR = third quartile (Q3)first quartile (Q1)) [46], to treat them by using SPSS 24 statistics package, and the wavelet transform technique to denoise the time series.

Wavelet Transform
The wavelet transform is an efficient time-frequency analysis technique. In general, a mother wavelet is suitably scaled and shifted along the original time series, which enables the representation of time series in time frequency domain instantaneously and thus is suitable for analyzing both stationary and nonstationary time series. They are divided into continuous wavelet transform (CWT) and discrete wavelet transforms (DWT). CWT is suitable for representing a time series in timefrequency domain, while DWT is useful for denoising and compressing time series, which makes it very relevant in the context of hydrology applications [47]. For a DWT of a time series x, the transformation is given as in the following formula in Equation (1) [48]: where Ψ(n) is the mother wavelet, while m and k are the scaling and shifting indices, respectively. This technique has been applied in various areas such as simulation of irrigation water [41], prediction of relative humidity [36], and water demand simulation [30].
One of the main challenges in the application of DWT is the choice of the kind of mother wavelet; hence, this study tested five types of wavelets using the MATLAB toolbox, including Symlets (sym), Coiflets (coif), Discrete Meyer Wavelet (dmey), Daubechies (db) and Haar, to increase the confidence of the method and decrease the uncertainty of results.

Identifying of Explanatory Factors
The selection of suitable predictors is considered one of the essential steps in the design of the structure of the prediction model [36]. This step assists to enhance the performance of the model by selecting the most relevant explanatory factors, that is, those that have a stronger relationship with water consumption [37]. Tabachnick and Fidell [45] stated that stepwise regression technique has been regularly used to select the optimal subset of independent variables (IVs) that better predict the dependent variable (DV), while also eliminating redundant IVs that adversely impact the model performance (i.e., show a p-value >0.05).
In this study, a stepwise regression was applied to select the optimum scenario of time intervals of water consumption that decreases the loss of information and avoids the presence of redundant intervals, which may adversely affect the training process.

Artificial Neural Networks (ANNs)
There are several artificial neural networks (ANNs) architectures and methodologies, applied in many different ways in the literature. For example, the feed-forward multilayer perceptron architecture (FF-MLP) has been frequently used to solve existing problems in the area of hydrology [14,36]. In the study of Bayatvarkeshi et al. [36], the ANN was trained by a Levenberg-Marquardt (LM) backpropagation algorithm due to its ability to estimate successfully any independent/dependent map. The proposed ANN architecture consists of four layers of neurons: an input layer that has the independents variables, output layer that contains dependent variable (target), and two hidden layers to deal with complex nonlinearity of water time series, as in Perea et al. [29]. A metaheuristic algorithm was combined with ANN to select the optimum number of neurons in the hidden layers and the best learning rate coefficient to get optimal input/output mapping and avoid over-and underestimation.

Crow Search Algorithm (CSA)
Various optimization approaches can be applied to locate the optimal values of a system's factors, under various situations [49]. In particular, metaheuristic algorithms have shown to be robust in the presence of multimodal and nonlinear problems. Currently, the tendency in the literature is the application of nature-inspired metaheuristic algorithms for solving various problems, and results show that these algorithms are very efficient [50].
The crow search algorithm (CSA) is a metaheuristic technique that is inspired in the behavior in a group of crows, who store their excess food for when it is needed. CSA is an optimization and computational iterative search technique proposed by Askarzadeh [51]. The principles of CSA technique are as follows: crows are living in the form of a swarm, they are memorizing the location of their hiding places, they are following each other for doing thievery, and they are protecting their catches from being stolen via a probability. Additional details can be found in Askarzadeh [51].
CSA has been applied to tackle a range of engineering optimization problems such as economic environmental dispatch [52], energy problems optimization [53], constrained engineering solutions [51], and selection of optimal size of conductor in radial distribution networks [54].

Combined Crow Search Algorithm-Based Artificial Neural Network
In the ANN model, before implementing the training, testing, and validation stages, it is significant to determine two parameters: the learning rate coefficient and the number of neurons hidden [21]. These parameters are in charge to map the relationship between dependent and independents variables in the ANN prediction model with minimum error [55]. However, the selection of these parameters on the basis of trial and error technique may not lead to optimal solutions and it is time-consuming. Hence, the exhaustive conventional trial and error processes should be avoided when locating these parameters [56]. Hence, the ANN model was integrated by crow search algorithm (CSA-ANN) to determine the optimal parameters of ANN model to avoid under-or overfitting the model, leading to a reduction in uncertainty for water utilities. Also, the hybrid technique of metaheuristic algorithm and machine learning model outperformed the same single model of machine learning, such as in [23,30].
Five swarm sizes, 10, 20, 30, 40, and 50, and 100 iterations, were used to locate the swarm that could get the minimum fitness function value. Additionally, the initial parameters of flight length and awareness probability were 2 and 0.1, respectively.

Adaptive Neuro Fuzzy Inference System (ANFIS)
Recently, ANFIS technique has widely acquired attention by researchers because of its capability to simulate nonlinear time series in different fields of study such as wireless sensor network [57], river stage modelling [58], and assessment of seismic-induced landslide [59]. Also, it has been reported in the literature by Ghalehkhondabi et al. [12], de Souza Groppo et al. [13] and Rahim, Nguyen, Stewart, Giurco, and Blumenstein [35] that the ANFIS technique has limited application in the field of prediction of urban water demand.
ANFIS is a type of artificial neural network (ANN) that is hybridized with a fuzzy inference system (FIS). In this hybrid algorithm, that is, backpropagation (BP) and least squares estimation, ANN evolves suitable if-then rules and membership functions (mfs) for FIS from the given inputoutput data pairs. Since ANFIS combines both ANN and FIS principles, it has the capability to capture the advantages of both models in a single framework that has learning potential to approximate nonlinear functions. Computationally, the first-order Takagi-Sugeno system is compact and effective, so it is considered to build the ANFIS technique. In addition, the performance of ANFIS technique can be improved by choosing the appropriate type and number of membership functions [57]. Accordingly, for each input, 3, 5, and 7 mfs were considered for learning and testing the ANFIS technique. Additionally, eight kinds of mfs, namely, the bell-shaped (gbell), trapezoidal (trap), pishaped curve (pi), Gaussian curve (gauss), difference of two sigmoid (dsig), triangular (tri), two-sided Gaussian curve (gauss2), and product of two sigmoid (psig) membership functions were tested. Twenty-four scenarios of simulation for predicting urban water demand were conducted, each of them using up to 1000 epochs.
The ANFIS technique has five layers that are structured as follows: membership, rules, normalization, function, and output. The x and y represent the input, (A1, A2) and (B1, B2) represent the linguistic variables, respectively, as presented in Figure 3. Layer one: each node includes adaptive nodes as presented in Equations (2) and (3): where µAi(x) and µBi(y) present the mfs of the suggested node. Layer two: Equation (4) shows products of the corresponding degrees gained from the Layer one.
where Wi refers to the product of each node. Layer three: the output of layer two will be normalized based on Equation (5) and considered as the nodes of the present layer.
where is the normalized firing strength. Layer four: a node function is used to link each node, as indicated in Equation (6): where pi, qi, and ri are the node parameters. In layer four, the parameters are deemed as the result parameters. Layer five: the output node, which is single, calculates the overall output via summing all incoming signals. See Equation (7).

Data Division
Data division is an important issue that needs to be addressed in the prediction model. It divides the time series data into three sections: training, testing, and validation. These sections must have the same pattern due to the inability of prediction model to extrapolate outside the range of data used to train the model [21]. In this study, data were separated randomly to training, testing, and validation subsets with 70%, 15%, and 15%, respectively. Data were employed to train, test, and validate the ANFIS and CSA-ANN techniques, by using MATLAB toolbox, to construct a relation between water demand (target) and water consumption interval time (model input).

Model Performances Criteria
It is vital to choose the suitable criteria for a particular application because there are no global performance criteria [37]. Accordingly, several statistical criteria were used to evaluate the performance of the models that were categorized into absolute, relative, and dimensionless errors. The absolute error includes mean absolute error (MAE) (Equation (8)). The mean absolute relative error (MARE) (Equation (9)) belongs to relative errors. The coefficient of efficiency (CE) (Equation (10)) refers to dimensionless error. Additionally, Taylor diagram, which displays pattern statistics for preparing a visual comprehension of performance by plotting several points on a polar plot for two sets or more of modelling outcomes, was used in the present study for comparison between the modelling outcomes [60]. All these criteria and tests are proper to examine the linear and nonlinear relationship between observed and predicted municipal water demand.
where = predicted water demand, = observed water consumption, = mean of observed water demand, and N= data size.

Data Preprocessing Analysis
Tabachnick and Fidell [45] advised that after locating the outliers, the first option is to transform the outliers to decrease their impact and to make the distribution normal or near to the normal, and the second option is to modify the score(s) for the rest of the outliers that still exist after the transformation technique. Hence, data were normalized and cleaned from outliers. After that, the DWT method was used to denoise the water consumption time series. First of all, sym, coif, and db wavelets were applied separately in different orders to select the best order for each one based on the correlation coefficient between the water consumption data and some previous monthly consumption. The results show that the best order is four for coif4 and five for sym5 and db5. The five kinds of wavelets were considered to denoise water time series and select the best one, as represented in Figure 4. This figure indicates that generally all types of wavelets enhance the correlation coefficients values with different limits over all time intervals compared with raw time series data. Also, sym5 offers the best scenario by yielding highest R compared with the other kinds of wavelets. Three shapes of box plots for raw, clean, and denoise data are presented in Figure 5. The figure illustrates that raw data has two outliers after normalization, and there is no big difference compared with the clean data shape. All shapes approximately have the same median and upper and lower extremes. While, the shape of denoise data is less than those of the other two shapes (raw data and clean data). Five intervals of historical water consumption were chosen, based on their effective correlation coefficients, to examine and select the best scenario of model input. Stepwise regression model was applied to identify the significant intervals of monthly historical water consumption to be as model input for water demand prediction based on a p-value. The results show that intervals (T-3 and T-5) are redundant and intervals (T-1, T-2, and T-4) offer the best scenario of model input. Accordingly, the length of the time series decreases from 180 to 175 because of choosing five previous monthly data as model input.

Preparation and Configuration of the Techniques
After data was subjected to preprocessing techniques, the data were divided into three sets, namely training, testing, and validation sets. Table 1 shows a comparison of training, testing, and validation sets based on several statistical indicators of water consumption for these sets, including maximum (Wmax), minimum (Wmin), mean (Wmean), standard deviation (Wstd), and sample size for each set (N). The results reveal that all sets generally have the same pattern.  The ANFIS and CSA-ANN model need to configure before being used to simulate water demand. Consequently, for the ANFIS technique, the error (RMSE) for the validation stage is estimated for different numbers and kinds of membership functions for the ANFIS technique to select the best one, as set out in Table 2. What stands out in this table is the best water demand error occurs when three mfs are selected for each input, and the error is worst when the number of mfs increases to five and seven. Additionally, the two-sided Gaussian curve (gauss2) type is better than the other membership function types, according to the RMSE value (0.0139) ML. Hence, considering the appropriate number and type of membership function is essential to ensure accurate water demand prediction, applying various numbers and types of mfs can increase the confidence level and decrease the uncertainty. The hybrid CSA-ANN algorithm was run five times based on the swarm sizes, which were 10, 20, 30, 40, and 50, to choose the best learning rate coefficient (LR) and an optimum number of hidden neurons (N1 and N2). Figure 6 shows the convergence rates between swarms 10 and 20 and between swarms 30 and 40 were very comparable, while swarm 50 showed less RMSE (equal to 0.008496) after a minimum number of iterations (50 iterations). Considering the swarm 50 that has better performance, the design factors of the ANN model are LR = 0.4180, N1 and N2 are 4 and 13 neurons, respectively. Accordingly, the ANN model will run several times to select the optimum weights' network that makes the model accurately generalize the new data. The performance of the ANN model (stand-alone) was examined to identify the impact of employing CSA algorithm to ANN model. Hence, extensively scenarios of trial and error technique were applied to locate the parameters of the ANN model (LR, N1, and N2) that offer the best accuracy of prediction. Accordingly, the results reveal that the values of LR, N1, and N2 are 0.5, 5, and 11, respectively.

Evaluating and Comparing the Performance of the Techniques
In order to be able to assess and compare the performance of the developed techniques, a number of statistical measures were calculated (see section 3.5 for more details). The MAE, CE, and MARE of both approaches can be seen in Table 3. The results of both techniques showed good simulation level of water time series based on the scale of error according to Dawson et al. [61]. Also, the results of ANFIS and CSA-ANN techniques are near each other, while the results of combined CSA-ANN are more accurate than the results of ANN (stand-alone) based on CE and have less error based on MAE and MARE. Accordingly, these outcomes give support and validation to combined models.  Figure 7 shows the observed and simulated water time series data by ANFIS and CSA-ANN models. It can be seen that the simulated data catch up the pattern (trend + periodicity) of observed data along the time series with good matching as well based on the scale of the plot. Moreover, Figure 8 displays Taylor diagram for ANFIS and CSA-ANN prediction models. This diagram offers a graphical summary for the agreement between measured and simulated patterns, considering standard deviation (SD), a root-mean-square difference (RMSD), and correlation coefficient (R). In Figure 8, the blue azimuthal line, grey arc, and green contour line represent the values of R, SD, and RMSD for observed (reference) pattern, respectively. The diagram shows that both ANFIS and CSA-ANN models yielded high R and low SD and RMSD relative to reference point, which refers to the observed pattern. The one-way analysis of variance (ANOVA) statistical technique was used to consider whether the two models (ANFIS and CSA-ANN) are statistically equivalent or one better than the other one. In this study, ANOVA was used to compare the variance (variability in observed and predicted scores) among observed data, ANFIS and CSA-ANN models in the validation stage (N = 26 for each  [62], Levene's test was used to examine the homogeneity of variances of the studied groups. The value of significance value (Sig.) for Levene's test is 0.968 > 0.05, meaning the homogeneity of variance assumption was not violated. Also, Table 4 presents the results of the ANOVA test. The main interesting thing that emerges from this table is the value of Sig. that equal to 0.991 > 0.05 that means a failure to reject the null hypothesis, which states that the groups' means are equal. Overall, we can conclude from this that ANFIS and CSA-ANN techniques are statistically equivalent and they are able to accurately simulate the municipal water demand.  (1) preprocessing data methods have a valuable contribution to denoise water consumption time series and to select the best model input scenario. (2) ANFIS and CSA-ANN techniques are statistically equivalent and reliable tools for monthly municipal water demand prediction based on lags time of water consumption. (3) The main improvement issues of this study are the optimization of (I) wavelet denoising method by using various types with different orders. (II) ANFIS technique via different kinds and numbers of membership functions that affect the precision and generalization of the suggested predictive methodology. (III) CSA-ANN algorithm with five swarms to select the best one that offers the optimum factors of ANN model and increase then the accuracy of prediction.
The proposed methodology of prediction of monthly urban water demand considered all the variables that impact water consumption, including climatic, demographic, socioeconomic, and strategic factors, by using previous water consumption as a model input. Consequently, therefore, this study provides a valuable scientific insight to help City West Water utility in Melbourne City for proper management of the existing freshwater resources, estimating the risk associated to a decision and planning extensions in response to the growing demand.

Conclusions
Prediction of water needed is a crucial element in operating, managing, and planning the urban water system. Although medium-term prediction is quite useful to manage dams for the cities such as Melbourne City that are depending on water harvesting, few studies consider medium-term, as showing in the reviewing articles. This research examines the application of new methodology (combined techniques): two hybrid intelligent techniques, ANFIS and CSA-ANN, combined with data preprocessing were employed to simulate monthly urban water demand depending on historical intervals of water consumption. Based on the literature review, this is the first time that the models CSA-ANN and ANFIS were subjected to new optimization techniques and data analytics to yield a reliable method for estimating medium-term water demand. Historical monthly data of water consumption for the CWW Company in Melbourne City over 15 years (2001-2015) were used to build and assess the models of prediction. Depended on the outcomes, it can be concluded that 1) DWT and stepwise regression are significant approaches to denoise raw time series and select best scenario of model input, respectively. 2) ANFIS and CSA-ANN are statistically equivalent models that are capable to accurately predict municipal water demand based on several statistical and graphical tests. Generally, the presented methodology could highlight the significance of hybrid intelligent models that, combined with robust data preprocessing techniques, can be applied to simulate monthly water demand depending on optimum time intervals. Additionally, this methodology could be an initial ground for future similar applications.
The potential future studies will involve additional testing of different hybrid techniques, especially updated metaheuristic algorithms that have revealed successful application in simulation, nonlinear time series data, as well as the application of further types of data preprocessing to improve the quality of data and choose the best scenario of independent variables. Also, future studies will test these methodologies for different terms, including long term and short term (daily data). Moreover, these models can apply to the data of the other two utilities in Melbourne City to compare the results of studies that give scientific insight to the main water company, which is Melbourne Water Company.