You are currently viewing a new version of our website. To view the old version click .
Water
  • Editor’s Choice
  • Article
  • Open Access

27 June 2024

Enhancing Water Management: A Comparative Analysis of Time Series Prediction Models for Distributed Water Flow in Supply Networks

and
1
Polytechnic Institute of Portalegre, 7300-555 Portalegre, Portugal
2
FCC Aqualia Portugal, 1990-514 Lisboa, Portugal
3
VALORIZA—Research Center for Endogenous Resource Valorization, Polytechnic Institute of Portalegre, 7300-555 Portalegre, Portugal
*
Authors to whom correspondence should be addressed.
This article belongs to the Topic Application of Smart Technologies in Water Resources Management

Abstract

Water scarcity poses a significant challenge to social integration and economic development, necessitating efficient water management strategies. This study compares time series forecasting models, both classical, Holt–Winters and ARIMA, and modern, LSTM and Prophet, to determine the most accurate model for predicting water flow in public supply networks. Data from four rural Portuguese locations were used, with preprocessing ensuring quality and uniformity. Performance metrics were evaluated for both medium-term (10 days) and long-term (3 months) forecasts. Results indicate that classical models like Holt–Winters and ARIMA perform better for medium-term predictions, while modern models, particularly LSTM, excel in long-term forecasts by effectively capturing seasonal patterns. Future research should integrate additional variables and explore hybrid models to enhance forecasting accuracy.

1. Introduction

Water scarcity represents a significant constraint regarding social integration and economic development. While agriculture accounts for 80% of water usage, domestic demand is also rising due to population growth, lifestyle changes, and global warming. These factors are decreasing water availability and increasing demand, necessitating better planning for water reserves, recycling, and efficient use, as well as real-time monitoring systems [1].
Enhancing water distribution efficiency requires pro-active management based on consumption forecasting, rather than reactive management based on current consumption. Accurate forecasting can reduce operating costs by approximately 18% [2], underscoring the importance of selecting the right model for specific dataset characteristics.
There is a considerable body of research work that deals with time series forecasting for water demand [3,4,5,6]. However, most of these studies use time series data from urban areas or, if focusing on rural areas, they correspond to a large number of users within the water distribution systems [7]. Moreover, many authors direct their forecasts towards short-term periods (hourly, daily, or time steps less than a week). On the other hand, forecasting for the medium term (weekly to monthly) or long term (longer than a month) has not yet been fully explored. Medium- and long-term demand forecasting are important for planning and design of water supply systems, managing water resources, and maintenance activities [8].
This work focuses primarily on monitoring of water flow rates in distribution networks in rural areas characterized by low populational density and water scarcity. It aims to forecast the medium- and long-term water demand in rural areas using both classical and modern forecasting methods. Among the classical models, two statistical approaches were considered: the simpler Holt–Winters models and the more advanced AutoRegressive Integrated Moving Average (ARIMA). The modern methods used are the Long-Short-Term Memory (LSTM), a type of deep learning neural network that has gained wide popularity in time series prediction, and Prophet, a recent regression forecasting model that has been successfully used in several time series forecasting tasks, but not yet explored enough in water demand forecasting.
This paper is organized as follows: Section 2—Selected Models and Related Research provides an overview of the research field and of the characteristics of the models used; Section 3—Materials and Methods describes the datasets, the pre-processing and analysis steps, and the model parameter definitions; and Section 4—Results presents the forecasting results for the different time series. The discussion of the results and the conclusions are presented in Section 5—Discussion and Section 6—Conclusions, respectively.

3. Materials and Methods

This section details the methodologies applied for collecting, preprocessing, and analyzing time series data of water flow in rural Portuguese supply sectors.

3.1. Data Acquisition and Preprocessing

This study uses real water flow measurements from four supply sectors located in rural, low-density populated areas in Portugal: Janeiro de Cima (638 inhabitants), Aldeia de Joanes (1168 inhabitants), Degolados (866 inhabitants), and Alcáçova (4798 inhabitants). Data, which were collected from monitoring systems and stored in databases, were made available by the company responsible for the water supply. Data spanning October 2021 to December 2022 were used for training and testing the predictive models.
Preprocessing involved several steps to ensure data quality and uniformity: concatenating files into a single DataFrame, retaining only the datetime and water flow rate columns and converting data types. Equidistance of the time series data was ensured by interpolating missing values and correcting timestamp anomalies. Data recorded at 5 min intervals were resampled to an hourly frequency by averaging the cleaned and processed data that were exported for the analysis.

3.2. Data Description

The final datasets contained hourly water flow records for each location, structured uniformly for model application. Each dataset provided a comprehensive representation of water flow dynamics, essential for effective time series forecasting.
The plots presented in Figure 1 illustrate that each dataset has its own particular characteristics. In Alcáçovas, a significant increase in the average was observed, peaking in August, followed by a sharp decline. This pattern corresponds to a rupture in the distribution network, which was initially undetectable on the surface. Aldeia de Joanes, Janeiro de Cima, and Degolados exhibit high variability, especially during the summer months. These locations are small villages with few permanent residents, but they experience a significant increase in population during the summer, a time when temperatures are also notably high. The effect of the summer is less pronounced in Degolados.
Figure 1. Time series, represented as the plot of the water flow (L/s) over time for the four locations studied: (a) Alcáçova, (b) Aldeia de Joanes, (c) Degolados, and (d) Janeiro de Cima.

3.3. Data Analysis

Data analysis procedures were implemented to prepare and ensure that the time series data meet the assumptions for the forecasting models used. Analyses of the datasets were performed regarding stationarity, autocorrelation, decomposition, and normality distribution assessment. To verify stationarity, the Augmented Dickey–Fuller (ADF) test, the Kwiatkowski–Phillips–Schmidt–Shin (KPSS) test, and the Philips–Perron (PP) test were employed. The ADF and PP tests check for the presence of a unit root, suggesting non-stationarity, while the KPSS test checks for stationarity. Differentiation was applied to transform non-stationary series into stationary ones by computing differences between consecutive observations. The appropriate order of differencing was determined iteratively, followed by re-evaluation using stationarity tests. The autocorrelation function (ACF) and partial autocorrelation function (PACF) plots were used to identify patterns and determine the order of the autoregressive components in the ARIMA model. Time series decomposition was performed to separate the series into trend, seasonality, and residual components using an additive model. To determine if the data followed a normal distribution, the Shapiro–Wilk and Anderson–Darling tests were conducted. These tests assess whether the data deviate significantly from a normal distribution, which is essential for many statistical modeling techniques. In cases where normality was not observed, logarithmic and cubic root transformations were applied to the series, and the normality of the transformed series was reassessed.
For model training and testing, each dataset was divided into a training set consisting of the first 12 months of the time series and a test set including the last 3 months. For the LSTM model, validation data were also necessary: the first 9 months were used for training, the next 3 months for validation, and the final 3 months for testing.

3.4. Parametrization of the Models

This section describes how the four chosen forecasting models were applied to data.
The Holt–Winters method, which applies exponential smoothing to level, trend, and seasonal components, was preferred in its additive form due to the consistent nature of seasonal variations in the data.
The parameters of the ARIMA model were determined using autocorrelation and partial autocorrelation functions and the observed seasonality, aiming to minimize the Akaike Information Criterion (AIC). Residual correlation was also analyzed to finalize parameter selection.
The LSTM models employed in this study included both univariate and multivariate configurations. After optimization, the chosen architectures are represented in Table 1. The input layer is always a matrix of shape (n × 24, k), where n is the number of days considered (n = 10 for medium-term forecasting; n = 30 for long-term forecasting) and k is the number of features considered in each model. The output layer is always a vector of size n × 24 that contains the hourly prediction of the n following days.
Table 1. Architecture for the univariate and the multivariate LSTM networks. n stands for the number of days in the two different time-horizon forecasts considered: 10 days for medium term and 30 days for the long-term forecast.
The univariate model received as input the single time series input, specifically the normalized water flow data. The multivariate model received as input both the normalized water flow time series and four additional variables representing the time information codified by Fourier series decomposition. In both cases, the activation function for the LSTM layers was the rectified linear unit, the optimization algorithm used was the Adam optimizer, and the loss function was the Mean Squared Error.
The parameters for the Prophet model were selected to handle seasonality, trend changes, and holiday effects. The growth parameter was set to “linear”. Yearly and weekly seasonalities were enabled by default, while daily seasonality was added if significant daily patterns were detected. Custom holiday effects were included to account for impactful days, specified based on context and expertise. The seasonality order was adjusted to capture complex seasonal patterns, and the model detected changepoints to reflect significant trend shifts.

3.5. Evaluation Metrics

Model performance was assessed using three key metrics: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Coefficient of Determination (R2). MAE (3) provides a direct measure of the average absolute error predictions but is not sensitive to outliers. RMSE (4) captures the standard deviation of prediction errors, emphasizing larger errors due to its quadratic nature. R2 (5) indicates the goodness of fit, as it measures the proportion of variance in the observed data explained by the model. In the majority of cases, it is a very reliable indicator for model performance.
The equations of the metrics are as follows:
M A E = i = 1 N y i y i ^ N
R M S E = i = 1 N y i y i ^ 2 N
R 2 = 1 i = 1 N y i y i ^ 2 i = 1 N y i y ¯ 2
where N is the number of observations; y i ^ is the predicted value for observation i ; y i is the measured value of observation i ; and y ¯ is the mean of the observed values.

3.6. Software and Equipment

The computational tasks were performed on a MacBook Pro equipped with a 2.6 GHz Intel Core i7 processor, 16 GB RAM, and AMD Radeon Pro 5300 M GPU Key software that included the Anaconda Navigator for managing Python environments and dependencies; Visual Studio Code as the integrated development environment (IDE) for code development; and Python libraries, including Pandas for data manipulation, statsmodels for statistical modeling, scikit-learn for evaluation metrics, and TensorFlow/Keras [27,28] for the LSTM implementation.

4. Results

This section presents the findings from the pre-treatment and analysis phases, specifically the outcomes of the stationarity and normality tests. It then presents the performance of the various forecasting models applied to the datasets.

4.1. Stationarity and Normality

For the four datasets, the application of numerical tests for stationarity revealed mixed results: the KPSS test indicated that the series were non-stationary, while the ADF and PP tests suggested that the series were stationary. Considering that the KPSS test can exhibit issues with very large samples, which is the case here, and that the other tests indicated stationarity, along with graphical observations, it was concluded that the series exhibited weak stationarity. Additionally, differentiation did not improve the stationarity of the series.
Both the Shapiro–Wilk and Anderson–Darling tests used to assess normality rejected the null hypothesis of normality across the four raw data series, indicating non-normality. Subsequent transformations, such as the logarithmic and cubic root, did not achieve normality for any of the datasets. These results pointed to the necessity of using robust models that do not rely on the assumption of normality for accurate forecasting.

4.2. Performance of Forecasting Models

In this section, the summary of the performance metrics of the models obtained across the four locations is presented. The results presented reflect the best-case scenario for each of the models and locations, according to the descriptions provided.

4.2.1. Holt–Winters Models

Table 2 summarizes the model performance metrics for medium-term (10 days) and long-term (3 months) predictions for the four locations, evaluated both on training and testing datasets, for the Holt–Winters model. The results presented reflect the best-case scenario for each of the models and locations, which were obtained with the logarithmic transformation for Alcáçova, Aldeia de Joanes, and Janeiro de Cima, and the original data for Degolados.
Table 2. Performance metrics of the Holt–Winters model applied to water flow time series from the four locations studied, for the transformations that resulted in best-case scenario: logarithmic transformation for Alcáçova, Aldeia de Joanes, and Janeiro de Cima, and original data for Degolados.
The Holt–Winters model showed varying levels of effectiveness across different locations, generally performing better in Degolados and Alcáçova for both medium-term and long-term predictions, while struggling significantly with long-term predictions in Aldeia de Joanes and Janeiro de Cima. Of special note are the results for Aldeia de Joanes, where the best R2 value achieved was only 0.404. This is probably due to the high variability of the dataset, as noted above, which the model was unable to capture.
For illustration purposes, Figure 2 presents the zoom in the transition region between training and testing plots superimposed with the predicted time series for (a) Alcáçova and (b) Janeiro de Cima.
Figure 2. Time series amplified over the transition between training and testing regions superimposed with the Holt–Winters model’s forecast for (a) Alcáçova and (b) Janeiro de Cima.

4.2.2. ARIMA Models

Table 3 summarizes the ARIMA model performance metrics for medium-term (10 days) and long-term (3 months) predictions for the four locations, evaluated both on training and testing datasets. The results presented for each case correspond to the models whose parameterization resulted in the lowest AIC and the least residual correlation.
Table 3. Performance metrics of the ARIMA model applied to water flow time series across the four locations studied.
The ARIMA model demonstrated strong performance in Alcáçova for both short- and long-term predictions, with high R2 values and relatively low error metrics. Degolados and Janeiro de Cima also showed good results for short-term predictions, but with a noticeable drop in the R2 value for long-term predictions. As in the previous model, Aldeia de Joanes, while included in the summary, faced challenges with its predictions due to irregularities in its time series data, affecting the model’s ability to provide accurate forecasts.
Figure 3 presents the time series plots zoomed in the transition region between training and testing superimposed with the ARIMA forecasting for (a) Aldeia de Joanes and (b) Degolados.
Figure 3. Original time series zoomed over the transition between training and testing regions superimposed with the ARIMA model’s forecast for (a) Aldeia de Joanes and (b) Degolados.

4.2.3. LSTM Models

Table 4 and Table 5 summarize the LSTM model performance metrics for medium-term (10 days) and long-term (3 months) predictions for the four locations, evaluated both on training and testing datasets, for the univariate and multivariate model, respectively.
Table 4. Performance metrics of the LSTM model applied to water flow time series from the four locations studied, for the univariate model with standardized training data.
Table 5. Performance metrics of the LSTM model applied to water flow time series from the four locations studied, for the approach that resulted in best-case scenario: multivariable model with standardized training data.
The multivariate LSTM models generally show an improvement in performance over the univariate models. This is evident in the slightly lower training MAE and RMSE values and slightly higher training R2 values for most locations. Testing results also show improvement in multivariate models, though the improvement varies by location. The multivariate models generally have higher R2 values during training and testing, indicating better model performance.
Overall, the LSTM model demonstrated strong performance in Alcáçova and Degolados for both medium- and long-term predictions, with high R2 values and relatively low error metrics. The model showed worse performance for Aldeia de Joanes, struggling with accuracy, as well as for Janeiro de Cima, particularly in the long-term predictions. The LSTM model was particularly effective in capturing the annual seasonality of the water flow data, which is crucial for accurate long-term forecasting. However, the Aldeia de Joanes dataset presents a high level of variability in the data used as the test period, and its behavior differs significantly from the same period the previous year. Both challenges proved too demanding for the LSTM model to adequately address, leading to limitations in its predictive capabilities for this dataset.
Figure 4 presents the Alcáçova time series plots zoomed in the transition region between training and testing superimposed with the (a) univariate LSTM and (b) multivariate LSTM forecasting models.
Figure 4. Original Alcáçova’s time series zoomed over the transition between training and testing regions superimposed with the LSTM model’s forecast for the Alcáçova (a) univariate model and (b) multivariate model.

4.2.4. Prophet Models

Table 6 presents Prophet model performance metrics for medium- (10 days) and long-term (3 months) predictions for the four locations.
Table 6. Performance metrics of the Prophet model applied to water flow time series from the four locations studied.
The Prophet model showed good performance in Alcáçova for both medium- and long-term predictions, with high R2 values and relatively low error metrics. Degolados also showed favorable results, especially for medium-term predictions. Janeiro de Cima exhibited moderate performance, with a noticeable drop in the R2 value for testing data in the long-term predictions. As in the previous cases, Aldeia de Joanes struggled with both short- and long-term predictions, showing low R2 values, as did Janeiro de Cima for the long-term forecasting. The Prophet model was particularly effective in capturing seasonal trends in Alcácova and Degolados. However, the particular characteristics of the test data for the other two datasets proved too demanding for the model, similar to what happened with the previous models.
Figure 5 presents the time series plots zoomed in the transition region between training and testing superimposed with the Prophet model’s forecast for a) Degolados and b) Janeiro de Cima.
Figure 5. Original time series zoomed over the transition between training and testing regions superimposed with the Prophet model’s forecast for (a) Degolados and (b) Janeiro de Cima.

4.2.5. Summary of Performance Models

Table 7 and Table 8 present the metrics of the best-performance models across the four locations, for medium-term and long-term forecasting, respectively.
Table 7. Summary of the best-performing models for medium-term forecasting across the four locations.
Table 8. Summary of the best-performing models for long-term forecasting across the four locations.
According to the results presented in Table 7, the models that perform better for medium-term forecasting are the statistical models. Among them, the ARIMA model allowed better short-term forecasting in three of the datasets. Despite the issues of autocorrelation evident in these types of time series, as well as the deviation of their distribution from a normal distribution, the ARIMA model presented the best values for the considered metrics in the short term. It should also be noted that the best forecast achieved for Aldeia de Joanes has quite a low R2 value, as pointed out previously.
For long-term forecasts, Table 8 does not allow for a robust conclusion about a model that systematically leads to better results. In fact, for two of the datasets, Aldeia de Joanes and Janeiro de Cima, the long-term forecasts obtained present low R2 values. To achieve a more conclusive analysis, the average performance metrics of the long-term models across the locations were determined (Table 9). All model results for locations with a Coefficient of Determination lower than 0.50 were disregarded, which meant excluding the results for Aldeia de Joanes and Janeiro de Cima.
Table 9. Average Metrics of Long-Term Forecast Models.
The results presented in Table 9 show that the best average results were obtained by the LSTM model, with an approximate Mean Absolute Error of 0.19 L/s, a Root Mean Squared Error of 0.25 L/s, and a Coefficient of Determination of 0.76. Therefore, the LSTM model stands out as the model that achieves better results for long-term water flow forecasting, followed by the Prophet model. The LSTM model, compared to the Prophet model, also allows for greater control over tuning its parameters. The Prophet model offers fewer adjustment options, which may limit its flexibility and adaptability to specific data characteristics. Both modern models outperformed the classical models (Holt–Winters and ARIMA) in terms of accuracy and reliability for the long-term forecasting.

5. Discussion

The results obtained in this work show that, overall, the ability to forecast accurately varies significantly by location. Alcáçova and Degolados showed the most reliable forecasting results, benefiting from clear temporal patterns that models could effectively capture. In contrast, Aldeia de Joanes and Janeiro de Cima presented more challenges, particularly for long-term forecasts, likely due to higher data variability and weak seasonal patterns. These findings emphasize the importance of tailoring forecasting models to the specific characteristics of each location’s data to achieve optimal results.
Considering Holt–Winters and ARIMA as classical models and LSTM and Prophet as modern models, the results indicate that the classical models generally perform better for medium-term forecasting. Conversely, the modern models tend to yield better results for long-term predictions.
According to the study referenced in [10], which compares the Holt–Winters model with the ARIMA model, it was expected that ARIMA would significantly outperform the Holt–Winters model. While this was observed to some extent in the medium term, the difference was not substantial. Surprisingly, in the long term, the Holt–Winters model performed better than ARIMA. This underperformance of the ARIMA model can be attributed to certain characteristics of the data series, such as poor stationarity, high autocorrelation, and a distribution far from normal. These factors likely impacted its effectiveness.
Contrary to the findings in [15], which suggest that neural network-based models perform well only for short-term horizons (predicting the next value), our results show that the LSTM model tested in this study provided better long-term forecasts. This model was evaluated in a manner that mimicked its application in a production environment, predicting value by value and compiling the results. When comparing our findings with the existing literature, where LSTM models often show extraordinary results under controlled laboratory conditions (e.g., [29]), we concluded that our results are more realistic and reflective of actual performance.
Regarding the ability to capture seasonality, all four models adjusted well to daily seasonality. However, only the LSTM and Prophet models managed to capture annual seasonality effectively. The Holt–Winters and ARIMA models did not account for patterns from the same period in the previous year, being more influenced by recent data leading up to the forecast period. This implies that these models lose accuracy when series exhibit strong seasonal patterns. This limitation of the ARIMA model was also noted in [22] during a comparison with the Prophet model.
On the other hand, the LSTM and Prophet models showed strong tendencies to adhere to annual seasonal patterns. This contradicts the implications in [22], which suggested that the LSTM model might not handle annual seasonality well. The Prophet model demonstrated a good fit for both daily and annual seasonality. However, it exhibited a significant limitation in the amplitude of its forecast curves. This limitation was particularly noticeable for series with high variability in maximum flow values during summer (Aldeia de Joanes and Janeiro de Cima). This raises concerns about the Prophet model’s accuracy if the forecast period includes the summer, as it might significantly under-predict the actual values. This is of special concern in datasets corresponding to low-population-density areas where the population increases significantly during summer, such as in the cases of Aldeia de Joanes and Janeiro de Cima.
The LSTM model proved to be the best at respecting annual seasonality, making predictions based on what happened in the same periods of the previous year. It only encountered issues during the transition period between training and testing, making it difficult to achieve good medium-term forecasts.
The Prophet model showed performance comparable to the LSTM model regarding annual seasonality. However, its limitation in the amplitude of predictions prevented it from achieving better results.
For the ARIMA and Holt–Winters models, the inability to implement annual seasonality renders them accurate for medium-term forecasting but imprecise for long-term forecasting. In a water distribution network, if the number of breaks continues to increase, these models will be unable to alert us as they do not consider the need to make predictions based on similar periods in previous years.
This work has some limitations. A primary constraint was the quality and availability of historical data, as incomplete, noisy, and high-variability data can lead to inaccuracies. Most notably, for one of the datasets used in this work (Aldeia de Joanes), none of the models provided satisfactory results for either medium-term or long-term forecasts. Similarly, for another dataset (Janeiro de Cima), none of the models used for long-term forecasting provided satisfactory results. Each forecasting model, including ARIMA and Holt–Winters, has inherent limitations; for example, ARIMA struggles with non-stationary data and pronounced seasonal patterns, particularly for long-term predictions. The complexity of selecting optimal parameters and the significant computational resources required for training advanced models like LSTM further constrained this study. Additionally, the findings and models are tailored to the specific dataset used, raising concerns about their generalizability to other datasets without further adjustments. The effectiveness of multivariate models is also heavily dependent on the careful selection and preprocessing of input features, where suboptimal choices can degrade model performance. Furthermore, this study does not incorporate additional external variables (e.g., climatic factors) that could potentially enhance forecasting accuracy, especially for modern models like LSTM. These limitations underscore the challenges in developing accurate and robust forecasting models, highlighting the need for meticulous data handling, model tuning, and validation. Further research could focus on the investigation of hybrid models for improvement in a water demand forecast. Hybrid models combining classical and deep learning approaches have been studied for water demand forecasting problems. For instance, combining ARIMA with a type of neural network called the General Regression Neural Network improved daily water forecasting in Saudi Arabia [30]. Another study found that a hybrid model with Holt–Winters, ANN, and SARIMA performed best for industrial water consumption forecasting [25]. More recently, advanced hybrid methods using different machine learning approaches, such as Convolutional Neural Networks, LSTM, and LSTM with attention mechanisms, have been exploited for water demand forecasting and have been shown to lead to very promising results [23].

6. Conclusions

In conclusion, this study underscores the significant importance of selecting appropriate time series forecasting models for efficient water management in distributed supply networks. By comparing classical models such as Holt–Winters and ARIMA with modern approaches like LSTM and Prophet, we demonstrated that the choice of model greatly impacts the accuracy and reliability of both medium-term and long-term forecasts. The results reveal that while classical models perform well for medium-term predictions, modern models, particularly LSTM, excel in long-term forecasting due to their ability to capture complex seasonal patterns. This insight is vital for water distribution networks, which require precise and reliable predictions to optimize resource allocation and minimize operational costs. Future research should focus on integrating additional external variables and exploring hybrid models to further enhance forecasting accuracy.

Author Contributions

Conceptualization, C.P. and M.V.M.; methodology, C.P. and M.V.M.; software, C.P.; validation, M.V.M.; resources, C.P.; data curation, C.P.; writing—original draft preparation, C.P.; writing—review and editing, M.V.M.; visualization, C.P.; supervision, M.V.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by national funds through the Fundação para a Ciência e Tecnologia. I.P. (Portuguese Foundation for Science and Technology) by the project UIDB/05064/2020 (VALORIZA—Research Centre for Endogenous Resource Valorization).

Data Availability Statement

Additional information is available from the authors upon reasonable request.

Acknowledgments

The authors express their sincere gratitude to FCC Aqualia Portugal, for granting permission to use the water flow data from the four supply sectors under study.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Tzanakakis, V.A.; Paranychianakis, N.V.; Angelakis, A.N. Water Supply and Water Scarcity. Water 2020, 12, 2347. [Google Scholar] [CrossRef]
  2. Antunes, A.; Andrade-Campos, A.; Sardinha-Lourenço, A.; Oliveira, M.S. Short-Term Water Demand Forecasting Using Machine Learning Techniques. J. Hydroinform. 2018, 20, 1343–1366. [Google Scholar] [CrossRef]
  3. Parmezan, A.R.S.; Souza, V.M.A.; Batista, G.E.A.P.A. Evaluation of Statistical and Machine Learning Models for Time Series Prediction: Identifying the State-of-the-Art and the Best Conditions for the Use of Each Model. Inf. Sci. 2019, 484, 302–337. [Google Scholar] [CrossRef]
  4. Niknam, A.; Zare, H.K.; Hosseininasab, H.; Mostafaeipour, A.; Herrera, M. A Critical Review of Short-Term Water Demand Forecasting Tools—What Method Should I Use? Sustainability 2022, 14, 5412. [Google Scholar] [CrossRef]
  5. Rahim, M.S.; Nguyen, K.A.; Stewart, R.A.; Giurco, D.; Blumenstein, M. Machine Learning and Data Analytic Techniques in Digitalwater Metering: A Review. Water 2020, 12, 294. [Google Scholar] [CrossRef]
  6. Tealab, A. Time Series Forecasting Using Artificial Neural Networks Methodologies: A Systematic Review. Futur. Comput. Inform. J. 2018, 3, 334–340. [Google Scholar] [CrossRef]
  7. Liu, G.; Savic, D.; Fu, G. Short-Term Water Demand Forecasting Using Data-Centric Machine Learning Approaches. J. Hydroinform. 2023, 25, 895–911. [Google Scholar] [CrossRef]
  8. Dhawan, P.; Dalla Torre, D.; Zanfei, A.; Menapace, A.; Larcher, M.; Righetti, M. Assessment of ERA5-Land Data in Medium-Term Drinking Water Demand Modelling with Deep Learning. Water 2023, 15, 1495. [Google Scholar] [CrossRef]
  9. Panda, M. Application of ARIMA and Holt-Winters Forecasting Model to Predict the Spreading of COVID-19 for India and Its States. medRxiv 2020. [Google Scholar] [CrossRef]
  10. Herrera, M.; Torgo, L.; Izquierdo, J.; Pérez-García, R. Predictive Models for Forecasting Hourly Urban Water Demand. J. Hydrol. 2010, 387, 141–150. [Google Scholar] [CrossRef]
  11. Abdel-Nasser, M.; Mahmoud, K. Accurate Photovoltaic Power Forecasting Models Using Deep LSTM-RNN. Neural Comput. Appl. 2019, 31, 2727–2740. [Google Scholar] [CrossRef]
  12. Liu, Y. Novel Volatility Forecasting Using Deep Learning–Long Short Term Memory Recurrent Neural Networks. Expert Syst. Appl. 2019, 132, 99–109. [Google Scholar] [CrossRef]
  13. Niknam, A.; Zare, H.K.; Hosseininasab, H.; Mostafaeipour, A. Developing an LSTM Model to Forecast the Monthly Water Consumption According to the Effects of the Climatic Factors in Yazd, Iran. J. Eng. Res. 2023, 11, 100028. [Google Scholar] [CrossRef]
  14. Fu, G.; Jin, Y.; Yuan, Z.; Butler, D. The Role of Deep Learning in Urban Water Management: A critical review. Water Res. 2022, 223, 118973. [Google Scholar] [CrossRef] [PubMed]
  15. Makridakis, S.; Spiliotis, E.; Assimakopoulos, V. Statistical and Machine Learning Forecasting Methods: Concerns and Ways Forward. PLoS ONE 2018, 13, e0194889. [Google Scholar] [CrossRef] [PubMed]
  16. Lim, B.; Zohren, S. Time-Series Forecasting with Deep Learning: A Survey. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2021, 379, 20200209. [Google Scholar] [CrossRef] [PubMed]
  17. Kühnert, C.; Gonuguntla, N.M.; Krieg, H.; Nowak, D.; Thomas, J.A. Application of LSTM Networks for Water Demand Prediction in Optimal Pump Control. Water 2021, 13, 644. [Google Scholar] [CrossRef]
  18. Siami-Namini, S.; Tavakoli, N.; Siami Namin, A. A Comparison of ARIMA and LSTM in Forecasting Time Series. In Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018, Orlando, FL, USA, 17–20 December 2018; Institute of Electrical and Electronics Engineers Inc.: Sakheer, Bahrain, 2019; pp. 1394–1401. [Google Scholar] [CrossRef]
  19. Taylor, S.J.; Letham, B. Forecasting at Scale. PeerJ Prepr. 2017, 5, e3190v2. [Google Scholar] [CrossRef]
  20. Ning, Y.; Kazemi, H.; Tahmasebi, P. A Comparative Machine Learning Study for Time Series Oil Production Forecasting: ARIMA, LSTM, and Prophet. Comput. Geosci. 2022, 164, 105126. [Google Scholar] [CrossRef]
  21. Chaturvedi, S.; Rajasekar, E.; Natarajan, S.; McCullen, N. A Comparative Assessment of SARIMA, LSTM RNN and Fb Prophet Models to Forecast Total and Peak Monthly Energy Demand for India. Energy Policy 2022, 168, 113097. [Google Scholar] [CrossRef]
  22. Duarte, D.; Faerman, J. Comparison of Time Series Prediction of Healthcare Emergency Department Indicators with ARIMA and Prophet; Academy and Industry Research Collaboration Center (AIRCC): Chennai, India, 2019; pp. 123–133. [Google Scholar]
  23. Zhou, S.; Guo, S.; Du, B.; Huang, S.; Guo, J. A Hybrid Framework for Multivariate Time Series Forecasting of Daily Urban Water Demand Using Attention-Based Convolutional Neural Network and Long Short-Term Memory Network. Sustainability 2022, 14, 11086. [Google Scholar] [CrossRef]
  24. Boudhaouia, A.; Wira, P. A Real-Time Data Analysis Platform for Short-Term Water Consumption Forecasting with Machine Learning. Forecasting 2021, 3, 682–694. [Google Scholar] [CrossRef]
  25. Ticherahine, A.; Boudhaouia, A.; Wira, P.; Makhlouf, A. Time Series Forecasting of Hourly Water Consumption with Combinations of Deterministic and Learning Models in the Context of a Tertiary Building. In Proceedings of the 2020 International Conference on Decision Aid Sciences and Application, DASA 2020, Online, 8–9 November 2020; Institute of Electrical and Electronics Engineers Inc.: Sakheer, Bahrain, 2020; pp. 116–121. [Google Scholar]
  26. Al-Zahrani, M.A.; Abo-Monasar, A. Urban Residential Water Demand Prediction Based on Artificial Neural Networks and Time Series Models. Water Resour. Manag. 2015, 29, 3651–3662. [Google Scholar] [CrossRef]
  27. Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G.S.; Davis, A.; Dean, J.; Devin, M.; et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, USA, 2–4 November 2016; pp. 265–283. [Google Scholar]
  28. Chollet, F.; Watson, M.; Bursztein, E.; Zhu, Q.S.; Jin, H.K. Keras. Available online: https://keras.io/getting_started/faq/ (accessed on 23 May 2024).
  29. Lazzeri, F. Machine Learning for Time Series Forecasting with Python; Wiley: Hoboken, NJ, USA, 2021; ISBN 10-1119682363. [Google Scholar]
  30. Alnsour, M.A.; Ijam, A.Z. Specifying a Cascade Water Demand Forecasting Model Using Time-Series Analysis: A Case of Jordan. Sustain. Water Resour. Manag. 2023, 9, 37. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.