Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework

Frisk, Fredrik; Johansson, Ola

doi:10.3390/w16192776

Open AccessArticle

Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework

by

Fredrik Frisk

^1,*

and

Ola Johansson

²

¹

Department of Computer Science, Kristianstad University, 291 88 Kristianstad, Sweden

²

Department of Urban Studies, Malmö University, 205 06 Malmö, Sweden

^*

Author to whom correspondence should be addressed.

Water 2024, 16(19), 2776; https://doi.org/10.3390/w16192776

Submission received: 26 August 2024 / Revised: 23 September 2024 / Accepted: 25 September 2024 / Published: 29 September 2024

(This article belongs to the Section Urban Water Management)

Download

Browse Figures

Versions Notes

Abstract

This study evaluates the accuracy of water level forecasting using two approaches: the hydrodynamic model SWMM and machine learning (ML) models based on the Nonlinear Autoregressive with Exogenous Inputs (NARX) framework. SWMM offers a physically based modeling approach, while NARX is a data-driven method. Both models use real-time precipitation data, with their predictions compared against measurements from a network of IoT sensors in a stormwater management system. The results demonstrate that while both models provide effective forecasts, NARX models exhibit higher accuracy, with improved Nash–Sutcliffe Efficiency (NSE) coefficients and 33–37% lower mean absolute error (MAE) compared to SWMM. Despite these advantages, NARX models may struggle with limited data on extreme flooding events, where they could face accuracy challenges. Enhancements in SWMM modeling and calibration could reduce the performance gap, but the development of SWMM models requires substantial expertise and resources. In contrast, NARX models are generally more resource-efficient. Future research should focus on integrating both approaches by leveraging SWMM simulations to generate synthetic data, particularly for extreme weather events, to enhance the robustness of NARX and other ML models in real-world flood prediction scenarios.

Keywords:

Internet of Things; machine learning; stormwater management; water level forecasting; wireless sensor networks

Graphical Abstract

1. Introduction

Urban flood control is essential due to the significant damage that flood events can cause to infrastructure, property, and human lives. Accurate prediction of these events is crucial, especially in low-lying or coastal areas, as it offers opportunities to mitigate associated risks [1]. Numerical modeling techniques are vital tools for forecasting and managing flood risks effectively [2].

1.1. Forecasting of Stormwater Levels

The Stormwater Management Model (SWMM) and machine learning (ML) are two prominent approaches for predicting flooding. SWMM [3] offers a comprehensive platform for simulating urban hydrology incorporating intricate representations of various processes such as rainfall–runoff, drainage systems, and pollutant transport. In contrast, ML utilizes data-driven algorithms to analyze patterns, potentially providing advantages in model building simplicity, computational speed, and adaptability.

SWMM’s strength lies in its detailed simulation of hydrological processes within urban environments, leveraging extensive data on land use, topography, and hydraulic structures. By providing a comprehensive representation of the urban hydrological cycle, SWMM enables the evaluation of various flood scenarios and flood risk management strategies, making it widely used in hydrology and flood-related studies [4,5,6,7].

Originally developed by the United States Environmental Protection Agency (EPA) in 1971, SWMM has undergone several iterations, with the latest major release, SWMM 5, being redeveloped from scratch and released as open-source software in the mid-2000s. SWMM 5 introduced a more robust and flexible modeling environment, facilitating integration with other software and adaptation to a wide range of applications, such as PYSWMM [8], which enables interaction with SWMM via a Python interface.

SWMM is increasingly used to assess the impacts of climate change [9] on urban drainage systems, simulating various climate scenarios to understand how rising temperatures and changing precipitation patterns could affect urban water systems and guide adaptations to enhance resilience. It also aids in evaluating flood risks and developing flood mitigation strategies, supporting real-time decision-making in stormwater management. However, a significant limitation of SWMM models is their high degree of specificity, which restricts their reusability across different contexts. Furthermore, these models typically necessitate the calibration of numerous parameters, which can be a complex and resource-intensive process [10,11,12].

Machine learning (ML) has emerged as a promising alternative to SWMM for flood prediction. Unlike SWMM, which relies on explicit hydrological modeling, ML is a data-driven approach that requires high-quality data and has merits in computational cost and speed [13]. ML algorithms can automatically identify complex patterns and relationships in hydrological data, enabling more flexible and adaptive flood prediction models.

Additionally, ML techniques are often more scalable and computationally efficient, allowing for real-time or near-real-time flood forecasting in urban environments. The results in [14] indicate that the model achieved a higher degree of accuracy and incurred lower computational costs compared to a hydrological model (SWMM).

However, the effectiveness of ML for flood prediction depends on the quality and quantity of available data, as well as the selection and optimization of appropriate ML algorithms. Inadequate or biased data can result in inaccurate predictions and restrict the applicability of ML models. However, using IoT sensor data as inputs to an ML model has been shown to increase the model accuracy [15], and SWMM has been used to augment existing data in [16].

In an extensive literature review [17], the efficacy of various prediction methods for both long-term and short-term floods was assessed through a qualitative analysis focusing on robustness, accuracy, effectiveness, and computational speed, thus providing a comprehensive understanding of the different techniques employed for flood predictions.

The specific challenge of short-term forecasting of flooding in stormwater systems has been addressed using multiple models, including Recurrent Neural Networks (RNN) and non-linear ARX models [18], as well as linear regression, support vector machine and others [19] and deep reinforced learning [20]. The study [21] proposes an event-based decision support algorithm for real-time flood forecasting driven by non-linear machine learning models.

1.2. Kristianstad Municipality IoT-Platform

Recognizing the pivotal role of data in managing the risk of flooding, Kristianstad municipality in Southern Sweden, home to the nation’s lowest point with an elevation at −2.32 m below the sea level, has implemented a comprehensive network of IoT sensors. Developed in collaboration with Kristianstad University and supported by Vinnova [22], this IoT network comprises three types of sensors: (1) weather sensors strategically positioned to monitor precipitation levels across key urban locations, (2) water level sensors installed at critical points within the stormwater system, as well as in nearby streams, and lakes to monitor water depth, and (3) sensors designed to measure groundwater levels.

In total, 653 units have been deployed in and around the city of Kristianstad. The geographical distribution of these sensors is depicted in Figure 1. Notably, in the vicinity of the village of Degeberga, a total of 49 sensors have been installed.

All sensors are equipped with wireless communication capability via an LoRaWAN [23] network, which is built and maintained by the municipality. The sensors transmit data to an IoT portal, from which the image in Figure 1 is derived. Most sensors are situated in the city center, focusing on surface channels and underground drainage systems. Additionally, sensors are deployed in surrounding villages and along waterways, including the small river that traverses the city. The deployed sensors serve various purposes, including monitoring water levels in stormwater pipes (101) and wastewater pipes (286), as well as groundwater levels (54), water levels in streams, lakes, and at sea (83), rain gauges (38), among others. The deployment of the IoT sensor network started in 2021.

2. Materials and Methods

This paper compares the accuracy of water level predictions between a hydrological model developed using SWMM and NARX models utilizing data from the IoT platform.

Degeberga, a village located 25 km south of Kristianstad, was selected as the evaluation site for the study for several reasons. Firstly, Degeberga was one of the initial villages to implement a comprehensive, updated SWMM model. Secondly, its geographical position, being distant from both the seashore and large bodies of water, significantly simplifies the SWMM model and the subsequent analysis of the stormwater system, as it experiences minimal or no influence from outfalls to the sea or water streams. Lastly, the stormwater pipe system in Degeberga is relatively small and easily comprehensible, making it an ideal case study for this comparative analysis.

2.1. IoT-Water Level Sensors

The red and blue dots in Figure 2 show the location of the deployed sensors that are measuring the water level in the stormwater pipes. The blue dots show the location of the sensors that have been included in this study. The red dots indicate the location of the sensors that have been excluded from the analysis due to incomplete or unreliable readings. The procedure for the exclusion is explained below. The blue star shows the position of the local rain gauge sensor.

The AxSensor [24] is the device most utilized for measurements in the stormwater system and is also the predominant gauge used in Degeberga. It comprises a measuring tube that is positioned at the flow bed. An acoustic pulse is emitted by the sensor within the tube towards the liquid surface and compared to a reference point within the tube. An example of a deployed sensor is displayed in Figure 3.

The water level is measured approximately every 10 min, indicating a sample time of 10 min, with one important exception. The sensor unit is configured to enter sleep mode if the water level falls below 20 mm. Consequently, if the water level is under 20 mm, the sensor unit will not transmit any data via the LoRaWAN network. Data transmission will resume once the measured water level exceeds 20 mm. The precision of the sensor unit is ±5 mm, as specified by the manufacturer. Additionally, there is a rain gauge [25] installed in Degeberga, which measures precipitation approximately every 10 min with a resolution of 0.2 mm.

2.2. Machine Learning Model and Training Set

The water levels in the stormwater management system were predicted using a NARX approach, which has successfully forecasted water levels in numerous instances (see [26,27,28,29]). The NARX model (1) consists of three components:

An autoregressive part that utilizes actual water levels up to time t − p, where p in this study is either one hour or 24 h.
An exogenous part that incorporates actual precipitation up to t − 10 min, including interaction terms.
A logarithmic transformation to address issues such as non-linearity, heteroscedasticity, and non-normality of the residuals.

Parameter estimation for the model was performed using the least squares optimization technique in scikit-learn [30] on a training dataset consisting of sensor data from 1 September 2023 to 31 January 2024.

y_{t} = e x p (μ + \sum_{k = 1}^{m} ϕ_{k} y_{t - m - p} + \sum_{i = 1}^{n} β_{i} x_{i} + \sum_{i = 1}^{n} \sum_{j = i + 1}^{n} β_{i j} x_{i} x_{j}) + ε

(1)

where:

$y_{t}$ is the water level at time t.
$μ$ is the intercept term.
$ϕ_{1}, ϕ_{2}, \dots, ϕ_{m}$ are the autoregressive coefficients.
$y_{t - 1 - p}, y_{t - 2 - p}, \dots, y_{t - m - p}$ are lagged values of the water level up to a time t − p.
$β_{1}, β_{2}, \dots, β_{n}$ are the coefficients associated with the exogenous variables $x_{1}, x_{2}, \dots, x_{n}$ $β_{12}, β_{13}, \dots, β_{n m}$ are the coefficients associated with the interaction terms between $x_{1}, x_{2}, \dots, x_{n}$
$ε$ is the error term

Preceding the modeling phase, the sensor data underwent preprocessing by recalculating water levels at regular 10 min intervals, commencing at 00:00:00, 00:10:00, and so forth. This process utilizes linear interpolation between recorded measurements to accommodate variations in reporting times from IoT sensors and to address occasionally missing data points. Additionally, any values below 20 mm were adjusted to 20 mm to align with the sensor’s threshold value.

In the context of forecasting, a direct methodology was employed to project water levels for two distinct timeframes: one hour and 24 h. The direct methodology involves training the model to make specific predictions for each of these time horizons directly rather than iteratively predicting one step ahead or relying on intermediate forecasts. This approach helps in addressing the unique challenges and uncertainties associated with shorter and longer forecasting periods. For the one-hour forecast, the model captures the immediate impacts of recent rainfall and water level changes, providing valuable information for near-term decision-making. The 24 h forecast, on the other hand, accounts for the cumulative effects over a longer period, including potential lag effects in the stormwater system’s response to precipitation events. By employing this dual timeframe forecasting approach, the methodology ensures that both short-term and long-term water level predictions are accurately assessed, offering a comprehensive understanding of the system’s behavior under varying conditions.

2.3. Test Set and Evaluation Metrics

The model was validated with an independent test dataset from 1 February 2024 to 24 February 2024, utilizing the evaluation metrics mentioned below. To compare the performance of the SWMM and NARX models in forecasting water levels, two evaluation metrics were employed: mean absolute error (MAE) and Nash–Sutcliffe Efficiency (NSE). MAE quantifies the average magnitude of the errors between the predicted and observed values, providing an intuitive measure of forecast accuracy. It reflects how close the model predictions are to the actual measurements on average. NSE, on the other hand, is a widely used statistical metric in hydrological modeling that assesses the predictive power of models by comparing the variance of the residuals to the variance of the observed data. An NSE value of 1 signifies perfect predictive accuracy, while an NSE value of 0 indicates that the model is as accurate as the mean of the observed data. Using these metrics allows for a comprehensive evaluation and direct comparison of the SWMM and NARX models, highlighting their respective strengths and limitations in representing water level dynamics within the stormwater management system.

2.4. SWMM

The municipality maintains the stormwater system using dpWater [31], a mapping software that documents all details of the municipality’s piping systems. This includes depths measured in RH2000 [32] (the official national Swedish height system), gradients, placements, nomenclatures, materials, and construction years.

The stormwater model in dpWater is exported to SWMM for simulations of precipitation and water level forecasting and to simulate reconstruction and expansion of the existing stormwater system. Degeberga village, located in Kristianstad municipality, is the first area to be fully modeled in SWMM. Figure 2 provides an overview of the village extracted from the SWMM model. The stormwater system consists of several separate pipe networks, each having outfalls discharging into different local water streams. The three water level sensors selected for this study are associated with different, unconnected subsystems.

The SWMM simulation of the water levels was conducted using the PYSWMM [8] software, enabling a more straightforward comparison of the results from the NARX model and the SWMM model within the same Python environment. The input for the SWMM simulation model was actual precipitation data sourced from the IoT portal, and the output was compared to the observed water levels from the sensors.

Despite meticulous efforts in model development, the SWMM simulation output exhibited significant discrepancies compared to the observed water levels, with substantial differences in both the mean and amplitude, as indicated by the high MAE in Table 1. Attempts were made to calibrate the SWMM model to historical data, but this proved challenging due to the occasional presence of spurious water flows. It is suspected that groundwater may be leaking into the system, creating additional water flows. This phenomenon is particularly evident when there is water running in the stormwater system despite the absence of rain. These issues made it difficult to achieve a well-calibrated SWMM model, thereby impacting its performance compared to the NARX model.

Despite this, the SWMM output demonstrated a moderate to high correlation with the sensor readings, as shown in Table 2. To address the discrepancies and facilitate a meaningful comparison between the methods, a linear transformation (2) was applied to adjust the simulated SWMM values.

d_{a d j} = α + β \cdot d_{S W M M} + ε

(2)

where d_adj is the adjusted water level and d_SWMM is the water level from the SWMM simulation. The two parameters, α and β, are chosen to minimize the error term ε for the training period.

2.5. Data Acquisition Issues

The large-scale deployment of sensors in Kristianstad municipality faced numerous obstacles and unforeseen challenges that compromised data integrity and continuity of data collection. One significant issue was measurement drift, a common problem where sensor accuracy degrades over time, necessitating frequent recalibration to maintain data reliability. This issue was exacerbated by deficiencies in the early versions of the AxSensors, particularly in certain plastic components, and software problems that failed to adequately account for the effects of ambient temperature and humidity on the units, leading to data fluctuations.

In addition, low battery levels resulted in intermittent data losses. There is an inherent conflict between maintaining high sampling rates and minimizing energy consumption in battery-powered sensors. To enhance system reliability and extend battery life, experiments were conducted with various sampling frequencies, as well as the implementation of a deadband, where water levels below a certain threshold were not reported by the sensor.

Communication issues were also prevalent, with signal interference and range limitations frequently disrupting data transmission. Moreover, the physical security of the equipment emerged as a concern, with multiple incidents of vandalism recorded at various sites. While placing antennas high above the ground can address signal issues, safeguarding the sensors from vandalism often requires hidden or underground placement, creating a challenging conflict between optimizing signal strength and ensuring security.

These challenges highlight the complexity and dynamic nature of managing large-scale sensor networks and emphasize the importance of comprehensive planning and continuous monitoring to effectively mitigate such issues. Over time, multiple software updates were necessary, each causing temporary disruptions in data collection. However, through iterative improvements and the refinement of both hardware and software, the stability and accuracy of the sensor network gradually improved, ultimately paving the way for more reliable data collection and the long-term sustainability of the monitoring system.

Figure 4 presents measurements from a sensor, illustrating the challenges discussed above, such as shifts due to software and hardware updates and measurement drift over time. Due to these persistent challenges, the scope of this study was narrowed to three independent subsystems of the Degeberga stormwater management system, where continuous and reliable data were available. This focus allowed for a more reliable comparison of forecasting methodologies within a controlled and stable environment.

3. Results

Water levels simulated in SWMM demonstrate moderate to high correlations with the observed water levels, as depicted in Table 1. However, despite efforts to correct the model, the simulation output is not calibrated to the sensor readings, leading to disparities in both the mean and amplitude, as evidenced by the high MAE in Table 2 and negative NSE values in Table 3. The linear adjustment of the simulated SWMM improves the situation and significantly reduces the MAE, increasing NSE while maintaining the same correlation, see Table 1, Table 2 and Table 3.

However, despite the efforts to correct the SWMM model, both NARX models, with 1 h and 24 h forecasting horizons, outperform SWMM in terms of accuracy. The NARX models exhibit superior correlation coefficients, lower MAE values, and higher NSE values. Notably, the 24 h NARX model achieves an MAE reduction of 33–37% compared to the SWMM model, as shown in Table 2. Among the NARX models, the 1 h forecast surpasses the 24 h forecast in overall accuracy.

In Figure 5, a comparison between the actual water level and the two predictions is shown for one of the analyzed sensor nodes. The actual water level and rainfall data reported by the sensors are depicted in blue, while the adjusted SWMM simulation values are represented in orange. Additionally, the 24 h forecast produced by the NARX model is illustrated in green, spanning a three-day duration at the beginning of February.

4. Discussion

Comparing the two methodologies reveals that both approaches can effectively forecast water levels within the stormwater management system. However, constructing SWMM models demands significantly more technical expertise than developing a NARX model. Despite the diligent efforts of the municipality’s engineers, the simulation output of SWMM still requires an adjustment to align with sensor readings. Nevertheless, even after recalibration, the SWMM model performs less effectively than the NARX models.

While NARX models appear superior in this investigation, it’s important to recognize that the SWMM model does include potential system defects like groundwater infiltration into the stormwater network or other leaks. Moreover, although ML models excel at detecting patterns within the available data, they may struggle to extrapolate values beyond the observed range. The dataset’s lack of major flooding incidents further restricts the models’ ability to accurately predict significant flooding events resulting from prolonged heavy rainfall. This absence of major flood event data is a significant limitation, emphasizing the need to address this gap for better assessment of models’ effectiveness during extreme weather events.

To mitigate this limitation, future efforts should focus on expanding the dataset to include extreme event simulations. This can be completed by systematically incorporating historical data from past flood events or by creating high-fidelity simulated events using hydrological models. Integrating such data would enhance the models’ predictive capabilities under varied conditions, making forecasts more reliable during extreme weather scenarios.

In this context, SWMM models can be invaluable for data augmentation, supplementing the existing dataset with simulated water levels for intense rainfall events. This would create a more comprehensive training dataset for all types of ML algorithms, potentially improving their predictive capabilities. However, ensuring the representativeness and realism of the augmented data, as well as reconciling differences between simulated and real-world data, are practical challenges that need to be addressed. Future research should focus on methodologies for seamlessly integrating synthetic and observed datasets to enhance ML models’ robustness.

In this study, actual rainfall data, collected by a rain gauge at 10 min intervals, was used as input for both the SWMM simulation and the NARX model. In practical applications, however, rainfall forecasts would be sourced from meteorological services, which are likely to be less accurate than direct measurements and typically available on an hourly basis. Consequently, this could result in less accurate water level forecasts. Nonetheless, the performance differences between the two models are expected to remain consistent.

Despite the promising results, the study also highlights some of the challenges with obtaining reliable data on water levels in a stormwater management system, e.g., unwanted shift and drift in measurement levels due to battery usage, sensor software updates and vandalism, and missing values due to LoRaWAN communication issues.

Yet, these challenges also present opportunities for improvement. By implementing more robust sensor technologies, refining data collection protocols, upgrading wireless communication systems, and integrating SWMM simulations with sensor data to inform ML models, there is potential for an even more effective approach to forecasting the risk of flooding events.

5. Conclusions

This study demonstrates that both NARX models and SWMM can effectively forecast water levels within a stormwater management system. Constructing SWMM models, however, requires significantly more technical expertise than developing NARX. Despite extensive efforts in model development, the simulation output from SWMM still necessitates further adjustments to align with the observed water levels. Even after recalibration, the accuracy of the SWMM models does not achieve the same levels as that of the NARX models.

While NARX models demonstrate superior performance in this investigation, with higher NSE and lower MAE, it is crucial to recognize their limitations. Statistical time-series methods excel at identifying patterns within data but may struggle to extrapolate values beyond the observed range. The dataset used in this study, which lacks major flooding incidents, may result in NARX models that fail to accurately predict significant flooding resulting from prolonged heavy rainfall.

In this context, SWMM models could be valuable for data augmentation, supplementing the existing dataset with simulated water levels for intense rainfall events. This augmentation would create a more comprehensive training dataset for all types of ML algorithms, potentially enhancing their predictive capabilities.

The study also highlights several challenges associated with obtaining reliable data on water levels in a stormwater management system. Issues such as unwanted shifts and drifts in measurement levels due to battery usage, sensor software updates, vandalism, and missing values caused by LoRaWAN communication issues were noted. Despite these challenges, they present opportunities for improvement. By implementing more robust sensor technologies, refining data collection protocols, upgrading wireless communication systems, and integrating SWMM simulations with sensor data to inform ML models, there is potential for an even more effective approach to forecasting the risk of flooding events.

In conclusion, while both SWMM and NARX models offer effective tools for forecasting stormwater levels, NARX models demonstrate superior accuracy and are less resource-intensive to develop and maintain. However, to fully leverage the strengths of both models, future research should prioritize enhancing data reliability and integrating the two approaches. Expanding datasets to include extreme weather events, generating synthetic data through SWMM simulations, and validating predictions with real-world flood scenarios will significantly improve the robustness and predictive power of NARX models. Additionally, addressing challenges in IoT sensor data collection, such as measurement drift, communication issues, and vandalism, through the adoption of more reliable sensor technologies and improved maintenance protocols will be crucial. These improvements not only enhance forecasting accuracy but also provide practical benefits for stormwater management system operators relying on IoT sensor networks. Finally, future research could explore different ML models beyond NARX to predict water levels, offering potential improvements in forecasting capabilities.

Author Contributions

Both authors contributed equally to this work. F.F. and O.J. were involved in the conceptualization, methodology, formal analysis, investigation, data curation, and writing—original draft preparation. Both authors also contributed to the writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data supporting the findings of this study are owned by Kristianstad Municipality. Due to privacy and confidentiality concerns, access to the data is restricted and not publicly available.

Acknowledgments

The authors would like to thank Kristianstad Municipality for granting us access to the municipality’s IoT portal and all sensor data. Additionally, we would like to thank Magnus Lindgren for his discussions regarding sensor evaluation, providing pictures, and engaging in fruitful conversations. Finally, we thank Marcus Vidal for providing SWMM models and enhancing them throughout the course of this work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Rosenzweig, B.R.; Cantis, P.H.; Kim, Y.; Cohn, A.; Grove, K.; Brock, J.; Yesuf, J.; Mistry, P.; Welty, C.; McPhearson, T.; et al. The Value of Urban Flood Modeling. Earth’s Future 2021, 9, e2020EF001739. [Google Scholar] [CrossRef]
Luo, P.; Luo, M.; Li, F.; Qi, X.; Huo, A.; Wang, Z.; He, B.; Takara, K.; Nover, D.; Wang, Y. Urban flood numerical simulation: Research, methods and future perspectives. Environ. Model. Softw. 2022, 156, 105478. [Google Scholar] [CrossRef]
Huber, W.C.; Dickinson, R.E.; Barnwell, T.O., Jr.; Branch, A. Storm Water Management Model, Version 4: User’s Manual; U.S. Environmental Protection Agency, Environmental Research Laboratory: Athens, GA, USA, 1988.
Sonavane, N.; Rangari, V.A.; Waikar, M.L.; Patil, M. Urban storm-water modeling using EPA SWMM—A case study of Pune city. In Proceedings of the 2020 IEEE Bangalore Humanitarian Technology Conference (B-HTC), Vijiyapur, India, 8–10 October 2020. [Google Scholar]
Yano, K.A.V.; Cabaluna, M.A.D.; Rempis, M.B.; Sales, A.I.S.; Beren, Q.Z.P.; Poso, F.D.; Vergel, J.M.B. Effect of Rainwater Gardens as Flood Mitigation using Storm Water Management Model. In Proceedings of the 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), Boracay, Philippines, 1–4 December 2022; pp. 1–5. [Google Scholar]
Calot, J.B.; Galang, J.B.R.; Lagrata, J.P.R.; Marcelino, R.A.T.; Buenconsejo, M.V.; Poso, F.D.; Escarieses, L.L.E. Sustainable Drainage System: Low Impact Development Practices to Minimize the Storm Water Runoff. In Proceedings of the 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), Boracay, Philippines, 1–4 December 2022; pp. 1–5. [Google Scholar]
Lee, J.; Chung, G.; Park, H.; Park, I. Evaluation of the Structure of Urban Stormwater Pipe Network Using Drainage Density. Water 2018, 10, 1444. [Google Scholar] [CrossRef]
Open Water Analytics. PySWMM. Version 2.0.1, Open Water Analytics. Available online: https://www.pyswmm.org (accessed on 20 May 2024).
Hassan, W.H.; Nile, B.K.; Kadhim, Z.K. Effect of climate change on the flooding of storm water networks under extreme rainfall events using SWMM simulations: A case study. Model. Earth Syst. Environ. 2024, 10, 4129–4161. [Google Scholar] [CrossRef]
Szeląg, B.; Majerek, D.; Eusebi, A.L.; Kiczko, A.; de Paola, F.; McGarity, A.; Wałek, G.; Fatone, F. Tool for fast assessment of stormwater flood volumes for urban catchment: A machine learning approach. J. Environ. Manag. 2024, 355, 120214. [Google Scholar] [CrossRef] [PubMed]
Kourtis, I.M.; Kopsiaftis, G.; Bellos, V.; Tsihrintzis, V.A. Calibration and validation of SWMM model in two urban catchments in Athens, Greece. In Proceedings of the International Conference on Environmental Science and Technology (CEST 2017), Rhodes, Greece, 31 August–2 September 2017. [Google Scholar]
Ma, B.; Wu, Z.; Hu, C.; Wang, H.; Xu, H.; Yan, D.; Soomro, S.-E. Process-oriented SWMM real-time correction and urban flood dynamic simulation. J. Hydrol. 2022, 605, 127269. [Google Scholar] [CrossRef]
Guo, K.; Guan, M.; Yu, D. Urban surface water flood modelling—A comprehensive review of current models and future challenges. Hydrol. Earth Syst. Sci. 2021, 25, 2843–2860. [Google Scholar] [CrossRef]
Brito, L.A.V.; Meneguette, R.I.; De Grande, R.E.; Ranieri, C.M.; Ueyama, J. FLORAS: Urban flash-flood prediction using a multivariate model. Appl. Intell. 2022, 53, 16107–16125. [Google Scholar] [CrossRef]
Yang, S.-N.; Chang, L.-C. Regional Inundation Forecasting Using Machine Learning Techniques with the Internet of Things. Water 2020, 12, 1578. [Google Scholar] [CrossRef]
Kim, H.I.; Han, K.Y. Urban Flood Prediction Using Deep Neural Network with Data Augmentation. Water 2020, 12, 899. [Google Scholar] [CrossRef]
Mosavi, A.; Ozturk, P.; Chau, K.-W. Flood Prediction Using Machine Learning Models: Literature Review. Water 2018, 10, 1536. [Google Scholar] [CrossRef]
Chang, F.-J.; Chen, P.-A.; Lu, Y.-R.; Huang, E.; Chang, K.-Y. Real-time multi-step-ahead water level forecasting by recurrent neural networks for urban flood control. J. Hydrol. 2014, 517, 836–846. [Google Scholar] [CrossRef]
Dai, W.; Tang, Y.; Zhang, Z.; Cai, Z. Ensemble learning technology for coastal flood forecasting in internet-of-things-enabled smart city. Int. J. Comput. Intell. Syst. 2021, 14, 166. [Google Scholar] [CrossRef]
Wang, Y.; Chen, X.; Wang, L.; Min, G. Effective IoT-Facilitated Storm Surge Flood Modeling Based on Deep Reinforcement Learning. IEEE Internet Things J. 2020, 7, 6338–6347. [Google Scholar] [CrossRef]
Piadeh, F.; Behzadian, K.; Chen, A.S.; Campos, L.C.; Rizzuto, J.P.; Kapelan, Z. Event-based decision support algorithm for real-time flood forecasting in urban drainage systems using machine learning modelling. Environ. Model. Softw. 2023, 167, 105772. [Google Scholar] [CrossRef]
Vinnova. Smart Real-Time Monitored VA System for Measured Overflow with Directly Connected Data Analysis. Available online: https://www.vinnova.se/en/p/smart-real-time-monitored-va-system-for-measured-overflow-with-directly-connected-data-analysis/ (accessed on 20 May 2024).
Alliance, L. LoRaWAN^® 1.0.4 Specification. 2018. Available online: https://resources.lora-alliance.org/technical-specifications/ts001-1-0-4-lorawan-l2-1-0-4-specification (accessed on 20 May 2024).
AXSensor. AXSensor Sewage Monitoring. AXSensor Sewage Monitoring 03. 2024. Available online: https://www.axsensor.com/wp-content/uploads/2021/06/AXsensor_Sewage_monitoring_03_EN.pdf (accessed on 20 May 2024).
MJK. Rain Gauge Sensor. MJK Professional Regnmätare. 2024. Available online: https://mjk.se/wp-content/uploads/2023/04/d_regn_prof.pdf (accessed on 21 May 2024).
Jiang, F.; Dong, Z.; Wang, Z.; Zhu, Y.; Liu, M.; Luo, Y.; Zhang, T. Flood forecasting using an improved NARX network based on wavelet analysis coupled with uncertainty analysis by Monte Carlo simulations: A case study of Taihu Basin, China. J. Water Clim. Chang. 2021, 12, 2674–2696. [Google Scholar] [CrossRef]
Renteria-Mena, J.B.; Plaza, D.; Giraldo, E. Multivariable NARX Based Neural Networks Models for Short-Term Water Level Forecasting. Eng. Proc. 2023, 39, 60. [Google Scholar] [CrossRef]
Ruslan, F.A.; Samad, A.M.; Zain, Z.M.; Adnan, R. Flood water level modeling and prediction using NARX neural network: Case study at Kelang river. In Proceedings of the 2014 IEEE 10th International Colloquium on Signal Processing and its Applications, Kuala Lumpur, Malaysia, 7–9 March 2014. [Google Scholar]
Sahagun, M.A.M.; Cruz, J.C.D.; Garcia, R.G. Nonlinear Autoregressive with Exogenous InputsNeural Network for Water Level Prediction. In Proceedings of the 2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM), Baguio City, Philippines, 29 November–2 December 2018. [Google Scholar]
Buitinck, L.; Louppe, G.; Blondel, M.; Pedregosa, F.; Mueller, A.; Grisel, O.; Niculae, V.; Prettenhofer, P.; Gramfort, A.; Grobler, J.; et al. API design for machine learning software: Experiences from the scikit-learn project. arXiv 2013, arXiv:1309.0238. [Google Scholar]
Digpro. dpWater. 2024. Available online: https://www.digpro.com/products/dpWater (accessed on 20 May 2024).
Lantmäteriet. RH2000: Swedish Height System. Available online: https://www.lantmateriet.se/en/geodata/gps-geodesi-och-swepos/reference-systems/height-systems/swedish-height-systems/RH-2000/ (accessed on 20 May 2024).

Figure 1. The image illustrates the spatial distribution of sensors within Kristianstad municipality. The graph is sourced from the IoT portal of Kristianstad municipality, showcasing the current deployment of 653 sensor units. Permission to publish has been obtained from Kristianstad municipality.

Figure 2. The picture displays the SWMM model of the Degeberga village in Kristianstad municipality. The red and blue dots indicate the location of water level sensors, with the blue dots representing the sensors whose data are analyzed in this study. The blue star marks the position of the rain gauge sensor. The village’s stormwater system consists of ten separate pipe networks, six of which have stormwater level sensors installed. Permission to publish has been obtained from Kristianstad municipality.

Figure 3. The image shows a deployed water level sensor unit. Permission to publish has been obtained from Kristianstad municipality.

Figure 4. The picture shows data from the sensor unit DNB5644, located in the village of Degeberga within Kristianstad municipality. The abrupt drops in the recorded water levels are attributed to different software and hardware updates. Permission for publication has been granted by Kristianstad municipality.

Figure 5. The picture shows the sensor unit DNB5644 from the village of Degeberga in Kristianstad municipality. The sudden drops are due to software/hardware updates. This sensor is considered unreliable before the training period due to these issues. During July, the water level was very high and not correlated with the amount of precipitation. Permission to publish is acquired from Kristianstad municipality.

Table 1. MAE comparison.

Water Level [mm]	DNB5135	DNB5207	DNB5644
SWMM	16.289	18.531	17.278
SWMM, adjusted	2.214	0.544	2.370
NARX, 1 h forecast	1.223	0.309	1.404
NARX, 24 h forecast	1.414	0.362	1.494

Table 2. Correlation comparison.

Water Level [mm]	DNB5135	DNB5207	DNB5644
SWMM	0.628	0.571	0.811
SWMM, adjusted	0.628	0.571	0.811
NARX, 1 h forecast	0.854	0.616	0.868
NARX, 24 h forecast	0.831	0.570	0.844

Table 3. NSE comparison.

Water Level [mm]	DNB5135	DNB5207	DNB5644
SWMM	<0.0%	<0.0%	<0.0%
SWMM, adjusted	16.0%	<0.0%	61.7%
NARX, 1 h forecast	67.1%	27.1%	73.6%
NARX, 24 h forecast	60.3%	9.2%	69.4%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Frisk, F.; Johansson, O. Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework. Water 2024, 16, 2776. https://doi.org/10.3390/w16192776

AMA Style

Frisk F, Johansson O. Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework. Water. 2024; 16(19):2776. https://doi.org/10.3390/w16192776

Chicago/Turabian Style

Frisk, Fredrik, and Ola Johansson. 2024. "Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework" Water 16, no. 19: 2776. https://doi.org/10.3390/w16192776

APA Style

Frisk, F., & Johansson, O. (2024). Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework. Water, 16(19), 2776. https://doi.org/10.3390/w16192776

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparative Evaluation of Water Level Forecasting Using IoT Sensor Data: Hydrodynamic Model SWMM vs. Machine Learning Models Based on NARX Framework

Abstract

1. Introduction

1.1. Forecasting of Stormwater Levels

1.2. Kristianstad Municipality IoT-Platform

2. Materials and Methods

2.1. IoT-Water Level Sensors

2.2. Machine Learning Model and Training Set

2.3. Test Set and Evaluation Metrics

2.4. SWMM

2.5. Data Acquisition Issues

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI