Leak and Burst Detection in Water Distribution Network Using Logic- and Machine Learning-Based Approaches
Abstract
:1. Introduction
2. Case Study Pumping Water System for the Detection of Leaks and Bursts
2.1. Software- and Hardware-Based Methods for the Detection of Leaks and Bursts in the Water Pipeline
2.1.1. Logic-Based (If and Else If Conditions) Algorithm Design for the Detection of Leaks and Bursts in Water Pipelines Using Flow, Pressure, and Pump Speed Data
- Task 1 (Steps 1–5): Collect SCADA data for flow in the pumping main, pressure, and pump speed.
- Task 2 (Steps 6–8): Estimate variation in pressure ∆P, ∆S, and ∆Q while the pump is running at its normal speed for the assessment of threshold values for leaks and burst conditions. Any data point that is more than three times the standard deviation is likely a burst.
- Task 3 (Step 9): Estimate pump speed variation during pump start to full speed and then decrease in pump speed during pump shutdown condition (identify pump start and shutdown stages).
- Task 4 (Steps 10–12): The identification of leak and burst conditions and notifying the operator of any leak and burst condition.
- Task 5 (Steps 13–17): Estimate the appropriate leak location along the pressure water main
2.1.1.1. Burst location Identification in a Pumping Main (Case Study System)
Alert Generation
2.1.2. Machine Learning-Based Method for the Detection of Leaks and Bursts in Water Pipelines
- Step 1: Collect the data from the existing SCADA for pressure sensors, flow meters, and pump speeds in the time interval of 1 min for the 20,000 data points.
- Step 2: Preprocess the data for algorithm conditions by data cleaning and analysing the SCADA data.
- Step 3: The prepared data will be used for the machine learning algorithms.
- Step 4: Different machine learning models will be used for training and testing.
- Step 5: The selection of the highest-performing machine learning model with the machine learning parametric indices (anomaly F1 score and ROC curve—explained below).
- Step 6: Implementing the selected model for the detection of leaks and bursts.
- Step 7: Alert generation for the leaks and bursts in the pipeline.
3. Results
3.1. Logic-Based Approach for Leak and Burst Detection
3.1.1. Estimation of Delta Pressure (∆P) and Delta Discharge (∆Q) for Case Study System
3.1.2. Estimation of Friction Factor in Pipe and Pipe Roughness
3.1.3. Categories in the Detection of Leaks and Burst
- Minor leak: Instances where the percentage threshold falls below 15 (15% pressure drop from Paverage) are classified as minor leakage, indicating relatively minor disturbances in the system.
- Major leak: When the percentage thresholds range between 15% and 20%, the severity escalates to major leakage, signifying a more significant impact on the pipeline’s integrity and functionality.
- Burst: Any leakage surpassing the threshold of 20% is categorised as a burst, representing a critical burst in the pipeline system that requires immediate attention and intervention to prevent further damage or disruptions.
3.1.4. Leak and Burst Identification
3.2. Machine Learning Approach
3.2.1. Selection of Suitable Machine Learning Techniques
- KMeans: Used for clustering data into two clusters based on normalised and standardised features.
- Isolation Forest: Effective for anomaly detection with the ability to handle high-dimensional data.
- Local Outlier Factor: Computes the local deviation of the density of a sample with respect to its neighbors.
- DBSCAN: Good for identifying clusters of varying shapes and sizes and isolating points that do not belong to any cluster.
- One-Class SVM: Suitable for novelty detection (identifying anomalies) by mapping data to a higher-dimensional space.
- Data Preprocessing: Normalisation and standardisation were performed on both training and test data to ensure consistency in feature scaling.
- Evaluation: Models were evaluated using F1 score as a metric, considering both normal and anomaly classes.
3.2.2. Web-Based Platform for Machine Learning Outcome
4. Conclusions
5. Future Recommendations
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Mitchell Plumbing and Gas. The Difference between Leaking and Burst Pipes. 2021. Available online: https://www.mitchellplumbinggas.com/blog-post/the-difference-between-leaking-and-burst-pipes (accessed on 12 December 2023).
- Sharma, A.; Marney, D. The Application and Utility of ‘Smarts’ for Monitoring Water and its Infrastructure. Water J. Aust. Water Assoc. 2012, 39, 86–92. [Google Scholar]
- Rapid Service Plumbing. Burst Pipes Sydney. Available online: https://rapidserviceplumbing.com.au/services/burst-pipes-sydney/ (accessed on 4 January 2024).
- Water Leak Detection. How To Detect Burst Pipes. 2020. Available online: https://www.waterleakdetection.net.au/how-to-detect-burst-pipes/ (accessed on 4 January 2024).
- Joseph, K.; Sharma, A.K.; van Staden, R.; Wasantha, P.L.P.; Cotton, J.; Small, S. Application of Software and Hardware-Based Technologies in Leaks and Burst Detection in Water Pipe Networks: A Literature Review. Water 2023, 15, 2046. [Google Scholar] [CrossRef]
- Berardi, L.; Giustolisi, O. Calibration of Design Models for Leakage Management of Water Distribution Networks. Water Resour. Manag. 2021, 35, 2537–2551. [Google Scholar] [CrossRef]
- Joseph, K.; Sharma, A.K.; Van Staden, R. Development of an Intelligent Urban Water Network System. Water 2022, 14, 1320. [Google Scholar] [CrossRef]
- Campos, J.A.; Jiménez-Bello, M.A.; Alzamora, F.M. Real-time energy soptimisation of irrigation scheduling by parallel multi-objective genetic algorithms. Agric. Water Manag. 2020, 227, 105857. [Google Scholar] [CrossRef]
- Fereidooni, Z.; Tahayori, H.; Bahadori-Jahromi, A. A hybrid model-based method for leak detection in large scale water distribution networks. J. Ambient. Intell. Humaniz. Comput. 2021, 12, 1613–1629. [Google Scholar] [CrossRef]
- Gorenstein, A.; Kalech, M.; Hanusch, D.F.; Hassid, S. Pipe fault prediction for water transmission mains. J. Water 2020, 12, 2861. [Google Scholar] [CrossRef]
- Swamee, P.K. Improving design guidelines for class-I circular sedimentation tanks. Urban Water J. 2004, 1, 309–314. [Google Scholar] [CrossRef]
- Ester, M.; Kriegel, H.P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, OR, USA, 2–4 August 1996. [Google Scholar]
- MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; University of California Press; Berkeley, CA, USA, 1967.
- Schölkopf, B.; Platt, J.C.; Shawe-Taylor, J.; Smola, A.J.; Williamson, R.C. Estimating the support of a high-dimensional distribution. Neural Comput. 2001, 13, 1443–1471. [Google Scholar] [CrossRef] [PubMed]
- Liu, F.T.; Ting, K.M.; Zhou, Z.-H. Isolation forest. In Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, Pisa, Italy, 15–19 December 2008. [Google Scholar]
- Breunig, M.M.; Kriegel, H.P.; Ng, R.T.; Sander, J. LOF: Identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, TX, USA, 16–18 May 2000. [Google Scholar]
- He, H.; Bai, Y.; Garcia, E.A.; Li, S. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In Proceedings of the 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–6 June 2008. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J.H.; Friedman, J.H. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer: New York, NY, USA, 2009; Volume 2. [Google Scholar]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: New York, NY, USA, 2013; Volume 112. [Google Scholar]
- Fawcett, T. An Introduction to ROC Analysis. J. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
- Hand, D.J.; Till, R.J. A simple generalisation of the area under the ROC curve for multiple class classification problems. J. Mach. Learn. 2001, 45, 171–186. [Google Scholar] [CrossRef]
- Lewis, R.E. Determining Outliers Using Standard Deviation. Study.com. 2024. Available online: https://study.com/skill/learn/determining-outliers-using-standard-deviation-explanation.html#:~:text=deviation%20of%201.3.-,Step%202%3A%20Determine%20if%20any%20results%20are%20greater%20than%20%2B%2F%2D,3.9)%20or%203.5%20to%2011.3 (accessed on 25 April 2024).
Date and Time (Per Min) | Pressure (mts) | Flow (LPS) | Pump Speed (RPM) |
---|---|---|---|
01/05/2022 08:05 | 77.46 | 252.67 | 1500 |
01/05/2022 08:06 | 77.46 | 253 | 1500 |
01/05/2022 08:07 | 77.46 | 252.83 | 1500 |
01/05/2022 08:08 | 77.46 | 252.67 | 1500 |
01/05/2022 08:09 | 77.2 | 252.5 | 1500 |
01/05/2022 08:10 | 77.2 | 252.33 | 1500 |
01/05/2022 08:11 | 77.2 | 252.17 | 1500 |
Type of Leakage | Date | Time | Pressure (m) | Flow (lps) | Pump Speed (rpm) | Max. Pressure (m) | Pressure Percentage Change | Leak Location |
---|---|---|---|---|---|---|---|---|
Minor leakage | 1 May 2022 | 16:08:00 | 75 | 259.67 | 1500 | 79.04 | 5.11 | |
1 May 2022 | 16:09:00 | 75 | 260 | 1500 | 79.04 | 5.11 | ||
1 May 2022 | 16:10:00 | 75 | 260.33 | 1500 | 79.04 | 5.11 | ||
1 May 2022 | 16:11:00 | 75 | 260.67 | 1500 | 79.04 | 5.11 | ||
1 May 2022 | 16:12:00 | 75 | 261 | 1500 | 79.04 | 5.11 | ||
Burst | 6 May 2022 | 15:05:00 | 61.18 | 245.83 | 1500 | 76.66 | 20.19 | 5237 |
6 May 2022 | 15:06:00 | 61.18 | 246 | 1500 | 76.66 | 20.19 | 5235.52 | |
6 May 2022 | 15:07:00 | 61.18 | 245.83 | 1500 | 76.66 | 20.19 | 5237 | |
6 May 2022 | 15:08:00 | 60.97 | 245.67 | 1500 | 76.66 | 20.47 | 5220.5 | |
Major leakage | 11 May 2022 | 17:55:00 | 65 | 245.67 | 1500 | 77.33 | 15.94 | |
11 May 2022 | 17:56:00 | 65 | 245.33 | 1500 | 77.33 | 15.94 | ||
11 May 2022 | 17:57:00 | 65 | 245 | 1500 | 77.33 | 15.94 | ||
11 May 2022 | 17:58:00 | 65 | 244.67 | 1500 | 77.33 | 15.94 | ||
11 May 2022 | 17:59:00 | 65 | 244.33 | 1500 | 77.33 | 15.94 | ||
11 May 2022 | 18:00:00 | 69 | 244 | 1500 | 77.33 | 10.77 |
Algorithm Type | Classifiers | Hyper-Parameters | Python Library |
---|---|---|---|
Unsupervised machine learning algorithms | KMEANS |
| from sklearn.cluster import KMeans |
DBSCAN |
| from sklearn.cluster import DBSCAN | |
Isolation Forest |
| from sklearn.ensemble import IsolationForest | |
Local Outlier Factor |
| from sklearn.neighbors import LocalOutlierFactor | |
One-class SVM |
| from sklearn.svm import OneClassSVM |
Anomaly F1 Score | ROC-AUC | |||||||
---|---|---|---|---|---|---|---|---|
Machine Learning Models | Imbalanced Data | Balanced ROS | Balanced SMOTE | Balanced ADASYN | Imbalanced Data | Balanced ROS | Balanced SMOTE | Balanced ADASYN |
DBSCAN | 0.08 | 0.00 | 0.02 | 0.11 | 0.58 | 0.45 | 0.46 | 0.49 |
ISOLATION FOREST | 0.13 | 0.65 | 0.64 | 0.64 | 0.61 | 0.65 | 0.64 | 0.64 |
K-MEANS | 0.01 | 0.04 | 0.04 | 0.14 | 0.49 | 0.47 | 0.47 | 0.53 |
Local Outlier Factor | 0.12 | 0.00 | 0.67 | 0.69 | 0.77 | 0.41 | 0.71 | 0.72 |
One-class SVM | 0.13 | 0.07 | 0.07 | 0.05 | 0.61 | 0.49 | 0.49 | 0.48 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Joseph, K.; Shetty, J.; Sharma, A.K.; van Staden, R.; Wasantha, P.L.P.; Small, S.; Bennett, N. Leak and Burst Detection in Water Distribution Network Using Logic- and Machine Learning-Based Approaches. Water 2024, 16, 1935. https://doi.org/10.3390/w16141935
Joseph K, Shetty J, Sharma AK, van Staden R, Wasantha PLP, Small S, Bennett N. Leak and Burst Detection in Water Distribution Network Using Logic- and Machine Learning-Based Approaches. Water. 2024; 16(14):1935. https://doi.org/10.3390/w16141935
Chicago/Turabian StyleJoseph, Kiran, Jyoti Shetty, Ashok K. Sharma, Rudi van Staden, P. L. P. Wasantha, Sharna Small, and Nathan Bennett. 2024. "Leak and Burst Detection in Water Distribution Network Using Logic- and Machine Learning-Based Approaches" Water 16, no. 14: 1935. https://doi.org/10.3390/w16141935
APA StyleJoseph, K., Shetty, J., Sharma, A. K., van Staden, R., Wasantha, P. L. P., Small, S., & Bennett, N. (2024). Leak and Burst Detection in Water Distribution Network Using Logic- and Machine Learning-Based Approaches. Water, 16(14), 1935. https://doi.org/10.3390/w16141935