Next Article in Journal
Short Term Prediction Model of Environmental Parameters in Typical Solar Greenhouse Based on Deep Learning Neural Network
Next Article in Special Issue
Development of a Multi-Objective Optimal Design Approach for Combined Water Systems
Previous Article in Journal
Binary and Multi-Class Malware Threads Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development of an Evaluation Method for Deriving the Water Loss Reduction Factors of Water Distribution Systems: A Case Study in Korean Small and Medium Cities

1
Department of Civil and Infrastructure Engineering, Gyeongsang National University, Jinju 52725, Republic of Korea
2
K-water Research Institute, K-water, Daejeon 34045, Republic of Korea
3
Department of Civil Engineering, The University of Suwon, Hwaseong-si 18323, Republic of Korea
4
Department of Civil and Environmental Engineering, Hannam University, Daejeon 34430, Republic of Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2022, 12(24), 12530; https://doi.org/10.3390/app122412530
Submission received: 13 November 2022 / Revised: 5 December 2022 / Accepted: 5 December 2022 / Published: 7 December 2022

Abstract

:
This study introduces a method that can evaluate the efficiency of leakage management practices and devises a formula to set leakage management goals. To develop the evaluation method for deriving leakage reduction factors, real data from small- and medium-sized cities in South Korea were collected. With the data collected, four leakage management factors (or activities) that could improve revenue water ratio or reduce leakage ratio were identified. With the leakage management factors, correlation analysis was carried out to identify the relationship between independent and dependent variables and within independent variables. Once the relationships were identified, standardization of the data using T-score conversion was carried out to scale all data with different units into similar ranges. Finally, the efficiency of leakage management actions was determined by the formulation of leakage using various data analysis approaches using multiple linear regression analysis and deep neural networks. As a result, pipe replacement was determined as an essential activity to decrease the leakage ratio or increase the revenue water ratio. In addition, annual water loss management actions of the small cities were more actively performed. Furthermore, the performance of data analysis using DNN is more appropriate in data classification, considering the characteristics of time series rather than independent data analysis. Through comparison of the above data classification approaches, the increase or decrease in the leakage ratio/revenue water ratio by the water loss management activity of local water distribution systems can be used to construct a more effective model for classification considering both local and temporal characteristics.

1. Introduction

The water distribution system (WDS) is one of the major infrastructures and supplies water to customers with sufficient pressure and quality. However, recent climate change has increased the frequency of drought events, and water availability has dropped significantly. Therefore, the need for efficient water supply and distribution using WDSs has increased. In most cases, transmission mains are regularly maintained, but distribution mains and service pipes are relatively difficult to maintain. Due to this issue, the rate of water loss is gradually increasing. In addition, as water is recognized as an economic product, water loss leads to economic loss. Therefore, recently, many studies have investigated the reduction of water loss, water loss management, and the factors influencing water loss.
The South Korea Water utility uses the revenue water ratio as a performance indicator for WDSs. If the target of the revenue water ratio in the planning stage is achieved in terms of operation, it is judged that the target of the project has been achieved. Therefore, the revenue water ratio is used as an indicator to evaluate the efficiency of business performance and operational management.
Since 2004, in small and medium cities in South Korea, 22 cities have begun evaluating the project planning and operation process to solve the aging problem and improve the operational efficiency of WDSs. In addition, in this regard, the revenue water ratio was used to evaluate each project. The revenue water ratio is an indicator used to evaluate the performance of waterworks projects, and because there are no other quantitative indicators other than the revenue water ratio, there is no choice but to use the revenue water ratio as a future project goal.
The revenue water ratio is a factor that evaluates the performance of WDSs, and in order to improve their performance, the revenue water ratio should be improved. The representative approach to improve the revenue water ratio is to manage effective water systems and reduce ineffective ones (i.e., leakage). As mentioned earlier, the revenue water ratio is the ratio of the total water supply to the effective water quantity; to increase the revenue water ratio, there are ways to increase the quantity of effective water systems or decrease the quantity of ineffective ones.
First, one effective water management approach to improve the revenue water ratio is to accurately manage the amount produced by the water treatment plant’s so-called supply management. For efficient supply management, it is necessary to determine the exact production water volume through regular inspection and replacement of the flow meter, and there is also a method to efficiently increase the supply amount by improving the supply system.
Second, as an effective water management method, an accurate fee should be charged through an accurate meter reading. This method involves regularly checking and replacing old water meters in order to obtain more accurate meter readings, and to reduce reading errors through accurate meter reading by meter-reading personnel. Such accurate meter reading is a way to reduce the amount of water loss by preventing the illegal use of water.
The third approach to improve the revenue water ratio is to manage ineffective water quantity, that is, to reduce water loss. To reduce water loss, leakage reduction activities such as pipe replacement, meter replacement, repair, leak detection, and pressure reduction valve installation are required. However, because the quantitative efficiency of the improvement of the revenue water ratio for each leakage reduction activity cannot be evaluated, it is insufficient as a basis for calculating the project cost for various waterworks projects. In addition, it is difficult to reflect on reality when establishing a strategy to improve the revenue water ratio for the project or when calculating a leak management goal [1,2,3].
For these reasons, recently, studies have been conducted related to the leakage reduction framework, evaluation of the water loss reduction approach, and evaluation of the factors influencing water loss. Zyoud et al. [4] showed the application of multi-criteria decision analysis (MCDA) based on participatory interaction approaches to select the most appropriate solutions to reduce and manage water losses in water supply systems and make the best consensus decisions in a complex environment. Ndunguru and Hoko [5] evaluated the non-revenue water situation in Harare water in some areas of Zimbabwe and assessed the contribution of water leakages to the water losses based on the monitoring data and SANFLOW model to determine average real losses. According to these studies, evaluating the effect of water loss reduction and the process of evaluating and determining the amount of leakage in the WDS is performed with four different methods: the top-down approach (water balancing), water and wastewater balancing, component analysis of leakage (the background and bursts estimates method), and the bottom-up approach (the minimum night flow method) [6,7,8]. Moreover, related to the evaluation of leakage reduction, data envelopment analysis (DEA) was performed to assess the effect of water loss reduction activity (e.g., pipe replacement, meter replacement, water loss repair, leakage detection, etc.) [9].
Although the above studies were conducted based on various approaches to reduce leakages, these methods are not practical approaches for several reasons: (a) lack of consideration for the leakage characteristics of real-world systems, (b) insufficient evaluation of leakage reduction methods, and (c) absence of an accurate and systemized method evaluating economic feasibility.
Therefore, in this study, the revenue water ratio improvement factors (investment and output indexes) were determined using real water leakage reduction data. Data were obtained from waterworks projects performed at small- and medium-sized cities in South Korea from 2004 to 2018. This study used statistical methods (i.e., multiple regression analysis; MRA) that have been widely applied to analyze water performance efficiency, along with artificial intelligence techniques (i.e., deep neural network; DNN). The revenue water ratio improvement factor derived through this study is an empirical methodology for evaluating the efficiency of leakage management and can be used to estimate leakage management goals in the future.

2. Materials and Methodology

In this study, evaluation of the efficiency of leakage management and the leakage management model formulation were derived using data related to revenue water ratio improvements according to the leakage of small- and medium-sized WDSs. The evaluation was performed by MRA analysis and DNN, which followed the procedure shown in Figure 1. There are three steps in developing the approach for the evaluation of the water loss reduction factors, which are (1) data collection, (2) pre-processing of data, and (3) evaluation of water loss management efficiency and determination of the formulation of water loss.

2.1. Collection Data in Study Area

In the first step, field data from the study area were collected, and data mining was performed based on the information characteristic (e.g., revenue water ratio, leakage management activity, variation of the revenue water ratio according to the leakage management activity, leakage status, and the spatiotemporal characteristics of leakage). This study considered the variation of the revenue water ratio according to the leakage management activity as a data-mining standard and then, among the various leakage management activities, the factors that can improve the revenue water ratio were selected.
Leakage can be categorized into burst or background leakage, according to characteristics such as actual quantity of leakage and type of leakage generation. Background leakage is leakage that occurs at the pipe connection or joint, and the amount of leakage is too small to be recognized. Burst leakage is divided into reported and unreported burst leakage. Generally, the reported burst leakage is a case in which leakage can be confirmed by exposure to the surface or by a significant amount greater than normal conditions in terms of supply, whereas unreported burst leakage is leakage that is difficult to detect [10,11].
The International Water Association (IWA) Water Losses Task Force is divided into seven categories, using the concepts of unavoidable annual real loss, current annual real losses, and potentially recoverable real losses, to analyze the change in the amount of water leakage according to the leakage management methods (i.e., pipe replacement, meter replacement, pressure reduction valve installation, sensor installation for leakage monitoring, leakage repair, leakage detection, and block system establishment). Among these, pressure reduction valve installation, block system establishment, and sensor installation for leakage monitoring were excluded in this study because it is difficult to quantify their effects, and these operations are irregular.
Therefore, considering the leakage management factors suggested by the IWA, pipe replacement (PR), meter replacement (MR), leakage repair (LR), and leakage detection (LD) were selected as the water loss management factors to estimate revenue water ratio increase (or reduction of leakage ratio) following those activities. This study applied six raw data elements: four leakage management factors (PR, MR, LR, LD), the revenue water ratio, and the leakage ratio reduction. The variance of the revenue water ratio and the leakage ratio reduction according to the four leakage management factors were then calculated according to those six raw data elements. The data were collected every year for 10 years, comparing the maintenance effect with the leakage management activity. Therefore, based on the characteristics of the applied data, this study performed two types of data mining based on the revenue water ratio or the leakage ratio and time-series characteristics.

2.2. Pre-Processing for Data Analysis

In step 2, the collected data were pre-processed to evaluate the efficiency of leakage management and to determine the water loss formula. First, four leakage management factors (PR, MR, LR, LD) were classified into independent and dependent variables based on the revenue water ratio increase and the leakage ratio reduction. The correlations of variables (i.e., independent and dependent variables) were then evaluated. Finally, the collected data were standardized using standardization scores (e.g., T-score converting) to alleviate the unit and scale differences between factors.

2.2.1. Determination of Independent and Dependent Variables

The first part of step 2 is the categorization of leakage management factors, revenue water ratio, and leakage ratio into independent and dependent variables. The four leakage management factors (PR, MR, LR, LD) determined from step 1 will directly or indirectly affect changes in the revenue water ratio or the leakage ratio. Therefore, the leakage management factors were determined as independent variables, and the revenue water ratio increase and the leakage ratio reduction were determined as dependent variables.

2.2.2. Correlation Analysis between Independent Variables

The correlations were analyzed to prevent multicollinearity between the four influencing factors selected to derive dependent and independent variables. The higher the covariance, the higher the correlation, and when the covariance is completely identical, the correlation becomes 1. The correlation coefficient indicates the degree of such correlation. The Pearson correlation coefficient [12] is used for normally distributed quantitative variables, and Kendall’s tau correlation [13] is used to measure the relationship between sequences when they are not normally distributed, or when the order of categories is not specified. The correlation coefficient ranges from −1 (completely negative relationship) to 1 (completely positive relationship); 0 indicates a non-linear relationship, and the correlation between independent variables that does not cause a problem of multicollinearity is conservatively 0.7, as judged below.

2.2.3. Correlation Analysis between Independent and Dependent Variables

In order to minimize the problem of collinearity between independent variables, there should be no correlation between independent variables. However, typically, the correlation between dependent and independent variables should be high. For this reason, it is necessary to select variables that include significant independent variables and remove independent variables without statistical significance through correlation analysis of each dependent–independent variable combination to evaluate the leakage management effectiveness and determination of formulation. Therefore, according to each data classification group, it is excluded from the independent variable, and the target calculation formula is derived.

2.2.4. Data Standardization through T-Score Conversion

Data standardization is a technique for determining the relative position of each variable when it is difficult to evaluate in the same standard because the units and ranges of the acquired data are different. In this study, T-score conversion was applied, taking into account the characteristics of the data. In order to compare the effects on the increase in the flowrate and the decrease in the leakage rate, it was converted into a T-score that was converted into an average of 50 and a standard deviation of 10 and applied. This can be calculated as in Equation (1).
T s c o r e = 10 X μ σ + 50
where X : variables, μ : average of variable X 1 ,   X 2 , X N , σ : standard deviation of variable X 1 ,   X 2 , X N .

2.3. Evaluation of Leakage Management Efficiency and Determination of Formulation of Leakage

In this study, the efficiency of leakage management actions was evaluated, and the formulation of leakage was determined using various data analysis approaches, such as MRA analysis and DNN model.

2.3.1. Multiple Regression Analysis

Multiple Regression Analysis is a type of regression analysis that estimates the relationship between variables using a statistical method. In regression analysis, there is an independent variable that is a cause and a dependent variable that is an effect. In this case, multiple regression analysis is a method of performing analysis on a regression model in which there is one dependent variable and two or more independent variables [14].
In this study, the increase in the flow rate and the decrease in the leakage rate according to the data classification criteria were classified according to four input indicators (PR, MR, LR, LD), and the target formulation was derived according to the increase in the flow rate and the decrease in the leakage rate for the derived input indicators.

2.3.2. Deep Neural Network

Another technique applied in this study to evaluate the efficiency of leakage management is DNN [15] (Figure 2). DNN is a technique composed of two or more hidden layers of an Artificial Neural Network (ANN) [16], and DNN have the advantage of learning complex data with fewer neurons compared to ANN. In addition, it is suitable for predicting non-linear data, and the performance can be adjusted by adjusting the number of neurons and the number of hidden layers, which are parameters of the technique. However, DNN has a high possibility of overfitting, and because this problem is directly related to the performance of the model, it is necessary to adjust parameters and select an activation function to prevent overfitting.
In each hidden layer, a function is applied to the value that enters the neuron and is then transmitted. The function used at this time is called the activation function, and the Rectified Linear Unit (Relu) function is applied to each neuron as the activation function. Equation (2) is the expression defining the Relu function.
f x = max 0 ,   x
Figure 3 is the graph form of the Relu function. If the input value is less than 0, it is output as 0, and if it is greater than 0, the input value is output as it is. The Relu function learns faster than other sigmoid and tanh functions and does not have a gradient loss problem. For the above reasons, the Relu function was applied in each layer when constructing the model.
When training the model, an error function that calculates the error between the output value and the target value of the model is specified. The Mean Squared Error (MSE) function was used as the error function when constructing the learning model. The MSE function is the mean square error between the predicted value and the actual value. In addition, an optimization function is essential when training a model. The optimization function is a function that finds the parameter that minimizes the value of the loss function. The Adaptive Moment Estimation (Adam) function was applied to the per-rate model. Adam is an algorithm that combines the strengths of RMSProp and Momentum, which are used as optimization functions. Like Momentum, it stores the exponential average of the slope, and similarly to RMSProp, it stores the exponential average of the square of the device. In this study, when constructing a learning model, one input layer, five hidden layers, and one output layer were stacked, and the number of neurons in each hidden layer was composed of 64, 128, 256, and 512.

3. Application and Results

3.1. Study Area and Data

In this study, data on the statistics of waterworks between 2005 and 2018 were used to initially select the study areas. Among the 163 local small and medium cities across South Korea, 22 study areas were selected to ensure there were no missing data or errors in terms of collecting data and that the leakage reduction effect was clear according to the activity of leakage management.
To be defined as small and medium cities, cities had to be of typical type, not too big or too small. In Korea, city size is generally defined by population. The population of local small and medium cities is at least 50,000, and the maximum range of population is 1 million. The selected cities’ data were accumulated for up to 14 years. For leakage management factors, PR, LR, and LD were divided by the total pipe length, and MR was used as the value divided by the water supply population in order to offset the difference depending on the size of WDS. In addition, only leakage management factors with either a revenue water ratio less than 80% or a leakage ratio higher than 10% were applied. Examples of data are shown in Table 1.

3.2. Pre-Processing for Data Analysis

The collected data show the change in the revenue water ratio and the leakage ratio according to the activity of leakage management (PR, LR, LD, MR). To evaluate the leakage reduction factors, these data were pre-processed. Table 2 shows the correlation between independent variables. Because the correlation depends on the data categorization, in this study, to compare the correlation by data classification criteria, revenue water ratio increase has been divided into four sections: (1) less than 60%, (2) 60–70%, (3) 70–80%, (4) higher than 80%, and four sections of leakage ratio: (1) less than 10%, (2) 10–20%, (3) 20–30%, (4) higher than 30%.
As a result, PR–LR (0.832), PR–LD (0.854), and LD–LR (0.866) of leakage ratio less than 10% showed a high correlation of more than 0.7. Therefore, the leakage ratio of less than 10% data was excluded when generating the prediction equation. In addition, the data with a revenue water ratio of less than 60% showed a high correlation for LD–PR (0.71), but in other data show correlation of less than 0.7.
Moreover, in order to minimize the problem of collinearity between independent variables, there should be no correlation between independent variables. However, the correlation between the dependent and independent variables should be high. Therefore, this study performed an analysis of correlation between the independent and the dependent variable, and Table 3 shows the results.
Table 3 shows the results of analysis of correlation components between dependent–independent variables according to various classification criteria. Only data of the leakage ratio of less than 10% showed high correlation between dependent–independent variables, and low correlation in other cases. In other words, it was confirmed that, even if old pipes and meters were replaced and leakage detection and repair were performed in some areas, the tendency of a large increase in the revenue water ratio or a decrease in the leakage ratio was small. For this reason, in order to derive a leakage management formulation according to the data analysis group, it is necessary to include significant independent variables and remove statistically insignificant independent variables. Therefore, the significance of independent variables was considered when deriving the leakage management formulation according to the data analysis group.
In this study, because the dependent and independent variables used different units, data standardization should be performed for correlation analysis and data analysis. Therefore, in this study, T-score was used, which was converted to an average of 50 and a standard deviation of 10.
Table 4 shows the results of converting factors that have a correlation of 0.5 or more to a T-score in the correlation analysis between the dependent and independent variables to obtain the formulation of revenue water ratio of the leakage ratio of less than 10% data. According to the decrease in the leakage ratio, the variable values of PR converted to T-scores were 42.28–91.54, MR, 37.56–86.53; LR, 37.62–91.85; and LD, 40.16–89.58. T-score values of the independent variables were converted to a relatively similar range.

3.3. Evaluation of Water Loss Management Efficiency and Determination of Formulation of Water Loss

In this study, the increase in the flow rate and the decrease in the leakage rate according to the data classification criteria (higher than 10% of leakage ratio, less than 80% of the revenue water ratio) were analyzed according to four input indices (PR, MR, LR, LM). Through this, a leakage management model formulation was derived according to the increase in the revenue water ratio and the decrease in the leakage ratio for input indicators. In order to derive a reliable leakage management model formulation, MRA and DNN were used, and Table 5 shows the result of deriving the leakage management model formulation according to the data classification criteria.
As a result of the leakage management model formulation, the coefficient of determination (R) of the eight classification criteria was distributed from 0.32 to 0.82, and an average value of 0.55 for MRA and 0.61 for DNN. Among the eight groups, the highest coefficient of determination in both analysis through MRA and DNN is Group 1, with a leakage ratio of less than 10%. In the case of Group 1, it depends on PR, MR, LD, etc., to reduce the leakage ratio, and it is confirmed that PR has the greatest influence on reducing the leakage ratio. In addition, if the leakage ratio is less than 10%, the revenue water ratio is high. In addition, it was confirmed that PR among the four leakage reduction activities was essential for reducing the leakage ratio and increasing the revenue water ratio in all classification criteria, except for Group 6, where the revenue water ratio was 60–70%. According to Group 8 in contrast to Group 1, LD was shown to be the most significant factor to increase the revenue water ratio. In the case of Group 8, the revenue water ratio is already high, and because PR, MR, and LR are judged to be in progress, the revenue water ratio is expected to decrease if no leakage reduction activities are carried out. This is because the model formula was derived as a negative number (−0.47), as in Group 1. The average value for the coefficient of determination on each data classification criteria of MRA and DNN were 0.55 and 0.61. In all groups, except for Groups 1 and 2, the DNN model of the coefficient of determination was high. Through the above results, it can be judged that DNN is a suitable model for the formulation of water loss management.
However, the conditions required for performing each model are different, and the results can also be derived differently depending on data processing, such as model parameters and training data mining. As a result of comparison of the models applied in this study, it was found that the application of an artificial intelligence algorithm similar to DNN rather than MRA is more appropriate when there is a large variance, such as water loss management data, among various artificial intelligence algorithms. Therefore, in order to derive better results, the analysis result can be improved through sensitivity analysis of the algorithm parameters and various training data classifications.
The multiple correlation coefficient of MRA analysis is the Pearson correlation coefficient for the relationship between the dependent variable according to the independent variable and the predicted value through the regression model. The average value for the coefficient of determination is 0.55, which is not a high correlation, and it would not be the best data classification for reflecting the characteristics of the data. This result means that it cannot be evaluated that the observed values of the sample are well clustered around the regression line, and that there is a difference between the predicted values and the observed values. Moreover, because the data classification was based on the leakage ratio and the revenue water ratio, it can meet the goal to improve the efficiency of leakage management in the future, but the model formulation predicts the increase in the revenue water ratio and the decrease in the leakage ratio It shows some error with the predicted value.
The prediction values of the model formulation show differences in accuracy depending on the classification of the data, even if the same data are used [17]. In this study, the leakage ratio and the revenue water ratio were classified according to the water loss management activity, which means that each variable is independent of each other in the data classification process. However, because the water loss management activities such as pipe and meter replacement reflect the time-sequential characteristics of the revenue water ratio and the leakage ratio after recovery, it is necessary to consider the spatiotemporal characteristics when classifying the data.
Therefore, in this study, when classifying the data, the increase in the revenue water ratio and the decrease in the leakage ratio affect the water loss management activity of the immediately preceding year. In this study, the time series and regional characteristics were considered, and a formulation was derived. Table 6 shows the data classified based on spatial and temporal characteristics.
As shown in Table 6, there are a total of 22 target areas classified based on spatial and temporal characteristics, and the model formulation was derived for a total of 11 target areas, except for small- and medium-sized cities with less than four datasets. The results are shown in Table 7.
The multiple correlation coefficient of the model formulation considering spatiotemporal characteristics is 0.87 on average (maximum: 0.99, minimum: 0.75), which shows a higher correlation than the data classified based on the revenue water ratio and the leakage ratio in both analysis approaches (i.e., MRA, DNN). This means that the observations of the sample are well clustered around the regression line, and the regression model predicts the dependent variable well. In particular, BH and JE showed a coefficient of determination of almost 1.00. This means that there are no missing values in the data; these two cities are smaller cities relatively, and the annual water loss management actions were more actively performed than in other areas. Accordingly, it is judged that the improvement of the revenue water ratio and the leakage ratio is larger than others. Moreover, in the case of NJ, the gap in the coefficient of determination presented the largest between MRA and DNN. This is because NJ has a distinct time-series characteristic compared to other cities. Therefore, it is thought that in the case of NJ, data analysis considering the characteristics of time series is more appropriate than independent data analysis.
Through the comparison of the above data classification methods, the increase or decrease in the leakage ratio/revenue water ratio by the water loss management activity of local water distribution systems can construct a more effective model for classification, considering both local and temporal characteristics.
However, this study could not be performed due to the lack of accumulated data, but when the data are classified considering both the current state of the water distribution systems (e.g., the revenue water ratio, the leakage ratio) and spatial and temporal characteristics, it is believed that a more reliable predictive model can be developed.

4. Summary and Conclusions

This study developed an evaluation method for deriving the leakage reduction factors of WDSs in the case of Korean small and medium cities. To develop the evaluation method for deriving leakage reduction factors, water loss management efficiency evaluation and formulation of water loss were derived based on improving the revenue water ratio data according to the water loss activity of WDSs in local small and medium cities. For this goal, this study performed three steps. First, data from the study area were collected, and the significant factors selected based on the information characteristic (e.g., revenue water ratio, water loss management activity, variation of the revenue water ratio according to the leakage management activity, leakage status, spatiotemporal characteristics). Secondly, the collected data underwent pre-processing for the selected factors. In the data pre-processing, independent and dependent variables were determined, and correlation analyses were performed between these independent and dependent variables. Through T-score conversion, the data were then standardized to be matched up with the units of various factors. Finally, the efficiency of leakage management actions was determined by the formulation of leakage using various data analysis approaches using MRA analysis and DNN. To apply the proposed approach, the revenue water ratio and the leakage ratio variation data according to the leakage management activity (PR, LR, LD, MR) for 22 local small and medium cities in South Korea were used.
Among the four leakage reduction activities, PR was highlighted as an essential activity for decreasing the leakage ratio and increasing the revenue water ratio in most of the data classifications, excluding Group 6. In addition, in relatively well-managed WDSs (i.e., the leakage ratio is less than 10% and the revenue water ratio is higher than 80%), the LD was presented as an effective activity to improve WDS network conditions. However, the first analysis did not consider the temporal correlation of each water loss management activity; pipe and meter replacement reflected the time-sequential characteristics of the revenue water ratio and the leakage ratio after recovery. It is necessary to consider the spatiotemporal characteristics when classifying the data. Therefore, in the second analysis, the data were classified by considering the characteristics of time and location, and the effect of water loss management activity was analyzed.
According to the second analysis results, the annual water loss management action of small cities (e.g., BH and JE) was more actively performed. It is thought that this is because a city with a small water supply population is more likely to be more efficient than other regions in distributing support as it is more likely to improve leakage according to the water loss management activity. Lastly, regarding the various analyses in this study, in most of the classification groups, depending on the revenue water ration and the leakage ration, the DNN model of the coefficient of determination was higher than MRA. Moreover, in the second analysis, the performance of data analysis using DNN is more appropriate in the data classification considering the characteristics of time series compared with independent data analysis. Therefore, it can be judged that DNN is a suitable model for the formulation of water loss management. Through the comparison of the above data classification approaches, the increase or decrease in the leakage ratio/revenue water ratio by the water loss management activity of local WDSs can be used to construct a more effective model for classification, considering both local and temporal characteristics.
The results of this study are expected to be highly useful when analyzing leakage management efficiency and determining the leakage management goals of WDSs with similar size and characteristics in the future. In particular, it can be used as a basis for determining project goals and budgets at the planning stage, and it can also be used for prioritization and evaluation of each leakage reduction activity to achieve project goals. In addition, it will be possible to derive a water leakage reduction plan with optimal efficiency by evaluating the current pipeline conditions according to the leakage ratio/revenue water ratio, selecting the pipeline maintenance target, and reducing the water leakage through pressure control. Finally, it will be possible to prioritize pipeline maintenance in order to increase leakage reduction efficiency by calculating the amount of leakage reduction for each pipeline.

Author Contributions

Conceptualization, Y.H.C. and T.C.; data curation, T.C.; methodology, D.G.Y. and Y.H.C.; supervision, S.L.; writing—original draft, Y.H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) (NRF-2021R1G1A1003295).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are not publicly available.

Acknowledgments

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government.

Conflicts of Interest

There is no conflict of interest, and the source of funding, including the grant number for this paper, have been declared.

References

  1. Criminisi, A.; Fontanazza, C.M.; Freni, G.; Loggia, G.L. Evaluation of the apparent losses caused by water meter under-registration in intermittent water supply. Water Sci. Technol. 2009, 60, 2373–2382. [Google Scholar] [CrossRef] [PubMed]
  2. De Marchis, M.; Fontanazza, C.M.; Freni, G.; La Loggia, G.; Napoli, E.; Notaro, V. A model of the filling process of an intermittent distribution network. Urban Water J. 2010, 7, 321–333. [Google Scholar] [CrossRef]
  3. Wang, R.; Wang, Z.; Wang, X.; Yang, H.; Sun, J. Pipe burst risk state assessment and classification based on water hammer analysis for water supply networks. J. Water Resour. Plan. Manag. 2014, 140, 04014005. [Google Scholar] [CrossRef]
  4. Zyoud, S.H.; Shaheen, H.; Samhan, S.; Rabi, A.; Al-Wadi, F.; Fuchs-Hanusch, D. Utilizing analytic hierarchy process (AHP) for decision making in water loss management of intermittent water supply systems. J. Water Sanit. Hyg. Dev. 2016, 6, 534–546. [Google Scholar] [CrossRef] [Green Version]
  5. Ndunguru, M.G.; Hoko, Z. Assessment of water loss in Harare, Zimbabwe. J. Water Sanit. Hyg. Dev. 2016, 6, 519–533. [Google Scholar] [CrossRef]
  6. Kayaalp, F.; Zengin, A.; Kara, R.; Zavrak, S. Leakage detection and localization on water transportation pipelines: A multi-label classification approach. Neural Comput. Appl. 2017, 28, 2905–2914. [Google Scholar] [CrossRef]
  7. Taha, A.W.; Sharma, S.; Lupoja, R.; Fadhl, A.N.; Haidera, M.; Kennedy, M. Assessment of water losses in distribution networks: Methods, applications, uncertainties, and implications in intermittent supply. Resour. Conserv. Recycl. 2020, 152, 104515. [Google Scholar]
  8. Negharchi, S.M.; Shafaghat, R. Leakage estimation in water networks based on the BABE and MNF analyses: A case study in Gavankola village, Iran. Water Supply 2020, 20, 2296–2310. [Google Scholar] [CrossRef]
  9. Choi, T.; Kang, K.; Koo, J. Efficiency evaluation of leakage management using data envelopment analysis. J.-Am. Water Work. Assoc. 2015, 107, E1–E11. [Google Scholar] [CrossRef]
  10. Lambert, A. Accounting for losses: The bursts and background concept. Water Environ. J. 1994, 8, 205–214. [Google Scholar] [CrossRef]
  11. Boztaş, F.; Özdemir, Ö.; Durmuşçelebi, F.M.; Firat, M.A.H.M.U.T. Analyzing the effect of the unreported leakages in service connections of water distribution networks on non-revenue water. Int. J. Environ. Sci. Technol. 2019, 16, 4393–4406. [Google Scholar] [CrossRef]
  12. Benesty, J.; Chen, J.; Huang, Y.; Cohen, I. Pearson correlation coefficient. In Noise Reduction in Speech Processing; Springer: Berlin/Heidelberg, Germany, 2019; pp. 1–4. [Google Scholar]
  13. Abdi, H. The Kendall rank correlation coefficient. Encycl. Meas. Stat. 2007, 2, 508–510. [Google Scholar]
  14. Mark, J.; Goldberg, M.A. Multiple regression analysis and mass assessment: A review of the issues. Apprais. J. 1988, 56, 89–109. [Google Scholar]
  15. Krizhevsky, I.A.; Sutskever, G.E. Hinton. ImageNet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems 25 (NIPS’2012), Lake Tahoe, NV, USA, 3–6 December 2012. [Google Scholar]
  16. Wang, S.C. Artificial neural network. In Interdisciplinary Computing in Java Programming; Springer: Boston, MA, USA, 2003; pp. 81–100. [Google Scholar]
  17. Congalton, R.G. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 1991, 37, 35–46. [Google Scholar] [CrossRef]
Figure 1. Procedure of this study.
Figure 1. Procedure of this study.
Applsci 12 12530 g001
Figure 2. Configuration of Deep Neural Network Model.
Figure 2. Configuration of Deep Neural Network Model.
Applsci 12 12530 g002
Figure 3. Formulation of Relu function.
Figure 3. Formulation of Relu function.
Applsci 12 12530 g003
Table 1. Example of data from the application area.
Table 1. Example of data from the application area.
Study AreaRevenue Water RatioIncrease Revenue Water RatioLeakage RatioDecrease Leakage RatioPipe
Replacement
(PR)
Meter Replacement
(MR)
Leakage Repair
(LR)
Leakage Detection
(LD)
%%%%km/km × MillionNo./Per. × 1000No./km × 1000No./km × 1000
TY 201147.16.740.5−5.728.77640.9581.7711.261
HP 201149.3−0.928.50.10.00030.5990.3590.186
HP 201050.25.428.6−0.219.158141.3900.6450.250
BH 201452.2−15.243.1−15.50.22937.6630.1070.000
NS 200554.1−2.636.30.211.59435.9422.8390.516
JD 201454.89.040.79.722.27089.2470.4890.249
BH 201557.65.431.611.59.971114.2190.4550.213
SC 200758.010.936.0−0.424.31024.8890.9200.456
GS 201158.310.636.63.214.60159.7040.4640.630
WD 201559.38.740.78.77.40814.4820.2980.160
TY 201259.612.535.94.635.70527.6961.9340.756
HP 201259.810.535.7−7.20.00030.4730.3460.147
CS 201859.80.137.03.30.000227.1280.6440.319
GR 200860.69.235.08.48.05332.3651.1370.272
HP 201362.52.728.67.110.3471.4680.4570.269
JE 200662.79.732.08.016.09419.8530.8420.072
DY 200964.12.514.10.413.58467.8770.7620.593
YC 200764.27.127.9−4.240.30455.0742.1000.100
NS 200664.510.429.46.945.21710.6044.3220.180
BH 201664.97.325.95.729.73052.1470.5050.207
SC 200865.27.228.97.115.39021.7290.8020.318
GS 201266.27.929.07.641.11236.2740.6590.342
GJ 200966.47.627.49.418.4915.6852.8461.097
JD 201566.711.928.512.230.74831.0100.3150.293
JE 200767.75.015.116.932.8858.0710.9510.292
GJ 201067.81.426.90.519.8467.2811.7170.763
WD 201667.98.627.213.540.43030.4110.3360.258
GeS 200969.5−1.125.1−1.032.92023.7540.9050.237
NS 200769.85.327.51.9125.81810.4322.6040.074
Table 2. Results of correlation analysis between independent variables.
Table 2. Results of correlation analysis between independent variables.
Standard of
Categorization
PRMRLRLD
All dataLeakage ratio of higher than 20%PR1---
MR−0.1031--
LR0.401−0.1451-
LD0.006−0.0440.3601
Revenue water ratio of less than 70%PR1---
MR−0.2681--
LR0.455−0.2731-
LD−0.079−0.0860.3181
The leakage ratioLess than 10% PR1---
MR0.4611--
LR0.8320.4221-
LD0.8540.4570.8661
10–20%PR1---
MR0.0211--
LR0.416−0.0021-
LD0.2720.0640.6491
20–30%PR1---
MR0.0251--
LR0.420−0.1681-
LD−0.118−0.1760.2431
Higher than 30%PR1---
MR−0.3161--
LR0.497−0.2131-
LD0.690−0.0710.6161
The revenue water ratioLess than 60%PR1---
MR−0.1911--
LR0.525−0.1951-
LD0.710−0.1570.6641
60–70%PR1---
MR−0.1361--
LR0.386−0.3371-
LD−0.373−0.1650.1511
70–80%PR1---
MR0.0261--
LR0.399−0.0211-
LD0.1130.0230.4311
Higher than 80%PR1---
MR0.1061--
LR0.1860.0031-
LD0.240−0.0650.6711
Table 3. Results of correlation analysis between independent and dependent variables.
Table 3. Results of correlation analysis between independent and dependent variables.
Standard of
Categorization
Pipe Replacement
(PR)
Meter Replacement
(MR)
Leakage Repair
(LR)
Leakage Detection
(LD)
km/km × MillionNo./Per. × 1000No./km × 1000No./km × 1000
All dataLeakage ratio of higher than 20%0.112−0.0130.0640.059
Revenue water ratio of less than 70%0.131−0.074−0.026−0.034
The leakage ratioLess than 10%0.7760.5950.6880.695
10–20%0.582−0.0600.2840.166
20–30%0.097−0.2130.0940.225
Higher than 30%0.2160.1990.003−0.119
The revenue water ratioLess than 60%0.569−0.1110.0710.374
60–70%0.0230.0820.136−0.227
70–80%0.463−0.1900.2870.197
Higher than 80%0.507−0.1030.1070.248
Table 4. Results of converting to T-scores.
Table 4. Results of converting to T-scores.
Study AreaLeakage Ratio Reduction (%)Pipe Replacement
(PR)
Meter Replacement
(MR)
Leakage Repair
(LR)
Leakage Detection
(LD)
T-Score for PRT-Score for MRT-Score for LRT-Score for LD
NS 20120.506.4515.960.400.1261.7451.3859.5457.62
GJ 2013−1.200.0021.010.190.0142.2856.6146.6141.32
GJ20170.800.283.630.160.0243.1438.6544.6142.20
PJ 20120.603.209.870.380.1251.9445.1057.8356.98
DDC 2015−2.702.6613.010.170.0650.3248.3345.1247.41
GJ 20180.500.3328.020.160.0443.2763.8444.9345.69
YJ 2010−1.601.2610.880.320.1146.0746.1454.7354.95
PJ 20130.402.459.390.330.0449.6844.6054.8545.05
YJ 20110.200.9215.670.280.0345.0751.0952.0744.09
PJ 20140.402.198.990.340.1248.9044.1855.6756.45
GJ 20121.900.2818.490.180.0243.1254.0045.6542.77
PJ 20150.407.3510.220.420.1264.4545.4560.6757.07
DDC 2017−0.403.3624.760.120.0552.4360.4842.4346.31
YJ 20130.000.0015.110.170.0142.2850.5145.4040.16
YJ 20121.300.003.260.210.0142.2838.2647.9840.18
DDC 20161.801.1532.230.040.0245.7568.1937.6241.57
YJ 2009−0.901.479.030.250.1046.7244.2349.9754.14
NJ 20162.700.1822.490.090.0942.8358.1340.7353.12
PJ 2017−0.504.118.800.220.1354.6943.9948.5458.75
EC 200820.4016.3349.980.940.3491.5486.5391.8689.58
YJ 2015−0.300.156.210.140.0342.7341.3243.3443.38
YJ 20140.700.148.250.140.0142.7143.4243.3241.22
PJ 20161.303.8715.030.310.1553.9450.4253.6561.41
DDC 20146.006.982.570.170.0763.3237.5645.5249.15
DDC 20182.102.1710.660.210.1048.8345.9147.7953.94
PJ 20181.803.2314.540.410.1152.0349.9259.7855.95
YJ 2018−0.501.5415.930.170.0246.9251.3545.4642.69
YJ 2017−0.200.7611.520.110.0144.5646.8041.7940.84
YJ 20163.401.388.420.120.0546.4643.6042.5346.02
Table 5. Results of deriving leakage management model formulation.
Table 5. Results of deriving leakage management model formulation.
Standard of
Categorization
MRADNN
Leakage Management Model FormulationUsed VariablesR
X1X2X3X4
Leakage ratioLess than 10% (Group 1)YG1 = 0.22X1 + 0.12X2 + 0.04X3 − 3.17OOO 0.820.78
10–20% (Group 2)YG2 = 0.14X1 + 1.06X3 + 0.24O O 0.590.56
20–30% (Group 3)YG3 = 0.03X1 + 7.01X4 + 1.71O O0.320.57
Higher than 30% (Group 4)YG4 = 0.52X1 + 0.06X2 + 1.47X3 − 3.41OOO 0.580.66
Revenue water ratioLess than 60% (Group 5)YG5 = 0.42X1 + 3.75X4 + 1.14O O0.640.71
60–70% (Group 6)YG6 = 0.02X2 + 0.94X3 + 6.99 OO 0.340.45
70–80% (Group 7)YG7 = 0.08X1 + 0.85X3 + 4.34X4 + 1.53O OO0.530.55
Higher than 80% (Group 8)YG8 = 0.1X1 + 4.35X4 − 0.47O O0.550.58
Table 6. Example of data classifying based on spatial and temporal characteristics.
Table 6. Example of data classifying based on spatial and temporal characteristics.
Study AreaRevenue Water RationIncrease Revenue Water RationLeakage RatioDecrease Leakage RatioPipe
Replacement
(PR)
Meter Replacement
(MR)
Leakage
Repair
(LR)
Leakage Detection
(LD)
%%%%km/km × MillionNo./Per. × 1000No./km × 1000No./km × 1000
GeJ 200966.47.627.49.418.495.682.851.10
GeJ 201067.81.426.90.519.857.281.720.76
GeJ 201172.54.722.14.84.5710.230.550.42
GeJ 201275.63.119.42.732.3212.440.780.51
GeJ 201374.8−0.819.20.220.308.910.880.40
GeJ 201480.45.614.74.526.435.800.650.33
GeJ 201580.50.114.70.014.049.160.680.37
GeJ 201680.0−0.514.8−0.16.4711.030.540.18
GeJ 201780.30.315.1−0.31.436.860.670.17
GeJ 201875.9−4.419.5−4.41.737.520.710.21
GR 200860.69.235.08.48.0532.371.140.27
GR 200972.111.512.422.664.1726.880.830.21
GR 201072.30.221.2−8.82.4515.600.600.06
GR 201176.74.418.72.52.6234.670.740.15
GR 201278.61.916.62.13.2533.510.860.22
GR 201380.01.415.01.63.4410.950.800.46
GR 201480.00.015.5−0.57.7014.620.680.39
GR 201580.30.315.20.32.3449.380.460.28
GR 201680.70.414.80.42.3128.570.540.19
GR 201778.2−2.514.9−0.11.0550.590.370.14
GR 201875.6−2.614.90.01.4851.910.260.13
···
GS 201158.310.636.63.214.6059.700.460.63
GS 201266.27.929.07.641.1136.270.660.34
GS 201373.06.822.56.521.5335.110.460.25
GS 201480.07.015.07.550.5111.340.330.18
GS 201578.1−1.917.2−2.214.2318.690.320.20
GS 201680.52.414.92.36.9922.070.280.18
GS 201780.3−0.215.0−0.10.7724.290.290.16
GS 201879.3−1.015.9−0.92.8825.730.220.12
Table 7. Results of deriving leakage management model formulation considering spatial and temporal characteristics.
Table 7. Results of deriving leakage management model formulation considering spatial and temporal characteristics.
Standard of
Categorization
MRADNN
Leakage Management Model FormulationR
GeJYGeJ = −0.34X2 − 4.59X3 + 19.28X4 + 1.680.750.76
GRYGR = 0.35X1+ 0.24X2 + 11.05X3 + 14.07X4 − 18.620.980.82
GSYGS = 0.35X1 + 0.30X2 − 2.22X3 − 20.35X4 − 6.370.910.92
NJYNJ = 0.12X1 − 0.11X2 − 7.57X3 + 16.26X4 + 1.530.770.84
DDCYDDC = −0.06X1 − 0.10X2 − 72.42X3 − 13.54X4 + 5.500.850.86
BHYBH = −0.14X1 + 0.03X2 − 10.64X3 + 13.99X4 − 15.410.990.99
SCYSC = −0.40X1 − 0.18X2 + 30.92X3 − 21.09X4 − 2.510.760.85
SSYSS = 0.22X1 + 0.03X2 − 3.59X3 + 26.57X4 − 1.130.90.86
JEYJE = −0.19X1 + 0.01X2 + 22.64X3 + 14.56X4 − 6.190.990.99
TYYTY = 0.22X1 − 0.38X2 + 1.47X3 − 1.90X4 + 5.060.820.87
HPYHP = 0.50X1 − 0.09X2 + 1.67X3 + 35.71X4 − 6.390.870.91
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Choi, Y.H.; Choi, T.; Yoo, D.G.; Lee, S. Development of an Evaluation Method for Deriving the Water Loss Reduction Factors of Water Distribution Systems: A Case Study in Korean Small and Medium Cities. Appl. Sci. 2022, 12, 12530. https://doi.org/10.3390/app122412530

AMA Style

Choi YH, Choi T, Yoo DG, Lee S. Development of an Evaluation Method for Deriving the Water Loss Reduction Factors of Water Distribution Systems: A Case Study in Korean Small and Medium Cities. Applied Sciences. 2022; 12(24):12530. https://doi.org/10.3390/app122412530

Chicago/Turabian Style

Choi, Young Hwan, Taeho Choi, Do Guen Yoo, and Seungyub Lee. 2022. "Development of an Evaluation Method for Deriving the Water Loss Reduction Factors of Water Distribution Systems: A Case Study in Korean Small and Medium Cities" Applied Sciences 12, no. 24: 12530. https://doi.org/10.3390/app122412530

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop