Next Article in Journal
Three-Dimensional Unsteady Aerodynamic Analysis of a Rigid-Framed Delta Kite Applied to Airborne Wind Energy
Next Article in Special Issue
Logistic Regression with Wave Preprocessing to Solve Inverse Problem in Industrial Tomography for Technological Process Control
Previous Article in Journal
Numerical Investigations on the Damage Behaviour of a Reconstructed Anode for Solid Oxide Fuel Cell Application
Previous Article in Special Issue
Dynamics of Electromechanical Systems Containing Long Elastic Couplings and Safety of Their Operation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of Electrical Tomography Imaging Using Machine Learning Methods for the Monitoring of Flood Embankments Leaks

by
Tomasz Rymarczyk
1,2,*,
Krzysztof Król
1,2,
Edward Kozłowski
3,
Tomasz Wołowiec
4,
Marta Cholewa-Wiktor
3 and
Piotr Bednarczuk
2
1
Research and Development Center, Netrix S.A., 20-704 Lublin, Poland
2
Faculty of Transport and Computer Science, University of Economics and Innovation, 20-209 Lublin, Poland
3
Faculty of Management, Lublin University of Technology, 20-618 Lublin, Poland
4
Faculty of Administration and Social Sciences, University of Economics and Innovation, 20-209 Lublin, Poland
*
Author to whom correspondence should be addressed.
Energies 2021, 14(23), 8081; https://doi.org/10.3390/en14238081
Submission received: 2 November 2021 / Revised: 22 November 2021 / Accepted: 30 November 2021 / Published: 2 December 2021

Abstract

:
This paper presents an application for the monitoring of leaks in flood embankments by reconstructing images in electrical tomography using logistic regression machine learning methods with elastic net regularisation, PCA and wave preprocessing. The main advantage of this solution is to obtain a more accurate spatial conductivity distribution inside the studied object. The described method assumes a learning system consisting of multiple equations working in parallel, where each equation creates a single point in the output image. This enables the efficient reconstruction of spatial images. The research focused on preparing, developing, and comparing algorithms and models for data analysis and reconstruction using a proprietary electrical tomography solution. A reliable measurement solution with sensors and machine learning methods makes it possible to analyse damage and leaks, leading to effective information and the eventual prevention of risks. The applied methods enable the improved resolution of the reconstructed images and the possibility to obtain them in real-time, which is their distinguishing feature compared to other methods. The use of electrical tomography in combination with specific methods for image reconstruction allows for an accurate spatial assessment of leaks and damage to dikes.

1. Introduction

Floods cause enormous damage, and are the cause of many tragedies. Therefore, the monitoring of flood defences is important for the safety of people and the environment. However, the methods used to identify cracks and seepage in dikes visually are insufficient. A dedicated monitoring system should be able to test dikes for seepage.
There are several methods which are used to analyze flood embankments. Table 1 shows the main advantages and disadvantages of each method. Geophysical methods are used in environmental studies to image objects [1]. They are characterized by their non-invasive character and the possibility of rapid imaging in soil. The GPR radar method is based on the waves passing through the object and allowing the analysis of their propagation through the analysis of the reflected signal, by which changes in the structure of the investigated medium can be observed [2]. Another method is thermography, which uses the phenomenon of the non-contact detection of electromagnetic radiation in the mid-infrared band. By analizing thermal images at different time intervals, changes in flood weights can be detected [3]. Another type of method is based on the analysis of the propagation of the generated electric current in the studied medium. This solution determines changes in the physical parameters of the tested object (resistivity). Finally, electrical resistivity tomography (ERT) [4,5,6] is used to study shafts. The resistivity tomography method consists of placing the measuring electrodes along the profile line at equal distances. Resistivity tomography is a technique which is often used for the imaging of geological soil properties. The electrical properties of soil are strongly influenced by such hydrological properties of rocks and soils as porosity and water saturation. Diffuse optical sensing [7,8,9,10,11,12], where optical fibres are placed at the bottom of the embankment, is also used for the monitoring of dikes. This use of sensors allows the reconstruction of temperature and strain profiles. However, it should be noted that distinguishing seepage in the obtained temperature data from terrain anomalies within earth embankments is still a difficult task. It should also be mentioned that fibre optic sensors require physical contact with the seepage material. Therefore, proper installation can become an invasive and challenging issue, especially when using this technology to monitor existing levees. However, fibre optics can be a very efficient and cost-effective solution for the monitoring of long stretches of new embankments where fibre optics can be easily installed within the embankment.
The Electrical Impedance Tomography (EIT) method enables the analysis of cracks and seepage [13] by taking measurements reconstructing them appropriately in the form of an image [14,15,16,17]. The monitoring of dikes, levees, dams, and floodwalls is a key process affecting the maintenance and reliability of these structures due to catastrophic effects and high possible damage costs [18]. The continuous and multi-level control of the technical condition of embankments is costly. Therefore, there is a great need to replace traditional observation methods with cyber-physical systems, in which impedance tomography (EIT) plays a key role [18,19,20,21]. The disadvantage of the commonly used solutions is the point measurement [22]. The most common cause of damage to shafts is water that penetrates through seepage and leaks. With the EIT implemented in cyber-physical monitoring systems, it is possible to obtain a cross-sectional view of the dike interior [23,24]. The image quality and resolution are sufficient to detect water-penetrated areas [25]. Early information about hazards helps to prevent severe damage. Despite significant advances in dike design and construction, unexpected events may occur. Therefore, designing an economical and reliable device to monitor the condition of dikes or flood-control reservoirs is crucial to prevent the devastating effects of flooding.
In tomographic systems, the image quality and its parameters—model geometry, spatial resolution, number of sensors, discretization, and reconstruction method—are important factors. Different deterministic and machine learning methods are used to solve inverse problems depending on the tomographic technique used. The goal of these methods is to obtain the expected solution quickly and with a good image reconstruction quality. Electrical impedance tomography has a relatively low image resolution. However, in the developed methods, the resolution is sufficient for seepage analysis or shaft condition monitoring.
The motivation for the research was to develop a distributed system to analyze dikes by spatially analyzing their condition in real-time. The main objective was to develop a comprehensive solution consisting of a measurement device, measurement electrodes, algorithms to reconstruct images from measurement data, and a cyber-physical system to monitor the dike condition. The solution based on electrical impedance tomography was chosen because of the possibility of the area analysis of the tested object using measurements on the near-surface of the tested object, the method of data acquisition, and the possibility of the analysis of changes in the values of electrical parameters. Besides this, electrical impedance tomography is quite sensitive to cracks and seepage, which undoubtedly makes it effective in shaft monitoring.
The authors’ main contribution is the development of a solution based on machine learning algorithms that work individually for each pixel of a tomographic image. The primary goal was to increase the resolution of the image reconstruction an-d enhance the system’s ability to identify different kinds of anomalies within flood embankments. The system architecture is shown in Figure 1. The data are collected from the tomographic equipment and then sent to the analysis software system.
The subject of the application of logistic regression with an elastic net regularisation [26], principal components analysis (PCA) [27] and wavelet preprocessing [28] in electrical impedance tomography [29,30] has been undertaken to verify and test the extent to which this approach can be helpful in the study of EIT dikes. For this purpose, numerical models and real models with different variations of electrode placement were prepared. Furthermore, the measurements from the tomograph were used to reconstruct the image of the object’s interior by collecting and analysing the data using machine learning methods.
The paper is divided into five sections, with an introduction. The architecture of the designed system is presented in Section 2. It also describes the measurement device, laboratory measurements, numerical models and machine learning algorithms. The work results obtained by reconstructing the corresponding images for the synthetic and measurement data are presented in Section 3. Section 4 discusses the results obtained, while Section 5 summarises the research carried out.

2. Materials and Methods

The following section presents electrical tomography’s structure and measurements, elements, apparatus, image reconstruction models from simulated and real measurements, and mathematical algorithms. In addition, the data collection methods and measurement methods are presented. The data are used in the next step to solve progressive and inverse problems. In the following subsections, several algorithms that were used in this work are described. These are wavelet-preprocessing, elastic net, and PCA algorithms. The above algorithms were used for tomographic imaging, and finally, they were compared. The above methods were used to obtain rapid and efficient solutions to build models on the reconstructed areas. The work was performed using tomographic equipment designed and manufactured at Netrix SA Research and Development Center. Additionally, Matlab, Python in R language and Eidors packages and tools were used [31].

2.1. Electrical Tomography

Electrical impedance tomography [32,33], generally called electrical tomography (ET) [34,35], includes some image reconstruction techniques. The methods available in the literature are electrical resistance tomography (ERT) [36,37], electrical capacitance tomography (ECT) [38,39,40,41] and electrical impedance tomography. ECT is based on the reconstruction of the electrical permeability ε, whereas EIT is based on the reconstruction of the conductivity γ. It is worth noting that electrical tomography has a low image resolution. This is caused by the limited number of current flow measurements through the test piece and the low sensitivity of the measured voltages, which depends on the conductivity change inside the test piece. In EIT, the current or voltage is introduced into an object through a set of sensors attached to its surface. The voltage drop values are measured on the other sensors. The properties of the test object are reconstructed by solving the inverse problem. Using the electrical properties of the materials, the interior of the test object can be mapped. Impedance tomography uses the conductivity of objects; by connecting a current or voltage source, a current flow or voltage distribution is forced. The current or voltage is observed to flow at the edge of the test area. The data collected in this way are used to build images using reconstruction algorithms. The image thus prepared is called a tomogram.

2.2. Measurement System

The system collects measurement data from the electrodes through a tomographic device that allows the measurement of up to 32 channels. The entire infrastructure is centrally managed for data collection and measurement recording. The application enables measurement collection, data management and device monitoring.
A finite element model and a real measurement model were prepared to test the effectiveness of different algorithms for the analysis of the procedures in tomography. In turn, electrical impedance tomography was used for the analysis.
The measurements were performed with a hybrid tomograph that bases its data controller and measurement structure on two stm32 controllers from the F4 family. Thanks to this solution, the measuring processor does not generate complex graphical interfaces; rather, it only performs the necessary operations in the tomographic process. This further significantly improves the solution performance. Figure 2 shows the block diagram of the built tomograph. The device is shown in Figure 3 as measurement block, a control and communication system, an inside view, and a measurement panel.
The excitation signal source was built on a direct digital synthesis (DDS) signal generator coupled with a current-generation system. The signal prepared in this way is then sent to the inputs of the multiplexer system, where—depending on the sequence of current measurements—it is transmitted to the selected output. The measurement controller performs the control of the currently measured/selected item. The measuring block consists of a set of active filters minimising the influence of the 50 Hz components on the measurement accuracy. The prepared signal travels to the digitally controlled gain control system, and to the Analog-to-Digital Converter (ADC). Each correct measurement of the matrix is sent to the control system, which, depending on the selected configuration, performs the process of recording or sending data to the server. Thanks to the integrated measuring circuit (capacitance/impedance), the device can be used to perform both EIT (impedance) and ECT (capacitance) measurements. The device is designed to work with sets of 8, 16 and 32 electrodes. The measurement takes place sequentially, depending on the selected mode. Two modes of EIT measurement are available (polar Ground GND—measurement against the negative electrode; polar NB—differential measurement). In the case of ECT measurements, only one mode is available: interelectrode measurement. In addition to mode selection, the instrument allows the user to choose the results’ averaging level and select the gain parameters. A 2.4-inch graphic touch screen is used to operate the instrument, select the functions and start the measurement process. The instrument enables the storage of the data on a portable memory carrier (pen drive), and allows the user to stream it via a serial port or the Ethernet.
The experiments were conducted on two measurement models. The first model (Figure 4) uses an array of 16 measurement electrodes equally spaced along the profile line, and a measurement array with 32 electrodes evenly spaced in two rows. In contrast, the second model consists of two multi-sensor arrays with 32 measuring electrodes (Figure 5). The EIT’s task is to generate an electrical current with specific parameters for each pair of sensors, read the voltage between each electrode, and send the data to the output. The flood embankment models were designed to represent the key parameters of the existing facility as accurately as possible.

2.3. Algorithms and Methods

Many deterministic, statistical, or computational intelligence methods solve optimisation tasks [42,43,44,45]. In this paper, the authors concentrate on the use of machine learning. In addition to the real models, the imaging domain had to be prepared as a composite finite-element mesh model. In this case, a model was created for each element, representing the relationship between the value of the signal coming from the electrodes and the conductivity. If a different conductivity comes out of the measurements than that in the background measurements, there is a water concentration in the area. With the above information and measured values, the location of the flooded embankment can be determined. Therefore, a model is defined for each finite element. This allows the calculation of the probability of conductivity that differs from the background conductivity for each finite element. Based on this, the resolution in the imaging domain was determined, and thus, the mesh was reconstructed. The resolution of the imaging domain closely depends on the classification level, and it is the classification level that determines the occurrence of conductivity which is different from the background conductivity. By performing ROC analysis [46,47], each finite element’s classification level was determined, which should be understood here as the limiting probability of percolating conductivity.

2.3.1. Logistic Regression

Knowing the model and the background measurement, we need to answer the question of which finite elements in the area belong to the background and which belong to the inclusions. If the answer to this question is already known, the reconstruction in the EIT [48] can be created. For this purpose, a logistic regression model is defined for each finite element. One defines the probability space as ( Ω , , P ) and defines a random variable Y with a discrete distribution for a finite element where Y : Ω { 0 ,   1 } . Then, it can be concluded that, in the presented approach, while the finite element belongs to the inclusions (increased humidity), it is assumed that the realisation of the random variable Y is one; otherwise, it is zero. Based on the X m   t h e signal from the sensors, it is determined whether the element belongs to the inclusions. For this purpose, one needs to define a classifier f : m   { 0 ,   1 } ). A classifier in the present work was logistic regression.
The logistic regression model is used to estimate the binomial (or polynomial) distribution of the response variable Y based on the realisations of the input variables X m ; that is, we determine P(Y = y|X), where y { 0 , 1 } , when P(Y = y|X) is the probability of success and P(Y = 0|X) is the probability of failure. We should mention the odds, which are the ratio of success to failure, and are calculated as follows:
θ ( X ) = P ( Y = 1 | X ) 1 P ( Y = 1 | X ) ,
The main aim of logistic regression [49,50,51] consists in the estimation of the success probability P ( Y = 1 | X ) based on the observation of vector of input variable X . Because P ( Y = 1 | X ) ( 0 ,   1 ) , from Equation (1) we know that the odds θ ( X ) ( 0 , ) , but ln θ ( X ) ( , ) . In logistic regression, the linear relationship between the logit and the vector of input variable X is analysed as follows:
ln θ ( X ) = ln ( p ( β 0 , β , X ) 1 p ( β 0 , β , X ) ) = β 0 + X T β + ε ,
where ε is a random variable with normal distribution N ( 0 , σ 2 ) , and β 0 and β = ( β 1 , β 2 , , β m ) m denote the model parameters. From (2) we know the success probability:
P ( Y = 1 | X ) = d e f p ( β 0 , β , X ) = e β 0 + X T β 1 + e β 0 + X T β ,
In order to determine the unknown parameters β 0 and β in (2), we use the Maximum Likelihood Estimation (MLE) method (i.e., we estimate the parameters by the method of maximum likelihood). Let D = { ( X , Y ) : X n × m ,   Y { 0 ,   1 } n } be a dataset:
Y = [ y 1 y 2 y n ] , X = [ x 11 x 12 x 1 m x 11 x 12 x 1 m x n 1 x n 2 x n m ] = [ x ( 1 ) x ( 2 ) x ( n ) ] ,
where y i { 0 ,   1 } and x ( i ) m denote the realisations of the response and input variables for the i−th observation, 1 ≤ in.
In order to estimate the parameters β 0 and β in logistic regression (2), we solve the following equation:
max β 0 , β L ( β 0 , β , Y , X ) ,
where the objective function L(β0, β, Y, X) is a likelihood defined as
L ( β 0 , β , Y , X ) = i = 1 N ( p ( β 0 , β , x ( i ) ) y i ( 1 p ( β 0 , β , x ( i ) ) ) 1 y i ) ,
Instead of task (5), we solve the auxiliary task:
max β 0 , β ln L ( β 0 , β , Y , X ) ,
where the logarithm of the objective function (7) is equal to
ln L ( β 0 , β , Y , X ) = i = 1 N ( y i ( β 0 + x ( i ) T β ) ln ( 1 + e β 0 + x ( i ) T β ) )
The Newton–Raphson algorithm was used to solve the auxiliary task (5) and to estimate the parameters β 0 and β .

2.3.2. Receiver Operating Characteristic (ROC)

The viewing area consists of k finite elements. A logistic regression model (3) is determined for each finite element. The obtained signal X m from the electrodes is used to estimate the probability of inclusion P j ( Y = 1 | X ) for j− finite element, 1 ≤ jk. The reconstruction of the viewing area is defined as a sequence { z ^ j } 1 j k , where for cutting level l ( 0 ,   1 ) .
For the proper recognition of the visual area, coefficients describing the quality of the area recognition must be provided. For this purpose, it is assumed that a finite element that does not belong to the inclusions is described as a negative case (N); observing the problem at hand, this should be interpreted as a ground. On the other hand, if a finite element belongs to the inclusions and is in the inclusion area, this is taken as the positive case (P).
The confusion matrix is determined in the following way: TP (True Positive) denotes the finite elements that correctly belong to the inclusion area, TN (True Neg-ative) denotes the number of finite elements that are properly recognized as belonging to the background, FP (False Positive) denotes the number of finite elements belonging to the background that are recognized as having belonged to the inclusion area (false alarm), and FN (False Negative) denotes the number of finite elements belonging to the inclusion area but recognized as background (Table 2).
For each finite element, from the readings X, we determine the inclusion probability P ( Y = 1 | X ) according to Equation (3) and assume for the level l ( 0 ,   1 ) .
The quality of image reconstruction (finding inclusions in the field of view) based on a classifier by logistic regression (3) is estimated by ratios [46,47]. The based ratios (coefficients) are determined as follows:
A c c u r a c y = T P + T N T P + T N + F P + F N ,
T r u e   P o s i t i v e   R a t e = S e n s i v i t y = T P T P + F N ,
S p e c i f i c i t y = 1 F a l s e P o s i t i v e R a t e = T N T N + F P ,
P o s i t i v e P r e d i c t i v e V a l u e = T P T P + F P ,
N e g a t i v e P r e d i c t i v e V a l u e = T N T N + F N ,
D e t e c t i o n R a t e = T P T P + T N + F P + F N ,
The accuracy represents the part of the visual area that the model correctly recognises. Note that this is only one possible measure that represents the correctness of the recognition.

2.3.3. Elastic Net

Signals derived from EIT tomography electrodes can be collinear signals. Because such collinear variables may appear, the Elasticnet method for logistic regression was applied [26]. A possible regularisation method is an elastic net, defined as a linear mixture of a Selection Operator (LASSO), Least Absolute Shrinkage and ridge regression, which is called Tikhonov regularisation [52,53]. Instead of the solution of classical task (5), which is connected with an estimation of unknown parameters in the logistic regression model, we optimise the objective function (6) whilst taking the penalty into account. In order to do this, we need to solve the following equation:
max β { i = 1 N ( y i x ( i ) β ln ( 1 + e x ( i ) β ) ) λ P α ( β ) }
where λ > 0 denotes the regularisation ratio, and 0 α 1 , and P α denotes the penalty, as follows:
P α ( β ) = α | | β | | L 1 + 1 α 2 | | β | | L 2 = j = 1 p ( α | β j | + 1 α 2 β j 2 )
The penalty P α ( β ) is the combination of the linear norm of the parameter β in L 1 and L 2 spaces. For α = 0 we have ridge regression, whereas for α = 1 we have LASSO regularisation.

2.3.4. Principal Components Analysis

Principal Component Analysis involves identifying factors (components) present in a dataset X, [49,54]. This contains n observations for m variables. The aim of PCA is to rotate a coordinate solution in such a way as to maximise the variability of the first coordinate, followed by the variability of the next coordinate, and so on. The coordinates of the new system are called the charges of the generated principal components. In the new space, most of the variability is explained by the initial factors. PCA is frequently applied to reduce size of a dataset of statistics by discarding recent factors [50].
The decomposition of an X matrix is by Singular Value Decomposition. Thus, any real matrix X may be represented as
X = U D V T + ε X
where U R n × m is the left orthogonal matrix of left singular vectors, D R m × m is the diagonal matrix containing the singular values, and V R m × m is the right orthogonal vector matrix (matrix of right singular vectors). The orthogonal matrix V satisfies the property VT = V−1. The decomposition of a given matrix X may be shown as
X = T P T + ε X
where T = UD denotes coordinates, and V = P denotes charges. A coordinate matrix T is determined by the multiplication of both sides (in a new space) of the equation X = T P T by P, obtaining
X P = T P T P = T
According to the expansion (13), we find the weights that are used in the linear combination in the column V of the matrix to form the new dimensions. This formula expresses the variance of the coordinates in the new space:
λ i = d i 2 n 1
where d i and 0 i k are the singular values from the diagonal of matrix D. The part of the variance explained by the i-th principal component is λ i j = 1 m λ j , while the amount of variance that is explained by k principal components is j = 1 k λ i j = 1 m λ j for 1 ≤ km.
Note that in EIT, for each case, the electrode readout including polarisation and projection angles x ( j ) R m (according to (13)) corresponds to the value of x ( j ) P in the new space. The logistic regression of the form (2) is determined for individual finite elements, except that we analyse the linear dependence of the chance logarithm on a value in the new coordinates ln θ ( S ) = T β + ε .

2.3.5. Discrete Wavelet Transformation (DWT)

Another method to reduce the amount of data that is redundant is to determine the projection of the signal { x t } 1 t l e q m obtained from the electrodes to an orthogonal basis named as the mother wavelet. By definition, Ψ ( t ) will be an orthogonal wavelet basis. For j , the sequence is defined as
Ψ j k ( t ) = 1 2 j 1 Ψ ( t 2 j k )
and { ϕ j k } k
ϕ j k ( t ) = 1 2 j 1 ϕ ( t 2 j k )
where ϕ ( t ) denotes the scaling function which corresponds to the parent wavelet Ψ . In this case, the time series { x t } 1 t l e q m can be represented as
x t = k = c j k ϕ j k ( t ) + i = j k = d i k Ψ i k ( t )
where the value c j k is the scaling factor, but dik is the specific coefficient. Sequence { x t } 1 t l e q m can be expressed in a different form from the level of decomposition j based on Equation (17).
The functions Ψ j k ( t ) and ϕ j k ( t ) take values which are different from zero on a bounded interval. It follows that the sequence { x t } 1 t m can be presented as follows:
x t = k = 0 s c j k ϕ j k ( t ) + i = 0 j k = 0 s d i k Ψ i k ( t )
where s < < m . According to [28], a projection operator for a time series is defined as { x t } 1 t m at level j on an orthogonal basis { ϕ j k ( t ) } 0 k s , as follows:
P j x t = k = 0 n 2 j 1 c j k ϕ j k ( t )
Much information on DWT can be read in [28,55,56]. The EIT for the sequence of scaling factors from the projection (26) was used as the input variables for the logistic regression. There are different types of wavelet, e.g., Daubechies (‘db1’–‘db18’), Coiflets (‘coif1’–‘coif5’), Symlets (‘sym1’–‘sym18’) and Biorthogonal (‘bior1.1’, ‘bior1.3’, ‘bior1.5’, ‘bior2.2’, ‘bior2.4’, ‘bior2.6’, ‘bior2.8’). Some results are presented in the next section.
To analyse this, we used Daubechies1 decomposition, which is also known as a Haar wavelet. The mother wavelet is defined as follows:
Ψ ( t ) = { 1 , t [ 0 , 1 2 ) , 1 , t [ 1 2 , 1 ) , 0 , t [ 0 , 1 )
but corresponding to her, the father wavelet is defined as follows:
ϕ ( t ) = { 1 , t [ 0 , 1 ) , 0 , t [ 0 , 1 ) .
From (15) and (16), we see that the coefficients Ψ j k ( t ) and ϕ j k ( t ) have different-from-zero values at the interval [ 2 j k , 2 j ( k + 1 ) ) .

2.3.6. Algorithm Implementation

In the classical inverse problem in EIT, we determine the conductivity of each finite element located in the viewing area. Our approach was to determine the inclusions with high probability areas; logistic regression was used for this purpose. The main problem in EIT is the problem of the co-liveness of features (measurements from electrodes). It is impossible to extract a set of stochastically independent features. In order to improve the quality of the classifier, the following techniques were additionally applied: elastic net regularization, principal component analysis and wavelet preprocessing. In this paper, however, only one type of wavelet was used, analyzing the accuracy and AUC curves. This technique is more robust to perturbations of the measurements from the electrodes. Figure 6 shows the learning process of the algorithm, while Figure 7 shows the reconstruction process using machine learning methods.

3. Results

This chapter presents water leakage results from dikes based on the built numerical models and laboratory measurements. Three models of tomographic image reconstruction systems were developed. The results of the presented solutions are presented in this chapter. The inverse and ill-posed problems of electrical tomography are related to the insufficient number of function arguments (measurements/inputs) compared to the required number of outputs (image pixels).
The presented research aims to generate a spatial image of the moisture with a specific pixel resolution from the measurements. Single models are trained with all of the measurement values as the input and only a one-pixel value as the output in order to simplify the generation of multi-pixel images. Therefore, the algorithm must generate as many models as there are pixels in the image. When learning the predictive models, each model obtains different coefficients and hyperparameters that determine the ways in which the input values are transformed to the output.
In the models presented below for each finite element, we determined the model of logistic regression. The measurements obtained from the electrodes are collinear. Thus, the direct application of logistic regression implies that the absolute values of the structural parameters of regression are quite large, and the prediction is poor. Therefore, three approaches were applied to overcome the collinearity problem of the input variables: (1) elastic net regularisation, (2) the estimation of the principal component measurements (PCA application) as input variables in the regression model, (3) the projection of the measurements obtained from the electrodes based on wavelet decomposition at level 4 .

3.1. Image Reconstruction for Model I

Figure 8 shows a numerical model of a dike with electrodes placed on its surface in one row. Model I consists of 16 electrodes, 2159 nodes, 7657 finite elements and point electrodes. Figure 9, Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 summary the image reconstruction for Model I. Figure 9, Figure 11 and Figure 13 show 3 cases of the reconstruction of the flood embankment image for model I, with different variants of water infiltration, divided into the methods of logistic regression with an elastic net, logistic regression with PCA distribution, and regression logistic with wavelet distribution. Table 3, Table 4 and Table 5 show the results of the exact reconstructions shown in Figure 9, Figure 11 and Figure 13. The accuracy of the statistical ROC model for model I is evaluated in Figure 10, Figure 12 and Figure 14.
The analysis of the reconstruction’s accuracy to the disturbances arising in the measurement obtained from the electrodes signals using the method mentioned above will be presented in Figure 15. For this purpose, instead, for a reconstruction based on signal x m we analysed the reconstruction based on a disturbed signal x + σ ε , where σ 0 and ε : Ω m is a sequence of independent identically distributed random variables with normal distribution N ( 0 , 1 ) . The figure below shows the relationship between the accuracy and AUC (area under the curve) depending on the value σ (standard deviation of disturbances).
In order to verify the performed numerical tests, a laboratory model of the flood embankment was prepared (shown in Figure 16), which was flooded with water. The image reconstruction results for the machine learning methods are shown in Figure 17, Figure 18, Figure 19 and Figure 20.

3.2. Image Reconstruction for Model II

Figure 21 shows a numerical model of a dike with electrodes placed on its surface in one row. Model I consists of 32 electrodes, 2818 nodes, 9805 finite elements and point electrodes. Figure 22, Figure 23, Figure 24, Figure 25, Figure 26 and Figure 27 summary the image reconstruction for Model II. Figure 22, Figure 24 and Figure 26 show three cases of image reconstruction for flood embankments of image reconstruction cases. The examples from model II have different variants of water infiltration divided into logistic regression methods with an elastic net, logistic regression with PCA distribution, and logistic regression with wavelet distribution. Table 6, Table 7 and Table 8 show the results of the exact reconstructions, as shown in Figure 22, Figure 24 and Figure 26. The accuracy of the ROC statistical model for Model II is evaluated in Figure 23, Figure 25 and Figure 27.
Figure 28 shows an analysis of the reconstruction accuracy of the perturbations arising in the measurements obtained from the electrode signals. The image reconstruction results for the machine learning methods are shown in Figure 29, Figure 30, Figure 31 and Figure 32.

3.3. Image Reconstruction for Model III

Figure 33 shows a numerical model of a dike with electrodes placed on its surface in one row. Model I consists of 2 × 16 electrodes, 3844 nodes, 16,240 finite elements and multi-sensors. Figure 34, Figure 35, Figure 36, Figure 37, Figure 38 and Figure 39 summary the image reconstruction for Model III. Three image reconstruction models were tested, and are shown in Figure 34, Figure 36 and Figure 38, with different variations of water infiltration. The data in Table 9, Table 10 and Table 11 show the results of the exact reconstructions shown in Figure 34, Figure 36 and Figure 38. The accuracy of the ROC statistical model for Model III is evaluated in Figure 35, Figure 37 and Figure 39.
The analysis of the accuracy of the reconstruction of the disturbances arising in the measurement obtained from the signals of the depth sensors is shown in Figure 38.
In order to verify the numerical studies performed, a laboratory model with depth sensors was constructed (Figure 39). Then, the object was flooded with water. The image reconstruction results for machine learning methods are shown in Figure 40. Laboratory model with depth sensors for 32 measuring electrodes and the reconstruction based on real measurements were presented in Figure 41 and Figure 42.

4. Discussion

This paper presented a monitoring system for flood embankments, and measurement models for water leakage were prepared. The designed models were equipped with different variants of sensors. In the first two models, point electrodes were used, while in the third model, multi-sensor electrodes were prepared. This solution directly measures specific electrical quantities and their corresponding analysis in the monitoring and damaging of flood embankments. Furthermore, machine learning algorithms were used and appropriately implemented for image reconstruction. Moreover, this is a novel approach in the application of machine learning algorithms, as other methods cannot estimate seepage in the entire volume of the object.
In this study, three flood embankment models with point and multi-sensor electrodes were implemented. The different sensors were evenly distributed along the edge of the object and inside it, depending on the model tested and the number of electrodes applied. The image reconstruction results show the great potential of the proposed solution for leakage analysis. Logistic regression with an elastic net, logistic regression with PCA, and wavelet decomposition methods solved the inverse problem. Furthermore, the solution presented works very quickly (reconstruction takes fractions of a second) and with high accuracy.
The main objectives of the research were to improve the resolution of the reconstructed image and increase the sensitivity of the monitoring system to changes occurring inside the floodbank. Thus, knowing the voltage values, the conductivity of the soil between the electrodes can be calculated. A separate algorithm generates each point of the output image. Due to the volume of data and equations to learn, the implementation of the assumptions required cloud computing. It is worth noting that the primary goal of EIT tomography is not to perfectly estimate the conductivity of the finite elements defined on the section mesh. The real goal is a faithful and accurate representation of the interior of the test object. Thus, it does not matter if the colours in the drawing correspond to the actual conductivity of the finite elements hidden inside the dike. However, it is important that the inclusions can be identified, regardless of their number, shape, and location.
By analysing Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10 and Table 11, it can be observed that the images obtained represent the models and examples studied with high accuracy. From the tables, one can read the error values of the reconstructed images relative to their models. It can be noticed that the majority of the pixels on the mesh do not include any errors. The comparative analysis of the three studied methods in the image reconstruction shows the high accuracy of each algorithm. Depending on the model and example studied, each method proves more or less accurate.
For the model shown in Figure 43, the correlation between the 96 features was determined (the signal contains 96 measurements between the electrodes). As we look at the heat map for these 96 features, we see that they are correlated.
On the other hand, by performing a PCA analysis (see Figure 44 and Table 12), we can see that five principal components explain 95% of the variation in the traits, and 10 principal components explain 98.6% of the variation in the traits.
In Figure 44, we see an exponential relationship between the contribution to the explanation of feature variance and the principal component number. The use of an elastic net shows that the finite element conductivity is best explained by measurements read from the electrodes closest to the finite element. Reconstructions obtained using logistic regression with an elastic net, PCA decomposition, and wavelet preprocessing were compared with results obtained using deterministic methods involving the variance reduction of regression models, such as total variation and Tikhonov regularization. The values obtained by logistic regression are in the interval [0, 1], so additionally, a linear transformation was performed where the value “0” corresponds to the reference value of the conductivity, while the value “1” corresponds to the inclusion conductivity. Figure 45 shows the pattern and reconstruction results.
The following measures were used to compare the reconstruction results with the standard: the total sum of squares (TSS), mean squared error (MSE), mean absolute error (MAE) and mean absolute percentage error (MAPE). The results of the comparisons are presented in the table below (Table 13).
For the model presented here, the use of logistic regression techniques improved the quality of the reconstruction. By comparing the results from the table, we can see that for the logistic regression models, the mean square error (MSE) for each finite element does not exceed 0.05, while in the case of the use of Tikhonov regularization alone, this error is almost 30 times higher, and in the case of the total variation technique it is almost eight times higher. By comparing the MAE and MAPE (mean absolute error and mean percentage error), we can see that they are several times smaller than for the Tikhonov regularization and the total variation for logistic regression. The results of the comparisons are shown in the table below (Figure 46).
For the shaft model (where the electrodes are placed only on one side of the viewing objection), by comparing the results from the table, we can see that the quality of reconstructions obtained by logistic regression with PCA and wavelet, and by total variation are quite good. In contrast, the reconstructions by logistic regression with an elastic net and by Tikhonov regularization are slightly worse. Furthermore, for the case of logistic regression with PCA, the mean absolute error (MAE) is even lower than for total variation, while the mean squared error (MSE) and mean percentage error (MAPE) for the total variation technique is smaller than that for logistic regression with PCA (Table 14).
Logistic regression with wavelet preprocessing is the most robust to noise, while logistic regression with PCA decomposition is the most sensitive to the real model.

5. Conclusions

The main purpose of the work was to develop image reconstruction algorithms for leakage analysis in various flood embankment models. The presented solution enables a fairly accurate assessment of the properties of the tested objects, and gives promising results. Algorithms based on logistic regression machine learning methods with an elastic net regularisation, PCA and wave preprocessing were used to calculate the leakage in order to obtain more accurate and stable image reconstruction, leading to a solution of the inverse problem in EIT. A measurement system and three models for the reconstruction of tomographic images with point and depth electrodes were developed in a laboratory study. Using a system of multiple separate algorithms running simultaneously to image the cross-section of a flood barrier allows the generation of accurate representations of specific patterns. The quality of this data is adequate to identify leakage over time correctly.
Furthermore, given the ability to take measurements at fixed time intervals, it is easy to determine the velocity of the leakage propagation. This information enables an accurate diagnosis to help determine the reliability of the dam. With this information, the tomographic system can plan appropriate actions before dam failure.
The approach presented was used to identify areas of high-probability inclusions, so logistic regression was used for this purpose. The main problem in EIT is the problem of the collinearity of features (measurements from the electrodes). In order to improve the quality of the classifier, additional methods were applied. Using deterministic methods, the conductivity distribution was determined and compared with the machine learning methods. The use of logistic regression for each finite element makes it possible to determine (extract) the inclusion regions. In the case of signals from electrodes without interference, the analysis of the reconstruction results using logistic regression with an elastic net and PCA is slightly better than logistic regression with wavelet preprocessing. However, when additional disturbances are introduced to the electrodes, recognising the inclusion in the visual area is better for logistic regression with wavelet preprocessing.
The study shows that the wavelet technique is a competitive approach compared to PCA and elastic net. Additionally, the reconstructions are more robust to perturbations arising during measurements. Many wavelets are known in the literature. Further studies will investigate the quality of reconstructions using different wavelets.

Author Contributions

Measurement methodology, development of a research project and supervision, writing—review and editing T.R.; development of the system concept, writing—original draft preparation, K.K.; development of the concept of machine learning methods and image reconstruction, E.K.; literature review, formal analysis, general review, and editing of the manuscript, funding acquisition, M.C.-W.; development of the numerical methods and techniques, P.B.; data analysis, preparation of descriptions in the manuscript, T.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Forson, A.; Comas, X.; Whitman, D. Integration of electrical resistivity imaging and ground penetrating radar to investigate solution features in the Biscayne Aquifer. J. Hydrol. 2014, 515, 129–138. [Google Scholar] [CrossRef]
  2. De Donno, G.; Di Giambattista, L.; Orlando, L. High-resolution investigation of masonry samples through GPR and electrical resistivity tomography. Constr. Build. Mater. 2017, 154, 1234–1249. [Google Scholar] [CrossRef]
  3. Bukowska-Belniak, B.; Borecka, A.; Leśniak, A. The continuous thermal imaging of the flood embankment to identify location of the leaks. In Proceedings of the 14th Quantitative InfraRed Thermography Conference, Berlin, Germany, 25–29 June 2018. [Google Scholar]
  4. Crawford, M.M.; Bryson, L.S.; Woolery, E.W.; Wang, Z. Using 2-D electrical resistivity imaging for joint geophysical and geotechnical characterization of shallow landslides. J. Appl. Geophys. 2018, 157, 37–46. [Google Scholar] [CrossRef]
  5. Lesparre, N.; Nguyen, F.; Kemna, A.; Robert, T.; Hermans, T.; Daoudi, M.; Flores-Orozco, A. A new approach for time-lapse data weighting in electrical resistivity tomography. Geophysics 2017, 82, E325–E333. [Google Scholar] [CrossRef]
  6. Hojat, A.; Ferrario, M.; Arosio, D.; Brunero, M.; Ivanov, V.; Longoni, L.; Madaschi, A.; Papini, M.; Tresoldi, G.; Zanzi, L. Laboratory Studies Using Electrical Resistivity Tomography and Fiber Optic Techniques to Detect Seepage Zones in River Embankments. Geosciences 2021, 11, 69. [Google Scholar] [CrossRef]
  7. Ghafoori, Y.; Vidmar, A.; Říha, J.; Kryžanowski, A. A Review of Measurement Calibration and Interpretation for Seepage Monitoring by Optical Fiber Distributed Temperature Sensors. Sensors 2020, 20, 5696. [Google Scholar] [CrossRef]
  8. Bossi, G.; Bersan, S.; Cola, S.; Schenato, L.; De Polo, F.; Menegazzo, C.; Boaga, J.; Cassiani, G.; Donini, F.; Simonini, P. Multidisciplinary analysis and modelling of a river embankment affected by piping. In European Working Group on Internal Erosion; Springer: Cham, Switzerland, 2018; pp. 234–244. [Google Scholar]
  9. Schenato, L. A Review of Distributed Fibre Optic Sensors for Geo-Hydrological Applications. Appl. Sci. 2017, 7, 896. [Google Scholar] [CrossRef]
  10. Bersan, S.; Koelewijn, A.R.; Simonini, P. Effectiveness of distributed temperature measurements for early detection of piping in river embankments. Hydrol. Earth Syst. Sci. Discuss. 2018, 22, 1491–1508. [Google Scholar] [CrossRef] [Green Version]
  11. Habel, W.R.; Krebber, K. Fiber-optic sensor applications in civil and geotechnical engineering. Photonics Sens. 2011, 1, 268–280. [Google Scholar] [CrossRef] [Green Version]
  12. Ghafoori, Y.; Maček, M.; Vidmar, A.; Říha, J.; Kryžanowski, A. Analysis of Seepage in a Laboratory Scaled Model Using Passive Optical Fiber Distributed Temperature Sensor. Water 2020, 12, 367. [Google Scholar] [CrossRef] [Green Version]
  13. Rymarczyk, T. Using electrical impedance tomography to monitoring flood banks. Int. J. Appl. Electromagn. Mech. 2014, 45, 489–494. [Google Scholar] [CrossRef]
  14. Jones, G.; Sentenac, P.; Zielinski, M. Desiccation cracking detection using 2-D and 3-D Electrical Resistivity Tomography: Validation on a flood embankment. J. Appl. Geophys. 2014, 106, 196–211. [Google Scholar] [CrossRef] [Green Version]
  15. Michta, E.; Szulim, R.; Sojka-Piotrowska, A.; Piotrowski, K. IoT-based flood embankments monitoring system. In Photonics Applications in Astronomy, Communications, Industry, and High Energy Physics Experiments 2017; SPIE: Bellingham, WA, USA, 2017; Volume 10445, p. 104455Y. [Google Scholar] [CrossRef]
  16. Sekuła, K.; Połeć, M.; Borecka, A. Innovative solutions in monitoring systems in flood protection. E3S Web Conf. Water Wastewater Energy Smart Cities 2018, 30, 01005. [Google Scholar] [CrossRef] [Green Version]
  17. Rymarczyk, T.; Kozłowski, E.; Kłosowski, G. Electrical impedance tomography in 3D flood embankments testing—Elastic net approach. Trans. Inst. Meas. Control 2020, 42, 680–690. [Google Scholar] [CrossRef]
  18. Rymarczyk, T.; Kłosowski, G. Application of neural reconstruction of tomographic images in the problem of reliability of flood protection facilities. Eksploat. Niezawodn.-Maint. Reliab. 2018, 20, 425–434. [Google Scholar] [CrossRef]
  19. Daniewski, K.; Kosicka, E.; Mazurkiewicz, D. Analysis of the correctness of determination of the effectiveness of maintenance service actions. Manag. Prod. Eng. Rev. 2018, 9, 20–25. [Google Scholar]
  20. Korzeniewska, E.; Sekulska-Nalewajko, J.; Gocawski, J.; Droż, T.; Kiebasa, P. Analysis of changes in fruit tissue after the pulsed electric field treatment using optical coherence tomography. EPJ Appl. Phys. 2020, 91, 30902. [Google Scholar] [CrossRef]
  21. Dusek, J.; Mikulka, J. Measurement-Based Domain Parameter Optimization in Electrical Impedance Tomography Imaging. Sensors 2021, 21, 2507. [Google Scholar] [CrossRef]
  22. Mosorov, V.; Rybak, G.; Sankowski, D. Plug Regime Flow Velocity Measurement Problem Based on Correlability Notion and Twin Plane Electrical Capacitance Tomography: Use Case. Sensors 2021, 21, 2189. [Google Scholar] [CrossRef]
  23. Szczesny, A.; Korzeniewska, E. Selection of the method for the earthing resistance measurement. Przegląd Elektrotech. 2018, 94, 178–181. [Google Scholar]
  24. Liu, S.; Huang, Y.; Wu, H.; Tan, C.; Jia, J. Efficient Multitask Structure-Aware Sparse Bayesian Learning for Frequency-Difference Electrical Impedance Tomography. IEEE Trans. Ind. Inform. 2021, 17, 463–472. [Google Scholar] [CrossRef] [Green Version]
  25. Wajman, R.; Banasiak, R.; Babout, L. On the Use of a Rotatable ECT Sensor to Investigate Dense Phase Flow: A Feasibility Study. Sensors 2020, 20, 4854. [Google Scholar] [CrossRef] [PubMed]
  26. Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B 2005, 67, 301–320. [Google Scholar] [CrossRef] [Green Version]
  27. Mevik, B.-H.; Wehrens, R. The pls Package: Principal Component and Partial Least Squares Regression in R. J. Stat. Softw. 2007, 18, 1–23. [Google Scholar] [CrossRef] [Green Version]
  28. Daubechies, I. Orthonormal Bases of Compactly Supported Wavelets. Commun. Pure Appl. Math. 1992, 41, 909–996. [Google Scholar] [CrossRef] [Green Version]
  29. Holder, D.S. Introduction to Biomedical Electrical Impedance Tomography Electrical Impedance Tomography Methods; History and Applications Bristol, Institute of Physics: Bristol, UK, 2005. [Google Scholar]
  30. Rymarczyk, T. New methods to determine moisture areas by electrical impedance tomography. Int. J. Appl. Electromagn. Mech. 2016, 52, 79–87. [Google Scholar] [CrossRef]
  31. Adler, A.; Lionheart, W. Uses and abuses of EIDORS: An extensible software base for EIT. Physiol. Meas. 2006, 27, S25–S42. [Google Scholar] [CrossRef] [Green Version]
  32. Dušek, J.; Hladký, D.; Mikulka, J. Electrical Impedance Tomography Methods and Algorithms Processed with a GPU. In Proceedings of the 2017 Progress in Electromagnetics Research Symposium—Spring (PIERS), St. Petersburg, Russia, 22–25 May 2017; pp. 1710–1714. [Google Scholar]
  33. Kozłowski, E.; Mazurkiewicz, D.; Żabiński, T.; Prucnal, S.; Sęp, J. Assessment model of cutting tool condition for real-time supervision system. Eksploat. Niezawodn.-Maint. Reliab. 2019, 21, 679–685. [Google Scholar] [CrossRef]
  34. Rymarczyk, T.; Kłosowski, G.; Hoła, A.; Sikora, J.; Wołowiec, T.; Tchórzewski, P.; Skowron, S. Comparison of Machine Learning Methods in Electrical Tomography for Detecting Moisture in Building Walls. Energies 2021, 14, 2777. [Google Scholar] [CrossRef]
  35. Rymarczyk, T.; Nita, P.; Vejar, A.; Woś, M.; Stefaniak, B.; Adamkiewicz, P. Wearable mobile measuring device based on electrical tomography. Przegląd Elektrotech. 2019, 95, 211–214. [Google Scholar]
  36. Chen, B.; Abascal, J.F.P.J.; Soleimani, M. Extended Joint Sparsity Reconstruction for Spatial and Temporal ERT Imaging. Sensors 2018, 18, 4014. [Google Scholar] [CrossRef] [Green Version]
  37. Rymarczyk, T.; Kłosowski, G.; Tchórzewski, P.; Cieplak, T.; Kozłowski, E. Area monitoring using the ERT method with multisensor electrodes. Przegląd Elektrotech. 2019, 95, 153–156. [Google Scholar] [CrossRef] [Green Version]
  38. Rybak, G.; Strzecha, K. Short-Time Fourier Transform Based on Metaprogramming and the Stockham Optimization Method. Sensors 2021, 21, 4123. [Google Scholar] [CrossRef] [PubMed]
  39. Voss, A.; Pour-Ghaz, M.; Vauhkonen, M.; Seppänen, A. Retrieval of the saturated hydraulic conductivity of cement-based materials using electrical capacitance tomography. Cem. Concr. Compos. 2020, 112, 103639. [Google Scholar] [CrossRef]
  40. Shi, X.; Tan, C.; Dong, F.; dos Santos, E.N.; da Silva, M.J. Conductance Sensors for Multiphase Flow Measurement: A Review. IEEE Sens. J. 2021, 21, 12913–12925. [Google Scholar] [CrossRef]
  41. Midura, M.; Wróblewski, P.; Wanta, D.; Domański, G.; Stosio, M.; Kryszyn, J.; Smolik, W.T. The system for complex magnetic susceptibility measurement of nanoparticles with 3d printed carcass for integrated receive coils. Inform. Autom. Pomiary Gospod. Ochr. Sr. 2021, 11, 4–9. [Google Scholar] [CrossRef]
  42. Sekulska-Nalewajko, J.; Gocławski, J.; Korzeniewska, E. A method for the assessment of textile pilling tendency using optical coherence tomography. Sensors 2020, 20, 3687. [Google Scholar] [CrossRef]
  43. Rzasa, M.; Czapla-Nielacna, B. Analysis of the Influence of the Vortex Shedder Shape on the Metrological Properties of the Vortex Flow Meter. Sensors 2021, 21, 4697. [Google Scholar] [CrossRef]
  44. Fiala, P.; Bartušek, K.; Dědková, J.; Dohnal, P. EMG field analysis in dynamic microscopic/nanoscopic models of matter. Inform. Autom. Pomiary Gospod. Ochr. Sr. 2019, 9, 4–10. [Google Scholar] [CrossRef]
  45. Kłosowski, G.; Rymarczyk, T.; Wójcik, D.; Skowron, S.; Adamkiewicz, P. The Use of Time-Frequency Moments as Inputs of LSTM Network for ECG Signal Classification. Electronics 2020, 9, 1452. [Google Scholar] [CrossRef]
  46. Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
  47. Hand, D.J.; Till, R.J. A simple generalisation of the area under the roc curve for multiple class classification problems. Mach. Learn. 2001, 45, 171–186. [Google Scholar] [CrossRef]
  48. Rymarczyk, T.; Kozłowski, E.; Kłosowski, G.; Niderla, K. Logistic Regression for Machine Learning in Process Tomography. Sensors 2019, 19, 3400. [Google Scholar] [CrossRef] [Green Version]
  49. Wehrens, R. Chemometrics with R. Multivariate Data Analysis in the Natural Science and Life Sciences; Springer: New York, NY, USA, 2011. [Google Scholar]
  50. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer Science & Business Media: New York, NY, USA, 2009. [Google Scholar]
  51. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning with Applications in R; Springer: New York, NY, USA, 2013. [Google Scholar]
  52. Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J. R. Stat. Soc. Ser. B 1996, 58, 267–288. [Google Scholar] [CrossRef]
  53. Friedman, J.; Tibshirani, R.; Hastie, T. Regularization paths for generalised linear models via coordinate descent. J. Stat. Softw. 2010, 33, 1–22. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Yan, X.; Su, X.G. Linear Regression Analysis; World Scientific Publishing Company: London, UK, 2009. [Google Scholar]
  55. Percival, D.B.; Walden, A. Wavelet Methods for Time Series Analysis; Cambridge University Press: Cambridge, UK, 2000; Volume 4. [Google Scholar]
  56. Walnut, D.F. An Introduction to Wavelet Analysis; Springer Nature: Basingstoke, UK, 2004. [Google Scholar]
Figure 1. Monitoring system based on electrical tomography.
Figure 1. Monitoring system based on electrical tomography.
Energies 14 08081 g001
Figure 2. Block diagram of the tomograph.
Figure 2. Block diagram of the tomograph.
Energies 14 08081 g002
Figure 3. Electric tomograph: (a) measuring unit, (b) communication and control unit, (c) inside view, and (d) panel.
Figure 3. Electric tomograph: (a) measuring unit, (b) communication and control unit, (c) inside view, and (d) panel.
Energies 14 08081 g003
Figure 4. Laboratory system with measuring electrodes during measurements with an electrical tomograph.
Figure 4. Laboratory system with measuring electrodes during measurements with an electrical tomograph.
Energies 14 08081 g004
Figure 5. Laboratory model with two multi-sensor electrodes during electrical tomography measurements.
Figure 5. Laboratory model with two multi-sensor electrodes during electrical tomography measurements.
Energies 14 08081 g005
Figure 6. Learning process.
Figure 6. Learning process.
Energies 14 08081 g006
Figure 7. Reconstruction process.
Figure 7. Reconstruction process.
Energies 14 08081 g007
Figure 8. Numerical model I of the flood embankment with 16 measuring electrodes.
Figure 8. Numerical model I of the flood embankment with 16 measuring electrodes.
Energies 14 08081 g008
Figure 9. Model I flood embankment with 16 electrodes, with water infiltration on the left side. Example 1: (a) the pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 9. Model I flood embankment with 16 electrodes, with water infiltration on the left side. Example 1: (a) the pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g009
Figure 10. ROC analysis for model I—example 1.
Figure 10. ROC analysis for model I—example 1.
Energies 14 08081 g010
Figure 11. Model I flood embankment with 16 electrodes, with water infiltration on the right side. Example 2: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 11. Model I flood embankment with 16 electrodes, with water infiltration on the right side. Example 2: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g011
Figure 12. ROC analysis for model I—example 2.
Figure 12. ROC analysis for model I—example 2.
Energies 14 08081 g012
Figure 13. Model I flood embankment with 16 electrodes, with water infiltration throughout the area. Example 3: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 13. Model I flood embankment with 16 electrodes, with water infiltration throughout the area. Example 3: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g013
Figure 14. ROC analysis for model I—example 3.
Figure 14. ROC analysis for model I—example 3.
Energies 14 08081 g014
Figure 15. Influence of disturbances of the reconstruction quality-model I: (a) Accuracy, (b) AUC.
Figure 15. Influence of disturbances of the reconstruction quality-model I: (a) Accuracy, (b) AUC.
Energies 14 08081 g015
Figure 16. A laboratory model of the flooded embankment with 16 measuring sensors.
Figure 16. A laboratory model of the flooded embankment with 16 measuring sensors.
Energies 14 08081 g016
Figure 17. Reconstruction based on real measurements-model I: (a) logistic regression with an elastic net, (b) logistic regression with PCA decomposition, (c) logistic regression with wavelet decomposition.
Figure 17. Reconstruction based on real measurements-model I: (a) logistic regression with an elastic net, (b) logistic regression with PCA decomposition, (c) logistic regression with wavelet decomposition.
Energies 14 08081 g017
Figure 18. Reconstruction based on real measurement - regression with an elastic net and different projections for model I-views: (a) front, (b) back, (c) side, (d) top.
Figure 18. Reconstruction based on real measurement - regression with an elastic net and different projections for model I-views: (a) front, (b) back, (c) side, (d) top.
Energies 14 08081 g018
Figure 19. Reconstruction based on real measurements—regression with PCA and different projections for model I-views: (a) front, (b) back, (c) side, (d) top.
Figure 19. Reconstruction based on real measurements—regression with PCA and different projections for model I-views: (a) front, (b) back, (c) side, (d) top.
Energies 14 08081 g019
Figure 20. Reconstruction based on real measurements—regression with different wavelet projections for model I-views: (a) front, (b) back, (c) side, (d) top.
Figure 20. Reconstruction based on real measurements—regression with different wavelet projections for model I-views: (a) front, (b) back, (c) side, (d) top.
Energies 14 08081 g020
Figure 21. Numerical model II of flood embankments with 32 measuring electrodes.
Figure 21. Numerical model II of flood embankments with 32 measuring electrodes.
Energies 14 08081 g021
Figure 22. Model II flood embankment with 32 electrodes, with water infiltration on the left side. Example 4: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 22. Model II flood embankment with 32 electrodes, with water infiltration on the left side. Example 4: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g022
Figure 23. ROC analysis for model II—example 4.
Figure 23. ROC analysis for model II—example 4.
Energies 14 08081 g023
Figure 24. Model II flood embankment with 32 electrodes, with water infiltration throughout the area. Example 5: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 24. Model II flood embankment with 32 electrodes, with water infiltration throughout the area. Example 5: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g024
Figure 25. ROC analysis for model II—example 5.
Figure 25. ROC analysis for model II—example 5.
Energies 14 08081 g025
Figure 26. Model II flood embankment with 32 electrodes, with water infiltration on the right side. Example 6: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 26. Model II flood embankment with 32 electrodes, with water infiltration on the right side. Example 6: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g026
Figure 27. ROC analysis for model II—example 6.
Figure 27. ROC analysis for model II—example 6.
Energies 14 08081 g027
Figure 28. Influence of disturbances on the reconstruction quality—model II.
Figure 28. Influence of disturbances on the reconstruction quality—model II.
Energies 14 08081 g028
Figure 29. Reconstruction based on real measurements—model II: (a) logistic regression with an elastic net, (b) logistic regression with PCA decomposition, and (c) logistic regression with wavelet decomposition.
Figure 29. Reconstruction based on real measurements—model II: (a) logistic regression with an elastic net, (b) logistic regression with PCA decomposition, and (c) logistic regression with wavelet decomposition.
Energies 14 08081 g029
Figure 30. Reconstruction based on real measurements—regression with an elastic net and different projections for model II-views: (a) front, (b) back, (c) side, (d) top.
Figure 30. Reconstruction based on real measurements—regression with an elastic net and different projections for model II-views: (a) front, (b) back, (c) side, (d) top.
Energies 14 08081 g030
Figure 31. Reconstruction based on real measurements—regression with PCA and different projections for model II-views: (a) front, (b) back, (c) side, (d) top.
Figure 31. Reconstruction based on real measurements—regression with PCA and different projections for model II-views: (a) front, (b) back, (c) side, (d) top.
Energies 14 08081 g031
Figure 32. Reconstruction based on real measurements—regression with different wavelet projections for model II-views: (a) front, (b) back, (c) side, (d) top.
Figure 32. Reconstruction based on real measurements—regression with different wavelet projections for model II-views: (a) front, (b) back, (c) side, (d) top.
Energies 14 08081 g032
Figure 33. Numerical model III of flood embankments with depth sensors.
Figure 33. Numerical model III of flood embankments with depth sensors.
Energies 14 08081 g033
Figure 34. Model III with depth sensors, with water infiltration inside the area. Example 7: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, (d) logistic regression with wavelet decomposition.
Figure 34. Model III with depth sensors, with water infiltration inside the area. Example 7: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, (d) logistic regression with wavelet decomposition.
Energies 14 08081 g034
Figure 35. ROC analysis for model III—example 7.
Figure 35. ROC analysis for model III—example 7.
Energies 14 08081 g035
Figure 36. Model III with depth sensors, with water infiltration on the right side. Example 8: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition (d) logistic regression with wavelet decomposition.
Figure 36. Model III with depth sensors, with water infiltration on the right side. Example 8: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition (d) logistic regression with wavelet decomposition.
Energies 14 08081 g036
Figure 37. ROC analysis for model III—example 8.
Figure 37. ROC analysis for model III—example 8.
Energies 14 08081 g037
Figure 38. Model III with depth sensors, with water infiltration at the top. Example 9: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Figure 38. Model III with depth sensors, with water infiltration at the top. Example 9: (a) pattern, (b) logistic regression with an elastic net, (c) logistic regression with PCA decomposition, and (d) logistic regression with wavelet decomposition.
Energies 14 08081 g038
Figure 39. ROC analysis for model III—example 9.
Figure 39. ROC analysis for model III—example 9.
Energies 14 08081 g039
Figure 40. Influence of disturbances into the reconstruction quality—model III.
Figure 40. Influence of disturbances into the reconstruction quality—model III.
Energies 14 08081 g040
Figure 41. Laboratory model with depth sensors-32 measuring electrodes.
Figure 41. Laboratory model with depth sensors-32 measuring electrodes.
Energies 14 08081 g041
Figure 42. Reconstruction based on real measurements—model III: (a) logistic regression with an elastic net, (b) logistic regression with PCA decomposition, and (c) logistic regression with wavelet decomposition.
Figure 42. Reconstruction based on real measurements—model III: (a) logistic regression with an elastic net, (b) logistic regression with PCA decomposition, and (c) logistic regression with wavelet decomposition.
Energies 14 08081 g042
Figure 43. Heatmap of the correlation between the attributes.
Figure 43. Heatmap of the correlation between the attributes.
Energies 14 08081 g043
Figure 44. The exponential relationship between the contribution to the explanation of feature variance and the principal component number.
Figure 44. The exponential relationship between the contribution to the explanation of feature variance and the principal component number.
Energies 14 08081 g044
Figure 45. Pattern and reconstruction results obtained by the machine learning and deterministic methods for the model with depth electrodes.
Figure 45. Pattern and reconstruction results obtained by the machine learning and deterministic methods for the model with depth electrodes.
Energies 14 08081 g045
Figure 46. Pattern and reconstruction results obtained by machine learning and deterministic methods-model with surface electrodes.
Figure 46. Pattern and reconstruction results obtained by machine learning and deterministic methods-model with surface electrodes.
Energies 14 08081 g046
Table 1. Main advantages and disadvantages of the different methods used in the study of dikes.
Table 1. Main advantages and disadvantages of the different methods used in the study of dikes.
AdvantagesDisadvantages
Point measurementsSimple solution.
Low costs.
Constant monitoring.
No spatial analysis of the study site.
Invasive.
ThermographyAbility to quickly analyze the external shaft.
Low cost.
Non-invasive.
No information on damage, internal seepage.
Unit measurements.
Geophysical methodsLarge area analysis/
Non-invasive.
Low efficiency in testing for leakage and shaft saturation.
Ground penetrating radarQuick and easy measurements.
Low costs.
Non-invasive.
Proper development of radarograms is very complicated.
Optical methodsAbility to analyze the dike along its course.
Continuous monitoring.
No information on area changes.
Electrical Resistance Tomography deviceAbility to analyze 2D/3D parts of the shaft with the ERT device.
Non-invasive.
Using only built-in measurement and reconstruction methods.
Unit measurements.
Distributed electrical impedance tomography systemArea ability analyze any portion of the levee or all of it through a distributed cyber-physical system.
Continuous monitoring.
Non-invasive.
High costs.
Table 2. Positive and negative prediction.
Table 2. Positive and negative prediction.
PositiveNegative
PositivePredictionTPFP
NegativePredictionFNTN
Table 3. The basic characteristic of reconstruction for model I—example 1.
Table 3. The basic characteristic of reconstruction for model I—example 1.
ElasticnetPCAWave
Sensitivity0.96979540.98083790.9632998
Specificity0.99847090.99978160.9912626
Pos Pred Value0.99766120.99966900.9866933
Neg Pred Value0.98006000.98727350.9757041
Precision0.99766120.99966900.9866933
Recall0.96979540.98083790.9632998
F10.98353100.99016390.9748562
Prevalence0.40211570.40211570.4021157
Detection Rate0.38997000.39441030.3873580
Detection Prevalence0.39088420.39454090.3925820
Balanced Accuracy0.98413320.99030970.9772812
AUC0.99957410.99989880.9980552
Table 4. The basic characteristic of the reconstruction for model I—example 2.
Table 4. The basic characteristic of the reconstruction for model I—example 2.
ElasticnetPCAWave
Sensitivity0.97462220.99230110.9751925
Specificity0.99662650.99759040.9795181
Pos Pred Value0.99592070.99713470.9757489
Neg Pred Value0.97893490.99352050.9790462
Precision0.99592070.99713470.9757489
Recall0.97462220.99230110.9751925
F10.98515640.99471200.9754706
Prevalence0.45801230.45801230.4580123
Detection Rate0.44638890.45448610.4466501
Detection Prevalence0.44821730.45579210.4577511
Balanced Accuracy0.98562430.99494570.9773553
AUC0.99947110.99994360.9983900
Table 5. The basic characteristic of the reconstruction for model I—example 3.
Table 5. The basic characteristic of the reconstruction for model I—example 3.
ElasticnetPCAWave
Sensitivity1.00000000.99890630.9963544
Specificity0.86589340.92775740.8858364
Pos Pred Value0.80629040.88529890.8296903
Neg Pred Value1.00000000.99934240.9977080
Precision0.80629040.88529890.8296903
Recall1.00000000.99890630.9963544
F10.89275830.93867760.9054166
Prevalence0.35823430.35823430.3582343
Detection Rate0.35823430.35784250.3569283
Detection Prevalence0.44429930.40420530.4301946
Balanced Accuracy0.93294670.96333190.9410954
AUC0.99650070.99841470.9947318
Table 6. The basic characteristics of the reconstruction for model II—example 4.
Table 6. The basic characteristics of the reconstruction for model II—example 4.
ElasticnetPCAWave
Sensitivity1.00000001.00000001.0000000
Specificity0.76198530.91257760.7507050
Pos Pred Value0.95008280.98106750.9478405
Neg Pred Value1.00000001.00000001.0000000
Precision0.95008280.98106750.9478405
Recall1.00000001.00000001.0000000
F10.97440250.99044330.9732219
Prevalence0.81917390.81917390.8191739
Detection Rate0.81917390.81917390.8191739
Detection Prevalence0.86221320.83498220.8642529
Balanced Accuracy0.88099270.95628880.8753525
AUC0.99953320.99986100.9978940
Table 7. The basic characteristics of the reconstruction for model II—example 5.
Table 7. The basic characteristics of the reconstruction for model II—example 5.
ElasticnetPCAWave
Sensitivity0.99128960.98937750.9923518
Specificity0.97842290.98568070.9354649
Pos Pred Value0.97696820.98456660.9342000
Neg Pred Value0.99184730.99014780.9925078
Precision0.97696820.98456660.9342000
Recall0.99128960.98937750.9923518
F10.98407680.98696620.9623983
Prevalence0.48006120.48006120.4800612
Detection Rate0.47587970.47496180.4763896
Detection Prevalence0.48709840.48240690.5099439
Balanced Accuracy0.98485620.98752910.9639084
AUC0.99948050.99949520.9978997
Table 8. The basic characteristics of the reconstruction for model II—example 6.
Table 8. The basic characteristics of the reconstruction for model II—example 6.
ElasticnetPCAWave
Sensitivity0.97055730.98264980.9613565
Specificity1.00000000.99733380.9951675
Pos Pred Value1.00000000.99573790.9921324
Neg Pred Value0.98167840.98909270.9759765
Precision1.00000000.99573790.9921324
Recall0.97055730.98264980.9613565
F10.98505870.98915060.9765020
Prevalence0.38796530.38796530.3879653
Detection Rate0.37654260.38123410.3729730
Detection Prevalence0.37654260.38286590.3759306
Balanced Accuracy0.98527870.98999180.9782620
AUC0.99983130.99984970.9993635
Table 9. The basic characteristics of the reconstruction for model III—example 7.
Table 9. The basic characteristics of the reconstruction for model III—example 7.
ElasticnetPCAWave
Sensitivity0.99932901.00000000.9793659
Specificity0.98783930.99367640.9857963
Pos Pred Value0.97944760.98921340.9756016
Neg Pred Value0.99960621.00000000.9880070
Precision0.97944760.98921340.9756016
Recall0.99932901.00000000.9793659
F10.98928840.99457750.9774801
Prevalence0.36705670.36705670.3670567
Detection Rate0.36681030.36705670.3594828
Detection Prevalence0.37450740.37105910.3684729
Balanced Accuracy0.99358410.99683820.9825811
AUC0.99985430.99998990.9992118
Table 10. The basic characteristics of the reconstruction for model III—example 8.
Table 10. The basic characteristics of the reconstruction for model III—example 8.
ElasticnetPCAWave
Sensitivity0.96738380.98716170.9906315
Specificity0.98701790.98424970.9722222
Pos Pred Value0.97619050.97181900.9515081
Neg Pred Value0.98214290.99287430.9947260
Precision0.97619050.97181900.9515081
Recall0.96738380.98716170.9906315
F10.97176720.97943020.9706757
Prevalence0.35492610.35492610.3549261
Detection Rate0.34334980.35036950.3516010
Detection Prevalence0.35172410.36052960.3695197
Balanced Accuracy0.97720090.98570570.9814269
AUC0.99893090.99947470.9992595
Table 11. The basic characteristics of the reconstruction for model III—example 9.
Table 11. The basic characteristics of the reconstruction for model III—example 9.
ElasticnetPCAWave
Sensitivity0.99813360.99602370.9982959
Specificity0.85626760.97932090.8593311
Pos Pred Value0.95623110.99344390.9571306
Neg Pred Value0.99318920.98738740.9937998
Precision0.95623110.99344390.9571306
Recall0.99813360.99602370.9982959
F10.97673310.99473220.9772799
Prevalence0.75880540.75880540.7588054
Detection Rate0.75738920.75578820.7575123
Detection Prevalence0.79205670.76077590.7914409
Balanced Accuracy0.92720060.98767230.9288135
AUC0.99810090.99959300.9978249
Table 12. Contributions of the components in explaining the variability of the features.
Table 12. Contributions of the components in explaining the variability of the features.
Percentage of VarianceCumulative Percentage of Variance
comp 175.8875.88
comp 210.4886.37
comp 34.8791.24
comp 43.2694.50
comp 51.3095.80
comp 61.1996.99
comp 70.6097.58
comp 80.5098.09
comp 90.3098.39
comp 100.2598.64
Table 13. Matrix of the prediction accuracy measures.
Table 13. Matrix of the prediction accuracy measures.
MethodsTSSMSEMAEMAPE
Logistic regression with Elasticnet788.3490.0490.0834.201
Logistic regression with PCA425.7860.0260.0281.705
Logistic regression with wavelet611.7910.0380.0493.453
Tikhonov regularization23,993.5771.4770.68928.752
Total variation7051.6890.4340.37924.462
Table 14. Matrix of the prediction accuracy measures.
Table 14. Matrix of the prediction accuracy measures.
MethodsTSSMSEMAEMAPE
Logistic regression with Elasticnet3848.0010.3920.28819.663
Logistic regression with PCA2255.1860.2300.14010.586
Logistic regression with wavelet4683.4200.4780.26211.266
Tikhonov regularization6118.0220.6240.35317.557
Total variation1852.5410.1890.1589.344
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Rymarczyk, T.; Król, K.; Kozłowski, E.; Wołowiec, T.; Cholewa-Wiktor, M.; Bednarczuk, P. Application of Electrical Tomography Imaging Using Machine Learning Methods for the Monitoring of Flood Embankments Leaks. Energies 2021, 14, 8081. https://doi.org/10.3390/en14238081

AMA Style

Rymarczyk T, Król K, Kozłowski E, Wołowiec T, Cholewa-Wiktor M, Bednarczuk P. Application of Electrical Tomography Imaging Using Machine Learning Methods for the Monitoring of Flood Embankments Leaks. Energies. 2021; 14(23):8081. https://doi.org/10.3390/en14238081

Chicago/Turabian Style

Rymarczyk, Tomasz, Krzysztof Król, Edward Kozłowski, Tomasz Wołowiec, Marta Cholewa-Wiktor, and Piotr Bednarczuk. 2021. "Application of Electrical Tomography Imaging Using Machine Learning Methods for the Monitoring of Flood Embankments Leaks" Energies 14, no. 23: 8081. https://doi.org/10.3390/en14238081

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop