1. Introduction
Pipelines are important channels of oil and gas transportation in many developing countries. They are made of ferromagnetic materials which are often vulnerable to corrosion and fatigue damages caused by surrounding environmental effects [
1]. Early damage detection of buried steel pipelines is necessary to confirm safety and reliability during service conditions. To achieve such goals, the primary aim of this study is to develop a non-contact geomagnetic probe using basic principles of magnetic gradient tensor [
2,
3] to detect the geomagnetic field signals. Non-contact geomagnetic detection [
4] is a new kind of non-destructive testing (NDT) technique that needs the Earth’s magnetic field as the stimulus source to locate buried ferromagnetic pipelines and achieves structural defect information, i.e., crack, corrosion and dents, etc., without any excavation. However, the magnetic field data recorded for large scale systems like buried pipelines are often contaminated by several factors when in their service environment, such as: the interference of parallel communication lines along the pipeline in heavy traffic areas, unusual disturbance often caused by the underground subway passages and overhead high voltage lines. Considerable attention in developing such non-parametric methods that can perform quick real-time assessment of the 3-axes magnetic field data is required towards safety and integrity of pipelines.
Several signal processing techniques have been considered in the literature for modal identification and flaw detection. They can perform efficient computation and adaptive implementation such as wavelet transform [
5,
6,
7] and empirical mode decomposition (EMD) also named Hilbert–Huang Transform (HHT) [
8,
9,
10,
11] to analyze non-stationary and nonlinear signals. Adjustment of algorithm parameters requires expert’s attention for successful applications for example, in wavelet transform careful selection of wavelet basis and the scales is important during processing of data. Similarly, the abilities of empirical mode decomposition method are often influenced by the sifting processes and selection of valid modes. In addition, measurement noise presents a challenge to their effectiveness.
Recently, blind source separation (BSS) techniques have been used as promising signal analysis tools in various fields of science [
12,
13,
14]. BSS based algorithms such as independent component analysis (ICA) [
15] and second-order blind identification (SOBI) [
16,
17] are computational methods used for separating a multivariate signal into their individual subcomponents. These methods were applied in structural dynamics for the first time in Reference [
18] to conduct output-only modal identification of structures. Furthermore, comparisons of the BSS based techniques and their limitations for modal analysis have been investigated in References [
14,
19,
20,
21]. BSS techniques are non-parametric data-driven algorithms, which extract modal features and perform structural assessment directly from the available data. BSS based methods are computationally efficient compared to the parametric methods that do not need a mathematical model to describe the physical performance of a system. Unlike parametric methods there is no need to adjust any parameter which keeps away the challenges like modal order problem [
22]. Advances in blind source separation (BSS) techniques for modal identification and damage detection have offered novel opportunities to develop new data-driven methodologies for efficient and effective sensing and processing of large scale health monitoring data.
This study presents a time-domain output data identification model for pipeline magnetic field data using the unsupervised blind source separation technique termed complexity pursuit (CP) [
23] that was independently formulated in Reference [
24]. CP learning algorithms have been successfully applied for system identification and damage detection in References [
22,
25,
26]. The main contribution of this paper is to apply the CP algorithms to the pipelines noisy magnetic field data, towards an accurate time-based modal identification. These non-parametric data driven algorithms has the ability to perform quick even real time, automatic sensing and processing of the large scale pipeline’s magnetic field data sets, that can provide excellent grounds for the inspection of buried ferromagnetic pipelines. The 3-axis magnetic field sensor data are fed as input into the blind source separation model where the complexity pursuit algorithms are applied for an accurate extraction of mode matrix that is then plotted to obtain the time-domain output modal responses. The frequency and damping ratio are computed from the recovered modal responses using Fourier transform and logarithm-decrement technique, respectively. The power spectral densities calculated from the recovered mode matrix show the abrupt variation in frequency due to the defects occurring in the pipeline. A numerical study for multi-degree of freedom systems and detailed indoor and outdoor experimental results show the ability of the non-parametric CP-BSS learning algorithms to accurately extract time-based modal information of the pipeline structures.
3. Stone’s Theorem for Solution of BSS Problem
Stone [
23] proposed that under the effect of some physical laws, the moment of mass in a given time produces possible sources in a system. Likewise, the measured system responses also contains least complex sources, each source is created under the influence of certain physical law. Summarizing, the complexity of a mixture of response signals can be found among the simplest and the most composite constituent sources. This theory was proved in Reference [
27].
This conclusion laid the foundation that source signals are the least complex signals that can be separated from the measured mixture of signals. Therefore, complexity pursuit algorithms search for source signals with least complexity, such that the “hidden” source component
which is obtained by multiplying the mixture
with the demixing row vector
which is the least complex signal.
This approach has been used as a solution towards the BSS problem.
Stone [
23] concluded that the complexity of a signal can be measured by maximizing the temporal predictability of a signal; the mathematical equation for temporal predictability is written by Reference [
23] as:
is the temporal predictability operator that contains the statistical and time-based information of the hidden source signal , that can be measured by finding the logarithmic ratio of .
The term
determines the global statistical information of signal
by computing the ‘overall variability’, estimated by a long-range prediction
. Similarly,
determines the local variance that calculates the time-based information [
23], using a short-range prediction parameter
on the temporal structure of
. The filtration process is performed by the long range prediction parameter and short-range prediction parameter, mathematically expressed as,
where
is termed as a half-life parameter for
and
as long as
[
23].
The significance of the proposed algorithm is to extract the hidden sources with accurate time-based structure. A careful selection of parameters is essential to predict a component with reduced local variance (smoothness) as compared with its global (long-range) variance, as the increase of the global statistical information only will produce a high variance signal, while increasing only will produce a smooth DC signal.
4. System Identification by CP
Combining Equation (2) and Equation (3)
where
and
are the
short-range and long-range covariance matrices among the mixtures, respectively. The elements of these matrices are given by
The matrices and are calculated only once and the terms and are calculated by fast convolution operations. For a given mixture of signals , the complexity pursuit algorithm calculates the de-mixing vector by maximizing the temporal predictability function ;
The derivative of
with respect to
is given by
Using the gradient ascent technique a maximum value of
can be obtained by repeatedly updating
such that the extracted component
, which is “most predictable” is considered as the least complex signal or the simplest source hidden in the mixtures [
23].
Considering the uncertainties of the proposed CP model for extracting only one simplest source can be solved by the deflation scheme [
22]. The sources are simultaneously extracted one after another using Gram–Schmidt de-correlation technique. The first step is to separate the most simplest source present in the mixture, after removing the first source the currently simplest source becomes the second one to be separated by the complexity pursuit algorithm and so on. The solution for gradient of
approaches zero such that,
Equation (9) has the form of a generalized eigenproblem;
can be found as the eigenvectors of matrix
, with corresponding eigenvalue
, [
23]. The de-mixing matrix
is calculated using a generalized eigenvalue routine. All the source signals can be separated by:
is the recovered source matrix with row-wise source signals .
5. Modal Parameters Estimated by CP
The governing equation of motion for a linear time invariant system is given by
where
is mass,
is damping matrix and
is the stiffness matrix, all real valued and symmetric.
is the displacement vector, which are actually the measured system responses.
is the external force acting on the system.
The connection of blind source separation with output modal identification was solved for the first time in Reference [
18], as in the BSS model in Equation (1), the modes of a system can be expanded as a linear combination of n number of modal responses: that can be expressed in Equation (12) as:
The basic phenomenon of Equation (12) is same as in Equation (1),
Φ contains the modal information of a system describing the entire situation of a noise contaminated system.
being the (mode shape) is related to the
ith modal feature column of the mode matrix, and is related with the
ith modal response
of the modal response vector
.
is actually the original source signal that can be obtained by multiplying the inverse of the mode matrix with the
matrix of the measured system responses.
is used to identify the change in the normal condition of a system as an abrupt variation in the modal feature. Therefore, plotting the mode shapes can give important information about the damage occurring in the system under observation.
The idea of “virtual sources” in Reference [
18] states that the recovered modal responses of a system should be considered as independent sources, if the power spectral density is not same or the frequencies are not able to be judged clearly. In such cases the mixing matrix matches with the recovered modal matrix, consequently the hidden sources and unidentified mixing matrix can be obtained by putting the measured system responses from the expanded model in Equation (12) as known mixtures into the blind source separation framework in Equation (1); accordingly the desired modal responses and mode matrix can be achieved.
Equation (12) can be used to classify the motion of a system by its mode matrix
as it provides complete information about the linear system. Putting Equation (12) into Equation (11) and multiplying the transpose of the mode matrix
on both sides,
yields to
where
is the diagonal real-valued modal mass matrix,
is the damping matrix, and
is the stiffness matrix.
is the modal force vector. A multi-DOF system can be decoupled into
n-DOF systems whose motions are given by
Equation (16) defines the basic idea of the complexity pursuit algorithm by targeting the motion of the decoupled single degree of freedom system on ith modal coordinate . The modal parameters of the system i.e. damping ratio is calculated by and resonant frequency of the system is calculated in terms of natural frequency of the ith mode given by .
In free excitation, i.e.,
, the modal responses behave like exponentially decaying sinusoids. The motion of mass at
ith modal coordinate governed by Equation (16) can be written as
The measured mixtures are linear combinations of these modal responses, written as
where
and
are some constants determined by initial conditions.
In case of random excitation, the recovered modal responses are dominant over the measured system response, producing randomly modulated exponentially decaying sinusoids with an envelope function
at the
ith mode [
18],
Hence, the measured responses are given by
In case of highly damped systems with complex valued mode matrix the complexity pursuit algorithms can separate the system into their respective modes until the intrinsic frequency and damping property of the system does not change. In such case the physical system in Equation (12) can be decoupled into Equation (17) in the state-space by the excitation mode matrix
, as well as the modal responses
. Therefore, using Stone’s algorithm the measured mixture
consisting of time based modal responses
can be subsequently separated by CP–BSS model,
and the excitation mode matrix can be estimated by
The frequency and damping ratio can be readily computed from the recovered time-domain modal response
using Fourier transform and logarithm-decrement technique, respectively. Yang and Nagarajaiah [
22] solved the modal order problem in complexity pursuit based blind source separation method by arranging the recovered modes ordering the frequency values, i.e., the first mode can be identified by the modal response with smallest frequency and so on.
6. Numerical Simulations
Using the CP based BSS algorithm, numerical examples are conducted on a 3-DOF system shown in
Figure 2 including different levels of damping.
The system parameters are adjusted to classify different modal identification problems i.e. proportional damping well-separated mode, closely spaced mode and complex mode. Free excitation and random excitation in each case are discussed. Gaussian White Noise (GWN) is used to produce stationary random excitation. Similarly the Gaussian White Noise (GWN) is modulated with a constant exponential decay function to create a non-stationary excitation effect in the system. The time histories of the system responses, i.e., the displacement vector, are calculated by the Newmark-Beta solver. The sampling frequency is set to 10 Hz.
The parameters of complexity pursuit based blind source separation method remains the same throughout the process. The long-range parameter
= 900,000 and short-range parameter
= 1 are taken same as given by Reference [
23]. Fast convolution operations are performed to calculate the long-range and short-range covariance matrices. The demixing matrix which is the eigenvector matrix is calculated by conducting eigenvalue decomposition on the obtained covariance matrices. The excitation mode matrix and the time-domain modal responses are calculated by Equations (21) and (22) respectively. Fourier transform algorithms and logarithm-decrement technique are used to calculate frequency and damping ratio respectively.
A modal assurance criterion is defined in Equation (23) to evaluate the correlation among the recovered mode values
and the theoretical mode values
Ranging from 0 to 1, where 0 means no correlation and 1 indicates perfect correlation.
6.1. Proportional Damping
The parameters of the system shown in
Figure 2 are borrowed from Reference [
18]. In case of proportional damping
is the mass matrix,
is the stiffness matrix and
is the damping matrix. The value of
corresponds to different damping level, (
and
).
in free excitation with initial condition
and
. In case of random excitation the system is excited at the 2nd and 3rd DOFs using stationary and non-stationary Gaussian white noise.
Table 1 and
Table 2 show the obtained results by CP algorithms and the modal assurance criterion values respectively.
Figure 3 shows the measured responses of a 3DOF linear system for
in free excitation proportional damping. After applying CP-BSS the estimated modes in free excitation are given in
Figure 4 and
Figure 5, respectively. The order of the recovered modal responses in each case is not rearranged to show the original results by CP model; (for example the Mode 1 in
Figure 4 simply means the first mode recovered by CP algorithm, not suggesting Mode #1). This is due to modal order problem that can be solved by rearranging the frequency values. Thus it can be observed that the measured responses of the 3DOF system are well-separated into their respective modes. The frequency of the separated modes can be observed from the power spectral densities of the estimated modes.
6.2. Effect of Noise
The calculated signal responses are now contaminated by adding zero-mean Gaussian white noise (with a 10% of the original signal). The results for
in free excitation are shown in
Table 3, addition of noise has no influence on the output of the CP model. The same accuracy has been seen in cases with various damping levels. This means that the CP algorithm provides healthy outputs for noise added signals also.
6.3. Closely Spaced Modes
The complexity pursuit model was implemented on measured system responses of a 3DOF system with closely spaced modes. The mass, stiffness and damping matrix were obtained by modifying the high proportional damping matrix in Reference [
21],
The initial conditions used in free excitation are changed to
and
, the remaining parameters were left the same as used in proportional damping case. Fairly accurate modal identification results are obtained in closely spaced modes shown in
Table 4 and
Table 5, respectively.
A highly damped system
with closely spaced mode in free excitation is shown in
Figure 6. The closely spaced 2nd and 3rd modes of the system responses that can be hardly judged in the power spectral densities are clearly decoupled by the CP model as shown in
Figure 7.
6.4. Non-Proportional Damping
The parameters of a system under non-proportional damping are given as follows,
The model was obtained by slightly changing the damping matrix used in Reference [
21] that results in complex modes. McNeil and Zimmerman [
21] presented a standard method to transform the complex modes into real ones due to the output of the complexity pursuit model to provide a real value demixing matrix.
Table 6 and
Table 7 show the identification results. Fairly well comparison can be seen among the identified and the theoretical results. Equation (23) can be used to evaluate the accuracy of the identified mode shapes. The system responses and recovered modal responses are shown in
Figure 8 and
Figure 9, respectively. Little influence on the output of the CP algorithm has been seen in case of non-proportional damping; still it offers better approximation compared to the theoretical values for complex modes.
6.5. Modal Identification of a 12-DOF System
The performance of complexity pursuit algorithms is further extended towards large scale structures, a 12-DOF system is built up such that the values of constant mass matrix
are given by;
,
,
and values of stiffness matrix
are
and damping matrix is calculated by
with
where
is the damping ratio. The first mode has a theoretical damping ratio of 4.46%. The frequencies of the 12 modes are distributed between 5.3505 and 44.5827 Hz; with a sampling frequency set to 1000 Hz. The system is excited at the 12th DOF, and the time histories of the signals with 5000 samples are measured (the length of the signal can be increased and the accuracy holds). Modal assurance criterion (MAC) values for all 12 modes under different conditions are shown in
Table 8. A high correlation among the approximated modes can be seen with (MAC) values above 0.99 for all damping levels.