Next Article in Journal
Dynamic and Full-Time Acquisition Technology and Method of Ice Data of Yellow River
Next Article in Special Issue
Power Transformer Voltages Classification with Acoustic Signal in Various Noisy Environments
Previous Article in Journal
Visual Perceptual Quality Assessment Based on Blind Machine Learning Techniques
Previous Article in Special Issue
Bearing Fault Diagnosis Using Multidomain Fusion-Based Vibration Imaging and Multitask Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multistage Centrifugal Pump Fault Diagnosis Using Informative Ratio Principal Component Analysis

1
Department of Electrical, Electronics and Computer Engineering, University of Ulsan, Ulsan 44610, Korea
2
Predictive Diagnosis Technology Cooperation, Ulsan 44610, Korea
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(1), 179; https://doi.org/10.3390/s22010179
Submission received: 2 December 2021 / Revised: 25 December 2021 / Accepted: 26 December 2021 / Published: 28 December 2021
(This article belongs to the Special Issue Sensing Technologies for Fault Diagnostics and Prognosis)

Abstract

:
This study proposes a fault diagnosis method (FD) for multistage centrifugal pumps (MCP) using informative ratio principal component analysis (Ir-PCA). To overcome the interference and background noise in the vibration signatures (VS) of the centrifugal pump, the fault diagnosis method selects the fault-specific frequency band (FSFB) in the first step. Statistical features in time, frequency, and wavelet domains were extracted from the fault-specific frequency band. In the second step, all of the extracted features were combined into a single feature vector called a multi-domain feature pool (MDFP). The multi-domain feature pool results in a larger dimension; furthermore, not all of the features are best for representing the centrifugal pump condition and can affect the condition classification accuracy of the classifier. To obtain discriminant features with low dimensions, this paper introduces a novel informative ratio principal component analysis in the third step. The technique first assesses the feature informativeness towards the fault by calculating the informative ratio between the feature within the class scatteredness and between-class distance. To obtain a discriminant set of features with reduced dimensions, principal component analysis was applied to the features with a high informative ratio. The combination of informative ratio-based feature assessment and principal component analysis forms the novel informative ratio principal component analysis. The new set of discriminant features obtained from the novel technique are then provided to the K-nearest neighbor (K-NN) condition classifier for multistage centrifugal pump condition classification. The proposed method outperformed existing state-of-the-art methods in terms of fault classification accuracy.

1. Introduction

The multistage centrifugal pump (MCP) converts electrical energy to mechanical energy for industrial processes [1]. MCP is the type of centrifugal pump in which multiple impellers are fitted in series and the fluid flows through the series of impellers. A survey conducted on 437 defective MCPs revealed that the industry went through a maintenance downtime of 6128 h because of the lack of intelligent FD, resulting in a cost of USD 50 million [2]. Defects in the MCP can be categorized into soft defects and hard defects. Hard defects occur abruptly and cause the MCP to stop unexpectedly. Hard defects are physically identifiable and can be addressed by inexpensive and simple analysis; however, soft defects are dangerous as they slowly affect the MCP performance. Thus, soft defects need to be identified quickly using intelligent FD [3]. Mechanical seal (MS)-related defects are responsible for 34% of the MCP soft defects. A defective MS results in MCP soft defects such as shaft wear, flushing of fluid, and fretting, etc. Furthermore, a defective impeller can cause hydraulic soft defects and mechanical soft defects [4,5]. To reduce downtime and cost for MCP maintenance, this paper considers the early fault diagnosis of soft defects because of a MS hole (MSH), MS scratch (MSS), and impeller defect (ID).
A change in the stiffness of the mechanical structure because of a mechanical defect produces an impulse in the VS. Therefore, VS can be used for the condition monitoring of MCP [6]. The impulses in the VS because of a mechanical defect occur at a specific frequency in the Fourier spectrum (FS). However, because of the low energy of the fault impulses, they are often obscured by interference and background noises [7]. Andrei et al. [8] discriminated the bearing fault harmonics from interference noises using the fault-oriented window series of a Gaussian mixture model. However, techniques based on narrowband demodulation are unable to discriminate between interference noise and fault impulses [9,10]. Furthermore, the VS under a defect changes its statistical characteristic over time indicating they are highly complex and nonstationary. Fourier transforms are ideal for stationary signals. To address these concerns, Viet et al. [11] proposed a blind source separation (BSS) technique-based acoustic signal denoising. However, the BSS requires a baseline signal for noise reduction in the subsequent signal. For complex VS, time-frequency domain (TFD) transforms have significant advantages. The TFD wavelet transform (WT) is sensitive to non-stationary defect impulses [12,13,14]. Rapur and Tiwari [15] preprocessed the VS of the MCP using WT and extracted statistical features (SF) for MCP fault diagnosis. For WT, an optimal mother wavelet selection for VS preprocessing is very important, otherwise the WT will suffer from an oscillation effect. Empirical mode decomposition (EMD) [16,17,18], an adaptive signal decomposition technique, can overcome the shortcomings of WT. Unfortunately, EMD has mode mixing and it suffers from extreme interpolation. These drawbacks of EMD make the WT more attractive [19]. To address the above concerns, rather than focusing on a narrow optimal frequency band, this paper first calculates the modes of vibration for MCP defects. To overcome the interference macrostructural vibration noise, these MCP defect modes of vibration are filtered from the MCP vibration spectrum. The filtered mode of vibration forms the FSFB, which is then used for discriminant SF extraction.
After VS preprocessing, feature extraction and feature preprocessing are important steps in intelligent FD [20,21,22]. SF can be extracted from the VS in time, frequency, and TFD [23]. The power of deep learning techniques can be utilized for fault-related discriminant feature extraction and classification [24,25]. Juan et al. [24] proposed a data-driven fault diagnosis strategy for bearing fault diagnosis. The proposed method extracts SF’s from the raw vibration signal in the time domain, frequency domain, and TFD. To obtain the discriminant set of features with reduced dimensions for the identification of bearing working conditions, a novel deep feature learning technique is proposed. However, the interaction of complex fluid and mechanical components inside the pump and the stiffness change in the mechanical component of the CP change the nature of the VS obtained from the CP under soft defect conditions from the vibration signal of the bearing. Thus, SF’s extracted from the raw VS of the CP in multiple domains result in noisy features. Furthermore, they are not capable to represent the fault-related information of the CP. Moreover, the time domain SF’s extracted from the raw vibration signal are either not sensitive to incipient defects or are not appropriate for severe defects [26]. In the case of the frequency domain, SF’s extracted from the frequency spectrum of the raw vibration signal may be noisy because the fault-related frequencies of the CP often occur at lower frequencies, and therefore, it can be overwhelmed by microstructural vibration noise. To address these concerns, a new technique is introduced for CP VS pre-processing which is applied to the raw VS before feature extraction. The technique first calculates the modes of vibration for MCP defects. These MCP defect modes of vibration are filtered from the MCP vibration spectrum. The filtered mode of vibration forms the FSFB, which is then used for discriminant SF’s extraction in time, frequency, and TFD. Yet the study presented in [24] is interesting and can be used for CP fault diagnosis after VS pre-processing. The SF’s extracted from time, frequency, and TFD are combined into a single feature vector called MDFP. The MDFP results in a larger dimension. Furthermore, not all features are best for representing centrifugal pump conditions and they can affect the condition classification accuracy of the classifier. To address this concern, feature preprocessing for discriminant feature extraction is of primary importance [27,28,29,30,31,32,33,34,35]. Several feature dimensionality reduction and discriminancy evaluation techniques have been proposed [36,37,38]. Among them, principal component analysis (PCA) and linear discriminant analysis (LDA) are most common. LDA preprocesses the feature and results into discriminant feature space by reducing within class feature sparseness and increasing the feature distance between classes [39]. Several variants of LDA are proposed such as Pearson LDA [40] and trace ratio LDA (Tr-LDA) [41]. However, the penalty graph representation for between classes distance affects the discriminant of the feature space. In contrast, the PCA constructs low dimensional feature space by considering the variance. Sakthivel et al. [42] evaluated supervised and unsupervised dimensionality reduction techniques for MCP fault classification. The evaluation showed that PCA outperformed all feature preprocessing methods for MCP fault classification. Unfortunately, PCA does not consider between class feature distance or within class feature sparseness. For this reason, this paper proposes a novel Ir-PCA. The novel technique first assesses the feature informativeness towards the fault by calculating the informative ratio between the feature within the class scatteredness and between class distance. To obtain a discriminant set of features with reduced dimensions, PCA is applied to the features with a high informative ratio. The combination of informative ratio-based features assessment and PCA forms the novel Ir-PCA.
The overall contribution of this work can be summarized as follows:
  • To overcome the macrostructural interference noises, this paper first calculates the vibration modes for MCP defects. These MCP defect modes of vibration are filtered from the MCP vibration spectrum. The filtered mode of vibration forms the FSFB, which is used for discriminant SF extraction in time, frequency, and TFD. All of these SF were combined into a single feature victor called MDFP.
  • Ir-PCA was proposed for discriminant feature extraction for MCP fault diagnosis. To the best of our knowledge, Ir-PCA has not been reported. Ir-PCA first assesses the feature informativeness towards the fault by calculating the informative ratio of the features. To obtain a discriminant set of features with reduced dimensions, PCA was applied to the features with a high informative ratio.
  • The MCP vibration signal obtained from a real-world industrial test rig was used for the evaluation of the proposed method.
The paper is organized into the following sections. Section 2 presents the technical review. The pump experimental test rig is presented in Section 3. Section 4 explains the proposed method. Section 5 presents the performance evaluation of the proposed method. Section 6 presents the conclusion of this study.

2. Technical Review

2.1. Review of Principle Component Analysis

PCA is one of the most popular methods when dimensionality reduction is concerned. Two criteria are essential in a dimensionality reduction method: the ability to compress the data and simultaneously maximize the coverage of data variances. PCA’s approach to these tasks is constructed around the finding of the reduced dimensionality’s linear basis.
Given an Nx1 vector x1, x2, …, xn, the general scheme of PCA initiates with the removal of the mean value x ¯ from each feature, which is used for computing the covariance matrix C cov that represents the data distribution. C cov eigenvalues and eigenvectors can be achieved using Equations (1) and (2).
C E V = λ i > λ i + 1 > ... λ N
C E V e c = μ i > μ i + 1 > ... μ N
Thus, (xi x ¯ ) can be rewritten as Equation (3):
x i x ¯ = a 1 μ 1 + a 2 μ 2 + ... + a N μ N = i = 1 N l
where a1, a2, …, an are scalars. Finally, only the corresponding terms of the K largest eigenvalues are retained, as shown in Equation (4):
x i x ¯ = i = 1 N a i μ i   where   K N

2.2. Review of the Wavelet Packet Transform (WPT)

Being an extension of the original wavelet transform (WT), WPT also presents a solution to independent frequency band analysis. Instead of dividing the signal into k levels, such as WT, WPT breaks the signal using low- and high-pass filters while creating 2k nodes at each level. As a result, WPT can exceed WT in terms of resolution, thus obtaining a more comprehensive analysis of time–frequency across the signal spectrum. Its coefficients can be computed as follows:
c k + 1 j ( n ) = c k j × h ( 2 n ) , 0 < j < 2 k 1
d k + 1 2 j + 1 ( n ) = d j k ( n ) × g ( 2 n ) , 0 < j < 2 k 1
where h and g are the low- and high-pass filters, respectively, k is the levels (scale parameter), and 2j, 2j + 1 are the nodes (frequency parameter) in Equations (5) and (6). Figure 1 shows the algorithmic description of WPT decomposition for three levels. A detailed description of WPT can be found in [43].

3. Pump Experimental Test Rig

The MCP test rig setup picture and schematics are shown in Figure 2a,b, respectively. The pump test rig consists of an MCP PMT-4008 (Hanil, Gwangju, Korea) driven by a 5.5 kW motor. A user-friendly control panel was established for controlling the pump speed, power, and flow rate. After turning on the electric power, the MCP starts pumping water from the main tank to the buffer tank through steel pipes. The VS were recorded from the MCP at a speed of 1733 rpm using accelerometers.
A total of four accelerometers were used for recording the VS of the MCP. Two of the accelerometers were attached to the pump casing, one accelerometer near the impeller in the axial direction, and one near the mechanical seal in the axial direction. A 300 Seconds (s) long VS was recorded from the MCP in the normal condition (NC). After collecting the MCP data in NC, the rotating part of the MS was replaced by a defective MS, as shown in Figure 3a. An MSH with a diameter of 2.8 mm and depth of 2.8 mm in the MS was created using an electrical drill, and the VS from the MCP were recorded for 300 s. Similarly, an MSS was created in the rotating part of the MS with a 38 mm inner diameter, as presented in Figure 3b, and the VS from the MCP were collected for 300 s. An ID, as shown in Figure 3c, was created by removing a metal piece from the impeller using an electric device with a length of 18 mm and a depth of 2.8 mm; the VS from the MCP were collected for 300 s.
A total of 1200 s of VS were collected from the MCP. All of the VS were digitized using a NI-9234 DAQ. Figure 4 shows the VS obtained from the MCP in NC and a defective condition.

4. Proposed Fault Diagnosis Method

The proposed method starts with selecting FSFB and ends with MCP health state classification. Figure 5 shows the flow diagram of the proposed method. Following are the steps involved in the proposed method.

4.1. Step 1: Fault Specific Frequency Band Selection

A fault can be identified from the shock produced by the change in stiffness around the mechanical structure when the fault occurs. Although the observation in the frequency spectrum can theoretically allow us to speculate these shocks at the fault-specific frequencies (FSF), it is more sophisticated in the case of the CP. The increased complexity is because of the interactions between the fluid and mechanical components, which can cause a hydraulic defect from the original mechanical defects. As a result, discriminant features might not be achieved with the sole focus on FSF. Furthermore, raw VS can be affected by the macrostructural vibration, thus altering the SF quality. These issues are addressed in this step by the identification and selection of FSFB.
Three main types of frequency are of interest in the CP: generated, excitation, and electronics. The harmonics from the first two types are considered valuable features concerning this study’s objective. The identification of these frequencies is feasible because of the system parameters (i.e., rotating speed, CP geometry, etc.). When the defect is located on the impeller, the generated frequencies are the source. The impeller imbalance caused by this defect can be observed in the VS [44], which is described at an FSF by the following equation:
F I D = n Z
where n is the frequency harmonics and Z is the MCP’s operating speed (Hz) in Equation (7). The FS difference of an MCP in the normal condition and ID condition are demonstrated in Figure 6. As FID’s calculation was taken to the 5th harmonic, the increase in the amplitude of the last three was attributed to the ID. Moreover, in the case of ID, spikes were observed across the FS, which can be explained by the interaction between the defective impeller and fluid.
In the MCP spectrum, an MS defect is associated with the excitation frequency, which reflects the CP’s amplified vibration as a single frequency harmonic [5,45]. The circular ring vibration theory was used to explain the calculation of the excitation frequency, which starts with the calculation of the deformation potential energy (PE) and vibration kinetic energy (KE):
E P E = ( A E u r 2 2 r 2 × 2 π r )
E K E = ( ρ A 2 ) ( ( u r ) 2 × 2 π r )
where A is the cross-sectional area of the ring, ur is the radial displacement, r is the centerline radius of the ring, and E is the elasticity modulus. u/r represents the ring’s unit elongation in Equations (8) and (9). According to the energy conservation method:
d d t ( ( E P E + E K E ) = 0
By solving Equation (10), the equation of motion is achieved as:
( u r ) + ( ω ° 2 ) u r = 0
where ω ° is the angular frequency. Equation (11) can then be solved, which results in the ring fundamental frequency as Equation (12):
f r f = ( 1 2 π r ) E ρ
Appropriate modes are present because the vibration is nonrandom. For a MS, the expressions of in-plane and out-of-plane modes are:
f i n p l a n e = ( 2 n ( n 2 1 ) π ) ( h d 2 ) E ρ ( 12 n 2 ) + ( 2 t h 3 ( 1 + ν ) c )
f i n p l a n e = ( 2 n ( n 2 1 ) π ) ( h d 2 ) E ρ ( 12 n 2 ) + ( 2 t h 3 ( 1 + ν ) c )
where h is the cross-sectional height, t is the ring thickness, d is the diameter, v is Poisson’s ratio, c is the torsion constant, and n is the vibration mode. Generally, high frequencies occur where the fundamental and in-plane vibrations occur. Nevertheless, lower frequencies were witnessed in the case of out-of-plane bending modes of vibration (flexural vibration) of the MS, which were obtained using Equation (14). As shown in Figure 7, with a MS defect, the amplitude of the excitation frequency in the 2nd and 3rd flexural vibration modes was twice as large as when the defect was absent. In this study, a lowpass filter with a cutoff frequency at 4600 Hz was used to obtain the MCP vibration signal, which scales up to the 3rd mode of flexural vibration. The filtered modes of flexural vibration are FSFB. As shown in Figure 6 and Figure 7, the presence of the MCP’s excitation frequencies, impeller defect’s FSF, and corresponding hydraulic defects are covered in this filtered FSFB. The FSFB is used for the extraction of SF in the next step.

4.2. Step 2: Multi-Domain Feature Pool Construction

SF’s are extracted from the FSFB in time, frequency, and TFD. The SF was adopted from a previous study [6]. The representative set of SF’s extracted from the FSFB in the time domain are mean, variance, root amplitude, skewness, root mean square (RMS), clearance factor, impulse factor, kurtosis, shape factor, peak, standard deviation, and crest factor. The time-domain SF’s are presented in Table 1. Continuously, features such as mean frequency, root variance, standard deviation, spectral kurtosis, and root mean frequency are extracted from the FSFB in the frequency domain. Table 2 shows the SF’s extracted from FSFB in the frequency domain. To extract the SF’s from the FSFB in TFD, the FSFB is transformed into TFD using WPT. In this study, Daubechies family db4 mother wavelet is used to decompose the FSFB up to k = 3 levels. As a result, a total of 2 k WPT bases are obtained. The features mentioned in Table 1 are extracted from each base of WPT. Experimental studies on the selection of optimal wavelets revealed that the Daubechies family db4 mother wavelet is sensitive towards the ongoing processes inside the MCP [46,47]. Therefore, in this study, Daubechies family db4 mother wavelet is selected for decomposition of FSFB. All the extracted SF were combined into a single feature vector called MDFP. The MDFP contains the features of time domain, frequency domain, and TFD. Thus, for each MCP condition, the MDFP contains a total of (12 + 5 + (2 k × 12) = 113) features.

4.3. Step 3: Novel Informative Ratio Principal Component Analysis

The MDFP results in a larger dimension; furthermore, not all features are best for representing the MCP condition and they can affect the accuracy of the condition classification. To address this concern, Ir-PCA was proposed for discriminant feature extraction for MCP fault diagnosis. The steps involved in Ir-PCA are:
Step 1. Calculate the inter-class feature sparseness using Equation (15).
I i S = 1 N n = 1 N I n , i ,
Using Equation (16), In,i can be obtained as:
I n , i = 1 S n × ( S n 1 ) j , s = 1 S n | x s , n , i x j , n , i | , where   j , s = 1 , 2 , ... , S n , j s .
where N represents the classes, i is the number of features, x represents the feature, S is the sample number, and I i S is the inter class sparseness of the features in Equations (15) and (16).
Step 2. Compute the inter-class feature mean μ n , i .
Step 3. Determine the distance between the features of different classes using Equation (17).
T l d = 1 N × ( N 1 ) p , q = 1 P | μ q , i μ p , i | , p , q = 1 , 2 , ... , P , p q .
Step 4. The informativeness of the feature is calculated using Equation (18).
I r = T l d I i S
Step 5. Apply PCA to the features with Ir ≥ 0.5.
Ir-PCA = P C A ( f e a t u r e s ( I r 0.5 ) ) .
Using Equation (19), a new set of discriminant features with high between classes distance, reduced within class sparseness, and reduced dimensions were obtained. The new Ir-PCA solves the problem of between class feature distance and within class feature sparseness of traditional PCAs. The new set of features are provided to the K-NN for MCP condition classification where K = 3. In this study, K-NN was selected for MCP condition identification because of the low computational cost and simple architecture.

5. Results and Performance Evaluation

The dataset used for evaluating the performance of the proposed method consists of 1200 VS. These VS were obtained from the MCP in NC, MSH, MSS, and ID conditions. The MDFP was constructed from the SF extracted from the 1200 VS, where the number of SF in MDTF was Nmcp × Vs × Isf. The Nmcp is the classes, VS is the VS instances, and Isf is the SF extracted from VS in each class. A cross-validation (CV) strategy with n-folds (n = 3) was adopted to validate the proposed method. The dataset was partitioned into n-folds where n-fold were used for classifier testing and the rest of the n−1 folds were used for testing the classifier. Out of 1200 samples, 800 were used for classifier training and 400 were used for classifier testing. All samples were selected randomly for each trial. To ensure stability in the classification results, each experiment was performed 20 times.

Performance Comparison of the Proposed and Reference Methods

From the pre-processing of VS and SF using the proposed method, new fault features were extracted. This study provides a comprehensive evaluation of extracted features for fault diagnosis by comparing it with a TFD features extraction method (WPT-MSVM-PCA) [15], an unsupervised feature pre-processing technique (PCA) [42], and a supervised feature pre-processing method (Tr-LDA) [41]. The comparison matrices used in this study are macro-recall (Mrecall) or true-positive rate (T-PR), macro-precision (Mprecision), the average accuracy of the classification (AAC), and error rate for classification (Er). These matrices are calculated using the following equations:
T - PR = 1 k j = 1 k ( ( N T P j , m ) N T P j , m + N F N j , m ) × 100 ( % )
AAC = 1 k j = 1 k ( m = 1 L N T P j , m N s a n p l e s ) × 100 ( % )
E r = 1 k j = 1 k ( ( N T P j , m ) + ( N F N j , m ) ( N T P j , m ) + ( N F N j , m ) + N T N j , m + N F P j , m ) × 100 ( % )
where k is the CV folds, true positives ( N T P j , m ) , true negatives ( N T N j , m ) , false positives ( N F P j , m ) , and false negatives ( N F N j , m ) identified by the classifier as condition m; the iteration of CV folds are j, and N s a m p l e s are the samples in the testing subset as stated in Equations (20)–(22).
The results obtained from the proposed and Reference methods are presented in Table 3. It is evident in Table 3 and Figure 8 that the proposed method best identified the MCP working conditions compared with the performance of the reference methods with 0% Er, 100% AAC, 100% Mrecall, and 100% Mprecision. The performance of the proposed method is expected because it first calculates the modes of vibration for MCP defects. To overcome the interference macrostructural vibration noise, these MCP defect modes of vibration are filtered from the MCP vibration spectrum. The filtered mode of vibration forms the FSFB, which is then used for multi-domain SF extraction. The multi-domain SF are combined into MDFP, which results in a large dimension. Moreover, some of the features may be noisy and can affect the classifier accuracy. The classifier accuracy is directly proportional to discriminant features. The proposed method applies the novel Ir-PCA to the MDFP to extract discriminant features with reduced dimension. Ir-PCA first assesses the feature informativeness towards the fault by calculating the informative ratio of the features. To obtain a discriminant set of features with reduced dimensions, PCA was applied to the features with a high informative ratio. As shown in Figure 9, the feature space obtained from the proposed method was discriminant; furthermore, the same class features were less sparse. Figure 9 also provides evidence for the higher AAC obtained from the proposed method.
The time–frequency domain (WPT-MSVM-PCA) method pre-processes the vibration signal using WPT and then, with the help of PCA, selects the WPT bases to extract the statistical features. The number of bases defines the dimension of the data. Therefore, the signal is decomposed to 2 levels with 4 bases using WPT. According to PCA, the importance of dynamics connected to data is greater if there are variations in the data, which is why the principal components are sorted by decreasing covariance. Because of the correspondence with 70% data covariance, the first two WPT bases are picked, and the other bases are discarded. After the selection of WPT bases, the useful statistical features are extracted, and then, the best features are picked from the features pool by a wrapper model. Feature extraction and classification using this model produced 6.77% Er, 96.39% AAC, 96.39% Mrecall, and 96.20% Mprecision, which are comparatively less than our proposed method, as shown in Table 3 and Figure 8. These results were expected because information loss occurred because of PCA and WPT dependency on signal level decomposition and the use of optimal parent wavelet for WT. Alternatively, the proposed model used all of the bases of WPT, along with the discriminant features obtained from MDFP pre-processing and VS pre-processing. However, sensitivity of WPT-PCA-MSVM to soft defects, such as MSS fault, was observed. The feature space obtained from WPT-PCA-MSVM is shown in Figure 9, where the only separable condition was the NC features. Overall, the AAC of this method was greater than 95% and it can be considered for MCP fault diagnosis.
The unsupervised features processing method PCA is also a dimensionality reduction technique that uses the variance in data to construct data representation in lower dimensions. Application of PCA with a K-NN classifier to our dataset resulted in 10.34% Er, 94% AAC, 93.97% Mrecall, and 94.20% Mprecision, which illustrates its poor performance compared with the proposed method, as shown in Table 3 and Figure 8. Additionally, to keep the fault features data, determining the optimal quantity of components is difficult for the PCA. As shown in Figure 9, the classification of features is not as good as the proposed method, resulting in the high classification error value for PCA.
The underperforming supervised features pre-processing method (Tr-LDA) is a linear dimensionality reduction method, which, with the aid of trace ratio criteria, reduces the intraclass dispersion and enhances the interclass distance. The application of this algorithm to our dataset results in 35.09% Er, 63.48% AAC, 63.48% Mrecall and 63.12% Mprecision, which is underperforming compared with our proposed method. This underperformance corroborated expectations because Tr-LDA transforms the data without consideration of feature processing for extracting the intrinsic discriminant information from raw statistical features. Tr-LDA efficiently reduces the intraclass dispersion; however, it results in 35.09% Er in separating different classes, as shown in Figure 9.
Overall, the proposed method considers feature pre-processing and VS pre-processing. The method is easy to implement and efficient for MCP condition classification.

6. Conclusions

This paper proposed a new condition classification method for a multistage centrifugal pump. In the signal pre-processing step, the proposed method selected the fault-specific frequency band from the raw vibration signal. The new method for selecting the fault-specific frequency band first calculates the defect vibration modes of the multistage centrifugal pump. Furthermore, to overcome interference macrostructural vibration noise these defect modes are filtered from the raw vibration signal. The filtered modes of vibration formed the fault-specific frequency band. Statistical features in time, frequency, and time–frequency domains were extracted from the fault-specific frequency band. All of the features were combined into a single multi-domain feature pool. To extract discriminant features from a multi-domain feature pool with low dimensions, novel Ir-PCA was applied to the multi-domain feature pool in the feature pre-processing step of the proposed method. The novel Ir-PCA first assesses the feature informativeness towards the fault by calculating the informative ratio of the features. To obtain a discriminant set of features with reduced dimensions, principal component analysis is applied to the features having a high informative ratio. The discriminant set of features obtained from Ir-PCA was then classified using the K-NN classifier for MCP condition classification. The proposed method for multistage centrifugal pump fault diagnosis outperformed the state-of-the-art reference method with an average accuracy of classification of 100%. However, the feature space obtained from the proposed method is highly separable and compact. Therefore, the high variance would be an issue if classification algorithms other than K-NN are used. In the future, the proposed method will be applied to fluid-related defects of the multistage centrifugal pump such as incipient cavitation and severe cavitation.

Author Contributions

Conceptualization, Z.A., T.-K.N., S.A. and J.-M.K.; methodology, Z.A., T.-K.N., S.A. and J.-M.K.; validation, Z.A., T.-K.N., S.A. and J.-M.K.; formal analysis, Z.A., T.-K.N., S.A. and J.-M.K.; resources, Z.A., T.-K.N., S.A., C.D.N. and J.-M.K.; writing—original draft preparation, Z.A., T.-K.N. and S.A.; writing—review and editing, J.-M.K.; visualization, Z.A., T.-K.N. and S.A.; supervision, J.-M.K.; project administration, J.-M.K.; funding acquisition, J.-M.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Technology development Program(S3106236) funded by the Ministry of SMEs and Startups(MSS, Korea). This work was supported by the Technology Infrastructure Program funded by the Ministry of SMEs and Startups (MSS, Korea).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ntoko, N.-M.; Djidjou, K.; Harrison, N. A Basis for Teaching Centrifugal Pump Characteristics in an Undergraduate Course in Turbomachinery. Int. J. Mech. Eng. Educ. 2005, 33, 283–293. [Google Scholar] [CrossRef]
  2. Rane, S.B.; Narvel, Y.A.M. Re-designing the business organization using disruptive innovations based on blockchain-IoT integrated architecture for improving agility in future Industry 4.0. Benchmarking Int. J. 2021, 28, 1883–1908. [Google Scholar] [CrossRef]
  3. Ahmad, Z.; Rai, A.; Hasan, J.; Kim, C.H.; Kim, J.-M. A Novel Framework for Centrifugal Pump Fault Diagnosis by Selecting Fault Characteristic Coefficients of Walsh Transform and Cosine Linear Discriminant Analysis. IEEE Access 2021, 9, 150128–150141. [Google Scholar] [CrossRef]
  4. Cao, S.; Hu, Z.; Luo, X.; Wang, H. Research on fault diagnosis technology of centrifugal pump blade crack based on PCA and GMM. Measurement 2020, 173, 108558. [Google Scholar] [CrossRef]
  5. Chittora, S.M. Monitoring of Mechanical Seals in Process Pumps. 2018. Available online: http://www.diva-portal.org/smash/get/diva2:1255405/FULLTEXT01.pdf (accessed on 15 June 2021).
  6. Ahmad, Z.; Rai, A.; Maliuk, A.S.; Kim, J.-M. Discriminant Feature Extraction for Centrifugal Pump Fault Diagnosis. IEEE Access 2020, 8, 165512–165528. [Google Scholar] [CrossRef]
  7. Qin, Y.; Member, S.; Jin, L.; Zhang, A.; He, B. Rolling Bearing Fault Diagnosis With Adaptive Harmonic Kurtosis and Improved Bat Algorithm. IEEE Trans. Instrum. Meas. 2021, 70, 3508112. [Google Scholar] [CrossRef]
  8. Maliuk, A.S.; Prosvirin, A.E.; Ahmad, Z.; Kim, C.H.; Kim, J.-M. Novel Bearing Fault Diagnosis Using Gaussian Mixture Model-Based Fault Band Selection. Sensors 2021, 21, 6579. [Google Scholar] [CrossRef]
  9. Prosvirin, A.E.; Ahmad, Z.; Kim, J.-M. Global and Local Feature Extraction Using a Convolutional Autoencoder and Neural Networks for Diagnosing Centrifugal Pump Mechanical Faults. IEEE Access 2021, 9, 65838–65854. [Google Scholar] [CrossRef]
  10. Shao, Y.; Kang, R.; Liu, J. Rolling Bearing Fault Diagnosis Based on the Coherent Demodulation Model. IEEE Access 2020, 8, 207659–207671. [Google Scholar] [CrossRef]
  11. Tra, V.; Kim, J.-M. Pressure Vessel Diagnosis by Eliminating Undesired Signal Sources and Incorporating GA-Based Fault Feature Evaluation. IEEE Access 2020, 8, 134653–134667. [Google Scholar] [CrossRef]
  12. Jalayer, M.; Orsenigo, C.; Vercellis, C. Fault detection and diagnosis for rotating machinery: A model based on convolutional LSTM, Fast Fourier and continuous wavelet transforms. Comput. Ind. 2020, 125, 103378. [Google Scholar] [CrossRef]
  13. Zhang, K.; Ma, C.; Xu, Y.; Chen, P.; Du, J. Feature extraction method based on adaptive and concise empirical wavelet transform and its applications in bearing fault diagnosis. Measurement 2021, 172, 108976. [Google Scholar] [CrossRef]
  14. Wei, H.; Zhang, Q.; Shang, M.; Gu, Y. Extreme Learning Machine-based Classifier for Fault Diagnosis of Rotating Machinery using a Residual Network and Continuous Wavelet Transform. Measurement 2021, 183, 109864. [Google Scholar] [CrossRef]
  15. Rapur, J.S.; Tiwari, R. Experimental fault diagnosis for known and unseen operating conditions of centrifugal pumps using MSVM and WPT based analyses. Measurement 2019, 147, 106809. [Google Scholar] [CrossRef]
  16. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.-C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
  17. Alabied, S.; Haba, U.; Daraz, A.; Gu, F.; Ball, A.D. Empirical Mode Decomposition of Motor Current Signatures for Centrifugal Pump Diagnostics. In Proceedings of the 2018 24th International Conference on Automation and Computing (ICAC), Newcastle Upon Tyne, UK, 6–7 September 2018. [Google Scholar] [CrossRef]
  18. Lei, Y.; Lin, J.; He, Z.; Zuo, M.J. A review on empirical mode decomposition in fault diagnosis of rotating machinery. Mech. Syst. Signal Process. 2013, 35, 108–126. [Google Scholar] [CrossRef]
  19. Nguyen, C.D.; Ahmad, Z.; Kim, J.-M. Gearbox Fault Identification Framework Based on Novel Localized Adaptive Denoising Technique, Wavelet-Based Vibration Imaging, and Deep Convolutional Neural Network. Appl. Sci. 2021, 11, 7575. [Google Scholar] [CrossRef]
  20. Hasan, J.; Rai, A.; Ahmad, Z.; Kim, J.-M. A Fault Diagnosis Framework for Centrifugal Pumps by Scalogram Based Imaging and Deep Learning. IEEE Access 2021, 9, 58052–58066. [Google Scholar] [CrossRef]
  21. Saeed, U.; Lee, Y.-D.; Jan, S.; Koo, I. CAFD: Context-Aware Fault Diagnostic Scheme towards Sensor Faults Utilizing Machine Learning. Sensors 2021, 21, 617. [Google Scholar] [CrossRef]
  22. Reges, G.; Fontana, M.; Ribeiro, M.; Silva, T.; Abreu, O.; Reis, R.; Schnitman, L. Electric submersible pump vibration analysis under several operational conditions for vibration fault differential diagnosis. Ocean Eng. 2020, 219, 108249. [Google Scholar] [CrossRef]
  23. Gonçalves, J.P.S.; Fruett, F.; Filho, J.G.D.; Giesbrecht, M. Faults detection and classification in a centrifugal pump from vibration data using markov parameters. Mech. Syst. Signal Process. 2021, 158, 107694. [Google Scholar] [CrossRef]
  24. Saucedo-Dorantes, J.J.; Arellano-Espitia, F.; Delgado-Prieto, M.; Osornio-Rios, R.A. Diagnosis Methodology Based on Deep Feature Learning for Fault Identification in Metallic, Hybrid and Ceramic Bearings. Sensors 2021, 21, 5832. [Google Scholar] [CrossRef]
  25. Ahmad, W.; Ayub, N.; Ali, T.; Irfan, M.; Awais, M.; Shiraz, M.; Glowacz, A. Towards Short Term Electricity Load Forecasting Using Improved Support Vector Machine and Extreme Learning Machine. Energies 2020, 13, 2907. [Google Scholar] [CrossRef]
  26. Lei, Y. Intelligent Fault Diagnosis and Remaining Useful Life Prediction of Rotating Machinery; Butterworth-Heinemann: Amsterdam, The Netherlands, 2016; pp. 12–86. [Google Scholar] [CrossRef]
  27. Hasan, J.; Sohaib, M.; Kim, J.-M. An Explainable AI-Based Fault Diagnosis Model for Bearings. Sensors 2021, 21, 4070. [Google Scholar] [CrossRef]
  28. Jin, T.; Yan, C.; Chen, C.; Yang, Z.; Tian, H.; Wang, S. Light neural network with fewer parameters based on CNN for fault diagnosis of rotating machinery. Measurement 2021, 181, 109639. [Google Scholar] [CrossRef]
  29. Orrù, P.; Zoccheddu, A.; Sassu, L.; Mattia, C.; Cozza, R.; Arena, S. Machine Learning Approach Using MLP and SVM Algorithms for the Fault Prediction of a Centrifugal Pump in the Oil and Gas Industry. Sustainability 2020, 12, 4776. [Google Scholar] [CrossRef]
  30. Yu, D.; Guo, J.; Zhao, Q.; Zhao, D.; Hong, J. Fault diagnosis for underdetermined multistage assembly processes via an enhanced Bayesian hierarchical model. J. Manuf. Syst. 2020, 58, 280–290. [Google Scholar] [CrossRef]
  31. Liu, R.; Yang, B.; Zio, E.; Chen, X. Artificial intelligence for fault diagnosis of rotating machinery: A review. Mech. Syst. Signal Process. 2018, 108, 33–47. [Google Scholar] [CrossRef]
  32. Mondal, P.P.; Ferreira, P.M.; Kapoor, S.G.; Bless, P.N. Monitoring and Diagnosis of Multistage Manufacturing Processes Using Hierarchical Bayesian Networks. Procedia Manuf. 2021, 53, 32–43. [Google Scholar] [CrossRef]
  33. Papananias, M.; McLeay, T.E.; Obajemu, O.; Mahfouf, M.; Kadirkamanathan, V. Inspection by exception: A new machine learning-based approach for multistage manufacturing. Appl. Soft Comput. 2020, 97, 106787. [Google Scholar] [CrossRef]
  34. Tayyab, S.M.; Asghar, E.; Pennacchi, P.; Chatterton, S. Intelligent fault diagnosis of rotating machine elements using machine learning through optimal features extraction and selection. Procedia Manuf. 2020, 51, 266–273. [Google Scholar] [CrossRef]
  35. Palacín, I.; Gibert, D.; Planes, J.; Arena, S.; Orru, P.F.; Melis, M.; Annis, M. Anomaly Detection for Diagnosing Failures in a Centrifugal Compressor Train. In Proceedings of the 23rd International Conference of the Catalan Association for Artificial Intelligence, CCIA 2021, Lleida, Spain, 20–22 October 2021; Volume 339, pp. 217–220. [Google Scholar]
  36. He, F.; Zhang, Z. Nonlinear Fault Detection of Batch Processes Using Functional Local Kernel Principal Component Analysis. IEEE Access 2020, 8, 117513–117527. [Google Scholar] [CrossRef]
  37. Wang, H.; Yao, M. Fault detection of batch processes based on multivariate functional kernel principal component analysis. Chemom. Intell. Lab. Syst. 2015, 149, 78–89. [Google Scholar] [CrossRef]
  38. He, F.; Wang, C.; Fan, S.-K.S. Nonlinear fault detection of batch processes based on functional kernel locality preserving projections. Chemom. Intell. Lab. Syst. 2018, 183, 79–89. [Google Scholar] [CrossRef]
  39. Yang, L.; Liu, X.; Nie, F.; Liu, Y. Robust and Efficient Linear Discriminant Analysis with L 2,1-Norm for Feature Selection. IEEE Access 2020, 8, 44100–44110. [Google Scholar] [CrossRef]
  40. Ahmad, Z.; Prosvirin, A.E.; Kim, J.; Kim, J.-M. Multistage Centrifugal Pump Fault Diagnosis by Selecting Fault Characteristic Modes of Vibration and Using Pearson Linear Discriminant Analysis. IEEE Access 2020, 8, 223030–223040. [Google Scholar] [CrossRef]
  41. Jin, X.; Zhao, M.; Chow, T.W.S.; Pecht, M. Motor Bearing Fault Diagnosis Using Trace Ratio Linear Discriminant Analysis. IEEE Trans. Ind. Electron. 2013, 61, 2441–2451. [Google Scholar] [CrossRef]
  42. Sakthivel, N.; Nair, B.B.; Elangovan, M.; Sugumaran, V.; Saravanmurugan, S. Comparison of dimensionality reduction techniques for the fault diagnosis of mono block centrifugal pump using vibration signals. Eng. Sci. Technol. Int. J. 2014, 17, 30–38. [Google Scholar] [CrossRef] [Green Version]
  43. Nikolaou, N.G.; Antoniadis, I.A. Application of Wavelet Packets in Bearing Fault Diagnosis. In Proceedings of the 5th WSES International Conference on Circuits, Systems, Communications and Computers (CSCC 2001), Rethymno, Greece, 8–15 July 2001; pp. 12–19. [Google Scholar]
  44. Yedidiah, S. (Ed.) Centrifugal Pump User’s Guidebook Problems and Solutions; Springer: Berlin, Germany, 1996. [Google Scholar]
  45. Picolet, L.E. Vibration problems in engineering. J. Frankl. Inst. 1929, 207, 286–287. [Google Scholar] [CrossRef]
  46. Muralidharan, V.; Sugumaran, V. Feature extraction using wavelets and classification through decision tree algorithm for fault diagnosis of mono-block centrifugal pump. Measurement 2013, 46, 353–359. [Google Scholar] [CrossRef]
  47. Muralidharan, V.; Sugumaran, V.; Indira, V. Fault diagnosis of monoblock centrifugal pump using SVM. Eng. Sci. Technol. Int. J. 2014, 17, 152–157. [Google Scholar] [CrossRef] [Green Version]
Figure 1. WPT decomposition tree up to 3 levels.
Figure 1. WPT decomposition tree up to 3 levels.
Sensors 22 00179 g001
Figure 2. Pump test rig: (a) photograph and (b) schematics.
Figure 2. Pump test rig: (a) photograph and (b) schematics.
Sensors 22 00179 g002
Figure 3. MCP defects: (a) MSH, (b) MSS, and (c) IF.
Figure 3. MCP defects: (a) MSH, (b) MSS, and (c) IF.
Sensors 22 00179 g003
Figure 4. VS of the MCP under NC and defective conditions.
Figure 4. VS of the MCP under NC and defective conditions.
Sensors 22 00179 g004
Figure 5. Proposed method for MCP condition monitoring.
Figure 5. Proposed method for MCP condition monitoring.
Sensors 22 00179 g005
Figure 6. MCP spectrum of (a) NC, and (b) ID.
Figure 6. MCP spectrum of (a) NC, and (b) ID.
Sensors 22 00179 g006
Figure 7. MCP spectrum of (a) NC, (b) MSH, and (c) MSS.
Figure 7. MCP spectrum of (a) NC, (b) MSH, and (c) MSS.
Sensors 22 00179 g007
Figure 8. The precision, recall, and errorate obtained from the proposed and reference methods.
Figure 8. The precision, recall, and errorate obtained from the proposed and reference methods.
Sensors 22 00179 g008
Figure 9. (a) Proposed, (b) WPT −PCA−MSVM, (c) PCA−KNN, and (d) Tr−LDA feature spaces.
Figure 9. (a) Proposed, (b) WPT −PCA−MSVM, (c) PCA−KNN, and (d) Tr−LDA feature spaces.
Sensors 22 00179 g009aSensors 22 00179 g009b
Table 1. Time domain SF’s extracted from the FSFB. The x(n) is FSFB in the time domain, N is the sample data points.
Table 1. Time domain SF’s extracted from the FSFB. The x(n) is FSFB in the time domain, N is the sample data points.
Feature NameEquationFeature NameEquation
Mean X m = n = 1 N x ( n ) N Variance X v = n = 1 N ( x ( n ) X m ) 2 ( N 1 )
Root amplitude X r o o t = [ n = 1 N | x ( n ) | N ] 2 Skewness X s k . = n = 1 N ( x ( n ) X m ) 2 ( N 1 ) X s d 3
RMS X r m s = n = 1 N ( x ( n ) ) 2 N Clearance factor X c l e a r a n c e = X p e a k X r o o t
Impulse factor X i m p u l s e = X p e a k 1 N n = 1 N | x ( n ) | Kurtosis X k u r t o s i s = n = 1 N ( x ( n ) X m ) 4 ( N 1 ) X s d 4
Shape factor X s h a p e = X r m s 1 N n = 1 N | x ( n ) | Peak value X p e a k = max | x ( n ) |
Standard deviation X s d = n = 1 N ( x ( n ) X m ) 2 N 1 Crest factor X c r e s t = X p e a k X r m s
Table 2. Frequency domain SF’s extracted from the FSFB. The s(k) is the FSFB spectrum, k is the sample of spectrum, and fk is the value of frequency at spectrum sample k.
Table 2. Frequency domain SF’s extracted from the FSFB. The s(k) is the FSFB spectrum, k is the sample of spectrum, and fk is the value of frequency at spectrum sample k.
Feature NameEquationFeature NameEquation
Mean frequency X m f = k = 1 K s ( k ) K Standard deviation σ f 2 = k = 1 K ( s ( k ) X m f ) 2 K 1
Root variance frequency X r v f = k = 1 K ( s ( K ) X m f ) 2 K Spectral kurtosis X f k u r t o s i s = k = 1 K ( s ( k ) X m f ) 4 ( K ) σ f 4
Root mean square frequency X f r m s = k = 1 K f k 2 ( s ( K ) ) 2 s ( K )
Table 3. Performance of the proposed method and reference methods.
Table 3. Performance of the proposed method and reference methods.
MethodsT-PR (%)AAC (%)
NCMSHMSSHID
Proposed100100100100100
WPT-PCA-MSVM10094.1196.5594.9196.39
PCA-KNN10092.4586.4596.9694
Tr-LDA65.4268.5352.2567.7463.48
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ahmad, Z.; Nguyen, T.-K.; Ahmad, S.; Nguyen, C.D.; Kim, J.-M. Multistage Centrifugal Pump Fault Diagnosis Using Informative Ratio Principal Component Analysis. Sensors 2022, 22, 179. https://doi.org/10.3390/s22010179

AMA Style

Ahmad Z, Nguyen T-K, Ahmad S, Nguyen CD, Kim J-M. Multistage Centrifugal Pump Fault Diagnosis Using Informative Ratio Principal Component Analysis. Sensors. 2022; 22(1):179. https://doi.org/10.3390/s22010179

Chicago/Turabian Style

Ahmad, Zahoor, Tuan-Khai Nguyen, Sajjad Ahmad, Cong Dai Nguyen, and Jong-Myon Kim. 2022. "Multistage Centrifugal Pump Fault Diagnosis Using Informative Ratio Principal Component Analysis" Sensors 22, no. 1: 179. https://doi.org/10.3390/s22010179

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop