Next Article in Journal
An Improved Algorithm for Optimal Load Shedding in Power Systems
Previous Article in Journal
Investigation and Analysis of Attack Angle and Rear Flow Condition of Contra-Rotating Small Hydro-Turbine
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Iterative Reduced KPCA Hidden Markov Model for Gas Turbine Performance Fault Diagnosis

1
Jiangsu Province Key Laboratory of Aerospace Power Systems, College of Energy & Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
2
Aviation Motor Control System Institute, Aviation Industry Corporation of China, Wuxi 214063, China
*
Author to whom correspondence should be addressed.
Energies 2018, 11(7), 1807; https://doi.org/10.3390/en11071807
Submission received: 5 June 2018 / Revised: 3 July 2018 / Accepted: 5 July 2018 / Published: 10 July 2018

Abstract

:
To improve gas-path performance fault pattern recognition for aircraft engines, a new data-driven diagnostic method based on hidden Markov model (HMM) is proposed. A redundant sensor somewhat interferes with fault diagnostic results of the HMM, and it also increases the computational burden. The contribution of this paper is to develop an iterative reduced kernel principal component analysis (IRKPCA) algorithm to extract fault features from original high-dimension observation without large additional calculation load and combine it with the HMM for engine gas-path fault diagnosis. The optimal kernel features are obtained by iterative sequential forward selection of the IRKPCA, and the features with lower dimensions are contracted through a trade-off between the fault information and modeling data scale in reduced kernel space. The similarity degree is designed to simplify the HMM modeling data using fault kernel features. Test results show that the proposed methodology brings a significant improvement in diagnostic confidence and computational efforts in the applications of a turbofan engine fault diagnosis during its steady and dynamic process.

1. Introduction

The gas turbine engine is the power source of aircraft, and its reliability directly affects aircraft safety and performance [1]. The engine is an easy-fault piece of machinery since it has complex structure and runs in harsh operating conditions. During the course of an engine’s life, various physical failures might happen, such as corrosion, erosion, fouling, and foreign object damage [2,3]. These failures lead to gas-path performance degradation, either gradually or abruptly, which is recognized as engine gas-path fault and is greatly harmful to flight safety [4,5]. For the purpose of enhancing operating reliability and reducing maintenance costs of aircraft propulsion systems, engine gas-path fault diagnosis technology has attracted interest.
Generally speaking, gas turbine fault diagnosis approaches are divided into model-based and data-driven ones [6,7]. Variants of Kalman filters are the typical model-based diagnostic methods for gas turbine engines [8,9]. It requires a reliable engine model, which relies on physical characteristics and aero-thermodynamic theory. The engine modeling uncertainties and operation non-linearity during the transient process negatively affects the performance of model-based technologies [10]. Engine-to-engine variation makes it difficult for the general engine model to represent every individual engine. In addition, the observability of general Kalman filters relates to the number of sensor measurements and their types, and it cannot recognize fault patterns as available sensors less than health parameters.
The data-driven approach is another important method of fault diagnosis for complex non-linear systems, especially in the rich data circumstance [11,12,13]. It depends on the collected data of fault modes but not an accurate mathematical model of a gas turbine engine. It is particularly suitable to a complex strong non-linear system, and it is not limited to the sensor measurement number. In the data-driven field, much attention has been paid to neural network approaches, which are theoretically developed from empirical risk minimization with simple mathematical expressions [8]. Bartolini applied neural networks to micro gas turbines, and then Amozegar developed an ensemble of dynamic neural network identifiers for engine fault detection and isolation [14,15]. The desirable topological structure of a neural network is usually selected by experience, and the diagnostic performance by a neural network easily fluctuates with stochastic measurement noise.
Different from the neural network approaches, the hidden Markov model (HMM) is a classic data-driven fault diagnosis for non-linear stochastic systems [16]. The HMM has rigorous theoretical deduction and definite model structure [17]. HMM statistical characteristics in modeling and classification makes it outperform fault pattern recognition for a mechanical system with clear randomness and uncertainties [18]. However, it is noted that the HMM’s computational cost increases dramatically when the measurement dimension increases. The sensor data from various operating cross-sections for engine gas-path fault diagnosis is usually complex and physically correlative in transient process in the flight envelope. Consequently, it is necessary to extract fault features from raw measurement sequences to decrease test data dimensions and simplify the HMM structure.
Principal component analysis (PCA) is introduced into the HMM to extract fault features to reduce computational effort. PCA is achieved by projecting the linear data matrix onto an uncorrelated subspace with less information loss, but its performance is reduced in the plant with strong non-linearity. Kernel-PCA (KPCA) is developed to overcome shortcomings of conventional PCA-to-linear issues [19,20]. Provided a kernel matrix (an n × n matrix where n is the number of the dataset) to map an original dataset into the feature space, the KPCA computational complexity is O(n3) in the principal component feature extraction. Taouali proposed a novel RKPCA to improve the sparsity capability [21], but it is difficult to balance the useful information capacity and data scale to reach the appropriate sample number.
To improve fault diagnostic confidence and computational efforts, a novel fault diagnostic approach is proposed using the combination of iterative reduced KPCA and HMM for engine gas-path fault diagnosis. In this paper, the IRKPCA is developed with a sample-reduction mechanism, which is designed to decline the redundant information in the initial observation sequences for gas-path fault feature extraction. The similarity degree and forward kernel inverse are employed to simplify the sample data, and then the kernel fault feature by the IRKPCA is used by the HMM to perform gas-path fault pattern recognition. The systematical tests are carried out to evaluate fault diagnosis performance of the proposed methodology, and it runs on a two-spool turbofan engine simulation in the steady and transient process at various cycle numbers during its life. The results indicate the superiority of the IRKPCA-HMM, and it supports our viewpoints.
The remainder of paper is organized as follows. The IRKPCA is developed from the basic KPCA, and the comparisons of the involved KPCAs are followed by feature extraction performance using benchmark datasets in Section 2. In Section 3, the IRKPCA-HMM is presented by the combination of IRKPCA and HMM to simplify the fault diagnostic model using reduced-kernel fault features. Simulation and analysis are given on a turbofan engine in dynamics for gas-path fault diagnosis in Section 4. Section 5 draws a conclusion and discusses future research directions.

2. IRKPCA-Based Feature Extraction Algorithm

2.1. KPCA

As the non-linear extension of PCA, KPCA has been applied to various engineering fields and exhibited satisfactory performance in complicated non-linear systems [19,22]. The kernel transformation of the KPCA maps the measured data space to higher-dimension feature space. Given a set of N mean-variance scaled training data, X { x i } , i = 1 , , N , x i R L , i = 1 N x i = 0 , and L is the original dimension of training measurement. The covariance matrix in the feature space F is
C = 1 N i = 1 N ϕ ( x i ) ϕ ( x i ) T
where transformation matrix ϕ (   )   : R L R H is a non-linear map from input vector into H-dimensional feature space F. Non-zero eigenvectors γ of the covariance matrix C is calculated by eigenvalue decomposition
C γ = λ γ = 1 N i = 1 N ϕ ( x i ) , γ ϕ ( x i ) γ = i = 1 N ε i ϕ ( x i )
where λ is the eigenvalues of C in feature space. Combine two expressions in Equation (2) and multiply the kernel matrix ϕ ( x k ) in both sides, and we can obtain
λ i = 1 N ε i ϕ ( x i ) , ϕ ( x k ) = 1 N j = 1 N i = 1 N ε i ϕ ( x i ) , ϕ ( x j ) ϕ ( x j ) , ϕ ( x k )
Then, a kernel matrix K = {kij} with the dimension of N × N is introduced to achieve the inner product in non-linear feature space
k i j = ϕ ( x i ) , ϕ ( x j ) = k ( x i , x j )
A radial basis kernel k ( x i , x j ) = exp ( ( 1 / 2 σ 2 ) x i x j 2 ) is used for Kernel transformation, and σ is a dispersion factor. The kernel matrix K is centralized before kernel transformation
K K 1 N I · K K · 1 N I + 1 N I · K · 1 N I
where I is an identity matrix with the size of N × N. Then, Equation (3) is specified as
K ε = N λ ε
The kernel principal components are selected as the p largest positive eigenvalues λ 1 λ 2 λ p > 0 , and their eigenvectors ɛ1, …, ɛp form the kernel principal component set. The projection parameter of a new data x n e w onto the eigenvectors in the principal component set is called the KPCA-transformed feature variable. The i-th feature zi for x n e w is expressed
z i = γ i T ϕ ( x n e w ) = j = 1 N ε i , j k ( x j , x n e w ) j = 1 , ,   p
From Equation (7), the kernel matrix is computed to obtain the feature vector for each sample in training dataset. The burden of computation and storage would be dramatically serious for drawing fault features as the increase of kernel matrix calculation number and the sample data accumulation. For these purposes, an iterative reduced KPCA (IRKPCA) is developed from the KPCA algorithm.

2.2. IRKPCA

Given the selected samples set from the training dataset X s { x i } ,   i = 1 , , N s , Ns means the selected sample number. The base vectors in feature subspace Φ = { ϕ ( x 1 ) , ,   ϕ ( x s ) } related to the selected samples are sufficient to express the whole transformed data in feature space. Thus, the mapping estimate of any sample vector x i can be represented by a linear combination of these base vectors X s
ϕ ^ ( x i ) = Φ · a i
where ai is the coefficient vector on the feature subspace basis denoted as a i = [ a i , 1 ,   a i , 2 , ,   a i , N s ] T . A new objective function to find these coefficients is designed, and it includes two minimization elements: the estimated mapping errors and coefficient vector norm. The objective function is expressed as
min   J i = ϕ ( x i ) ϕ ^ ( x i ) 2 + ρ a i 2
where 𝜌 is the regularization parameter to weight off the estimation error and coefficient scales. The former part indicates that the modeling-variable mapping is as close as possible to the real mapping, and the latter one compacts the scales. Let ( J i / a i ) = 0 , we have
J i a i = ( ( ϕ ( x i ) Φ a i ) T ( ϕ ( x i ) Φ a i ) + ρ a i T a i ) a i      = ( a i T K s s a i 2 ϕ ( x i ) T Φ a i + ϕ ( x i ) T ϕ ( x i ) + ρ a i T a i ) a i = 0 2 K s s a i 2 K s i + 2 ρ a i = 0 a i = ( K s s + ρ I s ) 1 K s i
where Is is an identity matrix of the size of s × s. Combining Equations (9) and (10), the solution of minimization Ji can be rewritten
min J i = K s i T ( K s s + ρ I ) 1 K s s ( K s s + ρ I ) 1 K s i 2 K s i T ( K s s + ρ I ) 1 K s i + 1 + ρ K s i T ( K s s + ρ I ) 1 I ( K s s + ρ I ) K s i       = 1 K s i ( K s s + ρ I s ) 1 K s i
where Kss represents the kernel matrix of the selected vectors, and Ksi is the dot product between x i and the selected base vectors, Ksi = [k(x1,xi), …, k(xs,xi)]T. Since the calculation procedure of objective function is treated to each sample in the training dataset, the overall objective function is defined by
J = min s ( x i X N { 1 K s i ( K s s + ρ I s ) 1 K s i } )
Equation (12) can be rewritten as
J = max s ( x i X N { K s i ( K s s + ρ I s ) 1 K s i } )
The sample reduction procedure is an iterative sequential forward process. The sample chooses preforms from the training dataset at each step of the procedure, which are combined with the prior selected ones to generate the optimal objective function of Equation (13). The procedure stops as the mean of overall objective is no longer increasing clearly when the sample is added. The mean residual is given by
Δ = mean s + 1 ( J ) mean s ( J )
To reduce the number of training sample, the similarity degree is used. The similarity degree between a new sample x i and the selected samples x j is
cos ( x i , x j ) = ϕ T ( x i ) ϕ ( x j ) ϕ T ( x i ) ϕ ( x i ) ϕ T ( x j ) ϕ ( x j ) j = 1 , ,   s
Kernel matrix inverse is calculated when a new training sample is selected during the iteration. It is very time-consuming to repeatedly calculate the Kss inverse. The forward kernel inverse strategy is used to compute the (s + 1)-th sample given base vectors X s and Rs = (Kss+ ρIs)−1
R s + 1 = [ K s s + ρ I s K s i K s i T k i i + ρ ] 1
Provided an invertible matrix A, and matrices D, V, U, we have
[ A U V D ] 1 = [ A 1 + A 1 U ( D V A 1 U ) 1 V A 1 A 1 U ( D V A 1 U ) 1 ( D V A 1 U ) 1 V A 1 ( D V A 1 U ) 1 ]
Since the matrix Rs has been obtained at the s-th iteration, Rs can be directly acquired by applying Equation (17) to Equation (16) for the matrix inversion
R s + 1 = [ R s 0 0 0 ] + τ [ β 1 ] [ β T   1 ]
where β = R s K s i , τ = ( k i i K s i T β ) , and the kernel inverse number becomes 1 as a new training sample is considered. The reduction scheme of IRKPCA training sample is developed by simplifying training data number for fault feature extraction. In the IRKPCA, the sample similarity degree is checked in advance, and the sample is such that the similarity index larger than its threshold is deleted. The forward kernel inverse strategy is employed to produce the matrix Rs+1 with respect to new training data.

2.3. Feature Extraction Performance Test

The benchmark datasets from UCI Machine Learning [23] are used to validate feature extraction performance of the basic KPCA and proposed IRKPCA. The six datasets include Iris, Wine, Glass, E.coli, Balance and Vehicle. The feature number (#Feature), the number of classes (#Class) and sample number (#SN) of each dataset are listed in Table 1.
The comparisons of basic KPCA and IRKPCA are investigated from reduced features, discriminative power, sparsity, and reduced time in Table 2. The reduced feature number depicts the principal component size after feature extraction by the KPCA. Discriminative power contains the mean correct classification ratio and standard errors using the reduced features in the transformed space. Sparsity is expressed by 2-norm/1-norm (L2/L1), and the larger L2/L1 value is more in favor of classification [24,25]. The reduced-dimension time gives the computational efforts of dataset transformation from the original measurement space to feature space. The regularization parameter ρ and kernel parameter σ are yielded separately from the bound of [2–3, 23] and [20, 210] by the interval 2 times.
From Table 2, we can see that both KPCA algorithms reduce the feature number, and the number by IRKPCA is less than that by KPCA in the most examined datasets. In terms of the discriminate power index, the IRKPCA produces more than 2.70% and 2.89% of mean correct ratio compared to KPCA in the E.coli and Balance datasets, respectively. The differences of mean correct ratios by the two KPCAs in the rest of the datasets are no more than 1%. The indices L2/L1 by IRKPCA is slightly larger than those by KPCA except in the Glass dataset, so the former produces less information loss than the learning feature. The KPCA generates one magnitude larger reduced time than IRKPCA. The reduced time of KPCA increases faster with the sample size, and the indices are 1.2387 s by KPCA and 0.0645 s by IRKPCA in the Vehicle dataset. The IRKPCA consumes less computational time and owns a lower feature number without discriminative power reduce compared to KPCA. Hence, it implies that the IRKPCA might be a better candidate for gas-path fault feature extraction. Figure 1 shows the extraction effect on benchmark datasets of Iris and Wine by the proposed IRKPCA method. The first three dimensions of benchmark datasets are presented in the form of scatter plots in Figure 1, and the points with the same color belongs to the same cluster. From Figure 1 we can see that the distance of different color points increases after the dimension transformation by the IRKPCA. The processed datasets are more easily distinguished than the original dataset in scatter plots, and it supports further classification.

3. IRKPCA-HMM Based Structure

3.1. Hidden Markov Model

Hidden Markov model is extended from Markov chains to yield inferential statistical information on a state sequence [26,27]. It comprises a finite hidden state number, and each state relates to an observation at a time point. There are two kinds of probabilities of every hidden state: transition probability and observation probability. HMM have double stochastic processes: the stochastic transition from one hidden state to another one with transition probability and the stochastic output observation generated from every hidden state with observation probability.
The state sequence of HMM is unobservable, but it can be estimated from the observation sequence. Let hidden state sets S = { s 1 ,   s 2 ,   , s M } and observation vector O = { o 1 ,   o 2 ,   , o T } with o t { v 1 ,   v 2 ,   , v N } , where M is the number of hidden states, N is the number of observation per state and T is the length of observation sequence. The variable q t ( q t S ) indicates the hidden state, and o t ( o t O ) the observation at time t. In general, the compact definition λ = (π, A, B) is used to specify an HMM model, and it includes three components: state transition probability matrix A, observation probability matrix B, and initial state distribution π.
An initial state distribution π illustrates the probability of the starting hidden state si (t = 0)
π = { π i } ,   i = 1 M π i = 1
where πi = Pr(q0 = si), 1 ≤ iM.
A transition probability A contains the element aij
A = { a i j } ,   j = 1 M a i j = 1 ,   ( 1 i M )
where aij is the probability of transition from one hidden state si at time t to another hidden state sj at time t + 1, and it follows aij = Pr(qt+1 = sj|qt = si), 1 ≤ i, jM. The observations are emitted from each state according to the conditional probability, bi(vj) = Pr(ot = vj|qt = si), which is represented as a matrix B{bi(oj)} as well. The elements of B evidently satisfy the constraints:
j = 1 N b i ( v j ) = 1 ,   1 i M
The Baum-Welch algorithm is used to obtain the topological parameters of HMM, where an iterative calculation is implemented to maximize the observed sequence probability Pr(X|λ) from an initial λ. The observed sequence XT×H = [x1, x2, …, xT]T is divided into two sections: Xt×H = [x1, x2, …, xt]T with t measurement data number, and X(T−t)×H = [xt+1, xt+2, …, xT]T with Tt. H is the dimension of measurement vector. Two probabilities corresponding to the forward variable and backward variable are computed. The former is denoted by the probability of observing the partial measurement data Xt×H ending in the state sj, and the latter by the probability of observing the remaining data X(T−t)×H as the state si. The forward variable αt(j) is
α t ( j ) = P r ( x 1 ,   x 2 , ,   x t ,   q t = s j |   λ )   =   [ i = 1 M α t 1 ( i ) · a i j ] b j ( x t ) ,   1 i ,   j M ,   1 t T 1
where α1(i) = πi. bi(x1), 1 ≤ iM. The backward variable βt(i) is
β t ( i ) = P r ( x t + 1 ,   x t + 2 , ,   x T |   q t = s i ,   λ )   =   i = 1 M a i j b j ( x t + 1 ) β t + 1 ( j ) ,   1 i ,   j M ,   T 1 t 1
where βT(i) = 1, 1 ≤ iM. The detailed Baum-Welch algorithm to obtain the probability of sequence O can be referred to in paper [28]. The scalar quantization is to generate the partition vector and the codebook vector according to the quantization distortion of the input before training HMM [29].
The observation sequence of HMM is formed by measurement data from various sensors, which are equipped along a gas path to depict engine operation. As mentioned earlier, redundant information of various operating data will increase fault diagnosis difficulty. The larger dimension of measurement vector xi leads to more complex HMM topology [5,19]. It would not only increase the computational time of forward-backward procedure in Baum-Welch algorithm but also negatively impact on classification accuracy.

3.2. IRKPCA-HMM Algorithm

Gas-path fault feature extraction based on the IRKPCA is combined with HMM for fault diagnosis in this section. The reduced observation sequence X*T×R = [x*1, x*2, …, x*T]T with R feature dimensions (R < H) is generated from the original sequence XT×H. Thus, the re-estimation process of HMM with the new sequence X*T×R is as follows
a ¯ i j = k = 1 R 1 P k t = 1 T 1 a t ( k ) ( i ) a i j b j ( x * t + 1 ( k ) ) β t + 1 ( k ) ( j ) k = 1 R 1 P k t = 1 T 1 a t ( k ) ( i ) β t + 1 ( k ) ( j ) ,   1 i ,   j M
b ¯ j ( l ) = k = 1 R 1 P k t = 1 , x * t = v t T 1 a t ( k ) ( i ) β t ( k ) ( i ) k = 1 R 1 P k t = 1 T 1 a t ( k ) ( i ) β t ( k ) ( i ) ,   1 j M ,   1 l N
P ( X * | λ ) = k = 1 R P ( X * ( k ) | λ ) = k = 1 R P k
where a t ( k ) is k-th forward variable, and β t + 1 ( k ) is k-th backward variable. In IRKPCA-HMM test phase, Equation (26) is specified as
P ( X * | λ ) = k = 1 R i = 1 M a t ( k ) ( i ) β t ( k ) ( i )
The log-likelihood probability (LL) is a fault index defined by the logarithm calculation to P ( X * | λ ) in Equation (27). The larger LL indicates the greater consistency of observation sequence to the HMM.
The IRKPCA-HMM achieves gas-path fault diagnosis in the lower dimension feature space from the original measurement space, and it decreases the calculation effort not only in the training process but also in testing process. The IRKPCA-HMM algorithm schematic is presented for fault diagnosis of aero engines in Figure 2.
The proposed IRKPCA-HMM procedure is as follows:
(1)
Initialization.
Set initial kernel parameters, regulation parameter and information loss threshold. Select the number of hidden states and observation states, and topological structure of HMM.
(2)
Search optimal samples by iterations.
(2.1.)
The original training sample set X0, and let the dataset X s = , X ¯ s = , X t = X 0 .
(2.2.)
Select the first optimal sample. Calculate the objective function Equation (13) for each sample, and the sample x1 with the maximum objective value is added into the sample dataset Xs, X s = X s { x 1 } .
(2.3.)
Compute the sample similarity degree. The similarity degree is computed between the selected sample in Xs and rest samples xi in X t X s X ¯ s . If sample similarity degree is larger than similarity threshold, then X ¯ s = X ¯ s { x i } , i = I + 1, and continue until the rest samples are all checked.
(2.4.)
Calculate the objective Equation (18) for each sample and select the sample of maximum objective value xi. Compute the mean objective residual as Equation (14) for the selected optimal sample xi. If the sample mean objective residual is larger than the difference threshold, go to 1.3, else continue.
(3)
Principal component computation in kernel space.
(3.1.)
Calculate the centralized kernel matrix K with selected samples.
(3.2.)
Compute eigenvectors { ε i } with corresponding eigenvalues { λ i } .
(3.3.)
Choose the p largest eigenvalues until i = 1 p λ i 0.95 //information loss threshold.
(4)
Online fault feature extraction for testing data { x n e w }.
(4.1.)
Calculate Ksi(xnew) = [k(x1, xnew), …, k(xs, xnew)]T.
(4.2.)
Calculate the features according to Equation (7).
(5)
HMM modeling with reduced observation.
(5.1.)
Set initial number of hidden states, observation states and choose topology structure of HMM.
(5.2.)
Re-estimate parameter using Equations (24) and (25).
(5.3.)
Form the HMM libraries for gas-path fault modes.

4. IRKPCA-HMM Based Engine Gas-Path Fault Diagnosis

The proposed IRKPCA-HMM approach to gas-path fault diagnosis is tested on a virtual two-spool turbofan engine developed by the component-level engine model [30,31]. The examined turbofan engine is mainly composed of inlet, fan, compressor, bypass, combustor, high-pressure turbine (HPT) and low-pressure turbine (LPT), mixer and nozzle, and it is illustrated in Figure 3. The inlet supplies airflow into the fan, and then the air is divided to two streams: one flowing into the compressor and the other passing through the bypass. Air leaving the compressor moves to the combustor, where fuel is injected and burns to produce hot gas to drive the turbines. The fan and compressor are driven by the LPT and HPT, respectively. Gas from LPT and air from bypass mix in the mixer, and then leaves the engine through the nozzle. Closed-loop control strategy of spool speed is applied to aero engine with safety protection [32]. The engine station numbers in Figure 3 are as follows: inlet exit marked by 2, compressor inlet by 22, compressor exit by 3, HPT entrance by 43, LPT entrance by 5, and LPT exit by 6.
The data are generated from the numerical engine model [33,34] to evaluate the involved methods in the steady behavior of the maximum power operation and transient behavior including acceleration and deceleration. The involved engine parameters are reported in Table 3. The control variables include fuel flow Wf and Nozzle area A8, which define the operating point of the engine. The health parameters are unmeasurable and represent engine gas-path health, containing indicators of fan efficiency SE1, fan flow SW1, compressor efficiency SE2, compressor flow SW2, HPT efficiency SE3, HPT flow SW3, LPT efficiency SE4, and LPT flow SW4. The available measurements are used to calculate health parameters, and they are low-pressure spool speed NL, high-pressure spool speed NH, compressor inlet temperature T22, compressor inlet pressure P22, compressor outlet pressure P3, compressor outlet temperature T3, LPT inlet temperature T43, LPT inlet pressure P43, LPT outlet pressure P5 and mixing chamber inlet temperature T6 [35]. The maximum power point on the ground is defined as corrected percentage of high-pressure spool speed NHcor = 100%, and corresponds to the corrected normalized values of engine control variables: fuel flow Wf = 100%, nozzle area A8 = 47%. The measurement noise follows time-uncorrelated zero-mean Gaussian noise, and the magnitude of these noises can be referred to in paper [36].
Both gradual and abrupt performance deterioration causes health parameter variations. The health parameters resulting from performance gradual degradation is long term, and all health parameters synchronously diverge from their nominal quantities with the cycle number increase. It starts from a healthy engine (all health parameters at their nominal values) at initial cycle number CN = 0, and with the linearly deviation at the end of cycle number CN = 6000. The first factory overhaul occurs at one quarter of the engine’s whole lifetime, and three cycle number points before this overhaul, including CN = 0, CN = 807 and CN = 1558, are addressed in this paper. Table 4 shows health parameter deviations under gradual degradation with regard to cycle numbers.
The health parameters move suddenly from their nominal values in gas-path fault scenarios, and the shift quantities of each fault case are given in Table 5. There are thirteen operating scenarios in total at one cycle number, including twelve fault cases and a no-derivation case. Sensor malfunction, such as bias or drift, is not considered in this study.
The historical measured data sample is used offline to build up gas-path fault HMM libraries of the engine by the proposed methodology in training stage. Every gas-path fault case relates to one HMM fault library, and they are independent each other. The IRKPCA-HMM libraries are ζ = { ζ 1 ,   ζ 2 , ,   ζ K } as the count of fault case equals to K. The available engine measurements in Table 3 are recorded online in sequence, and IRKPCA-HMM runs in the left-right type [37]. Figure 4 shows a gas-path fault diagnosis framework based on IRKPCA-HMM, and the processed data sequence is fed into HMM libraries and each LL of HMM will be calculated.
The optimal kernel samples are obtained from an online-sensed sequence by IRKPCA in the test stage. The probabilities related to gas-path fault libraries are calculated from reduced observation by the Baum-Welch algorithm. The index LL is used to recognize gas-path fault mode from the observation, and it belongs to the fault library that owns the largest LL [18]. The examined algorithms including HMM, KPCA-HMM and IRKPCA-HMM are performed on a Windows 10 PC with CPU i5-2450 M @2.50 GHZ (Intel, Santa Clara, CA, USA) and 8 GB RAM using MATLAB R2012b software (The MathWorks, Inc., Natick, MA, USA). The Monte-Carlo simulation is conducted, and the performance indices are from ten tries. The correct diagnostic ratio Acc and its standard deviation Std are separated, to assess gas-path fault diagnostic accuracy and stability:
A c c = 1 U i = 1 U N c i N t i S t d = 1 U 1 i = 1 U ( N c i N t i A c c )
where Nc is sample number of correct recognition, Nt is total sample number in one fault scenario, and U is the count of fault scenarios.

4.1. Fault Diagnosis in Steady Process

To evaluate fault diagnosis capability of the examined algorithms in steady process, tests are conducted in engine gas-path fault scenarios mixed with gradual degradation at full power on the ground. Gas-path faults are separately injected into the nominal performance deterioration at CN = 0, CN = 807, CN = 1558 in Table 4. The health parameters deviate with the cycle number accumulated over time, and they have an abrupt shift with the constant bias related to every gas-path fault scenario in the steady process. The available measurements are recorded as Table 3, and the hidden state number of HMMs are obtained by searching from 2 to 8 with unit interval. After several tries, the IRKPCA-HMM iteration stop conditions in training process are as follows: iterative step exceeds 100 or convergence error (LL difference between current step and last step) is below 0.01. The similarity degree is 0.9, and the training data for stochastic modeling is the observation sequence with the length of 100 sampling points. There are 1300 samples in total used as training data. Figure 5 gives the effect of gas-path fault feature extraction by IRKPCA in steady behavior.
Similar to that of benchmark dataset, the first three dimensions of aero engine dataset are presented in the form of scatter plots, where points with the same color belong to the same cluster from Figure 5. We have a more distinct version of thirteen engine fault patterns in steady process after feature extraction by IRKPCA.
The test data of ten gas-path fault scenarios are different from their training observation sequences. Table 6 shows maximum log-likelihood probability LL* and correct recognized number Nc by the HMMs at CN = 1558 in the steady process. The IRKPCA-HMM produces the least absolute LL* except in the case of XI and the largest Nc except in cases VIII and XI among the involved HMMs. It implies that IRKPCA-HMM is superior to HMM and KPCA-HMM with confidence and correct ratio regards at CN = 1558 in the steady process. The performance comparisons of HMMs are presented at various cycle numbers in the steady behavior in Table 6, and it shows average performance indices of HMMs in 13 scenarios.
The fault feature number and optimal hidden state number of basic HMM are the same at three cycle numbers, and they are larger than those of KPCA-HMM and IRKPCA-HMM. From Table 6, KPCA-HMM and IRKPCA-HMM have much simpler topological structure than the basic HMM due to less reduced feature number and hidden state number. The quantities of confusion matrix and transition matrix of IRKPCA-HMM in fault mode 1 are shown in Table 7.
The fault diagnostic accuracy indices of Acc, Std, and execution time ttest by three HMMs are discussed in Table 8. The Acc of KPCA-HMM and IRKPCA-HMM are clearly larger than that of HMM, while Std are smaller than that of HMM at three cycle numbers. The larger Acc represents better confidence of fault diagnosis result, and the less Std illustrates better stability. Hence, both of KPCA-HMM and IRKPCA-HMM have better fault diagnostic confidence and stability than basic HMM, and KPCA-HMM is a little worse than IRKPCA-HMM. When it comes to executing time ttest, KPCA-HMM and IRKPCA-HMM consume less time compared to basic HMM due to the former two having more simplified topology. It is also found that IRKPCA-HMM has almost half the executing time of KPCA-HMM.
The performance index Acc decreases a bit with cycle number accumulation over time, while computational time of three HMMs are hardly changed at three cycle numbers. The IRKPCA-HMM produces the least Std in all cases in Table 8. It implies that IRKPCA-HMM has more outstanding diagnostic accuracy, stability and computational efforts compared to the HMM and KPCA-HMM. Hence, it is a satisfactory method of gas-path fault diagnosis in the steady behavior of turbofan engine. In addition, the tests are simulated in the steady process of high-altitude operation (H = 5000, Ma = 1, Wf = 100%, A8 = 52%), and the results are as shown in Table 9. As seen from Table 9, the performance indices of HMM, KPCA-HMM and IRKPCA-HMM are similar to those in the steady process of ground operation.

4.2. Fault Diagnosis in Transient Process

The transient test is performed including acceleration and deceleration in the flight envelope to further reveal the performance of proposed methodology for gas-path fault diagnosis. The engine starts from the idle (Wf = 68%, A8 = 100%), and gradually increases to full power (Wf = 100%, A8 = 47%). After dwelling 0.5 s it moves sharply back to the idle, and the whole operation lasts 9.5 s on the ground. The variations of control variables are shown in the transient behavior in Figure 6. The deviation quantity of the combination of gradual degradation and abrupt degradation are added into health parameters, and the simulations run at three cycle numbers.
The sampling rate of 10-dimension measurements is 0.1 s, and the length of observation sequence for training is 95. The hidden state number and the iteration stop parameters in the dynamics are set as the same as those in Section 4.1. There are 1235 samples in total used as training data for 13 fault scenarios, and average training time of these fault scenarios by HMM, KPCA-HMM and IRKPCA-HMM are 23.27 s, 19.53 s and 16.89 s, respectively. We can find that the training computational efforts of IRKPCA-HMM are the least among the examined algorithms. The scale of test data for each gas-path fault scenario is 10 observation sequences, and the length of every sequence is the same as the training one. The indices of LL* and Nc by the examined HMMs at CN = 0 in the transient process of ground operation is given in Table 10.
From Table 10, the indices of LL* and Nc by IRKPCA-HMM are the largest ones in the most fault cases as engine experiences from idle to full power and then back to idle. The topological parameters of three HMMs and average performance indices of all fault scenarios at CN = 0, CN = 807, CN = 1558 are reported in Table 11. The fault feature number and optimal hidden state number of basic HMM are clearly larger than the rest HMMs, and it means that the topologies of the latter two HMMs are simplified. This is positive for reducing the computational time of gas-path fault diagnosis.
Both KPCA-HMM and IRKPCA-HMM produce the similar performance indices of Acc and Std, which outperforms basic HMM. The confidence and stability of fault diagnostics are improved by fault feature extraction of KPCAs. When the performance index ttest is concerned, KPCA-HMM is obviously different from IRKPCA-HMM and no longer better than basic HMM. The feature extraction dominates computational time of gas-path fault diagnosis in the transient process. The IRKPCA-HMM consumes less time for feature extraction due to the forward kernel inverse and sample simplification scheme, and it is the best way of weighting off the diagnostic accuracy and computational efforts.
Furthermore, transient performance tests of the proposed methodology are implemented in the flight envelope, and control variables fuel flow Wf and nozzle area A8 change along flight operation H and Ma. The engine starts from the ground point H = 0, Ma = 0, climbs to high altitude (H = 5000, Ma = 1), and the whole operation lasts 10 s. It runs at the full power operation using closed-loop control strategy of spool speed. Figure 7 shows the change route of four input variables during transient process in the flight envelope. The deviation quantities related to each fault scenario are initially added into health parameters as well as that of the ground operation.
The observation sequence length of training data is 100, and there are 1300 samples in total used for 13 fault cases. The average training time of fault cases by HMM, KPCA-HMM and IRKPCA-HMM are 26.75 s, 22.43 s and 17.75 s, respectively. The training time of the IRKPCA-HMM is the least among the examined algorithms. The indices of LL* and Nc by the examined HMMs at CN = 0 in the transient process is given in Table 12.
From Table 12, the indices of LL* and Nc by IRKPCA-HMM at CN = 0 are the largest ones in the most fault cases in the flight envelope. The topological parameters of three HMMs and average performance indices in all fault scenarios at CN = 0, CN = 807, CN = 1558 are reported in Table 12. The fault feature number and optimal hidden state number of basic HMM are clearly larger than the others. It indicates that the topological structure of the HMMs is clearly simplified after feature extraction by the KPCA and IRKPCA, and it is positive for reducing computational time of gas-path fault diagnosis.

5. Conclusions

This paper develops a systematic approach to fault feature extraction and pattern recognition, which leads to an improved data-driven fault diagnosis method. The novelty of this methodology lies in the development of IRKPCA and HMM in combination to facilitate gas-path fault diagnosis for turbofan engines. The reduced samples from IRKPCA in feature space decrease the measurement dimension while the principal information of fault feature is retained. The IRKPCA is evaluated using general benchmark datasets, and the results reveal that IRKPCA is superior to plain KPCA regarding discriminative power, sparsity and reduced dimension time. The simplified observation sequence by IRKPCA is utilized by HMM to develop an IRKPCA-HMM algorithm. The goal of this methodology is to increase gas-path fault diagnostic accuracy and relieve computational effort both in steady and transient behaviors. The proposed methodology is evaluated in the scenarios of gas-path abrupt fault mixed with gradual degradation in the flight envelope, and test data are generated from a dual-spool turbofan engine model. The stochastic diagnostic modeling framework is presented and numerically assessed by several performance indices. The advantage of the proposed methodology is that it does not only produce more reliable results but also consumes less computational efforts of fault diagnosis.
This research establishes a new direction in data-driven fault diagnosis by proposing IRKPCA-HMM technique that is specifically beneficial to gas-path stochastic fault diagnosis for turbofan engine applications. The methodology developed in this study is not only limited to turbofan engine, but also extended to other types of gas turbine engine. There are some important topics for further study related to this work. First, further studies can be carried out to investigate various kernels used to map the measurement space to feature space. Second, extensions of the cases that have more than one gas-path abrupt fault, added to gradual degradation and the tests of semi-physical hardware in the loop, are worthy of future exploration.

Author Contributions

F.L. and J.H. contributed in developing the ideas of this research, J.J. and X.Q. performed this research. All of the authors were involved in preparing this manuscript.

Funding

We are grateful for the financial support of the Fundamental Research Funds for the Central Universities (No. NS2015027).

Acknowledgments

Gratitude is extended to Viliam Makis for useful advices and to China Scholarship Council for supporting the first author to carry out collaborative research in the Department of Mechanical and Industrial Engineering at the University of Toronto.

Conflicts of Interest

The authors declare no conflict of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

Nomenclature

PCAPrincipal component analysis
HMMHidden Markov model
HPCHigh-pressure compressor
LPTLow-pressure turbine
HPTHigh-pressure turbine
L1Norm-1
L2Norm-2
LLLog-likelihood
HHeight
MaMach number
SWFlow capacity
SEEfficiency
NLLow-pressure spool speed
NHHigh-pressure spool speed
T22Compressor inlet temperature
P22Compressor inlet Pressure
P3Compressor outlet pressure
T3Compressor outlet temperature
T43low pressure turbine inlet temperature
P43low pressure turbine inlet pressure
P5low pressure turbine outlet pressure
T6mixing chamber inlet temperature
WfFuel flow
A8Nozzle area

References

  1. Zaidan, M.A.; Mills, A.R.; Harrison, R.F.; Fleming, P.J. Gas turbine engine prognostics using Bayesian hierarchical models: A variational approach. Mech. Syst. Signal Process. 2016, 70–71, 120–140. [Google Scholar] [CrossRef]
  2. Li, J.; Fan, D.; Sreeram, V. SFC optimization for aero engine based on hybrid GA-SQP method. Int. J. Turbo Jet-Engines 2013, 30, 383–391. [Google Scholar] [CrossRef]
  3. Kraft, J.; Sethi, V.; Singh, R. Optimization of aero gas turbine maintenance using advanced simulation and diagnostic methods. J. Eng. Gas Turbines Power 2014, 136, 111601. [Google Scholar] [CrossRef]
  4. Salahshoor, K.; Kordestani, M.; Khoshro, M.S. Fault detection and diagnosis of an industrial steam turbine using fusion of SVM (support vector machine) and ANFIS (adaptive neuro-fuzzy inference system) classifiers. Energy 2010, 35, 5472–5482. [Google Scholar] [CrossRef]
  5. Mohammadi, E.; Montazeri-Gh, M. Performance enhancement of global optimization-based gas turbine fault diagnosis systems. J. Propuls. Power 2016, 32, 214–224. [Google Scholar] [CrossRef]
  6. Lu, F.; Ju, H.; Huang, J. An improved extended Kalman filter with inequality constraints for gas turbine engine health monitoring. Aerosp. Sci. Technol. 2016, 58, 36–47. [Google Scholar] [CrossRef]
  7. Wen, H.; Zhu, Z.H.; Jin, D.; Hu, H. Model predictive control with output feedback for a deorbiting electrodynamic tether system. J. Guid. Control Dyn. 2016, 39, 2455–2460. [Google Scholar] [CrossRef]
  8. Tahan, M.; Tsoutsanis, E.; Muhammad, M.; Karim, Z.A.A. Performance-based health monitoring, diagnostics and prognostics for condition-based maintenance of gas turbines: A review. Appl. Energy 2017, 198, 122–144. [Google Scholar] [CrossRef] [Green Version]
  9. Zhang, S.G.; Guo, Y.Q.; Chen, X.L. Performance evaluation and simulation validation of fault diagnosis system for aircraft engine. J. Propuls. Technol. 2013, 34, 1121–1127. [Google Scholar]
  10. Yang, C.; Kong, X.X.; Wang, X. Model-based fault diagnosis for performance degradation of turbofan gas path via optimal robust residuals. In Proceedings of the ASME Turbo Expo: Turbine Technical Conference and Exposition, Seoul, South Korea, 13–17 June 2016; Volume 6. [Google Scholar]
  11. Zhou, D.J.; Yu, Z.Q.; Zhang, H.S.; Weng, S.L. A novel grey prognostic model based on Markov process and grey incidence analysis for energy conversion equipment degradation. Energy 2016, 109, 420–429. [Google Scholar] [CrossRef]
  12. Zhao, X.F.; Liu, Y.B.; He, X. Diagnosis of gas turbine based on fuzzy matrix and the principle of maximum membership degree. Energy Procedia 2012, 16, 1448–1454. [Google Scholar] [CrossRef]
  13. Zhong, S.S.; Luo, H.; Lin, L. An improved correlation-based anomaly detection approach for condition monitoring data of industrial equipment. In Proceedings of the IEEE International Conference on Prognostics and Health Management (ICPHM), Ottawa, ON, Canada, 20–22 June 2016. [Google Scholar]
  14. Amozegar, M.; Khorasani, K. An ensemble of dynamic neural network identifiers for fault detection and isolation of gas turbine engines. Neural Netw. 2016, 76, 106–121. [Google Scholar] [CrossRef] [PubMed]
  15. Bartolini, C.M.; Caresana, F.; Comodi, G.; Pelagalli, L.; Renzi, M.; Vagni, S. Application of artificial neural networks to micro gas turbines. Energy Convers. Manag. 2011, 52, 781–788. [Google Scholar] [CrossRef]
  16. Owsley, L.M.D.; Atlas, L.E.; Bernard, G.D. Self-organizing feature maps and hidden Markov models for machine-tool monitoring. IEEE Trans. Signal Process. 1997, 45, 2787–2798. [Google Scholar] [CrossRef]
  17. Boutros, T.; Liang, M. Detection and diagnosis of bearing and cutting tool faults using hidden Markov models. Mech. Syst. Signal Process. 2011, 25, 2102–2124. [Google Scholar] [CrossRef]
  18. Soualhi, A.; Clerc, G. Hidden Markov models for the prediction of impending faults. IEEE Trans. Ind. Electron. 2016, 63, 3271–3281. [Google Scholar] [CrossRef]
  19. Shi, S.B.; Li, G.N.; Chen, H.X.; Hu, Y.P.; Wang, X.Y.; Guo, Y.B.; Sun, S.B. An efficient VRF system fault diagnosis strategy for refrigerant charge amount based on PCA and dual neural network model. Appl. Therm. Eng. 2018, 129, 1252–1262. [Google Scholar] [CrossRef]
  20. Ni, J.J.; Zhang, C.B.; Yang, S.X. An adaptive approach based on KPCA and SVM for real-time fault diagnosis of HVCBs. IEEE Trans. Power Deliv. 2011, 26, 1960–1971. [Google Scholar] [CrossRef]
  21. Taouali, O.; Jaffel, I.; Lahdhiri, H.; Harkat, M.F.; Messaoud, H. New fault detection method based on reduced kernel principal component analysis (RKPCA). Int. J. Adv. Manuf. Technol. 2015, 85, 1547–1562. [Google Scholar] [CrossRef]
  22. Lu, F.; Wang, Y.F.; Huang, J.Q.; Huang, Y.H. Gas turbine transient performance tracking using data fusion based on an adaptive particle filter. Energies 2015, 8, 13911–13927. [Google Scholar] [CrossRef]
  23. You, C.X.; Huang, J.Q.; Lu, F. Recursive reduced kernel based extreme learning machine for aero-engine fault pattern recognition. Neurocomputing 2016, 214, 1038–1045. [Google Scholar] [CrossRef]
  24. Kasun, L.L.C.; Yang, Y.; Huang, G.B.; Zhang, Z.Y. Dimension reduction with extreme learning machine. IEEE Trans. Image Process. 2016, 25, 3906–3918. [Google Scholar] [CrossRef] [PubMed]
  25. Demiriz, A.; Bennett, K.P.; Shawe-Taylor, J. Linear Programming Boosting via Column Generation. Mach. Learn. 2002, 46, 225–254. [Google Scholar] [CrossRef] [Green Version]
  26. Khaleghei, A.; Makis, V. Model parameter estimation and residual life prediction for a partially observable failing system. Nav. Res. Logist. 2015, 62, 190–205. [Google Scholar] [CrossRef]
  27. Yan, X.; Jia, M.P. Compound fault diagnosis of rotating machinery based on OVMD and a 1.5-dimension envelope spectrum. Meas. Sci. Technol. 2016, 27, 075002. [Google Scholar] [CrossRef]
  28. Windmann, S.; Jungbluth, F.; Niggemann, O. A HMM-based fault detection method for piecewise stationary industrial processes. In Proceedings of the IEEE 20th Conference on Emerging Technologies & Factory Automation, Luxembourg, 8–11 September 2015. [Google Scholar]
  29. Veprek, P.; Bardley, A.B. An improved algorithm for vector quantizer design. IEEE Signal Process. Lett. 2000, 7, 250–252. [Google Scholar] [CrossRef]
  30. Lu, F.; Huang, J.Q.; Lv, Y.Q. Gas path health monitoring for a turbofan engine based on a nonlinear filtering approach. Energies 2013, 6, 492–523. [Google Scholar] [CrossRef]
  31. Chaibakhsh, A.; Amirkhani, S. A simulation model for transient behaviour of heavy-duty gas turbines. Appl. Therm. Eng. 2018, 132, 115–127. [Google Scholar] [CrossRef]
  32. Qi, Y.W.; Bao, W.; Chang, J.T. State-based switching control strategy with application to aeroengine safety protection. J. Aerosp. Eng. 2015, 28, 04014076. [Google Scholar] [CrossRef]
  33. Lu, F.; Jiang, J.P.; Huang, J.Q.; Qiu, X.J. Dual reduced kernel extreme learning machine for aero-engine fault diagnosis. Aerosp. Sci. Technol. 2017, 71, 742–750. [Google Scholar] [CrossRef]
  34. Wang, C.; Li, Y.G.; Yang, B.Y. Transient performance simulation of aircraft engine integrated with fuel and control systems. Appl. Therm. Eng. 2017, 114, 1029–1037. [Google Scholar] [CrossRef]
  35. Aretakis, N.; Roumeliotis, I.; Alexiou, A.; Romesis, C.; Mathioudakis, K. Turbofan engine health assessment from flight data. J. Eng. Gas Turbines Power 2015, 137, 041203. [Google Scholar] [CrossRef]
  36. Borguet, S.; Leonerd, O. Comparison of adaptive filters for gas turbine performance monitoring. J. Comput. Appl. Math. 2010, 234, 2201–2212. [Google Scholar] [CrossRef]
  37. Camci, F.; Chinnam, R.B. Health-state estimation and prognostics in machining processes. IEEE Trans. Autom. Sci. Eng. 2010, 7, 581–596. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Extraction effect on Iris dataset and wine dataset by the IRKPCA.
Figure 1. Extraction effect on Iris dataset and wine dataset by the IRKPCA.
Energies 11 01807 g001
Figure 2. The structure of improved IRKPCA-HMM algorithm.
Figure 2. The structure of improved IRKPCA-HMM algorithm.
Energies 11 01807 g002
Figure 3. A dual-spool turbofan engine gas-path component cross-section diagram.
Figure 3. A dual-spool turbofan engine gas-path component cross-section diagram.
Energies 11 01807 g003
Figure 4. Gas-path fault diagnosis framework for turbofan engine based on IRKPCA-HMM.
Figure 4. Gas-path fault diagnosis framework for turbofan engine based on IRKPCA-HMM.
Energies 11 01807 g004
Figure 5. The effect comparison of gas-path fault feature extraction using IRKPCA.
Figure 5. The effect comparison of gas-path fault feature extraction using IRKPCA.
Energies 11 01807 g005
Figure 6. Variations of fuel flow and nozzle area of the engine in the transient process.
Figure 6. Variations of fuel flow and nozzle area of the engine in the transient process.
Energies 11 01807 g006
Figure 7. The change route of engine input variables in the transient process in flight envelope.
Figure 7. The change route of engine input variables in the transient process in flight envelope.
Energies 11 01807 g007aEnergies 11 01807 g007b
Table 1. Specification of these benchmark datasets.
Table 1. Specification of these benchmark datasets.
DatasetsFeatureClassSN
Iris43150
Wine133178
Glass106214
E.coli75336
Balance43625
Vehicle184846
Table 2. Performance comparisons of basic KPCA and IRKPCA on the benchmark datasets.
Table 2. Performance comparisons of basic KPCA and IRKPCA on the benchmark datasets.
DatasetAlgorithmsModeling ParametersReduced FeaturesDiscriminative PowerL2/L1Reduced Time (s)
IrisKPCAσ = 21020.9564 ± 0.01550.09280.0372
IRKPCAσ = 29 ρ = 2−120.9600 ± 0.01580.09450.0080
WineKPCAσ = 210100.8609 ± 0.03350.08490.0517
IRKPCAσ = 210 ρ = 2180.8621 ± 0.04350.08510.0089
GlassKPCAσ = 2970.6453 ± 0.02900.09670.0726
IRKPCAσ = 210 ρ = 160.6438 ± 0.02840.09740.0115
E.coliKPCAσ = 2760.8847 ± 0.01350.06270.1069
IRKPCAσ = 29 ρ = 2−150.9117 ± 0.01120.07170.0149
BalanceKPCAσ = 2840.8576 ± 0.1360.05120.5873
IRKPCAσ = 28 ρ = 130.8897 ± 0.1300.07910.0288
VehicleKPCAσ = 21080.7198 ± 0.01280.04021.2387
IRKPCAσ = 29 ρ = 2−170.7201 ± 0.01140.04040.0645
Table 3. Turbofan engine control variables, health parameters, and measured variables.
Table 3. Turbofan engine control variables, health parameters, and measured variables.
Control VariablesHealth ParametersMeasured Outputs
WfFuel flowSE1Fan efficiency indicatorNLLPC speed
A8Nozzle areaSW1Fan flow indicatorNHHPC speed
SE2Compressor efficiency indicatorT22Compressor inlet temperature
SW2Compressor flow indicatorP22Compressor inlet pressure
SE3HPT efficiency indicatorP3Compressor outlet pressure
SW3HPT flow indicatorT3Compressor outlet temperature
SE4LPT efficiency indicatorT43Low pressure turbine inlet temperature
SW4LPT flow indicatorP43Low pressure turbine inlet pressure
P5Low pressure turbine outlet pressure
T6Mixer inlet temperature
Table 4. Gradual degradation of turbofan engine performance over time.
Table 4. Gradual degradation of turbofan engine performance over time.
CNSW1 (%)SE1 (%)SW2 (%)SE2 (%)SW3 (%)SE3 (%)SW4 (%)SE4 (%)
000000000
807−0.4125−0.2704−0.5190−0.45930.5212−0.81420.0632−0.0896
1558−0.7886−0.5316−1.1219−0.96890.9032−1.39190.1151−0.1796
Table 5. Turbofan engine gas-path performance fault cases.
Table 5. Turbofan engine gas-path performance fault cases.
ScenarioVariations of Health ParameterFaulty Component
ISE1 −1%Fan
IISE1 −0.5% and SW1 −1%
IIISE2 −1%Compressor
IVSW2 −1%
VSE2 −0.7% and SW2 −1%
VISE3 −1%HPT
VIISW3 +1%
VIIISE3 −1% and SW3 −1%
IXSE4 −1%LPT
XSW4 −1%
XISE4 −0.4% and SW4 −1%
XIISE4 −0.6% and SW4 +1%
Table 6. LL* and Nc by three HMMs in the steady process at CN = 1558.
Table 6. LL* and Nc by three HMMs in the steady process at CN = 1558.
Fault ScenariosHMMKPCA-HMMIRKPCA-HMM
LL*NcLL*NcLL*Nc
I−1195.139−398.6510−302.4710
II−1433.338−453.518−362.978
III−1154.3710−446.148−429.369
IV−1371.258−503.759−399.329
V−1226.5210−415.8910−415.8810
VI−1360.839−404.679−258.159
VII−1203.938−512.188−420.539
VIII−1358.677−530.849−312.828
IX−1217.6110−454.5110−299.7910
X−1367.249−369.519−362.2510
XI−1274.029−419.7810−439.948
XII−1150.728−386.989−250.369
XIII−1269.479−420.058−320.038
Table 7. Parameters of IRKPCA-HMM at CN = 0 in fault mode 1.
Table 7. Parameters of IRKPCA-HMM at CN = 0 in fault mode 1.
Transition MatrixConfusion Matrix
0.750.2500000.0440.0420.0370.0390.0470.2310.2420.2520.0400.020
00.690.310000.0600.0770.0640.0640.057000.0030.3660.304
000.740.26000.0940.0850.0800.3070.3730.0580000
0000.840.1600.4460.4000.1540000000
00000.460.540010000000
000001000.3790.621000000
Table 8. Fault diagnosis comparisons of the HMMs in the ground steady process.
Table 8. Fault diagnosis comparisons of the HMMs in the ground steady process.
CNAlgorithmsReduced FeaturesHidden StatesAcc ± Stdttest
0HMM1050.9108 ± 0.01240.2958
KPCA-HMM640.9305 ± 0.00630.4237
IRKPCA-HMM640.9226 ± 0.00440.2654
807HMM1050.9054 ± 0.01030.3096
KPCA-HMM630.9208 ± 0.00720.4045
IRKPCA-HMM640.9174 ± 0.00530.2596
1558HMM1050.8785 ± 0.01250.3013
KPCA-HMM640.8953 ± 0.00820.4109
IRKPCA-HMM630.8907 ± 0.00680.2432
Table 9. Comparisons of fault diagnosis methods in high altitude steady operation.
Table 9. Comparisons of fault diagnosis methods in high altitude steady operation.
CNAlgorithmsReduced FeaturesHidden StatesAcc ± Stdttest
0HMM1050.8916 ± 0.01030.2809
KPCA-HMM630.9012 ± 0.00700.4254
IRKPCA-HMM630.8983 ± 0.00530.2543
807HMM1050.8795 ± 0.01140.3025
KPCA-HMM640.8902 ± 0.00650.4181
IRKPCA-HMM630.8883 ± 0.00580.2496
1558HMM1050.8501 ± 0.01300.2957
KPCA-HMM640.8676 ± 0.00680.4316
IRKPCA-HMM630.8602 ± 0.00490.2601
Table 10. LL* and Nc by three HMMs at CN = 0 in the transient process of ground operation.
Table 10. LL* and Nc by three HMMs at CN = 0 in the transient process of ground operation.
Fault ScenariosHMMKPCA-HMMIRKPCA-HMM
LL*NcLL*NcLL*Nc
I−1809.8810−926.1610−725.0310
II−1654.7910−759.5510−566.629
III−1804.088−760.0910−658.059
IV−1636.126−893.398−865.8810
V−1804.7210−748.0110−639.4610
VI−1707.477−824.518−723.788
VII−1821.227−780.729−641.119
VIII−1676.868−710.9910−711.8610
IX−1831.409−704.3010−600.9410
X−1685.299−713.1210−766.8310
XI−1696.526−840.639−686.899
XII−1861.6710−795.2710−709.4810
XIII−1685.334−596.599−470.519
Table 11. Fault diagnosis result comparisons in the transient process of ground operation over time.
Table 11. Fault diagnosis result comparisons in the transient process of ground operation over time.
CNAlgorithmsReduced FeaturesHidden StatesAcc ± Stdttest
0HMM1060.7931 ± 0.00530.2938
KPCA-HMM440.9474 ± 0.00360.3992
IRKPCA-HMM430.9446 ± 0.00330.1948
807HMM1050.8157 ± 0.00470.2824
KPCA-HMM430.9402 ± 0.00350.3319
IRKPCA-HMM440.9362 ± 0.00280.1891
1558HMM1050.8023 ± 0.00570.2602
KPCA-HMM440.9386 ± 0.00410.3707
IRKPCA-HMM440.9306 ± 0.00370.2030
Table 12. LL* and Nc by three HMMs during dynamic process in the flight envelope at CN = 0.
Table 12. LL* and Nc by three HMMs during dynamic process in the flight envelope at CN = 0.
Fault ScenariosHMMKPCA-HMMIRKPCA-HMM
LL*NcLL*NcLL*Nc
I−2013.028−1021.6210−925.0310
II−1895.177−872.459−716.439
III−1974.286−853.129−658.529
IV−1862.297−983.437−865.168
V−2084.429−855.099−709.759
VI−1906.546−934.168−815.878
VII−2021.336−870.399−734.278
VIII−1892.187−819.099−756.1310
IX−2011.045−844.6010−694.269
X−1889.256−793.257−706.836
XI−1906.277−841.429−741.149
XII−2056.718−896.718−811.738
XIII−1892.354−696.876−624.296

Share and Cite

MDPI and ACS Style

Lu, F.; Jiang, J.; Huang, J.; Qiu, X. An Iterative Reduced KPCA Hidden Markov Model for Gas Turbine Performance Fault Diagnosis. Energies 2018, 11, 1807. https://doi.org/10.3390/en11071807

AMA Style

Lu F, Jiang J, Huang J, Qiu X. An Iterative Reduced KPCA Hidden Markov Model for Gas Turbine Performance Fault Diagnosis. Energies. 2018; 11(7):1807. https://doi.org/10.3390/en11071807

Chicago/Turabian Style

Lu, Feng, Jipeng Jiang, Jinquan Huang, and Xiaojie Qiu. 2018. "An Iterative Reduced KPCA Hidden Markov Model for Gas Turbine Performance Fault Diagnosis" Energies 11, no. 7: 1807. https://doi.org/10.3390/en11071807

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop