A State Space Modeling Method for Aero-Engine Based on AFOS-ELM

Chen, Hongyi; Li, Qiuhong; Pang, Shuwei; Zhou, Wenxiang

doi:10.3390/en15113903

Open AccessArticle

A State Space Modeling Method for Aero-Engine Based on AFOS-ELM

Jiangsu Province Key Laboratory of Aerospace Power System, College of Energy and Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

^*

Author to whom correspondence should be addressed.

Energies 2022, 15(11), 3903; https://doi.org/10.3390/en15113903

Submission received: 9 May 2022 / Revised: 24 May 2022 / Accepted: 24 May 2022 / Published: 25 May 2022

Download

Browse Figures

Versions Notes

Abstract

:

State space models (SSMs) are important for multi-variable performance analysis and controller design of aero-engines. In order to solve the problems of the traditional state space modeling methods that rely on component-level models (CLMs) and cannot be carried out in real time, an aero-engine state space modeling method based on adaptive forgetting factor online sequential extreme learning machine (AFOS-ELM) is proposed in this paper. The structure of the extreme learning machine (ELM) is determined according to the form of the state space model, and the inverse-free ELM algorithm is used to automatically select the appropriate number of hidden nodes to improve the efficiency of offline initialization. The focus of the ELM on current operation performance is enhanced by the adaptive renewed forgetting factor, which reduces the impact of aero-engine history and deviated data on the current output and improves the accuracy of the model. Then, according to the analytical equation of the ELM model, the state space model of an aero-engine at each sampling time is obtained by using the partial derivative method. The simulation results based on engine test data show that the real-time performance and accuracy of the state space model established online in this paper can meet the needs of aero-engine control system requirement.

Keywords:

aero-engine; state space model; forgetting factor; online sequential extreme learning machine (OS-ELM)

1. Introduction

As a kind of linear mathematical model, the state space model (SSM) is widely used in multi-variable system analysis and controller design processes, especially when the knowledge of modern control theory is applied [1]. Furthermore, in the area of model-predictive control, performance-seeking control, and health evaluation, using SSM can greatly improve the real-time performance of the control system and simplify the process of the optimization and the estimation [2,3,4].

SSM also plays an important role in the design of aero-engine control systems. Therefore, it is of great significance to research its modeling method, especially the method that can accurately capture the individual aero-engine dynamics in real time.

In view of the strong nonlinearity of an aero-engine, its mechanism model (component-level model, CLM) is very complex [5]; the analytical SSM cannot be derived from it. At present, there are two main modeling methods for aero-engine SSM, namely, fitting method and partial derivative method [6,7,8]. Both methods belong to the small disturbance method, and the parameters of the SSM are obtained based on the responses of the CLM to the separate small disturbance of the SSM inputs (and states for partial derivative method). The fitting method is carried out with the principle that the linear system output should be consistent with that of the nonlinear system under the same separate small disturbance inputs, and its model parameters are determined by minimizing the errors between the SSM model outputs and the CLM outputs. Therefore, optimization methods are widely used in the parameters identification process of the fitting method. For example, the SSM of certain turbofan engines is established by the least square optimization in the literature [9], and the model parameters are optimized by genetic algorithm in the literature [10]. The partial derivative method applies disturbances to the control variables and state variables, respectively, on the CLMs, and the partial derivative calculation in the SSM is approximated by the output difference [11,12].

However, both methods have some defects. The fitting method needs to collect the engine dynamic response data offline, and the partial derivative method needs to call the engine CLMs repeatedly for differential calculation. They both cannot be realized in real time and both depend on the CLM. Aero-engine CLM is a complex model, which is composed of multiple component models, co-working equations, and equation solving process [13,14,15]. Due to the assumptions in modeling, the accuracy of CLM is difficult to ensure, and the CLM is mostly established based on the component characteristics at rated condition, which makes it hard to reflect the individual differences and performance degradation of aero-engine. The fitting method and partial derivative method based on the rated CLMs cannot characterize the accurate characteristics of a specific engine. Although the data used in fitting method can be obtained from the engine test, for multivariable control systems, the test data are a set of responses to the simultaneously varying input variables, while the data obtained from the component-level model are several sets of responses to separately varying input variables, which means the test data cannot clearly reflect the characteristics between the inputs and the outputs, and the model built based on the test data lacks reliability even if it can fit the output curve of the selected test point. In addition, the aero-engine works in a wide flight envelope, and the working state spans from start to maximum reheat rating. The traditional method can only be applied to limited steady-state points. For other working points, the SSMs are more dependent on interpolation or parameter scheduling methods, such as TS fuzzy model [16,17] and equilibrium manifold expansion (EME) model [18,19]. Compared with the SSMs directly established at the corresponding points, the accuracy of the model obtained by interpolation or other methods is relatively lower.

Pang et al. present an online exact partial derivation calculation method to solve this problem, which provides a component-level derivative model of engine, and the partial derivations are calculated by digital solution of the component-level derivative model [20,21]. This method can obtain more accurate SSMs in the whole envelope at any working state of the engine, and the SSM can adapt to the engine degradation with the adaptive CLM used. However, this modeling method is also based on the CLM, the calculation process is complex, and the model accuracy will also be affected by the accuracy of CLM.

With the development of artificial intelligence technology, data-driven methods have developed rapidly in recent years. In the field of aero-engine, data-driven methods have been used in degradation prediction, mathematical model establishment, and so on [22,23]. Compared with the traditional modeling methods that rely on CLMs, data-driven modeling method avoids the complex component-level modeling process and eliminates the influence of CLMs modeling accuracy on SSM. Based on the idea of data-driven, some scholars use an MGD neural network modeling method to establish the adaptive dynamic model of turbofan engine [24,25]. The improved MGD neural network based on batch training is used to establish the onboard model of turbofan engine, but the training process is time-consuming and must be carried out offline, which means the model cannot adapt to the engine degradation. The data-driven modeling method combined with intelligent algorithm is more efficient, but there is still little research on aero-engine state space modeling. Only in reference [23] is a linear parameter-varying (LPV) modeling method of turboshaft engine proposed by using a special construction neural network.

This paper proposes an aero-engine modeling method based on adaptive forgetting factor online sequential extreme learning machine (AFOS-ELM), which aims to model the state space equation online based on data-driven method. The main contributions of this paper are as follows: (1) An online state space modeling method based on AFOS-ELM is proposed, which can be carried out in real time with the data gathered online. According to the form of SSM, the inputs and outputs of neural network are determined. By using the online training ability of online sequential extreme learning (OS-ELM), the parameters of the network are updated in real time, and the analytical model of the AFOS-ELM is obtained. Then, the SSM is derived by applying the partial derivative method to the analytical model of the AFOS-ELM. (2) The inverse-free extreme learning machine (ELM) is used to initialize the neural network model offline. The number of hidden nodes and the initial weights of the ELM are determined by the inverse-free method to improve the efficiency and accuracy of the offline training. (3) The neural network is trained by the AFOS-ELM method which can extract the current working state characteristics of the engine and improve the accuracy of the SSM model. (4) Compared with the traditional linearization modeling method, the method proposed in this paper can obtain the SSM based on the aero-engine test data at each sampling time, which can better reflect the individual characteristics of the engine.

The paper is organized as follows. Section 2 introduces the inverse-free ELM used in this paper. Section 3 introduces the AFOS-ELM method used in the modeling process. Section 4 introduces the neural network structure for state space modeling and the deduction of SSM. Section 5 gives the simulation results of the modeling. Section 6 concludes this paper.

2. Inverse-Free ELM

Extreme learning machines are feedforward neural networks which were first proposed by Professor Huang of Nanyang University of Technology in Singapore in 2006 [26]. Compared with traditional machine learning algorithms such as backpropagation (BP) neural network, ELM has faster learning speed and similar generalization performance [27]. The traditional neural network has the problem that the number of hidden nodes needs to be determined by trial. The inverse-free ELM can solve this problem, and it has aroused many concerns [28,29]. It adds hidden nodes through iterative method, which can save the trail time and improve the calculation efficiency.

In the ELM algorithm, given the training set

χ = {x_{ELM, i}, T_{i}}_{i = 1}^{q}

, where input

x_{ELM,}_{i} = {[x_{ELM,}_{i 1}, \dots, x_{ELM,}_{i N}]}^{T} \in R^{N}

, target output

T_{i} = {[T_{i 1}, \dots, T_{i M}]}^{T} \in R^{M}

.

In ELM with l hidden nodes, the output corresponding to the i^th input can be described as

y_{E L M,}_{i} = \sum_{j = 1}^{l} β_{j} f (W_{j} x_{E L M,}_{i} + b_{j})

(1)

where

W_{j} = [W_{j 1}, W_{j 2}, \dots, W_{j N}]

are the weights connecting the hidden layer node j and the input layer nodes. b_j is the bias of the hidden layer node j.

β_{j} = [β_{j 1}, \dots, β_{j M}]

are the weights connecting the hidden layer node j and the output layer nodes. f(x) is the hidden layer activation function.

The learning target of neural network is to minimize the output error, so the cost function of ELM can be described as

\min_{β} J = ‖ T - H β ‖

(2)

where

{\begin{cases} H = {[h_{1}, \dots, h_{l}]}_{q \times l}^{T} \\ β = {[β_{1}, \dots, β_{l}]}_{q \times M}^{T} \end{cases}

(3)

H is the hidden layer output matrix,

h_{i} = f (W x_{E L M, i} + b), i = 1, 2, \dots, l

.

Output weight β can be obtained from the following formula

β = T H^{+} = T H^{T} {(H H^{T})}^{- 1}

(4)

where

H^{+}

is the generalized inverse of H.

In order to avoid singular matrix or overfitting, Tikhonov regularization method was used, so Equation (4) becomes

β = T H^{T} {(H H^{T} + δ^{2} I)}^{- 1}

(5)

where

δ^{2} > 0

is the regularization factor. Equation (4) can be regarded as a special form of Equation (5) where

δ^{2} = 0

.

If an additional hidden node is added to the ELM with l hidden nodes, the hidden layer output matrix of all l+1 nodes becomes

H_{l + 1} = [\begin{matrix} H_{l} \\ h_{l + 1} \end{matrix}]

(6)

where

h_{l + 1} = f (W_{l + 1} x_{E L M} + b_{l + 1})

is the output of the (l+1)^th hidden node. The input weight vector

W_{l + 1}

and bias

b_{l + 1}

corresponding to the newly added node are randomly generated.

With l+1 hidden layer nodes, the weights of the ELM output layer can be calculated as

β_{l + 1} = T H_{l + 1}^{T} {(H_{l + 1} H_{l + 1}^{T} + δ^{2} I)}^{- 1} = T O_{l + 1}

(7)

where

O_{l + 1} = H_{l + 1}^{T} {(H_{l + 1} H_{l + 1}^{T} + δ^{2} I)}^{- 1}

.

We rewrite O_l₊₁ as [30].

O_{l + 1} = [\begin{matrix} {\tilde{O}}_{l} & o_{l + 1} \end{matrix}]

(8)

where

o_{l + 1} = \frac{h_{l + 1} - O_{l} H_{l} h_{l + 1}}{h_{l + 1}^{T} h_{l + 1} + δ^{2} - h_{l + 1}^{T} O_{l} H_{l} h_{l + 1}}

(9)

{\tilde{O}}_{l} = O_{l} - o_{l + 1} h_{l + 1}^{T} O_{l}

(10)

Then

O_{l + 1} = [\begin{matrix} {\tilde{O}}_{l} & o_{l + 1} \end{matrix}] = [\begin{matrix} O_{l} - o_{l + 1} h_{l + 1}^{T} O_{l} & o_{l + 1} \end{matrix}]

(11)

According to Equation (7),

β_{l + 1} = T O_{l + 1} = [\begin{matrix} T {\tilde{O}}_{l} & T o_{l + 1} \end{matrix}] = [\begin{matrix} {\tilde{β}}_{l} & {\overset{⌢}{β}}_{l + 1} \end{matrix}]

(12)

where

{\tilde{β}}_{l} = β_{l} - {\overset{⌢}{β}}_{l + 1} h_{l + 1}^{T} O_{l}

(13)

{\overset{⌢}{β}}_{l + 1} = T o_{l + 1}

(14)

Then, the process of updating the output weights can be described as

β_{l + 1} = [\begin{matrix} {\tilde{β}}_{l} & {\overset{⌢}{β}}_{l + 1} \end{matrix}] = [\begin{matrix} β_{l} - {\overset{⌢}{β}}_{l + 1} h_{l + 1}^{T} O_{l} & T o_{l + 1} \end{matrix}]

(15)

The above calculation process is repeated, and gradually increases the hidden layer nodes until the network training error satisfies the requirement or the number of hidden nodes reaches the maximum value. This method greatly saves the trail time and workload of repeated ELM network training used in adjusting the hidden layer nodes.

3. Adaptive Forgetting Factor OS-ELM (AFOS-ELM)

3.1. OS-ELM Neural Network

He ELM neural network belongs to offline batch processing algorithm, and the training data are obtained offline. When new data arrive, the model will discard the results of previous training and be completely rebuilt, which arouses the increase of the calculation burden. In order to meet the needs of real-time dynamic calculation in various fields, OS-ELM came into being, which has a fast learning speed and can recursively renew the output weight with the highest calculation efficiency, which makes it suitable for online learning [31].

The recursive learning output weight of OS-ELM at time k + 1 can be expressed as

β (k + 1) = β (k) + [T (k + 1) - β (k) H (k + 1)] H^{T} (k + 1) P (k + 1)

(16)

where the initial matrixes are obtained by the inverse-free OS-ELM method.

P (0) = {(δ^{2} I + H_{0} H_{0}^{T})}^{- 1}

(17)

P (k + 1) = P (k) - P (k) H (k + 1) {[I + H^{T} (k + 1) P (k) H (k + 1)]}^{- 1} H^{T} (k + 1) P (k)

(18)

With the data input gradually increasing, the output weights

β

are updated online by using Equations (16)–(18). This is the online learning stage of OS-ELM.

3.2. AFOS-ELM Neural Network

Although OS-ELM avoids the offline repeated training of the past samples and achieves the online learning of the new data, the online modeling accuracy is not significantly improved compared with the traditional offline ELM algorithm. The traditional OS-ELM algorithm does not consider the temporal validity of the samples [32]. An aero-engine is a complex aerothermodynamic system with large flight envelope. When the online neural network is applied, the old samples cannot reflect the current dynamic characteristics of the engine over time. If these failure samples are not processed, the neural network cannot reflect the current dynamic characteristics well. Therefore, this paper adopts an OS-ELM with memory mechanism, which uses adaptive forgetting factor to reduce the influence of past failure samples on the current output, so as to improve the modeling accuracy.

OS-ELM with memory mechanism has the same updating principle as the traditional OS-ELM. It only needs to update the output weight at time

k + 1

.

The cost function with forgetting factor can be described as

J [β (k)] = \sum_{i = 1}^{k} λ^{k - i} g (T (k) - H (k) β (k)) + \frac{1}{2} δ^{2} ‖ β (k) ‖

(19)

where g( ) is the least square cost function [33], and λ∈(0,1] is the forgetting factor.

If

λ

is smaller, the proportion of the past samples impact on the cost function is lower, and the influence of past samples on the current outputs will be decreased greatly. According to Equation (19), applying the memory mechanism, the recursive formula of the output weights can be described as [34]

β (k + 1) = β (k) - δ^{2} (1 - λ) P (k + 1) β (k) + [T (k + 1) - β (k) H (k + 1)] H^{T} (k + 1) P (k + 1)

(20)

where

P (k + 1) = P^{*} - P^{*} H^{T} (k + 1) H (k + 1) P^{*} [1 + H (k + 1) P (k + 1) H^{T} (k + 1)]

(21)

P_{}^{*} = λ^{- 1} P (k) - δ \frac{λ}{λ - 1} P (k) {[I + δ \frac{1 - λ}{λ} P (k)]}^{- 1} P (k)

(22)

Due to the influence of the measurement noise and the sudden change of the system dynamic, the network output errors show great difference. An adaptive forgetting factor based on the relative error of neural network output is adopted. The forgetting factor of the sample with larger output error is relatively smaller, which can strengthen the forgetting effect and weaken its influence on future output.

The maximum relative output error of multi-output neural network can be described as

e_{ELM} (k) = \max (| \frac{y_{ELM} (k) - T (k)}{T (k)} |)

(23)

The forgetting factor is adjusted based on the relative error of network output. The calculation method of forgetting factor can be expressed as follows:

{\begin{cases} λ (k) = λ_{\max} - θ \cdot e_{ELM} (k) \\ i f λ (k) < λ_{\min}, λ (k) = λ_{\min} \end{cases}

(24)

where

λ_{\max}

is the maximum value of forgetting factor, which is generally taken as 1 or slightly less than 1.

θ

is the proportional coefficient. The larger the

θ

is, the greater the influence of relative error on the forgetting factor. If the forgetting factor is less than the minimum value

λ_{\min}

, then it is set to

λ_{\min}

.

If the system dynamics change greatly, the error e_ELM(k) will increase significantly and the forgetting factor will decrease, so that the data will be forgotten faster. At the same time, when the neural network tends to converge and the error gradually decreases,

e_{ELM} (k)

will approach 0. and λ(k) approaches

λ_{\max}

. The steady-state accuracy during the online operation can be guaranteed.

The forgetting factor is widely used in the OS-ELM [32,33,34] and can improve the modeling accuracy, more or less, in the practical problems. It can be said that it is generally effective in online learning. With the adaptive forgetting factor, the adaptive performance of the network on the complex dynamic system is enhanced, and both the static and dynamic accuracy of the system outputs are enhanced.

4. Online State Space Modeling Based on AFOS-ELM

The nonlinear state space model of the aero-engine can be described as follows:

{\begin{array}{l} x (k + 1) = f_{1} (x (k), u (k)) \\ y (k) = f_{2} (x (k), u (k)) \end{array}

(25)

where

x = {[x_{1}, \dots, x_{n}]}^{T}

is the n-dimensional state vector,

y = {[y_{1}, \dots, y_{m}]}^{T}

is the m-dimensional output vector, and

u = {[u_{1}, \dots, u_{r}]}^{T}

is the r-dimensional input vector.

The linearized state space model of the system at the working point

(x_{t 0}, u_{t 0}, y_{t 0})

at time t can be expressed as

{\begin{matrix} Δ x_{t} (k + 1) = A_{t} Δ x_{t} (k) + B_{t} Δ u_{t} (k) \\ Δ y_{t} (k) = C_{t} Δ x_{t} (k) + D_{t} Δ u_{t} (k) \end{matrix}

(26)

where ∆ is the increment symbol,

Δ x_{t} (k) = x (k) - x_{t}_{0}

,

Δ u_{t} (k) = u (k) - u_{t}_{0}

,

Δ y_{m} (k) = y (k) - y_{m 0}

, where t represents the engine operation time when the state space model is established, and k is the variable used to express a discrete time system.

The parameters of the state space matrixes in Equation (26) can be expressed in the form of partial derivative, and the matrixes A_t, B_t, C_t, D_t are also called Jacobian matrixes.

\begin{matrix} A_{t} = {[\begin{matrix} \frac{\partial x_{1} (k + 1)}{\partial x_{1} (k)} & \dots & \frac{\partial x_{1} (k + 1)}{\partial x_{n} (k)} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial x_{n} (k + 1)}{\partial x_{1} (k)} & \dots & \frac{\partial x_{n} (k + 1)}{\partial x_{n} (k)} \end{matrix}]}_{k = t} & B_{t} = {[\begin{matrix} \frac{\partial x_{1} (k + 1)}{\partial u_{1} (k)} & \dots & \frac{\partial x_{1} (k + 1)}{\partial u_{r} (k)} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial x_{n} (k + 1)}{\partial u_{1} (k)} & \dots & \frac{\partial x_{n} (k + 1)}{\partial u_{r} (k)} \end{matrix}]}_{k = t} \\ C_{t} = {[\begin{matrix} \frac{\partial y_{1} (k)}{\partial x_{1} (k)} & \dots & \frac{\partial y_{1} (k)}{\partial x_{n} (k)} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial y_{m} (k)}{\partial x_{1} (k)} & \dots & \frac{\partial y_{m} (k)}{\partial x_{n} (k)} \end{matrix}]}_{k = t} & D_{t} = {[\begin{matrix} \frac{\partial y_{1} (k)}{\partial u_{1} (k)} & \dots & \frac{\partial y_{1} (k)}{\partial u_{r} (k)} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial y_{m} (k)}{\partial u_{1} (k)} & \dots & \frac{\partial y_{m} (k)}{\partial u_{r} (k)} \end{matrix}]}_{k = t} \end{matrix}

(27)

To obtain the linear state space model with AFOS-ELM, the data used to train the ELM are normalized as

\bar{x} = \frac{x - x_{\min}}{x_{\max} - x_{\min}}

(28)

The inverse-free ELM neural network described in Section 2 is used in the ELM initialization. The inputs of the AFOS-ELM are the input variables u(k) and the state variables x(k) of the state space model in Equation (25). The outputs of the ELM are the state variables x(k + 1) and the output variables y(k) of the SSM. Then, the construction of the AFOS-ELM is shown in Figure 1.

The OS-ELM adopts linear output nodes, and the analytical expression of network output can be described as

y_{E L M} (k) = \sum_{i = 1}^{l} β_{i} (k) f (W_{i} x_{E L M} (k) + b_{i})

(29)

where

y_{E L M} (k) = [\begin{matrix} x^{T} (k + 1) & y^{T} (k) \end{matrix}]

,

x_{E L M} (k) = [\begin{matrix} x^{T} (k) & u^{T} (k) \end{matrix}]

are the neural network output vector and input vector.

The partial derivative of network output to network input is calculated by

\frac{\partial y_{E L M} (k)}{\partial x_{E L M} (k)} = \sum_{j = 1}^{l} \frac{\partial y_{E L M} (k)}{\partial h_{j} (k)} \frac{\partial h_{j} (k)}{\partial x_{E L M} (k)} = \sum_{j = 1}^{l} β_{j} (k) W_{j} \dot{f} (W_{j} x_{E L M} (k) + b_{j})

(30)

\dot{f} (W_{j} x_{E L M} (k) + b_{j}) = \frac{e^{- W_{j} x_{E L M} (k) - b_{j}}}{{(1 + e^{- W_{j} x_{E L M} (k) - b_{j}})}^{2}}

(31)

Therefore, the parameters in Equation (27) can be calculated according to Equations (30) and (31), and then the SSM in Equation (26) is obtained at time k.

5. Simulation and Discussion

5.1. Data Process

In order to verify the method of establishing SSM online based on the neural network proposed in this paper, the test data of a twin spool hybrid exhaust afterburner turbofan aero-engine are used for modeling and validation. The research is carried out in the environment of Win11, Intel(R) CORE(R) i5-12600Kf CPU, and 32G RAM.

Since the data come from the engine test, the measurement noise is inevitable. To avoid the impact of outliers on modeling accuracy, the local singularity estimation capability of wavelets and the residuals of the signal wavelet transform can be used to determine the residual threshold for outlier discrimination [35,36].

The residual of the original signal S after L-layer wavelet decomposition can be described by

R_{e} = S - S_{S}

(32)

where S_S are the low-frequency reconstructed signal after L-layer decomposition.

When the residual is small, the low-frequency reconstructed signal approaches the original signal. When the residual is large, there is a relatively large-amplitude high-frequency noise superimposed on the measured signal, which can be seen as an outlier. Taking the fuel data of an engine test as an example, the “db8” wavelet with scale 7 is used to detect the outliers. A total number of 111,406 data are collected in this test, and the residual distribution of fuel flow W_f is shown in Table 1.

It can be seen from Table 1 that the data in the range of residual [−0.721%, 1.39%] account for 98.54% of the total data, and the data in the range of residual [−2.83%, 3.49%] account for 99.94% of the total data. The data distribution in the range of residual greater than 3.49% or less than −2.83% does not tend to decrease with the increase of residual. Therefore, the threshold for outlier identification is set to −2.83% and 3.49%. The data with residuals beyond this range are considered outliers, which are replaced by the average value of the data before and after them. For continuous outliers, they are replaced by linear interpolation of the reasonable data before and after them. After replacing the outlier data, the “db8” wavelet with scale 7 is adopted to filter the signal. Taking fuel data as an example, the comparison of data before and after filtering is shown in Figure 2a,b.

It can be seen from Figure 2 that the noise in the filtered data is significantly suppressed, which is helpful to establish the SSM in a data-driven way. Then, the data are normalized with Equation (27).

5.2. AFOS-ELM Validation

The input vector for a twin spool turbo-fan engine control system is

u (k) = {[W_{f} (k), A_{8} (k)]}^{T}

, and A₈ is the nozzle throat area. The measured output vector besides rotor speeds is

y (k) = {[P_{3} (k), T_{6} (k), P_{6} (k), F (k)]}^{T}

, where P₃ is the total pressure at compressor outlet, T₆, P₆ are the total temperature and total pressure at low-pressure turbine outlet, and F is the thrust. The state vector is

x (k) = {[n_{1} (k), n_{2} (k)]}^{T}

; n₁ and n₂ are the low-pressure rotor speed and the high-pressure rotor speed.

According to the construction of AFOS-ELM in Figure 1, the input vector of the ELM is

x_{E L M} (k) = {[\begin{matrix} W_{f} (k) & A_{8} (k) & n_{1} (k) & n_{2} (k) \end{matrix}]}^{T}

. The output vector of the ELM is

y_{E L M} (k) = {[n_{1} (k + 1), n_{2} (k + 1), P_{3} (k), T_{6} (k), P_{6} (k), F (k)]}^{T}

.

The first 30% of the engine test data sequence are adopted to initialize the ELM based on the inverse-free ELM offline. When the root mean square error is less than 6.5% or the number of hidden layer nodes increases to more than 100, the offline training stops. The number of hidden layer nodes is automatically set to 86 in this research.

The AFOS-ELM is used to update the output weights online for all the engine test data. The n₂ output of the AFOS-ELM is compared with the engine test data in Figure 3. The relative output errors of n₂ and F are compared with that of the traditional OS-ELM without the forgetting factor in Figure 4. The detail errors of the ELM outputs are listed in Table 2.

From Figure 3 we can see that both the AFOS-ELM and OS-ELM output n₂(k + 1) track the engine test data well in the acceleration process. From Figure 4 we can see that the relative errors of n₂(k + 1) and F(k) are both smaller than 0.05% with the AFOS-ELM method, which is much smaller than that of the OS-ELM method, and the adaptive forgetting factor enhances the modeling accuracy both at the dynamic state and at the steady state.

It can be seen from Table 2 that the errors of two rotor speeds are very small for both AFOS-ELM and OS-ELM, the maximum error occurs at n₁ with OS-ELM method, which is 0.1419%, and the average error is less than 0.002%, which is much better than that of the component-level model [37]. For the other output variables, the AFOS-ELM method shows greater advantage than the OS-ELM method. The maximum error of AFOS-ELM occurs at T₆, which is 0.2747%, and the max average error occurs also occurs at T₆, which is 0.002%, while for OS-ELM, the maximum error and the max average error also occur at T₆, which are 16.52% and 0.5854%, respectively, and the figures are much bigger than that of the AFOS-ELM. AFOS-ELM can model the state space configuration nonlinear model well. It can process the engine failure data and ensure the accuracy of the current output. The adaptive forgetting factor can also deal with the rapid change caused by the large dynamic and maintain a high accuracy.

5.3. State Space Model Validation

Based on the analytical expression of the AFOS-ELM model established above, the SSM can be derived by partial derivative calculation. Taking the calculation of parameter

a_{11} (k)

as an example, according to Equations (3) and (27), we have

a_{11} (k) = \frac{\partial n_{1} (k + 1)}{\partial n_{1} (k)} = \frac{\partial n_{1} (k + 1)}{\partial H (k)} \frac{\partial H (k)}{\partial n_{1} (k)} = \sum_{j = 1}^{l} β_{1 j} W_{1 j} \dot{f} (W_{1 j} n_{1} (k) + b_{j})

(33)

Applying the calculation process of

a_{11}

o the other parameters of the SSM, the calculation of all state space matrix parameters at time k can be achieved, and the SSM at time k is established.

In order to validate the effectiveness of the SSM established in this paper, the models established at different times are selected to demonstrate their output prediction ability over future time, which is essential for model-predictive control and performance-seeking control of aero-engines.

The prediction output of SSM at future time can be obtained through the response calculation of a discrete-time system.

\begin{matrix} x (k + 1) & = A (k) x (k) + B (k) u (k) \\ x (k + 2) & = A (k) x (k + 1) + B (k) u (k + 1) \\ = A^{2} (k) x (k) + A (k) B (k) u (k) + B (k) u (k + 1) \\ \dots \dots \\ x (k + p) & = A^{p} (k) x (k) + \sum_{i = k}^{k + p - 1} A^{k + p - 1 - i} (k) B (k) u (i) \\ y (k + p) & = C (k) A^{p} (k) x (k) + \sum_{i = k}^{k + p - 1} C (k) A^{k + p - 1 - i} (k) B (k) u (i) + D (k) u (k + p) \end{matrix}

(34)

Taking p = 5 as an example, the error between the SSM prediction output and the engine output is compared. The results of prediction are shown in Figure 5a–l, where the “SSM Prediction” represents the prediction output of the SSM proposed in this paper, and “Engine Output” represents the engine test data at k + p.

As can be seen from Figure 5, the maximum prediction error at time k + 5 occurs at

P_{3}

around 1500 s, where a sudden acceleration occurs and the thrust F increases about 40.6%. Except for F, the other variables all reach their biggest prediction error at this moment. Under such a big dynamic, the max prediction error occurred at

P_{3}

and is about 1.1%. The max prediction error of the low-pressure rotor speed is less than 0.3% when the low-pressure rotor speed n₁ increases about 22.9%, while the error is more than 0.4% in the component-level-model-based online state space modeling method in reference [21] for time k + 5 output prediction when the low-pressure rotor speed decreases by about 12% (there is no other common variable that can be compared in reference [21]). The prediction accuracy of the SSM established by the method described in this paper shows great advantage, and it can predict the varying of the engine test data well.

In order to validate the real-time performance of the SSM proposed in this paper, the time consumption of the AFOS-ELM online modeling process and the SSM modeling process are evaluated. The simulation platform is as described above; the modeling process is carried out three times. The neural network is established online based on 111,406 groups of engine test data, and 111,405 SSMs are obtained from the first data to the last-1 data, and the average time for modeling the 111,405 SSMs based on AFOS-ELM is 183.958 s. The online network weights updating and online SSM calculation process of each step takes an average of 1.7 ms. It is lower than the sampling step of the aero-engine, which is about 20 ms. Therefore, this modeling method can meet the real-time requirements of the aero-engine.

6. Conclusions

An online data-driven state space modeling method based on AFOS-ELM is proposed in this paper, which provides a mechanism-model-free state space modeling method for a complex nonlinear system. The inputs and outputs of the AFOS-ELM are determined according to the description of the SSM. The partial derivative calculation based on the analytical model of the AFOS-ELM provides an alternative for deference-based calculation in the Jacobian matrix (state space matrix) calculation of SSM. The OS-ELM with adaptive forgetting factor shows great system dynamic capture ability and achieves high modeling accuracy at both steady state and dynamic state on the engine test dataset, which means the SSMs obtained from it have the ability to predict the system output in future time accurately.

The inverse free-ELM algorithm used in the OS-ELM initialization can automatically determine the most appropriate number of the hidden layer nodes, which reduces the trail time and enhances the modeling efficiency.

The SSM is established online at each sampling time of the engine operation and has high real-time property, which gives it the potential to be further used in model-based control and health management research.

Author Contributions

Conceptualization, Q.L.; methodology, H.C.; software, H.C. and S.P.; validation, H.C., Q.L. and S.P.; formal analysis, H.C.; investigation, H.C.; resources, H.C. and Q.L.; data curation, H.C.; writing—original draft preparation, H.C.; writing—review and editing, H.C. and Q.L.; visualization, H.C.; supervision, Q.L.; project administration, Q.L. and W.Z.; funding acquisition, W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Postgraduate Research & Practice Innovation Program of NUAA (xcxjh20210214), National Natural Science Foundation of China (51976089), Enhanced Foundation Project (2017-JCJQ-ZD-047-21), the National Science and Technology Major Project (2017-V-0004-0054).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhou, W. Research on Object-Oriented Modeling and Simulation for Aeroengine and Control System. Ph.D. Thesis, Nanjing University of Aeronautics and Astronautics, Nanjing, China, 2006. [Google Scholar]
Alag, G.; Gilyard, G. A proposed Kalman filter algorithm for estimation of unmeasured output variables for an F100 turbofan engine. In Proceedings of the 26th Joint Propulsion Conference, Orlando, FL, USA, 16–18 July 1990. [Google Scholar] [CrossRef] [Green Version]
Pang, S.; Li, Q.; Ni, B. Improved nonlinear MPC for aircraft gas turbine engine based on semi-alternative optimization strategy. Aerosp. Sci. Technol. 2021, 118, 106983. [Google Scholar] [CrossRef]
DeCastro, J.A. Rate-based model predictive control of turbofan engine clearance. J. Propuls. Power 2007, 23, 804–813. [Google Scholar] [CrossRef] [Green Version]
Roberts, R.A.; Eastbourn, S.M. Modeling techniques for a computational efficient dynamic turbofan engine model. Int. J. Aerosp. Eng. 2014, 2014, 283479. [Google Scholar] [CrossRef]
Feng, Z.; Sun, J.; Huang, J.; Cai, H. A New Method for Establishing a State Variable Model of Aeroengine. J. Aerosp. Power 1998, 13, 435–438, 463. [Google Scholar]
Sun, J. Prospects of the Aeroengine Control Development in the Early Time of the 21st Century. J. Aerosp. Power 2001, 16, 97–102. [Google Scholar]
Sugiyama, N. Derivation of ABCD system matrices from nonlinear dynamic simulation of jet engines. In Proceedings of the 28th Joint Propulsion Conference and Exhibit, Nashville, TN, USA, 6–8 July 1992. [Google Scholar] [CrossRef]
Feng, Z.; Sun, J. Modeling of small perturbation state variable model for aeroengines. J. Propuls. Technol. 2001, 22, 54–57. [Google Scholar]
Li, Q.; Sun, J. Aero-Engine State Variable Modeling Based on the Genetic Algorithm. J. Aerosp. Power 2006, 21, 427–431. [Google Scholar]
Sugiyama, N. Derivation of system matrices from nonlinear dynamic simulation of jet engines. J. Guid. Control Dyn. 1994, 17, 1320–1326. [Google Scholar] [CrossRef]
Henrion, D.; Reberga, L.; Bernussou, J.; Vary, F. Linearization and identification of aircraft turbofan engine models. IFAC Proc. Vol. 2004, 37, 1055–1060. [Google Scholar] [CrossRef] [Green Version]
Lytle, J.K. The Numerical Propulsion System Simulation: An Overview. In Proceedings of the Computational Aerosciences Workshop, Moffett Field, CA, USA, 15–17 February 2000; p. E-12152. [Google Scholar]
Parker, K.I.; Guo, T. Development of a Turbofan Engine Simulation in a Graphical Simulation Environment; NASA/TM-2003-212543; NASA: Washington, DC, USA, 2003.
Ma, J.; Chang, J.; Ma, J.; Bao, W.; Yu, D. Mathematical modeling and characteristic analysis for over-under turbine based combined cycle engine. Acta Astronaut 2018, 148, 141–152. [Google Scholar] [CrossRef]
Diao, Y.; Passino, K.M. Stable fault-tolerant adaptive fuzzy/neural control for a turbine engine. IEEE Trans. Control Syst. Technol. 2001, 9, 494–509. [Google Scholar] [CrossRef]
Pan, M.; Wang, H.; Huang, J. T–S Fuzzy Modeling for Aircraft Engines: The Clustering and Identification Approach. Energies 2019, 12, 3284. [Google Scholar] [CrossRef] [Green Version]
Liu, X.; Yuan, Y.; Shi, J.; Zhao, L. Adaptive modeling of aircraft engine performance degradation model based on the equilibrium manifold and expansion form. Proc. Inst. Mech. Eng. Part G J. Aerosp. Eng. 2014, 228, 1246–1272. [Google Scholar] [CrossRef]
Liu, X.; Zhang, L.; Luo, C. Model reference adaptive control for aero-engine based on system equilibrium manifold expansion model. Int. J. Control 2021. [Google Scholar] [CrossRef]
Pang, S.; Li, Q.; Zhang, H. An Exact Derivative Based Aero-Engine Modeling Method. IEEE Access 2018, 6, 34516–34526. [Google Scholar] [CrossRef]
Pang, S.; Li, Q.; Zhang, H. A new online modelling method for aircraft engine state space model. Chin. J Aeronaut 2020, 33, 1756–1773. [Google Scholar] [CrossRef]
Wang, C.; Zhu, Z.; Lu, N.; Cheng, Y.; Jiang, B. A data-driven degradation prognostic strategy for aero-engine under various operational conditions. Neurocomputing 2021, 462, 195–207. [Google Scholar] [CrossRef]
Gu, Z.; Pang, S.; Zhou, W.; Li, Y.; Li, Q. An Online Data-Driven LPV Modeling Method for Turbo-Shaft Engines. Energies 2022, 15, 1255. [Google Scholar] [CrossRef]
Zheng, Q.; Zhang, H.; Li, Y.; Hu, Z. Aero-Engine On-Board Dynamic Adaptive MGD Neural Network Model within a Large Flight Envelope. IEEE Access 2018, 6, 45755–45761. [Google Scholar] [CrossRef]
Zheng, Q.; Fang, J.; Hu, Z.; Zhang, H. Aero-engine on-board model based on batch normalize deep neural network. IEEE Access 2019, 7, 54855–54862. [Google Scholar] [CrossRef]
Huang, G.; Zhu, Q.; Siew, C. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Huang, G.; Chen, L.; Siew, C.K. Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans. Neural Netw. 2006, 17, 879–892. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, S.; You, Z.; Guo, H.; Luo, X.; Zhao, Z. Inverse-free extreme learning machine with optimal information updating. IEEE Trans. Cybern. 2015, 46, 1229–1241. [Google Scholar] [CrossRef] [Green Version]
Jin, B.; Jing, Z.; Zhao, H. Incremental and decremental extreme learning machine based on generalized inverse. IEEE Access 2017, 5, 20852–20865. [Google Scholar] [CrossRef]
Zhu, H.; Wu, Y. Inverse-Free Incremental Learning Algorithms with Reduced Complexity for Regularized Extreme Learning Machine. IEEE Access 2020, 8, 177318–177328. [Google Scholar] [CrossRef]
Liang, N.; Huang, G.; Saratchandran, P.; Sundararajan, N. A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans. Neural Netw. 2006, 17, 1411–1423. [Google Scholar] [CrossRef]
Du, Z.; Li, X.; Zheng, Z.; Zhang, G.; Mao, Q. Extreme learning machine based on regularization and forgetting factor and its application in fault prediction. Chin. J. Sci. Instrum. 2015, 36, 1546–1553. [Google Scholar]
Guo, W. Robust adaptive online sequential extreme learning machine for predicting nonstationary data streams with outliers. J. Algorithms Comput. Technol. 2019, 13, 174830261989542. [Google Scholar] [CrossRef]
Guo, W.; Xu, T.; Yu, J.; Tang, K. Online Sequential Extreme Learning Machine Based on M-estimator and Variable Forgetting Factor. J. Electron. Inf. Technol. 2018, 40, 1360–1367. [Google Scholar] [CrossRef]
Torrence, C.; Compo, G.P. A practical guide to wavelet analysis. Bull. Am. Meteorol. Soc. 1998, 79, 61–78. [Google Scholar] [CrossRef] [Green Version]
Bilen, C.; Huzurbazar, S. Wavelet-based detection of outliers in time series. J. Comput. Graph. Stat. 2002, 11, 311–327. [Google Scholar] [CrossRef]
Pang, S.; Li, Q.; Feng, H.; Zhang, H. Joint steady state and transient performance adaptation for aero engine mathematical model. IEEE Access 2019, 7, 36772–36787. [Google Scholar] [CrossRef]

Figure 1. AFOS-ELM neural network construction.

Figure 2. Fuel data comparison before and after filtering. (a) Original fuel data; (b) filtered fuel data.

Figure 3. Comparison of neural network n₂(k + 1) output. (a) AFOS-ELM n₂(k + 1) output; (b) OS-ELM n₂(k + 1) output.

Figure 4. Comparison of relative error between AFOS-ELM and OS-ELM. (a) Error of AFOS-ELM n₂(k + 1) output; (b) error of OS-ELM n₂(k + 1) output; (c) error of AFOS-ELM F(k + 1) output; (d) error of OS-ELM F(k + 1) output.

Figure 5. Prediction result of SSM at time k + 5. (a) n₁ prediction; (b) prediction error of n₁; (c) n₂ prediction; (d) prediction error of n₂; (e) P₃ prediction; (f) prediction error of P₃; (g) T₆ prediction; (h) prediction error of T₆; (i) P₆ prediction; (j) prediction error of P₆; (k) F prediction; (l) prediction error of F.

Table 1. Statistics of wavelet transform residual distribution of W_f.

	Residual Distribution
Residual interval/%	[−∞, −7.04]	[−7.04, −4.94]	[−4.94, −2.83]	[−2.83, −0.721]	[−0.721, 1.39]	[1.39, 3.49]	[3.49, 5.6]	[5.6, 7.71]	[7.71, 9.82]	[9.82, ∞]
Number of data	9	7	16	1348	109,786	207	17	8	2	6

Table 2. Relative error of neural network output (%).

Method	Error Type	Variables
Method	Error Type	n₁(k + 1)	n₂(k + 1)	P₃(k)	T₆(k)	P₃(k)	F(k)
AFOS-ELM	Average error	0.0013	0.0008	0.0016	0.002	0.0014	0.0013
	Maximum error	0.0649	0.0263	0.2084	0.2747	0.0866	0.0479
	Standard deviation	0.0043	0.0026	0.0055	0.0051	0.004	0.0037
OS-ELM	Average error	0.0019	0.0014	0.3218	0.5854	0.2815	0.0831
	Maximum error	0.1419	0.0492	5.8795	16.52	6.8424	1.7649
	Standard deviation	0.0042	0.0028	0.4385	1.0432	0.5142	0.1572

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, H.; Li, Q.; Pang, S.; Zhou, W. A State Space Modeling Method for Aero-Engine Based on AFOS-ELM. Energies 2022, 15, 3903. https://doi.org/10.3390/en15113903

AMA Style

Chen H, Li Q, Pang S, Zhou W. A State Space Modeling Method for Aero-Engine Based on AFOS-ELM. Energies. 2022; 15(11):3903. https://doi.org/10.3390/en15113903

Chicago/Turabian Style

Chen, Hongyi, Qiuhong Li, Shuwei Pang, and Wenxiang Zhou. 2022. "A State Space Modeling Method for Aero-Engine Based on AFOS-ELM" Energies 15, no. 11: 3903. https://doi.org/10.3390/en15113903

APA Style

Chen, H., Li, Q., Pang, S., & Zhou, W. (2022). A State Space Modeling Method for Aero-Engine Based on AFOS-ELM. Energies, 15(11), 3903. https://doi.org/10.3390/en15113903

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A State Space Modeling Method for Aero-Engine Based on AFOS-ELM

Abstract

1. Introduction

2. Inverse-Free ELM

3. Adaptive Forgetting Factor OS-ELM (AFOS-ELM)

3.1. OS-ELM Neural Network

3.2. AFOS-ELM Neural Network

4. Online State Space Modeling Based on AFOS-ELM

5. Simulation and Discussion

5.1. Data Process

5.2. AFOS-ELM Validation

5.3. State Space Model Validation

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI