Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox

Zhou, Shenghan; Xu, Xingxing; Xiao, Yiyong; Chang, Wenbing; Qian, Silin; Pan, Xing

doi:10.3390/e21090861

Open AccessArticle

Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox

by

Shenghan Zhou

¹

,

Xingxing Xu

¹

,

Yiyong Xiao

^1,*

,

Wenbing Chang

¹,

Silin Qian

²

and

Xing Pan

¹

School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China

²

Nanjing Research Institute of Electronics Technology, Nanjing 210039, China

^*

Author to whom correspondence should be addressed.

Entropy 2019, 21(9), 861; https://doi.org/10.3390/e21090861

Submission received: 23 July 2019 / Revised: 24 August 2019 / Accepted: 1 September 2019 / Published: 3 September 2019

(This article belongs to the Special Issue Entropy, Nonlinear Dynamics and Complexity)

Download

Browse Figures

Versions Notes

Abstract

The prediction of electrical machines’ Remaining Useful Life (RUL) can facilitate making electrical machine maintenance policies, which is important for improving their security and extending their life span. This paper proposes an RUL prediction model with similarity fusion of multi-parameter and multi-sample. Firstly, based on the time domain and frequency domain extraction of vibration signals, the performance damage indicator system of a gearbox is established to select the optimal damage indicators for RUL prediction. Low-pass filtering based on approximate entropy variance (Aev) is introduced in this process because of its stability. Secondly, this paper constructs Dynamic Time Warping Distance (DTWD) as a similarity measurement function, which belongs to the nonlinear dynamic programming algorithm. It performed better than the traditional Euclidean distance. Thirdly, based on DTWD, similarity fusion of multi-parameter and multi-sample methods is proposed here to achieve RUL prediction. Next, the performance evaluation indicator Q is adopted to evaluate the RUL prediction accuracy of different methods. Finally, the proposed method is verified by experiments, and the Multivariable Support Vector Machine (MSVM) and Principal Component Analysis (PCA) are introduced for comparative studies. The results show that the Mean Absolute Percentage Error (MAPE) of the similarity fusion of multi-parameter and multi-sample methods proposed here is below 14%, which is lower than MSVM’s and PCA’s. Additionally, the RUL prediction based on the DTWD function in multi-sample similarity fusion exhibits the best accuracy.

Keywords:

remaining useful life (RUL); similarity fusion; dynamic time warping; damage indicators extraction; approximate entropy variance; vibration monitoring

1. Introduction

As a nonlinear dynamical system, a diesel generator’s safe and smooth running is essential to the reliability of systems. The gearbox is a core part of a diesel generator, directly determining its performance. Remaining Useful Life (RUL) prediction can detect faults early and estimate the downtime of diesel generator components, further helping operators to arrange a reasonable maintenance schedule and save operating costs.

Vibration signal analysis is one of the most widely used methods of condition monitoring. Vibration monitoring generally involves arranging sensors at important locations, using the data acquisition card to obtain signals, and finally using the computer to calculate and analyze the data. This article aims at analyzing the degradation trend of machines and predicts their RUL with the vibration signal collected from the sensors online or offline. In this way, the RUL of a diesel generator is achieved during condition monitoring.

One of condition-based maintenance (CBM)’s main missions is to predict a machine’s RUL [1]. RUL prediction counts more than fault diagnosis in the makings of maintenance decisions [2]. According to the data and continuous degradation trend recorded by the condition detection system, RUL is predicted. It will forecast a potential degradation when current faults have been cleared, providing direct references for CBM. As Figure 1 shows, the functional degradation of a and b stands at an even level at t_i−₁. S_a, S_b represent the degree of performance degradation for machine a and b, and f_c means that the machine is incapable of working. Additionally, at t_i, a’s health level is higher than b’s, indicating that a is healthier. After t_i, a’s function degrades faster than b’s and a’s RUL is shorter. Any planned maintenance must be performed on a in advance [3].

RUL is defined as the time span from the present moment to the end of the useful life [4], expressed as

l_{k} = t_{E o l} - t_{k}

, where t_Eol is life termination, t_k is the present moment, and

l_{k}

is the remaining life at t_k.

The primary mission of RUL prediction is to monitor the useful time left before the system loses its working capability according to condition detection information. Based on time series analysis, the accuracy of prediction is the primary factor considered in the choice of prediction method. The existing methods are based on physical models, statistical data, and artificial intelligence [5], as described in the following:

(1) RUL prediction methods based on physical models reflect the life-cycle degradation process of the system by establishing a mathematical model based on the failure mechanism [5]. As a typical physical model, the Paris-Erdogan model (PE) is widely used for RUL prediction. Frank et al. [6] used PE to predict the RUL of two types of pipelines, 80 and 100. Hu et al. [7] used Norton’s law to describe the creep of a turbine and combined the Kalman filtering (KF) and particle filter (PF) to predict RUL; however, the methods based on physical models need the deep understanding and sufficiently accurate judgment of failure mechanism to ensure the accuracy of RUL estimation.

(2) RUL prediction methods based on statistical data fit the observational data into a random coefficient model and a stochastic process model. This method is widely applied as many on-the-shelf statistical models can be applied to fit the data, that is for instance random coefficient models, autoregressive models, Gamma process models, inverse gaussian processes, Markov models, and proportional hazards models. However, Autoregressive models rely heavily on high-quality historical data and are not conducive to RUL prediction under complex operating conditions, Wiener models and Gamma process models is limited by the assumption of Markov, which assume that the future state is only related to the current state but not to the past state, so it is not applicable to some practical situations.

(3) RUL prediction methods based on artificial intelligence concentrate on learning the degradation pattern of the system from observations. Common AI techniques include the artificial neural network (ANN), neural fuzzy (NF), support vector machine/relevance vector machine (SVM/RVM), K-nearest neighbor (KNN) and Gaussian process regression (GPR). Hussain et al. [8] extracted the index of health from the vibration signal, and established the RUL prediction model by the adaptive neural fuzzy inference system and nonlinear autoregression. The NF excels in RUL prediction because it takes advantage of expert knowledge and intelligent ANN, but needs high-quality data sources. There are many different kinds of SVM that are used for machines’ RUL prediction, like one-class SVM and multi-class SVM [9], and Squares-SVM [10]. However, SVM only provides point estimate and does not provide a probability distribution over of points. In order to make up for this shortcoming, RVM was proposed, which has the same functional form as SVM, but provides a full probability distribution over all possible outcomes [11]. However, those methods focus more on data training rather than analysing the mechanism of mechanical failure. The structure and parameters of ANN need to be set artificially, which leads to low generalization ability; Kernel function selection for SVM/RVM with different objects is a huge challenge. Calculation process of GPR is complex and takes a long time.

It can be seen from the above that the three RUL prediction ideas have their own limitations. The methods of RUL prediction are variable, among which, similarity measure of the data-driven prediction is advantageous at avoiding constructing complex functional degradation models. Therefore, this paper will study RUL prediction based on statistical data from the perspective of similarity measure. Research on similarity-based RUL prediction was first proposed in 2012 and has been proved to be a very effective RUL prediction approach [12,13,14,15,16]. However, the methods have not been so widespread until now. The basic idea is that products with similar degradation processes have a similar service life [3]. The RUL of the test sample is determined by observing the similarity between the performance degradation trajectory of the test sample and the reference samples of the known life-cycle degradation process.

There is little literature about RUL prediction based on similarity but they verified the validity of “similarity” idea. You et al. [12] conducted an experiment to predict the RUL of a welding spot under vibrations. He thought if the asset under study is more similar to reference sample “A”, then “A” should play a more important role in RUL estimation of the asset under study. Eker [13] testified the function of similarity-based prediction through data collected from Virkler’s fatigue crack propagation, a degradation data set of drilling, and a turnout system of slide chair degradation. Zhang [14] put forward a method to predict the RUL of a mechanical system based on the similarity of a phase space trajectory and found that the results approximated the actual RUL very closely. Xiong [15] built a one-dimensional damage indicator on an aero engine’s multiple parameters by means of liner regression. He obtained the RUL after matching test engine data to the model base. In the same way, Moghaddass [16] adopted principal components analysis to integrate a turbine engine’s multiple parameters and drew the first principal component to describe the system degradation process.

It can be concluded from the literature review that similarity-based RUL prediction methods so far are almost always built on a single parameter. The latest research is only employed to integrate multiple parameters into a one-dimensional parameter firstly, and then compare the similarity of performance degradation curves with statistical methods or AI methods. There is no research about co-impact both multiple samples and multiple parameters of those samples on RUL prediction. However, performance degradation or malfunction may result from a multitude of reasons. Thus, multiple parameters of different perspectives may provide a more comprehensive reflection of the running process [17]. Especially for a complex system, what a single parameter can present is far less than multiple parameters in describing the degradation of various forms.

Therefore, this paper proposes an RUL prediction method based on the similarity fusion of multiple damage indicators and samples. In contrast to the more traditional methods, the method of multi-parameter and multi-sample similarity fusion estimates RUL by referring to multiple parameters and samples.

The process can be divided into five parts. At first, in Section 2.1 and Section 2.2, the various time and frequency domain features extracted from a vibration signal that will be applied as damage indicators are introduced together with the entropy variance method for fuzzy filtering applied for low pass filtering of the time-domain features. Further, the method used for parameter evaluation in order to select the most significant performance damage indicators to be applied for RUL prediction is discussed. Second, in Section 2.3, we introduce principles of RUL prediction based on similarity and defines four core elements in the RUL prediction based on similarity: Time window D, similarity measurement function S(.), weight function w(.), and performance evaluation indicator Q. Third, in Section 2.4, we introduce the Dynamic Time Warping Distance (DTWD) as the similarity measure function S(.) to discuss the similarity of data degradation trajectory patterns for the first time. Fourth, in Section 2.5, Section 2.6 and Section 2.7, according to combinations of different performance damage indicators, the RUL prediction model based on the similarity fusion of multi-parameter and multi-sample methods is established. Finally, in Section 3, this paper studies a type of heavy high-speed diesel generator produced by the China Shipbuilding Industry Corporation (CSIC), and validates the RUL prediction method proposed here with experimental results. In the meanwhile, Proposed method here are compared with the mature methods of Multivariable Support Vector Machine (MSVM) and Principal Component Analysis (PCA) for comparison analysis in Section 3.3 and Section 3.4.

2. Methodology

At first step, we will select the most significant performance damage indicators which will be the input of RUL estimation from various time and frequency domain features. Then, we will define four core elements in the RUL prediction based on similarity: Time window D, similarity measurement function S(.), weight function w(.), and performance evaluation indicator Q. Next, as the most important core, similarity measurement function S(.) will be established with DTWD and we write the details about DTWD in Section 2.3. At last, the RUL prediction model based on the similarity fusion of multi-parameter and multi-sample methods will be established.

2.1. The Damage Indicators

The various time domain and frequency domain features extracted from the vibration signal will be used as damage indicators in the following RUL prediction. Further, we discuss the method we apply to to define for each individual gearbox under study a subset of most significant damage indicators system, to be applied for RUL prediction for this particular gearbox. The time domain features of the vibration signal effectively reflect the performance degradation of the gearbox [18]. As shown in Table A1 of Appendix A, we have chosen to use 10-time domain features as damage indicators [19]. Further, the Fourier transform is applied to convert the vibration signal into its frequency spectrum representation [20]. We have chosen to use 15 frequency domain features [21], as damage indicator, see Table A2 in Appendix A.

Since the time-series of the various damage indicators are noisy and in order to correctly compare them with the reference samples we need to smooth the series, i.e., low pass filtering. Fuzzy filtering is a low pass filtering method based on fuzzy set theory, which can adjust the filter structure adaptively based on the features of the signal [22]. A large number of studies have shown that this method is easy to implement and has a good filtering effect, which is very suitable for engineering applications.

For time domain features, this paper proposed the low pass filtering based on approximate entropy variance. The time-series of the various damage indicators are rather noisy, we apply low pass filtering techniques to smooth them [23]. For the time domain damage indicators we have applied low-pass filtering with approximate entropy variance (Aev), because approximate entropy [24] is suitable for describing dynamic noise with a small amount of data and has a strong Robustness to observation noise, and the dynamics system is easy to reconstruct. Approximate entropy variance is a statistic measuring the complexity of time series and it can accurately measure the complexity of signals. Especially in the case of small data quantity and noise interference, it also demonstrates statistical stability. The variance could describe the stability in time series. Approximate entropy (Ae) is defined as: For time series

{n (i)} (i = 1, 2, \dots (N)

, x(i) denotes m consecutive values of u starting at point i:

A e (m, r) = \lim_{n \to \infty} [φ^{m} (r) - φ^{m + 1} (r)]

(1)

where:

φ^{m} (r) = {(N - m + 1)}^{- 1} \sum_{i = 1}^{N - m + 1} [\sum_{j = 1}^{N - m + 1} H {r - d_{m} [x_{m} (i), x_{m} (j)]} / (N - m + 1)]

(2)

\begin{array}{l} d_{m} [x_{m} (i), x_{m} (j)] = \max | u (i + k) - u (j + k) | \\ x_{m} (i) = [u (i), u (i + 1), \dots, (u (i + m - 1)] \end{array}

(3)

H() is the Heaviside function, After Ae is calculated, Aev is defined as:

Aev = \sum_{i = 1}^{N} (Ae - \bar{Ae}) / N

(4)

Then low-pass filtering decomposes the damge indicator signal into the parts trend and noise:

X (t_{k}) {= X}_{T} (t_{k}) {+ X}_{R} (t_{k})

(5)

With X(t_k) is the value of the performance damage indicator at time t_k, X_T(t_k) is the trend term, X_R(t_k) the noise term, and t_k = 1, 2,…, N, with N the number of discrete observations made within the measurement time interval.The weighting filter and fuzzy filtering membership function are defined as

u^{'} (x_{n - k}) = f (A e, n - k)

according to [24], the range of u’(x_n−k) is a [0, 1], and f is set to normal distribution function. So XT(t_k) will be ramained while XR(t_k) removed.

To smooth the frequency domain damage indicator over time, a simple moving average filtering is applied. The moving average filtering can reduce random noise while reflect unit step function response of signal [25]. First, the damage indicators are decomposed into two parts just as before in Equation (5), then calculate the average value as the predicted value of the next sub-interval and move forward in turn.

\hat{X} (t_{j})

is the first part of damage indicator with moving average filtering which is defined as the weighted average value of the adjacent N data points.

\hat{X} (t_{j}) = \frac{1}{n} \sum_{i = j - n}^{j - 1} X (t_{i}) j = n + 1, n + 2, L, N + 1

(6)

In Figure 2, as an example of the full signal together with its trend is shown for one of the time/frequency domain indicators. The ideal output can be obtained by wave filtering.

2.2. Defining a Subset of Most Significant Damage Indicators

To define an—asset dependent—subset of most significant damage indicators to be applied for RUL prediction of the asset, so called significance indicators have been defined [26]. By aid of these significance indicators each of the twenty-five damage indicators is evaluated and a score from 0 to 1 is given to each damage indicator as a measure of how significant the parameter is for the RUL prediction for the asset under study. We have defined three significance indicators, Correlation, Monotonicity and Robustness to act as RUL significance indicators, and which will be defined and explained in the following.

The correlation r measures the correlation of a damage indicator with time (that is over the whole time span the vibration measurements have been performed), i.e., it states the normalized slope of the trend of the damage indicator over time, i.e.,

r = (σ X / σ t) b

, with X is the damage indicator, σX the standard deviation of X, σt the standard deviation of the the variable t time, and b the slope of the regression line ‘found by linear regression when viewing X as a function of t.

The Monotonicity indicator reflects the unidirectional trend of time domain features and frequency domain features. The larger the value of Monotonicity, the greater the slope of the parameters, and the more intuitive and obvious the trend of performance degradation. If a parameter rises and falls recurrently in the degradation process, it may be just a cyclic change as the machine vibrates. That does not change in a certain direction as performance degradation occurs.

The Robustness indicator reflects the tolerability of damage indicators for outliers. Robustness measures whether the degradation parameter is capable of resisting random interference [27]. If a parameter is sensitive to external disturbance, it does not contain valuable information even if it fluctuates wildly.

The equations applied to compute each of the indicator indicators are stated in Appendix B.

This study proposes a combination function W with three indicators above as a “ruler” to select several optimal parameters for following RUL prediction.

\max_{X \in Ω} {W = ω}_{1} {Corr (x) + ω}_{2} {Mon (X) + ω}_{3} Rob (X) with : ω_{i} > 0 \cap \sum_{i} ω_{i} = 1 i = 1, 2, 3

(7)

In this equation, W is the combination function, distributed in the range of [0, 1]; Ω represents a set of candidate damage indicators; and ω_i represents the weight of each indicator. The parameter with a larger value of W should be selected for effective RUL prediction. ω_i is determined by two sources: Subjectively, due to the fact that damage indicator is used to describe performance degradation trajectory as time goes, Mon should take up the largest weight. This is in compliance with similarity-based prediction method. So ω_i will be subjectively assigned a value denoted as prior weighing a_i. While objectively, the optimal combination of the chosen damage indicators in essence is about constrained optimization. We adopt the solving model with AMPL, input the permutation and combination of three indicators’ weights (adjustment of weighting is from 0.2~0.8), and determine the posterior weighting b_i. according to the results. At last, considering both prior weighing a_i and posterior weighting b_i, ω_i will be determined, and some more significant damage indicators can be chosen for subsequent RUL estimation.

ω_{i} = α a_{i} + (1 - α) b_{i}, (0 \leq α \leq 1)

(8)

2.3. Similarity-Based RUL Prediction

As Figure 3 shows, the concept of the similarity-based RUL prediction method is that assets that show similar behavior of their damage indicators have similar RUL values [28]. By comparing the damage indicator time series of an asset with corresponding historical reference time-series, the RUL of the asset can be predicted. It is assumed that the assets from which reference indicator curves are available are the same or of closely related type of product or system—and have performed under more or less similar operating environments and conditions—as the asset under study.

The blue curve represents the time-series of one of the damage indicators over time for a reference gearbox, while the red curve is the time-series of same indicator for a gearbox in use on which we wish to make an RUL prediction. Now the similarity concept states that we should find the most similar certain part of blue curve to red curve, which named ‘optimal match’. When an optimal match has been established then as estimate for the RUL of gearbox of interest the length of the time interval of the blue curve which is on the right of the red is applied. Here we always assume that the final available measurement point of any of the reference curves corresponds with the end of the remaining useful life of the reference gearbox.

To apply the similarity prediction method, its four core elements need to be defined. These are the time window D, the similarity measure function S(.), a weight function w(.), and the performance evaluation indicator Q. The time window D refers to the time interval of similarity between the test sample and reference samples, shown as the data block length marked as yellow in Figure 3. The similarity measurement function S(.) quantifies the similarity of the degradation trajectory of the test sample and reference samples. This paper will establish the DTWD-based nonlinear dynamic programming algorithm as S(.) which will be explained in Section 2.4. The weight function w(.) concerns the similarity between the test sample and reference samples, and it gives different weights to different reference samples and different parameters in line with their contributions. The performance evaluation indicator Q is used to describe differences between the RUL estimated value and its actual value, which helps to find the optimal method through comparing different RUL prediction methods. We borrowed 5 indicators as the performance evaluation indicator Q, which are shown in Appendix B (2).

Similarity-based RUL prediction follows four steps:

(1): Define the time window D to be used for each of the damage indicators related to an asset. The right side of the data block is the state of asset under study. The red curve is the time window D of the test sample and the blue curve is the life-cycle degradation state of reference sample. The right boundary line of D is observation point at present for test sample.
(2): Define a similarity measure function S(.) through which the similarity or closeness between two time-series is defined. DTWD algorithm is established as the similarity measure function S(.) in order to find the most similar part in one certain reference asset with the time window D, so one similarrity distance could be obtained. Suppose H most significant damage indicators are selected and L reference assets are compared with the asset under study, which means each reference asset contains H damage indicators. Then H*L similarity distances between each damage indicator in the asset under study and each damage indicator in those reference assets could be obtained by DTWD algorithm.
(3): Based on the thought “the more similar the two-time series is, the larger the weight value is”, we will make weighted summation among those H*L similarity distances. That is normalizing H*L similarity distances and then assigning different weights according to the thought such as closer distances will be given greater weights. The details of weight function w(.) based on multi-parameter and multi-sample refers to Equations (12) and (14) and Equations (16) and (18), respectively.
(4): For those RUL values referring to different parameters or different samples, weighted average method is used to obtain the test sample’s RUL estimation based on the corresponding weights calculated in step (3).

2.4. Similarity Measurement Function S(.): Dynamic Time Warping Distance (DTWD)

The DTWD is a dynamic nonlinear programming idea, and an algorithm that matches time dimension warping with distance optimization planning [29]. DTWD has been widely used in text data matching, voice information processing and other fields in recent years. Compared with the traditional Euclidean distance, it shows better recognition accuracy and robustness in the application of time series. DTWD can compress and bend time series, make the overall distance of two sequences smaller. The DTWD of two time series is defined as the minimum distance between the two series calculated by time dimension bending. when calculating the distance between series A and B, traditional Euclidean Distance takes the distance between two time series A and B at same time point, while DWTD takes the distance between two time series A and B that needn’t at same time point in order to obtain the shortest distance. For example, supposing that time series A = {2,5,2,5,2,3}, B = {0,3,6,0,6,0}, so the traditional Euclidean Distance is calculated as 2 + 2 + 4 + 5 + 4 + 3 = 20, and DTWD is calculated as Figure 4. The gray elements from the upper left corner to the lowest right corner are dynamic time warping path. The lowest right corner element ”12” is the cumulative distance D_twd(A,B) = 12.

Therefore, DTWD is calculated as follows: Setting time series A = (a₁,a₂,…,a_l) and B = (b₁,b₂,…,b_j…,b_k), l and k represent the sequence length of A and B, respectively. The DTWD algorithm needs to first align two time series and establish a l × k matrix D which contains the value d(a_i,b_j) on its ij-th entry. d (a_i,b_j) represents the distance between points a_i and b_j in two time series.

In matrix D, P (P = q₁,q₂,…,q_n,…,q_N) denotes the dynamic time warping path of time series A and B, q_i represents the distance of time series A and B at time point i. Path P needs to meet the following four restraint conditions:

(1): Boundedness: max(l,k) < N < l + k₋₁;
(2): Boundary conditions: q₁ = D(1,1) and q_N = D(l,k), that is, the start and end points of the dynamic warping path can only be on the diagonal of the matrix;
(3): Continuity: For q_n = (a,b) and q_n−1 = (a’,b’), the conditions $a - a^{'} \leq 1$ and $b - b^{'} \leq 1$ must be met;
(4): Monotonicity: For q_n = (a,b) and q_n−1 = (a’,b’), a − a’ = 0 and b − b’ = 0 can’t happen. that is, all line segments representing the dynamic bending paths cannot intersect each other.

For small-scale data, an exhaustive search method can be used to find an optimal dynamic time warping path. For large-scale data, based on the Dynamic Programming Model, the optimal dynamic time warping path can be obtained by a recursive search algorithm with the local optimal solution from point (1,1) to point (i,j). Using DTWD to represent DTWD between time series A and time series B, the computation process is

{\begin{cases} D_{twd} (A, B) = d (a_{1} {, b}_{1}) + \min {\begin{cases} D_{twd} (A, rest (B)) \\ D_{twd} (rest (A), B) \\ D_{twd} (rest (A), rest (B)) \end{cases} \\ d (a, b) = {‖ a - b ‖}_{p} \end{cases}

(9)

In the equation, p denotes the norm, rest(A) = {a₂, a₃…a_l}, rest(B) = {b₂, b₃…b_k}. As Equation (9) showed, d(a_i,b_j) represents the first point’s distance between two time series, then search for each shortest bending path at each rest point(i.e., rest(A) and rest(B)) between two series. The pseudocode of DTWD algorithm is shown in Appendix B (3).

2.5. RUl Estimation by Multi-Parameter Fusion

Multi-parameter similarity fusion focuses on the impact of different parameters on the RUL estimation of the asset under study. As the four steps showed in Section 2.3, Suppose H most significant damage indicators are selected and L reference assets are compared with the asset under study, which means each asset contains H damage indicators. Then H*L similarity distances between each damage indicator in the asset under study and each damage indicator in those reference assets could be obtained by DTWD algorithm. First, according to the weight idea in step (3) of Section 2.3, different weights are arranged to those H*L similarity distances. Second, for each certain damage indicator Hi, we make weighted summation among those Hi from L reference assets respectively, which is called “first fusion” and need to be traversal H times because there are total of H damage indicators. After first fusion there will be H similarity distances formed. Third, for those formed H similarity distances, we make weighted summation among them again based on the weight idea in step (3). This is called “second fusion”. There will be one similarity distances formed called “RUL value”. At last, by finding the corresponding time point of “RUL value”, we can estimate the RUL.

The following is the calculation process of mathematical theory:

For a diesel generator gearbox, with a asset under study(called “test sample’) of X, suppose H performance damage indicators can be obtained with the method in Section 2.2. With the l-th reference sample Y^l, l

(l = 1, 2, \dots, L)

is the label of reference sample and L is the number of reference samples. The idea of multi-parameter similarity fusion is shown in Figure 5.

In Figure 5, by

y_{h}^{l}

we denote the time-series of the h-th damage indicator of the l-th reference gearbox, with l = 1,…, L, and L the total number of reference gearboxes. Further,

U_{h *}^{l}

represents the RUL estimation described by the h-th damage indicator of the l-th reference sample, and

U_{h *}

represents the RUL value estimated by the h-th damage indicator after first fusion.

(1) Calculating similarity distance between each damage indicator in the asset under study and each damage indicator in those reference assets which is denoted by

U_{h *}^{l}

, so we need to run this step H*L times and obtain total of H*L similarity distances. let

S_{h *}^{l}

denotes the optimal similarity distance by DTWD between the h-th damage indicator of the l-th reference sample and the h-th damage indicator of the test sample. The calculation of

U_{h *}^{l}

is as follows:

S_{h *}^{l} = \min_{Δ} D_{t w d} (x_{h *}, y_{h *}^{l} (N - D - Δ))

(10)

U_{h *}^{l} = \arg \min_{Δ} D_{t w d} (x_{h *}, y_{h *}^{l} (N - D - Δ))

(11)

As Figure 3 shows, we need to match the red block such that is most similar (w.r.t. a certain measure) to a part of the blue curve, that is to find an ‘optimal match’. Only when the D_twd attains a minimum, we can conclude that the right boundary line of D which corresponds to a time point of reference sample reflects the RUL of test sample. With the minimum of distance

S_{h *}^{l}

is determined, the

U_{h *}^{l}

is determined.

(2) First fusion:

w_{h *}^{l}

represents the weight of

U_{h *}^{l}

, so Equation (12) is established as weight function w(.) for the first fusion according to the idea” The smaller the distance between the two time series is, the larger the weight value of the parameter is.”, then

U_{h *}^{}

could be obtained as showed in Figure 6.

S_{h *}^{l}

and

U_{h *}^{l}

have been calculated in Equations (10) and (11).

w_{h *}^{l} = \frac{\sum_{l = 1}^{L} S_{h *}^{l}}{\sum_{l = 1}^{L} (\sum_{l = 1}^{L} S_{h *}^{l} / S_{h *}^{l}) \cdot S_{h *}^{l}}

(12)

U_{h *}^{} = \sum_{l = 1}^{L} w_{h *}^{l} \cdot U_{h *}^{l}

(13)

(3) Second fusion: After obtaining a total of H

U_{h *}

,

w_{h}

represents the weight of

U_{h *}

, so Equation (14) is established as weight function w(.) for the second fusion.

w_{h} = \frac{\sum_{h = 1}^{H} \sum_{l = 1}^{L} S_{h *}^{l}}{\sum_{l = 1}^{L} (\frac{\sum_{h = 1}^{H} \sum_{l = 1}^{L} S_{h *}^{l}}{\sum_{l = 1}^{L} S_{h *}^{l}}) \cdot \sum_{l = 1}^{L} S_{h *}^{l}}

(14)

U = \sum_{h = 1}^{H} w_{h} \cdot U_{h *}

(15)

2.6. RUl Estimation by Multi-Sample Fusion

Compared with multi-parameter similarity fusion, multi-sample similarity fusion focuses more on the similarity between reference assets and the asset under study, rather than the similarity among different parameters. Same as Section 2.5, suppose H most significant damage indicators are selected and L reference assets are compared with the asset under study, which means each asset contains H damage indicators. Then H*L similarity distances between each damage indicator in the asset under study and each damage indicator in those reference assets could be obtained by DTWD algorithm. First, according to the weight idea in step (3), different weights are arranged to those H*L similarity distances. Second, for each certain reference sample L_i, we make weighted summation among those Hi, which is called “first fusion” and need to be traversal L times because there are total of L reference samples. After first fusion there will be L similarity distances formed. Third, for those formed L similarity distances, we make weighted summation among them again based on the weight idea in step (3). This is called “second fusion”. There will be one similarity distances formed called “RUL value”. At last, by finding the corresponding time point of “RUL value”, we can estimate the RUL.

The following is the calculation process of mathematical theory:

(1): Repeating the steps (1) in Section 2.5 based on multi-parameter similarity fusion;
(2): First fusion: $\overset{•}{w_{h *}^{l}}$ represents the weight of $U_{h *}^{l}$ , so Equation (16) is established as weight function w(.) for the first fusion, then $U_{h *}^{}$ could be obtained as showed in Figure 7. Unlike multi-parameter fusion, each reference sample is treated as a “unit”, H damage indicators of a certain reference sample will have a fusion firstly in those units.

$\overset{•}{w_{h *}^{l}} = \frac{\sum_{h = 1}^{H} S_{h *}^{l}}{\sum_{h = 1}^{H} (\sum_{h = 1}^{H} S_{h *}^{l} / S_{h *}^{l}) \cdot S_{h *}^{l}}$

(16)

$U_{*}^{l} = \sum_{h = 1}^{H} {\dot{w}}_{h *}^{l} \cdot U_{h *}^{l}$

(17)
(3): Second fusion: For the obtained $U_{*}^{l}$ , We used Equation (18) as weight function w(.) and make weighted summation to integrate L reference sample to the final RUL value $\overset{•}{U}$ :

$w^{l} = \frac{\sum_{h = 1}^{H} \sum_{l = 1}^{L} S_{h *}^{l}}{\sum_{l = 1}^{L} (\frac{\sum_{h = 1}^{H} \sum_{l = 1}^{L} S_{h *}^{l}}{\sum_{h = 1}^{H} S_{h *}^{l}}) \cdot \sum_{h = 1}^{H} S_{h *}^{l}}$

(18)

$\overset{•}{U} = \sum_{l = 1}^{L} w^{l} \cdot U^{l}$

(19)

2.7. Combining the Two Estimates into One

After obtaining the two results of RUL estimation with two methods, it’s feasible to make “third fusion” to combining the two estimates into one. This paper provides another idea about combining yet. As Figure 7 showed, the performance evaluation indicator Q is established to discuss the estimation results of the two methods, and the better RUL estimation result is selected for the diesel generator gearbox.

In addition, for a mechanical system, we will use the both methods and then prefer a more suitable result. Performance evaluation indicator Q is used to measure which result is better, they are some index like deviation of estimation in Appendix B. The two methods make the fusion process from different perspectives and take into account influencing factors comprehensively, so there is no need to fuse the two method’ results.

3. Experimental Results and Comparative Analysis

In this paper, the RUL of a diesel generator gearbox is studied by analyzing the vibration signals of a gearbox shell surface as Figure 8 showed. Data comes from the High Stress Accelerated Life Test of a certain type of heavy high-speed vessel diesel manufactured by the China Shipbuilding Industry Corporation (CSIC), which is collected from the gearbox Monoblock’s accelerometers. The number of teeth of the drive pinion is 17, and the number of teeth of the driven bull gear is 75. The input shaft bearing has a pitch diameter of 60 mm, a rolling element diameter of 19.05 mm, and six steel balls; the output shaft bearing has a diameter of 95 mm, a rolling element diameter of 22.25 mm, and eight steel balls. The data were recorded every 5 or 10 min at a sampling rate of 20 KHz. Four sets of diesel generator gearbox data were recorded during the life-cycle degradation process in Table 1. GU1, GU2, GU3, and GU4 all belong to the same type of component of the system, which are of similar working environments and operating conditions.

Figure 9 depicts the whole vibration signal in a gearbox lifecycle. The amplitude of the vibration signal increases gradually until the gear box fails to work properly.

3.1. Parameters System: Gearbox Performance Degradation Data

With the theory in Section 2.1 and Section 2.2, the evaluation result of 25 damage indicators is showed in Table 2. According to the calculation result of Equation (7), the weights are

ω_{1}

= 0.2,

ω_{2}

= 0.5, and

ω_{3}

= 0.3, respectively. According to the Section 2.2, the first six damage indicators (F_p₉, F_p₁₃, F_s₄, F_p₃, F_s₂, and F_p₁) ranked from large to small according to the W value are selected to construct the performance damage indicator system of the diesel generator gearbox. They will be the input of two RUL estimation methods. Figure 10 shows the life-cycle trajectories of F_p₉, F_p₁₃, F_s₄, F_p₃, F_s₂, and F_p₁.

After establishing the gearbox performance damage indicator system [F_p₉, F_p₁₃, F_s₄, F_p₃, F_s₂, F_p₁], the performance damage indicator data set of four samples (GU1 to GU4) is calculated. Figure 11 indicates that the curves of the same performance damage indicator from different samples have similar states. This proves that the gearbox registers a similar degradation trajectory in line with the running state and environment, which provides strong practical evidence for the subsequent RUL prediction based on multi-parameter and multi-sample similarity fusion. On the other hand, the different characters of F_P₁₃, F_P₃, and F_P₁ exactly reflect the different performance degradation trajectories of four samples. By selecting samples with different performance degradation, the verification of experience could be more convincing. In addition, in the aspect of the sample, F_P₁₃, F_P₃, and F_P₁ from a same sample have similar degradation trajectories, and the amplitude ranges are also so similar. This proves that these three parameters could actually reflect the performance degradation and should be selected for RUL prediction.

Considering the running time and data features, this study sets Sample GU1 as the test sample and GU2, GU3, and GU4 as reference samples to prove the validity of multi-parameter and multi-sample similarity fusion.

3.2. RUL Prediction Results

(1) Results based on multi-parameter similarity fusion

This study unrolled the prediction of a diesel generator’s data starting from the point of 200 h, with the time window D of 30. The details of RUL prediction result based on multi-parameter similarity fusion with Euclidean distance/DTWD are shown in Table A3 and Table A4 of Appendix B. Figure 12 and Figure 13 show the relative error between the actual values and predicted values of RUL.

In RUL prediction based on multi-parameter similarity fusion with DTWD, the relative error between the predicted values and the actual values ranges from −0.88% to −95.82%. Except for very few points with large errors, the relative errors of most of the predicted values are below 30%, which could obtain more accurate values than traditional Euclidean distance.

(2) Result based on multi-sample similarity fusion

The RUL estimation values based on multi-sample similarity fusion during the life-cycle degradation process are shown in Table A5 and Table A6 of Appendix B. Figure 14 and Figure 15 show the relative error between the actual values of RUL and the predicted values with Euclidean distance /DTWD.

With RUL prediction based on multi-sample similarity fusion, the relative error between the predicted values and the actual values ranges from −0.35% to −76%. Except for very few points with large errors, the overall relative error is controlled below 30%, which has a better prediction accuracy than the RUL prediction result based on multi-parameter similarity fusion.

3.3. Comparative Analysis with Single-Parameter RUL Prediction

Unlike methods of single-parameter similarity fusion, the method of multi-parameter similarity fusion generates a combination of results predicted by multiple parameters. In order to prove the validity and rationality of the model, a performance degradation curve is established upon each and every one of the reference samples’ parameters. The calculation adopts that of the single-parameter similarity prediction method and the weight value calculation process of different reference samples is the same as above. The test data set contains six performance damage indicators of Sample GU1: F_p₉, F_p₁₃, F_s₄, F_p₃, F_s₂, and F_p₁. They are compared to parameters F_p₉, F_p₁₃, F_s₄, F_p₃, F_s₂, and F_p₁ of Sample GU2, GU3, and GU4 to determine the RUL.

In this study, Principal Component Analysis (PCA) technology is used to integrate elements of the performance degradation index system [30]. The first principal component PCA-1 and the second principal component PCA-2 were extracted respectively to conduct RUL prediction through the single-parameter life RUL prediction method [31]. This paper takes the life cycle data set of Sample GU1 as an example. Through PCA of its six performance damage indicators, we get the KMO of 0.748, higher than 0.5, indicating that the six parameters are suitable for dimensionality reduction processing.

The curve of the first-order principal component and second-order principal component of the performance damage indicator system of Sample GU1’s life cycle data is shown in Figure 16.

The single-parameter RUL prediction results of the first-order performance principal component PCA-1 and the second-order principal component PCA-2 are shown in Table 3.

It can be seen from the table that there are significant differences in the prediction effects of the six parameters in the diesel generator gearbox performance degradation index system. The prediction accuracy of F_p₁₃ and F_s₄ is higher than the rest, whereas the prediction of all the six first-order principal components of PCA-1 is more accurate than that of a single-parameter. The single-parameter similarity RUL prediction registers a poorer performance. In summary, the multi-parameter fusion-based RUL prediction method proposed in this study has certain advantages and effectiveness.

3.4. Comparative Analysis with AI-Based RUL Method: MSVM

Research on RUL prediction based on artificial intelligence has also been developed, such as Bayesian methods, which are deep learning methods. This paper uses the multivariable support vector machine (MSVM) for comparative analysis. MSVM fully considers the interaction and constraints between multiple variables, and realizes the maximum mining of potential information for small sample data. According to Section 3.1, F_p₉, F_p₁₃, F_s₄, F_p₃, F_s₂, and F_p₁ are selected as the input of MSVM, and a regression function is constructed:

f (x) = (w \cdot x) + b, (w \in R^{n}, b \in R)

(20)

w and b can be obtained by solving the optimum solution of the following equation:

\begin{array}{l} \min \frac{1}{2} ‖ w ‖^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*}) \\ with : 〈 w \cdot x_{i} 〉 + b - y_{i} \leq ξ_{i} + ε; y_{i} - 〈 w \cdot x_{i} 〉 - b \leq ξ_{i}^{*} + ε i = 1, 2, \dots, n; ζ_{i}, ζ_{i}^{*} \geq 0 \end{array}

(21)

C is a penalty factor,

ζ_{i}, ζ_{i}^{*}

are relaxation factors, and

ε

is an unsensitive factor. When the data set shows a nonlinear relationship, a kernel function is introduced into the SVM operation to map the original data into the high-dimensional feature space. The Radial Basis Function (RBF) and Poly kernel function are as follows:

K (x_{i}, x_{j}) = \exp (\frac{- {‖ x_{i} - x_{j} ‖}^{2}}{2 p^{2}})

(22)

P is the index of RBF. The Lagrangian function is introduced to transform the optimization problem into a convex quadratic programming problem.

α_{i}

,

α_{i}^{*}

are Lagrangian multipliers.

\begin{array}{l} \max W (α_{i}, α_{i}^{*}) = - \frac{1}{2} \sum_{i, j = 1}^{n} (α_{i} - α_{i}^{*}) (α_{j} - α_{j}^{*}) K (x_{i}, x_{j}) - ε \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) + \sum_{i = 1}^{n} y_{i} (α_{i} - α_{i}^{*}) \\ w i t h : \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) = 0; 0 \leq α_{i} \leq C; 0 \leq α_{i}^{*} \leq C (i = 1, 2, \dots, n) \end{array}

(23)

The calculation results of the comparative analysis are shown in Table 4.

Table 4 indicates that the prediction accuracy of multi-sample similarity fusion is higher than multi-parameter similarity fusion concerning the prediction’s average relative error, and the two methods’ MAPE are both lower than MSVM, validating the effectiveness of the proposed method compared with the AI-based method. In addition, the proposed DTWD-based algorithm performs better than the traditional Euclidean distance.

In parameter similarity fusion, RUL values predicted by the same performance damage indicators are integrated to calculate the RUL of the test sample; while in sample similarity fusion, the RUL values of samples are integrated on the basis of performance damage indicators carried by each sample.

Multi-parameter and multi-sample methods are similar in calculation, but differ in some respects. Multi-parameter similarity fusion depends more on parameters’ feedback on the performance degradation process, while multi-sample similarity fusion relies on the sample data that is similar to the life-cycle trajectory in the gearbox running process. The more similar the test samples are with reference samples in terms of operating methods, conditions, and load environments, the larger the weight value that can be obtained, and the closer the RUL prediction value is to the actual value. Experimental results of the comparison are shown in Figure 17.

3.5. Limitations and Future Work

This paper proposes an RUL prediction model based on multi-parameter and multi-sample fusion, and has verified its effectiveness through analyzing a certain type of heavy high-speed diesel generator manufactured by an affiliate of CSIC. The results show that the proposed method is superior to previous studies in terms of the prediction accuracy. However, there are still some limitations in several respects. First, this paper verifies the proposed model with the diesel generator gearbox as an example, but further efforts should be devoted to testing broader gearbox equipment and even the mechanical rotating equipment. Second, this study does not classify types of malfunction at the termination and identify the degradation trend at different stages. Future researches can focus more on RUL under different malfunctions, grouping and decomposing the performance degradation process to identify test samples’ running stages, and refining the RUL prediction problems and models. Third, the research is conducted on the vibration signal of the diesel generator gearbox. To develop a more comprehensive RUL prediction method, future research should incorporate more data sources, such as performance parameters and environmental parameters.

4. Conclusions

This paper takes a certain type of heavy high-speed diesel generator as the study case. In the first step, through extracting time and frequency domain features of the original vibration and fuzzy filtering based on approximate entropy variance, the diesel generator performance damage indicators system is established. Next, this paper analyses the four core elements of similarity-based RUL prediction and establishes DTWD as the similarity measurement function. Then, we propose the methods of multi-parameter similarity fusion and multi-sample similarity fusion. Based on the two methods, the performance comparison research is carried out. The experimental results show that the MAPE values of the two RUL prediction methods proposed here are below 14%, which are lower than MSVM’s and PCA’s. This fully validates the effectiveness of the proposed method for predicting the RUL.And the RUL prediction based on the dynamic time bending distance function in the sample similarity fusion has the best accuracy which is below 10%. The similarity-based RUL prediction method has the merit of avoiding establishing a system degradation model, and is simple and practical. Moreover, it fully employs effective information provided by vibration signals, considers multiple parameters that can reflect performance degradation, and conducts a comparative analysis of multiple samples. The predicted results are stable as experimental results showed.

In summary, the innovations of this article are mainly as follows:

(1): We put forward the idea of similarity fusion with multi-parameter and multi-sample methods, and established the RUL prediction model. The performance degradation process is multi-dimensional and multifaceted. Multi-parameter similarity fusion takes full consideration of multiple parameters of vibration signals and a whole performance degradation process. Hence, a more comprehensive and accurate prediction is achieved. In contrast, multi-sample similarity fusion considers multiple samples with life-cycle degradation. By integrating RUL prediction values calculated by damage indicators carried with those samples, we improve the stability and credibility of RUL prediction; the MAPE is reduced to less than 14%, the MSE less than 220, the MADM less than 13.
(2): The DTWD-based nonlinear dynamic programming algorithm is established as the distance measure of similarity in RUL prediction. In the time series analysis, it performed better than the traditional Euclidean distance, the average relative errors of DTWD is 17% less than Euclidean distance.
(3): After time domain and frequency domain features extraction, we proposed approximate entropy variance (Aev) for low-pass filtering to remove signal noise.

Author Contributions

This manuscript was written by X.X., under the supervision of S.Z. and W.C. The modeling, data analysis, and software process were executed by S.Q., X.P. And Y.X. is responsible for the data acquisition and model design.

Funding

This work is supported by the National Natural Science Foundation of China (Grant No. 71971013 & 71871003 & 71501007) and the Fundamental Research Funds for the Central Universities (YWF-19-BJ-J-330). The study is also sponsored by the Aviation Science Foundation of China (2017ZG51081), the Technical Research Foundation (JSZL2016601A004) and Civil Aircraft Science Research Fund (MJ-2017-J-92).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In Table A1, F_ss represents the time domain parameter of s, and x_i represents the amplitude of the vibration signal collected by the gearbox sensor within a certain period (i = 1, 2, …, N₀), where N₀ is the quantity of data points collected within the period.

Table A1. Time domain damage indicators of the gearbox vibration signal.

Damage Indicators	Feature Symbol	Equation	Implication
Average value	$F_{s 1}$	$F_{s 1} = \frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} x_{i}$	Average energy value of gearbox vibration within a certain period
Mean square	$F_{s 2}$	$F_{s 2} = \sqrt{\frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} x_{i}^{2}}$	Better manifesting performance degradation trend of gear and bearing [32,33,34]
Mean-square amplitude	$F_{s 3}$	$F_{s 3} = {[\frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} \sqrt{\| x_{i} \|}]}^{2}$	Sensitive to larger amplitude change [35]
Absolute average	$F_{s 4}$	$F_{s 4} = \frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} \| x_{i} \|$	Calculating absolute value before calculating averages, which can avoid positive-negative offset
Skewness index	$F_{s 5}$	$F_{s 5} = \frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} x_{i}^{3}$	Measuring asymmetry of vibration signals
Waveform index	$F_{s 6}$	$F_{s 6} = \frac{\sqrt{\frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} x_{i}^{2}}}{F_{s 4}}$	Representing deviated and inclined value between present vibration signal and sine wave
Pulsatility index	$F_{s 7}$	$F_{s 7} = \frac{\max (x_{i})}{\frac{1}{N} \sum_{i = 1}^{N_{0}} \| x_{i} \|}$	Manifesting stability and destruction level of gearbox’s degradation and malfunction [36]
Kurtosis index	$F_{s 8}$	$F_{s 8} = \frac{\frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} x_{i}^{4}}{{(\sqrt{\frac{1}{N_{0}} \sum_{i = 1}^{N_{0}} x_{i}^{2}})}^{4}}$	Measuring “bending and arching” level of vibration signal
Peak-peak value	$F_{s 9}$	$F_{s}_{9} = m a x (x_{i}) - m i n (x_{i})$	Reflecting impact vibration resulted from malfunction
Margin index	$F_{s 10}$	$F_{s 10} = \frac{\max (x_{i})}{F_{s 3}}$	Reflecting Abrasion level of gear and bearing [35]

In Table A2, The spectrum of the original signal x_i collected within a certain period is represented as s_j, where j = 1, 2, …, J. J is the spectral line quantity of the spectrum, f_j represents the frequency value of the j-th line, and F_pk below represents the value of the k-th frequency domain damage indicator F_p.

Table A2. Frequency domain features of the vibration signal.

Feature Symbol	Equation	Feature Symbol	Equation	Feature Symbol	Equation
$F_{p 1}$	$\frac{\sum_{j = 1}^{J} {\bar{s}}_{j} \cdot s_{j}}{2 π \sum_{j = 1}^{J} s_{i}^{2}}$	$F_{p 2}$	$\frac{\sum_{j = 1}^{J} {\bar{s}}_{j}^{2}}{4 π^{2} \sum_{j = 1}^{J} s_{i}^{2}}$	$F_{p 3}$	$F_{p}_{2} - F_{p} {_{1}}^{2}$
$F_{p 4}$	$\frac{\sum_{j = 1}^{J} s_{j}}{J}$	$F_{p 5}$	$\frac{\sum_{j = 1}^{J} {(s_{j} - F_{p 4})}^{2}}{J - 1}$	$F_{p 6}$	$\frac{\sum_{j = 1}^{J} {(s_{j} - F_{p 4})}^{3}}{J {(\sqrt{F_{p 5}})}^{3}}$
$F_{p 7}$	$\frac{\sum_{j = 1}^{J} {(s_{j} - F_{p 4})}^{4}}{J F_{p 5}^{2}}$	$F_{p 8}$	$\frac{\sum_{j = 1}^{J} f_{j} s_{j}}{\sum_{j = 1}^{J} s_{j}}$	$F_{p 9}$	$\sqrt{\frac{\sum_{j = 1}^{J} {(f_{j} - F_{p 8})}^{2} s_{j}}{J}}$
$F_{p 10}$	$\sqrt{\frac{\sum_{j = 1}^{J} f_{j}^{2} s_{j}}{\sum_{j - 1}^{J} s_{j}}}$	$F_{p 11}$	$\sqrt{\frac{\sum_{j = 1}^{J} f_{j}^{4} s_{j}}{\sum_{j - 1}^{J} f_{j}^{2} s_{j}}}$	$F_{p 12}$	$\frac{\sum_{j = 1}^{J} f_{j}^{2} s_{j}}{\sqrt{\sum_{j = 1}^{J} s_{j} \sum_{j = 1}^{J} f_{j}^{4} s_{j}}}$
$F_{p 13}$	$\frac{F_{p 9}}{F_{p 8}}$	$F_{p 14}$	$\frac{\sum_{j = 1}^{J} {(f_{j} - F_{p 8})}^{3} s_{j}}{J F_{p 9}^{3}}$	$F_{p 15}$	$\frac{\sum_{j = 1}^{J} {(f_{j} - F_{p 8})}^{4} s_{j}}{J F_{p 9}^{4}}$

Appendix B

(1) Calculation of Corr, Mon and Rob

C o r r (X, T) = \frac{| K \sum_{k} X_{T} (t_{k}) t_{k} - \sum_{k} X_{T} (t_{k}) \sum_{k} t_{k} |}{\sqrt{[K \sum_{k} X_{T} {(t_{k})}^{2} - (\sum_{k} X_{T} (t_{k}))^{2}] [K \sum_{k} t_{k}^{2} - (\sum_{k} t_{k})^{2}]}}

(A1)

M o n (x) = \frac{1}{K - 1} | \sum_{k} δ (X_{T} (t_{k + 1}) - X_{T} (t_{k})) - \sum_{k} δ (X_{T} (t_{k}) - X_{T} (t_{k + 1})) |

(A2)

R o b (X) = \frac{1}{K} \sum_{k} \exp (- | \frac{X_{R} (t_{k})}{X (t_{k})} |)

(A3)

In those equations, K represents the total number of time series and

δ

( ) represents the unit step function. When the value of the independent variable in parentheses is larger than 0, the value of

δ

( ) is 1; otherwise, the value of

δ

( ) is 0.They are all distributed in the range of [0, 1] and positively correlated with time domain features and frequency domain features.

(2) The performance evaluation indicator Q

Mean Absolute Error (MAE): B is the start time of the test sample’s prediction, E is the end time of prediction, and i is the time point of prediction. Δ(i) represents the difference between the predicted value and actual value of the i-th prediction. The smaller the MAE is, the higher the prediction accuracy is.

M A E = \frac{\sum_{i = B}^{E} | Δ (i) |}{E - B + 1}

(A4)

Mean Squared Error (MSE):

M S E = \frac{1}{E - B + 1} \sum_{i = 1}^{E - B + 1} Δ {(i)}^{2}

(A5)

Mean Absolute Percentage Error (MAPE): The concept of Relative Error is introduced in this paper considering the difference between the predicted value and actual value.

M A P E = \frac{1}{E - B + 1} \sum_{i = 1}^{E - B + 1} | \frac{100 Δ (i)}{U (i)} |

(A6)

Error Standard Deviation (ESD): This reflects fluctuation of the error value. The smaller the value is, the more stable the gearbox is.

M = \frac{1}{E - B + 1} \sum_{i = 1}^{E - B + 1} Δ (i)

(A7)

E S D = \sqrt{\frac{\sum_{i = 1}^{E - B + 1} {(Δ (i) - M)}^{2}}{E - B}}

(A8)

Error Standard Deviation (ESD):Me denotes the median of the error value. MADM reflects the deviation degree of the error value from the median value, which applies to cases where the error value does not conform to Normal Distribution.

M A D M = \frac{1}{E - B + 1} \sum_{i - 1}^{E - B + 1} | Δ (i) - M e |

(A9)

(3) The pseudocode of DTWD (Algorithm A1)

Algorithm A1: Main: calculate DTWD

Input: time series A (array [1, l]), B(array [1,k])

Output: DTWD

(1) Matrix D = A^T B.

(2) Set constraint condition:

Boundedness, Boundary conditions, Continuity, and Monotonicity

(3) D_twd = 0

(4) FORi: = 1:

l

DO

DTWD [i, 0]: = ∞

(5) FORi: = 1:

l

DO

DTWD [0, i]: = ∞

(6) DTW [0, 0]: = 0

(7) FORi = 1:

l

DO

{ FORj = 1:

k

DO

d (i, j) = {‖ i - j ‖}_{p}

\cos t : = d (A [i], B [j])

DTWD [i,j]: = cost + minimum (DTWD [i − 1,j], DTWD [i,j − 1], DTWD [i − 1,j − 1])

}

(8) returnDTWD

The details of RUL prediction result based on multi-parameter similarity fusion with Euclidean distance/DTWD.

Table A3. RUL prediction results based on multi-parameter similarity fusion with Euclidean distance.

NO.	RUL		Error	Relative Error (%)	NO.	RUL		Error	Relative Error (%)
NO.	Actual Value	Predicted Value	Error	Relative Error (%)	NO.	Actual Value	Predicted Value	Error	Relative Error (%)
1	267	348	−81	−30.35	28	132	128	4	2.97
2	262	238	24	9.17	29	127	137	−10	−7.50
3	257	351	−94	−36.54	30	122	135	−13	−10.33
4	252	235	17	6.91	31	117	128	−11	−9.35
5	247	236	12	4.67	32	112	115	−3	−2.90
6	242	227	15	6.39	33	107	122	−15	−13.69
7	237	223	14	6.05	34	102	129	−27	−26.31
8	232	218	14	6.03	35	97	116	−19	−19.45
9	227	208	19	8.36	36	92	111	−19	−20.36
10	222	209	13	5.83	37	87	113	−26	−30.06
11	217	207	10	4.73	38	82	103	−21	−25.65
12	212	203	9	4.12	39	77	94	−17	−21.66
13	207	166	42	20.08	40	72	96	−24	−33.46
14	202	167	36	17.61	41	67	92	−25	−37.67
15	197	163	34	17.20	42	62	97	−35	−55.75
16	192	161	31	16.34	43	57	90	−33	−57.49
17	187	161	26	14.14	44	52	97	−45	−86.60
18	182	156	26	14.06	45	47	95	−48	−101.49
19	177	154	23	12.77	46	42	71	−29	−68.44
20	172	152	20	11.77	47	37	71	−34	−91.10
21	167	149	18	10.65	48	32	65	−33	−102.14
22	162	150	12	7.35	49	27	63	−36	−131.98
23	157	147	10	6.16	50	22	51	−29	−130.16
24	152	142	10	6.36	51	17	39	−22	−127.27
25	147	143	4	2.66	52	12	15	−3	−23.33
26	142	143	−1	−0.31	53	7	15	−8	−109.38
27	137	143	−81	−4.34

Table A4. RUL prediction results based on multi-parameter similarity fusion with DTWD.

NO.	RUL		Error	Relative Error (%)	NO.	RUL		Error	Relative Error (%)
NO.	Actual Value	Predicted Value	Error	Relative Error (%)	NO.	Actual Value	Predicted Value	Error	Relative Error (%)
1	267	239	28	10.36	28	132	127	5	3.95
2	262	242	20	7.67	29	127	135	−8	−6.38
3	257	240	17	6.52	30	122	134	−11	−9.38
4	252	237	15	6.13	31	117	126	−9	−7.55
5	247	232	15	6.13	32	112	122	−10	−8.69
6	242	232	10	4.28	33	107	121	−14	−12.84
7	237	222	15	6.52	34	102	110	−8	−7.56
8	232	215	17	7.22	35	97	120	−23	−23.53
9	227	214	13	5.73	36	92	111	−18	−19.95
10	222	215	7	3.05	37	87	100	−13	−14.76
11	217	213	4	1.85	38	82	98	−16	−19.31
12	212	210	2	1.00	39	77	92	−15	−19.27
13	207	204	4	1.72	40	72	86	−14	−19.21
14	202	206	−4	−1.81	41	67	81	−14	−20.64
15	197	205	−8	−3.89	42	62	98	−35	−56.96
16	192	201	−9	−4.66	43	57	95	−38	−66.05
17	187	202	−15	−8.12	44	52	94	−42	−79.67
18	182	202	−20	−11.01	45	47	92	−45	−95.82
19	177	150	27	15.23	46	42	45	−3	−7.55
20	172	149	23	13.36	47	37	25	12	31.81
21	167	191	−24	−14.59	48	32	42	−9	−29.06
22	162	148	14	8.44	49	27	25	2	7.94
23	157	143	14	9.00	50	22	30	−8	−36.89
24	152	141	11	7.10	51	17	30	−12	−71.91
25	147	138	9	5.94	52	12	16	−4	−28.81
26	142	138	5	3.19	53	7	5	2	27.88
27	137	138	−1	−0.88

The details of RUL prediction result based on multi-sample similarity fusion with euclidean distance/DTWD.

Table A5. prediction results based on multi-sample similarity fusion with euclidean distance.

NO.	RUL		Error	Relative Error (%)	NO.	RUL		Error	Relative Error (%)
NO.	Actual Value	Predicted Value	Error	Relative Error (%)	NO.	Actual Value	Predicted Value	Error	Relative Error (%)
1	267	221	46	17.14	28	132	121	11	8.09
2	262	221	41	15.72	29	127	120	7	5.37
3	257	220	37	14.34	30	122	123	0	−0.38
4	252	222	30	11.89	31	117	119	−2	−1.60
5	247	221	26	10.43	32	112	105	8	6.75
6	242	205	37	15.25	33	107	93	14	13.36
7	237	204	33	13.78	34	102	91	11	10.94
8	232	199	33	14.07	35	97	97	0	−0.13
9	227	192	35	15.33	36	92	93	−1	−1.37
10	222	189	33	15.03	37	87	85	2	2.84
11	217	188	30	13.60	38	82	86	−3	−4.12
12	212	163	49	23.18	39	77	73	5	6.01
13	207	157	50	24.30	40	72	76	−4	−5.06
14	202	157	45	22.16	41	67	71	−4	−5.64
15	197	153	44	22.27	42	62	61	1	1.98
16	192	150	42	22.11	43	57	74	−17	−28.88
17	187	155	32	17.17	44	52	76	−24	−46.44
18	182	151	31	17.17	45	47	70	−23	−49.52
19	177	140	37	20.85	46	42	74	−31	−74.72
20	172	141	32	18.35	47	37	58	−21	−57.02
21	167	144	23	13.97	48	32	59	−27	−84.10
22	162	140	22	13.35	49	27	54	−27	−98.48
23	157	137	20	12.60	50	22	35	−13	−57.95
24	152	130	22	14.43	51	17	26	−9	−51.51
25	147	128	19	12.79	52	12	21	−9	−72.66
26	142	127	15	10.83	53	7	8	−1	−11.67
27	137	128	10	6.95

Table A6. RUL prediction results based on multi-sample similarity fusion with DTWD.

NO.	RUL		Error	Relative Error (%)	NO.	RUL		Error	Relative Error (%)
NO.	Actual Value	Predicted Value	Error	Relative Error (%)	NO.	Actual Value	Predicted Value	Error	Relative Error (%)
1	267	284	−17	−6.40	28	132	133	0	−0.35
2	262	263	−1	−0.47	29	127	137	−10	−8.07
3	257	272	−15	−5.79	30	122	136	−14	−11.36
4	252	224	28	11.04	31	117	132	−15	−12.51
5	247	224	23	9.34	32	112	116	−4	−3.29
6	242	216	26	10.58	33	107	113	−6	−5.30
7	237	215	22	9.19	34	102	111	−9	−8.33
8	232	213	19	8.40	35	97	115	−18	−18.84
9	227	207	20	8.90	36	92	109	−17	−17.94
10	222	211	11	4.89	37	87	109	−21	−24.59
11	217	205	12	5.65	38	82	101	−18	−22.37
12	212	200	12	5.59	39	77	91	−14	−18.31
13	207	191	16	7.64	40	72	91	−19	−26.62
14	202	189	13	6.46	41	67	90	−23	−34.15
15	197	187	10	5.09	42	62	92	−30	−48.85
16	192	186	6	3.38	43	57	80	−23	−39.99
17	187	185	2	0.87	44	52	76	−24	−45.74
18	182	182	0	0.26	45	47	55	−8	−16.65
19	177	155	22	12.63	46	42	55	−13	−30.48
20	172	154	18	10.52	47	37	40	−3	−7.66
21	167	162	5	3.18	48	32	40	−8	−24.40
22	162	152	10	6.42	49	27	35	−8	−28.88
23	157	149	9	5.43	50	22	39	−17	−76.00
24	152	146	7	4.34	51	17	18	−1	−4.89
25	147	144	3	1.80	52	12	10	2	17.78
26	142	143	−1	−0.61	53	7	5	2	30.21
27	137	142	−5	−3.83

References

Wang, Z.Q.; Hu, C.H.; Si, X.S.; Enrico, Z. Remaining useful life prediction of degrading systems subjected to imperfect maintenance: Application to draught fans. Mech. Syst. Sig. Process. 2018, 100, 802–813. [Google Scholar] [CrossRef]
Martinez-Morales, J.D.; Palacios-Hernandez, E.R.; Campos-Delgado, D.U. Multiple-fault diagnosis in induction motors through support vector machine classification at variable operating conditions. Electr. Eng. 2018, 100, 59–73. [Google Scholar] [CrossRef]
Wang, T. Trajectory Similarity Based Prediction for Remaining Useful Life Estimation. Ph.D. Thesis, University of Cincinnati, Cincinnati, OH, USA, 2010. [Google Scholar]
Sikorska, J.Z.; Hodkiewicz, M.; Ma, L. Prognostic modelling options for remaining useful life estimation by industry. Mech. Syst. Sig. Process. 2011, 25, 1803–1836. [Google Scholar] [CrossRef]
Lei, Y.; Li, N.; Guo, L.; Li, N.; Yan, T.; Lin, J. Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mech. Syst. Sig. Process. 2018, 104, 799–834. [Google Scholar] [CrossRef]
Frank, A.X.P.H.; Pinter, G. Numerical Assessment of PE 80 and PE 100 Pipe Lifetime Based on Paris—Erdogan Formula. Macromol. Symp. 2012, 311, 112–121. [Google Scholar] [CrossRef]
Hu, Y.; Baraldi, P.; Maio, F.D.; Enrico, Z. Online Performance Assessment Method for a Model-Based Prognostic Approach. IEEE Trans. Reliab. 2016, 65, 718–735. [Google Scholar] [CrossRef]
Hussain, S.; Gabbar, H.A. Vibration analysis and time series prediction for wind turbine gearbox prognostics. Int. J. Progn. Health Manag. 2013, 4, 718–735. [Google Scholar]
Aroussi, M.E. Bearings prognostic using Mixture of Gaussians Hidden Markov Model and Support Vector Machine. Int. J. Netw. Secur. Appl. 2013, 5, 4. [Google Scholar]
Dong, S.; Luo, T. Bearing degradation process prediction based on the PCA and optimized LS-SVM model. Measurement 2013, 46, 3143–3152. [Google Scholar] [CrossRef]
Wang, P.; Youn, B.D.; Hu, C. A generic probabilistic framework for structural health prognostics and uncertainty management. Mech. Syst. Sig. Process. 2012, 28, 622–637. [Google Scholar] [CrossRef]
You, M.Y.; Meng, G. A generalized similarity measure for similarity-based residual life prediction. Proc. Inst. Mech. Eng. Part E J. Process. Mech. Eng. 2011, 225, 151–160. [Google Scholar] [CrossRef]
Mosallam, A.; Medjaher, K.; Zerhouni, N. Data-driven prognostic method based on Bayesian approaches for direct remaining useful life prediction. J. Intell. Manuf. 2016, 27, 1037–1048. [Google Scholar] [CrossRef]
Zhang, Q.; Tse, W.T.; Wan, X.; Xu, G. Remaining useful life estimation for mechanical systems based on similarity of phase space trajectory. Expert Syst. Appl. 2015, 42, 2353–2360. [Google Scholar] [CrossRef]
Xiong, X.; Yang, H.; Cheng, N.; Li, Q. Remaining Useful Life Prognostics of Aircraft Engines Based on Damage Propagation Modeling and Data Analysis. In Proceedings of the International Symposium on Computational Intelligence and Design, Hangzhou, China, 12–13 December 2015. [Google Scholar]
Moghaddass, R.; Zuo, M.J. An integrated framework for online diagnostic and prognostic health monitoring using a multistate deterioration process. Reliab. Eng. Syst. Saf. 2014, 124, 92–104. [Google Scholar] [CrossRef]
Xiao, Y.; Zhang, R.; Zhang, Q. Permutation flow shop scheduling with order acceptance and weighted tardiness. Appl. Math. Comput. 2015, 270, 312–333. [Google Scholar] [CrossRef]
Epinette, J.; Jolles-Haeberli, B.M. Comparative Results from a National Joint Registry Hip Data Set of a New Cross-Linked Annealed Polyethylene vs. Both Conventional Polyethylene and Ceramic Bearings. J. Arthropl. 2016, 31, 1483–1491. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; He, B.; Liu, F.; Lu, S.; Zhao, Y.; Zhao, J. Remaining Useful Life Prediction of Rolling Bearings Using PSR, JADE, and Extreme Learning Machine. Math. Prob. Eng. 2016, 2016, 13. [Google Scholar] [CrossRef]
Candan Çağatay Ozaktas, H.M. Sampling and series expansion theorems for fractional Fourier and other transforms. Sig. Process. 2003, 83, 2455–2457. [Google Scholar] [CrossRef]
Lei, Y.; He, Z.; Zi, Y. A new approach to intelligent fault diagnosis of rotating machinery. Expert Syst. Appl. 2008, 35, 1593–1600. [Google Scholar] [CrossRef]
Zhou, S.; Qian, S.; Chang, W.; Xiao, Y.; Cheng, Y. A Novel Bearing Multi-Fault Diagnosis Approach Based on Weighted Permutation Entropy and an Improved SVM Ensemble Classifier. Sensors 2018, 18, 6. [Google Scholar] [CrossRef]
Zheng, J.; Pan, H.; Yang, S.; Cheng, J. Generalized composite multiscale permutation entropy and Laplacian score based rolling bearing fault diagnosis. Mech. Syst. Sig. Process. 2018, 99, 229–243. [Google Scholar] [CrossRef]
Huang, J.; Pei, W.; Cao, D.; Yu, S. Fuzzy Filter Based on Approximate Entropy. J. Data Acquis. Process. 1998, 13, 140–143. [Google Scholar]
Josue, E.; Guillermo, V.; Francisco, R.; Lopez, E.; Gerardo, S.; Jose, A. Efficient predictive vibration control of a building-like structure. Asian J. Control 2018. [Google Scholar] [CrossRef]
Ahmad, W.; Khan, S.A.; Islam, M.M.; Kim, J.M. A reliable technique for remaining useful life estimation of rolling element bearings using dynamic regression models. Reliab. Eng. Syst. Saf. 2018, 184, 67–76. [Google Scholar] [CrossRef]
Tanzi, M.; Pereira, T.; Van Strien, S. Robustness of ergodic properties of non-autonomous piecewise expanding maps. Ergod. Theory Dyn. Syst. 2017, 39, 32. [Google Scholar] [CrossRef]
Gu, M.; Chen, Y. Two improvements of similarity-based residual life prediction methods. J. Intell. Manuf. 2019, 30, 303–315. [Google Scholar] [CrossRef]
Aghabozorgi, S.; Shirkhorshidi, A.S.; Wah, T.Y. Time-series clustering—A decade review. Inf. Syst. 2015, 53, 16–38. [Google Scholar] [CrossRef]
Santos-Ruiz, J.R.; Bermúdez, F.R.; López-Estrada, V.; Puig, L.; Torres, J.A. Diagnosis of Fluid Leaks in Pipelines Using Dynamic PCA. Int. Fed. Autom. Control 2018, 51, 373–380. [Google Scholar] [CrossRef]
Jia, X.; Jin, C.; Buzza, M.; Wang, W. Wind turbine performance degradation assessment based on a novel similarity metric for machine performance curves. Renew. Energy 2016, 99, 1191–1201. [Google Scholar] [CrossRef]
Yao, B.; Su, J.; Wu, L.; Guan, Y. Modified Local Linear Embedding Algorithm for Rolling Element Bearing Fault Diagnosis. Appl. Sci. 2017, 7, 1178. [Google Scholar] [CrossRef]
Zhang, B.; Zhang, L.; Xu, J. Degradation Feature Selection for Remaining Useful Life Prediction of Rolling Element Bearings. Qual. Reliab. Eng. Int. 2016, 32, 547–554. [Google Scholar] [CrossRef]
Wu, B.; Li, W.; Qiu, M.Q. Remaining Useful Life Prediction of Bearing with Vibration Signals Based on a Novel Indicator. Shock Vibr. 2017, 10. [Google Scholar] [CrossRef]
Vladimiroff, T. Ab initio and density functional calculations of mean-square amplitudes of vibration for benzene and cubane. J. Mol. Struct. Theochem. 2000, 507, 111–118. [Google Scholar] [CrossRef]
Ji, M.; Guo, H.J.; Zhang, Y.D.; Li, T.; Gao, L. Hierarchic Analysis Method to Evaluate Rock Burst Risk. Math. Probl. Eng. 2015, 2015, 8. [Google Scholar] [CrossRef]

Figure 1. Performance of the system.

Figure 2. Curves comparison of F_S₁₀ before and after wave filtering.

Figure 3. Principle of similarity-based Remaining Useful Life (RUL) prediction.

Figure 4. The dynamic time warping path.

Figure 5. Similarity fusion of multi-parameter.

Figure 6. Similarity fusion of multi-sample.

Figure 7. Combination of the two methods.

Figure 8. Typical diesel engine (Inside the black frame is a gearbox).

Figure 9. Vibration signal diagram in the life-cycle degradation process.

Figure 10. Life-cycle diagrams of six selected performance damage indicators.

Figure 11. Curves of the six performance damage indicators from different samples.

Figure 12. Actual values and predicted values of RUL based on multi-parameter similarity fusion with Euclidean distance.

Figure 13. Actual values and predicted values of RUL based on multi-parameter similarity fusion with Dynamic Time Warping Distance (DTWD).

Figure 14. Actual values and predicted values of RUL based on multi-sample similarity fusion with Euclidean distance.

Figure 15. Actual values and predicted values of RUL based on multi-sample similarity fusion with DTWD.

Figure 16. First- and second-order principal component of Sample GU’s performance damage indicator system.

Figure 17. Comparison of four RUL prediction methods.

Table 1. Vibration data sets for the diesel generator gearbox.

Gearbox No.	Service Time (Unit: h)	Record Interval (Unit: min)
GU1	467	5 or 10
GU2	390	5 or 10
GU3	410	5 or 10
GU4	408	5 or 10

Table 2. Evaluation results of time-domain and frequency-domain damage indicators.

Damage Indicators	Corr	Mon	Rob	W	Ranking	Damage Indicators	Corr	Mon	Rob	W	Ranking
F_s₁	0.0723	0.0060	0.3936	0.13554	25	F_p₄	0.6379	0.0496	0.9491	0.43711	10
F_s₂	0.7374	0.0641	0.9418	0.46207	5	F_p₅	0.5585	0.0641	0.8725	0.4055	14
F_s₃	0.5224	0.0675	0.9084	0.41075	13	F_p₆	0.9134	0.0051	0.9016	0.45571	8
F_s₄	0.7452	0.0675	0.9514	0.46821	3	F_p₇	0.1305	0.0436	0.5069	0.19997	24
F_s₅	0.7440	0.0051	0.4542	0.28761	22	F_p₈	0.1091	0.0017	0.9956	0.32135	20
F_s₆	0.5463	0.0009	0.9797	0.40362	15	F_p₉	0.8201	0.0513	0.9696	0.48055	1
F_s₇	0.5228	0.0009	0.8678	0.36535	18	F_p₁₀	0.4615	0.0188	0.9928	0.39954	16
F_s₈	0.6418	0.0239	0.8301	0.38934	17	F_p₁₁	0.5650	0.0239	0.9917	0.42246	11
F_s₉	0.7128	0.0474	0.8501	0.42129	12	F_p₁₂	0.0447	0.0265	0.9950	0.32069	21
F_s₁₀	0.8328	0.0248	0.8696	0.43984	9	F_p₁₃	0.8896	0.0615	0.8985	0.47822	2
F_p₁	0.9135	0.0085	0.9071	0.45908	6	F_p₁₄	0.4418	0.0094	0.5156	0.24774	23
F_p₂	0.9135	0.0085	0.9071	0.45908	7	F_p₁₅	0.3534	0.0581	0.8413	0.35212	19
F_p₃	0.9105	0.0077	0.9222	0.46261	4

Table 3. Comparison with single-parameter Remaining Useful Life (RUL) prediction.

Parameter	Performance Evaluation Indicators Q
Parameter	MAE	MSE	MAPE	ESD	MADM
F_p₉	23	751.86	36.51%	24.78	21.10
F_p₁₃	20	709.71	20.30%	26.45	19.97
F_s₄	21	676.81	21.35%	21.98	18.92
F_p₃	41	4142.53	79.74%	63.75	40.72
F_s₂	23	1014.17	40.64%	25.22	19.34
F_p₁	38	3023.67	69.93%	54.82	38.20
PCA-1	13	241.22	18.79%	15.39	13.05
PCA-2	16	490.60	22.69%	21.90	16.02

Table 4. Results of different prediction methods.

RUL Prediction Method	Similariy Measure	Performance Evaluation Indicators Q
RUL Prediction Method	Similariy Measure	MAE	MSE	MAPE	ESD	MADM
multi-parameter similarity fusion	Euclidean distance	23	808.81	30.90%	27.78	22.56
multi-parameter similarity fusion	DTWD function	14	292.11	17.15%	16.82	13.73
multi-sample similarity fusion	Euclidean distance	22	684.30	21.43%	22.41	19.35
multi-sample similarity fusion	DTWD function	12	219.37	14.00%	14.76	12.39
MSVM	Euclidean distance	19	351.02	23.34%	17.83	12.24
MSVM	DTWD function	17	287.34	20.14%	16.21	11.45

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhou, S.; Xu, X.; Xiao, Y.; Chang, W.; Qian, S.; Pan, X. Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox. Entropy 2019, 21, 861. https://doi.org/10.3390/e21090861

AMA Style

Zhou S, Xu X, Xiao Y, Chang W, Qian S, Pan X. Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox. Entropy. 2019; 21(9):861. https://doi.org/10.3390/e21090861

Chicago/Turabian Style

Zhou, Shenghan, Xingxing Xu, Yiyong Xiao, Wenbing Chang, Silin Qian, and Xing Pan. 2019. "Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox" Entropy 21, no. 9: 861. https://doi.org/10.3390/e21090861

APA Style

Zhou, S., Xu, X., Xiao, Y., Chang, W., Qian, S., & Pan, X. (2019). Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox. Entropy, 21(9), 861. https://doi.org/10.3390/e21090861

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Remaining Useful Life Prediction with Similarity Fusion of Multi-Parameter and Multi-Sample Based on the Vibration Signals of Diesel Generator Gearbox

Abstract

1. Introduction

2. Methodology

2.1. The Damage Indicators

2.2. Defining a Subset of Most Significant Damage Indicators

2.3. Similarity-Based RUL Prediction

2.4. Similarity Measurement Function S(.): Dynamic Time Warping Distance (DTWD)

2.5. RUl Estimation by Multi-Parameter Fusion

2.6. RUl Estimation by Multi-Sample Fusion

2.7. Combining the Two Estimates into One

3. Experimental Results and Comparative Analysis

3.1. Parameters System: Gearbox Performance Degradation Data

3.2. RUL Prediction Results

3.3. Comparative Analysis with Single-Parameter RUL Prediction

3.4. Comparative Analysis with AI-Based RUL Method: MSVM

3.5. Limitations and Future Work

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI