Stochastic Identification and Analysis of Long-Term Degradation Through Health Index Data

Hamid Shiri; Pawel Zimroz

doi:10.3390/math13121972

Abstract

Timely diagnosis and prognosis based on degradation symptoms are essential steps for condition-based maintenance (CBM) to guarantee industrial safety and productivity. Most industrial machines operate under variable operating conditions. This time-varying operating condition can accelerate the machinery’s degradation process. It may have a massive influence on data and impede the process of diagnosis and prognosis of the machinery. Therefore, in this paper, to address the mentioned problems, we introduced an approach for modelling non-stationary long-term condition monitoring data. This procedure includes separating random and deterministic parts and identifying possible autodependence hidden in the random sequence, as well as potential time-dependent variance. To achieve these objectives, we employ a time-varying coefficient autoregressive (TVC-AR) model within a Bayesian framework. However, due to the limited availability of diverse run-to-failure data sets, we validate the proposed procedure using a simulated degradation model and two widely recognized benchmark data sets (FEMTO and wind turbine drive), which demonstrate the model’s effectiveness in capturing complex non-stationary degradation characteristics.

Keywords:

condition based maintenance; health index; non-stationery; degradation; time varying; Gaussian noise; Bayesian framework; long term data; autoregressive

MSC:

62M10

1. Introduction

Diagnostics and prognostics using long-term monitoring data have increased with the development of condition monitoring systems. Efficient use of long-term condition monitoring data, collected over months or even years, is a crucial element for diagnosis and prognosis. In recent years, many methods have been developed and published in the area, which can be categorised into three main groups: data-driven approach (including machine learning (ML)-based approach [1,2,3,4] and statistics-based model approach [5,6,7,8]), physics-based approach [9,10], and hybrid approach [11,12]. Physics model-based approaches try to explain the degradation process by taking advantage of the physics of the degradation process based on damage and fracture mechanics [13]. This family of approaches could provide accurate results. However, the exact physics knowledge of complex systems is not always available or too expensive to extract [14]. Therefore, employing the physics-based approaches has significant restrictions to use in real applications. The data-driven approach attempts to build a model of the degradation process using historical data. The data-driven approach is usually more effective than the physic-based approach when we are faced with complex engineering systems such as wind turbines, aircraft, and mining machines. This class of approach is divided into two subclasses: machine learning-based model and statistics-based model. The machine learning-based approach is a powerful tool for modelling, segmenting, and predicting complex systems, where the degradation processes are hard to interconnect with physics or statistics-based models. However, these methods require a large amount of historical degradation process data for training, which are not accessible for most practical equipment. Compared with previous methods, statistics-based approaches do not require much degradation data and mechanical knowledge of the equipment. Furthermore, the statistics-based model has enormous potential to consider degradation uncertainties. The hybrid approach wants to use the advantages of a physics-based and data-driven approach through their integration. More detailed information on the hybrid approach can be found in the literature [11,12].

Selecting the appropriate statistical model (stochastic or random variable), which has closely matched the degradation process in an actual application, is crucial to address diagnostic and prognostic problems. Therefore, different statistical model-based approaches have been developed in the literature to consider the problem. Some of these stochastic model-based approaches are developed based on Gaussian processes, such as the Brownian motion (BM), also known as the Wiener process [15]. Wang et al. [16] developed a degradation model that employs linear BM that calculated RUL, assuming that its distribution is inverse Gaussian. Bian et al. [17] introduced a covariance-dependent degradation model. However, for models developed on the basis of BM, it is difficult to describe the long-range dependence. To solve this problem, Xi et al. [18] used a degradation model based on fractional Brownian motion (FBM), which provided an approximate explicit solution to estimate the remaining useful life (RUL).

Another part of the stochastic model has been developed based on the non-Gaussian process to model a degradation process, such as the Gamma process [19], the generalised Cauchy process (GC) [20], and fractional Levy stable motion (FLSM) [21].

However, the mentioned stochastic models are practical and suitable only under particular consideration. For example, both BM and FBM are developed on the basis of stationary increments. Additionally, the increment of both mentioned methods has followed the same distribution. However, BM and FBM can describe non-stationary degradation processes when only the drift term has a nonlinear trend. The degradation model with this non-stationary characteristic is recalled only in the increment expectations, as the drift terms can express the deterministic part of the degradation process [22]. Generally, the heterogeneity of actual degradation processes is represented in deterministic and random parts. Therefore, it is necessary to consider non-stationary characteristics in a random part. Some methods describe the non-stationary part with a specific function, such as power law. However, this assumption is too simple for complex systems [23].

The autoregressive model (AR) is one of the influential and well-known approaches used to model and analyse time series [5,24,25]. Nevertheless, classical AR modelling is suitable only for stationary time series, and in numerous cases, assumptions about stationarity are too restrictive or inappropriate [22]. Nonetheless, Żuławiński et al. [23] introduced a new framework for the long-term HI model, which, in part of this framework, uses a robust AR approach to identify the characteristics of random parts. They used a robust estimator to identify the scale (variance in the Gaussian case) and normalised a random component. Then they fitted the AR model to the stationary part of the signal. This framework can also model both the deterministic and random parts of the degradation process. However, it is difficult to use this model for online applications.

In the case of non-stationary time series, time-varying parameters or adaptive modelling seems to be potential options. TVC-AR models have been developed since the early 1980s, particularly in the Bayesian framework; see [26] for an excellent review. TVC-AR models are powerful tools for expressing non-stationary time series with quasi-periodic latent features [27]. These types of time series appear in several applications that involve, for example, structure health monitoring [28,29], biomedical [30,31,32], speech signal processing [33], and financial time series [34,35].

To address the above-mentioned problems, we introduced a framework for identifying and modelling complex long-term condition monitoring data with non-stationary characteristics based on TVC-AR. We describe how to identify all components (including deterministic and random parts) and how to finally build a model for prognosis and remaining a useful life when we have a signal with non-stationary characteristics in both deterministic and random components. Additionally, this model contains all dependent characteristics, such as location and variance (scale). The contributions in this paper are summarised as follows:

A model of HI data was proposed as a three-segment sequence with non-stationary characteristics in both random and deterministic components to describe the degradation process, which can be used to simulate the artificial data set.
Online identification of the time-varying random characteristics component, like mean (location) and variance (scale), and also the dependency between them, is described for the TVC-AR model.
A long-term data model based on TVC-AR is proposed for identification and modelling, and extensive experiments are carried out on the simulated data set and FEMTO and wind turbine data sets to verify its effectiveness.

The paper is formed as follows: After the introduction, in Section 2, the critical parts of the proposed approach are defined in theory. Then, in Section 3, the suggested model is simulated, and the results are presented. The results of using the proposed approach to two benchmark data sets are shown in Section 4 with an indication of all intermediate steps. Ultimately, in Section 5, the discussion made for the results of the previous section and the conclusions are formed in Section 6.

2. Methodology and Theory

2.1. Degradation Model

In the PHM community, there are several stochastic models that are used to describe the degradation process. These models are usually composed of the deterministic part (which is trying to qualify the global trend of the degradation process) and the random part (which is trying to consider the uncertainty of the degradation process). These models are usually selected based on the physics of the degradation process with different deterministic and random trends [23]. In this work, due to the complex trend of a growing crack, we selected a time-varying model with 3 different regimes. The first regime has a constant deterministic trend, which refers to the healthy state of the system (the machine is working stably). The second regime has a linear growing trend, which assigns to the degradation of the machine (when the length of the crack is growing up gradually). The last regime has an exponential trend, which is attributed to the critical state of the degradation machine (when the length of the crack is growing up dramatically). Additionally, as we discussed modelling, the random part is very important in reflecting the uncertainty. Therefore, we proposed a model with non-stationary characteristics. The scale’s trend (variance) of the random part is changed during the degradation process based on the regime we are in. It may also include the dependency on the random component during the process. In Table 1, we show the main characteristics of the data corresponding to different stages for more details on the model.

Table 1. Characteristics of the proposed degradation model.

Additionally, in Figure 1, it can be seen as an example of the degradation curve generated based on Table 1.

Figure 1. Example of generated degradation curves with the proposed methodology.

2.2. Methodology

The flowchart of this methodology procedure is presented in Figure 2. First, we express the general approach for the proposed framework. Then, we present the algorithms used in this framework in detail.

Figure 2. The flowchart of the proposed methodology (each block of this diagram is described in detail in the following).

2.3. Theory

2.3.1. Deterministic Component

First, we identify the deterministic component in the long-term data. The deterministic component indicates a global trend of the degradation process, which plays a crucial role in describing, analysing, and predicting the degradation process. Most failures in machinery failure are related to the initiation and growth of cracks until failure. Therefore, according to the nature of the growing crack in material, our preliminary research on the real data sets indicates that a single deterministic function cannot adequately describe this component [23]. More precisely, we observe a much more complex situation when the type of the deterministic component (called trend here) changes depending on the regime corresponding to the good, warning, and alarm regimes. Therefore, in this paper, we compute the empirical location measure for overlapping segments from windows of a given length w to identify the deterministic component without considering long-term data segmentation problems. This step is essential for identifying the deterministic component that is further removed from the raw data. After removing the deterministic component, the series is denoted as

{y (n)}

.

2.3.2. Separating the Random and Deterministic Component

In the general framework, the first crucial step is identifying the deterministic trend. The classical statistic assumed as the location measure is just a sample mean. In this case, we compute the moving average (MA) for overlapping segments (with the number of overlapping samples o) of length w and assume it as the deterministic component of the signal.

2.3.3. Random Component

In this subsection, we try to identify and model random components. Identifying, analysing, and modelling random components are essential because they help us consider the degradation process’s uncertainty, which may be related to the time-varying load, changing environmental conditions, etc.

Time-varying autoregressive model (TVC-AR)

The TVC-AR model of order p is an extension of the autoregressive (AR) model stated by assuming time-varying (instead of constant) coefficients

{φ_{l} (n); l = 1, . . ., p}

. The evolution of the model is then described with the following equation [36]:

y_{n} = \sum_{l = 1}^{p} φ_{l} (n) y_{n - l} + ε_{n} .

(1)

As usual, we assume that

ε_{n}

is a Gaussian white noise series with zero mean and unknown variance

σ^{2}

.

ε_{n}

is an innovation to the model; thus it is independent from

y_{l - 1}

for

l > 0

. The Bayesian framework is used to model the time-varying coefficients of the AR model by assuming that

{φ_{l} (n); l = 1, . . ., p}

are also random variables.

Stochastic constraints (also called smoothness priors) are put on the AR coefficients, which is expressed by stochastic difference equations, as follows:

Δ^{q} φ_{l} (n) = ν_{n l}

(2)

where

Δ^{q}

is the qth-order difference operator defined by

Δ φ_{l} (n) = φ_{l} (n) - φ_{l} (n - 1)

,

Δ^{q} φ_{l} (n) = Δ^{q - 1} (Δ φ_{l} (n))

. The innovation term

ν_{n l}

is a Gaussian white noise sequence (indexed with n) with zero mean and unknown variance

τ^{2}

. Suppose that

ν_{n l}

and

ε_{n}

are independent random variables. The behaviour of the model depends on the value of the difference order q, this hyperparameter is usually chosen, considering lower-order cases such as q equal to 1 or 2.

State space representations

Each TVC-AR model can be expressed by a state space representation, as follows:

\begin{matrix} x_{n} = F x_{n - 1} + G w_{n}, \\ y_{n} = H_{n} x_{n} + ε_{n} . \end{matrix}

(3)

The matrix components of the model depend on a specific order q. Here, we present the case of

q = 1

and

q = 2

. If

q = 1

, the state space model is parameterised as follows:

x_{n} = {(φ_{1} (n), . . ., φ_{p} (n))}^{T}, w_{n} = {(ν_{n 1}, . . ., ν_{n p})}^{T}, F = I_{p}, G = I_{p}, H_{n} = (y_{n - 1}, . . ., y_{n - p}),

(4)

where

{(\cdot)}^{T}

denotes the matrix transposition, and

I_{p}

the

p \times p

identity matrix. For

q = 2

, the state space model is given by the following:

\begin{matrix} x_{n} = {(φ_{1} (n), . . ., φ_{p} (n), φ_{1} (n - 1), . . ., φ_{p} (n - 1))}^{T}, \\ w_{n} = {(ν_{n 1}, . . ., ν_{n p})}^{T}, \\ F = [\begin{matrix} 2 I_{p} & - I_{p} \\ I_{p} & 0 \end{matrix}], \\ G = [\begin{matrix} I_{p} \\ 0 \end{matrix}], \\ H_{n} = (y_{n - 1}, . . ., y_{n - p}, 0, . . ., 0) . \end{matrix}

(5)

Covariance matrices of

ω_{n}

and

ε_{n}

are respectively given by

Q = E {w_{n} {w_{n}}^{T}} = τ^{2} I_{p}

,

R = E {ε_{n}^{2}} = σ^{2}

. Given the observations

y_{1}, . . ., y_{N}

and the initial conditions

x_{0 | 0}

and

V_{0 | 0}

, we use the Kalman filter algorithm to obtain the conditional mean and covariance matrix of the state vector at each time

n = 1, . . ., N

. This Bayesian procedure consists of repeated two steps of prediction and filtering, as follows:

[Prediction]

\begin{matrix} x_{n | n - 1} = F x_{n - 1 | n - 1}, \\ V_{n | n - 1} = F V_{n - 1 | n - 1} F^{T} + τ^{2} G G^{T} . \end{matrix}

(6)

[Update]

\begin{matrix} k_{n} = V_{n | n - 1} H_{n}^{T} {(H_{n} V_{n | n - 1} H_{n}^{T} + σ^{2})}^{- 1}, \\ x_{n | n} = x_{n | n - 1} + k_{n} (y_{n} - H_{n} x_{n | n - 1}), \\ V_{n | n} = (I_{m} - k_{n} H_{n}) V_{n | n - 1} . \end{matrix}

(7)

Above,

x_{i | j}

and

V_{i | j}

denote, respectively, the mean and covariance matrix of the state vector (estimated from the Kalman filter)

x_{i}

given the data

y_{1}, . . ., y_{j}

. The term

k_{n}

is called Kalman gain.

After performing the Kalman filter procedure, by the following backward iteration, we can obtain smoothed estimation

x_{n | N}

of the state vector and the corresponding covariance matrix

V_{n | N}

, as follows:

[Smoothing]

\begin{matrix} A_{n} = V_{n | n} F^{T} V_{n + 1 | n}^{- 1}, \\ x_{n | N} = x_{n | n} + A_{n} (x_{n + 1 | N} - x_{n + 1 | n}), \\ V_{n | N} = V_{n | n} + A_{n} (V_{n + 1 | N} - V_{n + 1 | n}) A_{n}^{T} . \end{matrix}

(8)

Note that the state vector

x_{n}

consists of the coefficients

φ_{l} (n)

, and thus, their estimates

{\hat{a}}_{l} (n)

are directly obtained via

x_{n | N}

.

Estimation and identification of the model

The conditional density function of

y_{n}

, conditioned on the data

y_{1}, . . ., y_{n - 1}

, is given by the following formula:

f (y_{n} | y_{1}, . . ., y_{n - 1}; σ^{2}, τ^{2}) = {(2 π ν^{2} (n))}^{- \frac{1}{2}} exp \{- \frac{{(y_{n} - H_{n} x_{n | n - 1})}^{2}}{2 ν^{2} (n)}\},

(9)

where

ν^{2} (n) = H_{n} V_{n | n - 1} H_{n}^{T} + σ^{2}

denotes the variance of the difference between measurement

y_{n}

and prediction

H_{n} x_{n | n - 1}

, based on observations

y_{1}, . . ., y_{n - 1}

. Since, given p and q (the order of the model and the differentiation order), the joint density function of the random vector

y_{1}, . . ., y_{N}

is the following:

f (y_{1}, . . ., y_{N} | σ^{2}, τ^{2}) = \prod_{n = 1}^{N} f (y_{n} | y_{1}, . . ., y_{n - 1}; σ^{2}, τ^{2}),

(10)

the log-likelihood for the hyperparameters

σ^{2}

and

τ^{2}

can be approximated with the following:

ℓ (σ^{2}, τ^{2}; p, q) = - \frac{1}{2} \{N log 2 π + \sum_{n = 1}^{N} (log ν^{2} (n) + \frac{{(y_{n} - H_{n} x_{n | n - 1})}^{2}}{ν^{2} (n)})\} .

(11)

In order to estimate the hyperparameters

σ^{2}

and

τ^{2}

, the method of maximization of the log-likelihood is applied. For more in-depth details about TVC-AR and the estimation of its hyperparameters, the reader can look into [36].

3. Simulation

Due to the unavailability of publicly accessible run-to-failure data sets with complete and free access to all physical axes, a synthetic run-to-failure data set is employed in the following section. These synthetic data allow us to evaluate the effectiveness and robustness of the proposed approach under controlled degradation scenarios.

3.1. Generating the Degradation Data

In the following section based on Section 2.1, the health index is generated. As we discussed in Section 2.1, the health index (HI) is constructed from two time-varying parts (deterministic and random part) as follows:

\begin{matrix} H I (t) = D (t) + R (t), \end{matrix}

(12)

where

D (t)

and

R (t)

are represented as deterministic and random parts of the degradation process, respectively. Both of these parts are following the assumption of three regimes based on Table 1. The behaviour of the deterministic part in Equation (12) has the following form:

\begin{matrix} D (t) = \{\begin{matrix} c_{1} & 0 < t \leq τ_{1}, \\ a_{1} t + c_{2} & τ_{1} < t \leq τ_{2}, \\ a_{2} exp (b_{1} t) + c_{3} & τ_{2} < t \leq N, \end{matrix} \end{matrix}

(13)

where

τ

denotes the changing point between regimes.

c_{1}

is a constant value that is used to represent a healthy regime. Additionally, the constants

a_{1}

,

a_{2}

, and

b_{1}

are used to construct the linear and exponential trend, which corresponds to the degradation and critical regimes. Furthermore, the parameters of

c_{2}

and

c_{3}

are tuned in a way that keeps the continuity of the degradation curve (health index) in changing points.

In the simulation part, we consider Gaussian distributions for the random component of the degradation process. The time-varying random component corresponding to

R (t)

is generated in the following way:

\begin{matrix} R (t) = S C (t) \hat{R} (t), \end{matrix}

(14)

where

S C (t)

corresponds to the scale (variance) of the random part, which is constructed in the following way:

\begin{matrix} S C (t) = \{\begin{matrix} a_{3} t + b_{2} & 0 < t \leq τ_{1}, \\ a_{4} t + b_{3} & τ_{1} < t \leq τ_{2}, \\ a_{5} exp (b_{4} t) & τ_{2} < t \leq N, \end{matrix} \end{matrix}

(15)

where the values

a_{3}, b_{2}, a_{4}, b_{3}, a_{5}, b_{4}

are constant and tuned in this way, in which

S C (1) = σ_{1}

,

S C (τ_{1}) = σ_{2}

,

S C (τ_{2}) = σ_{3}

, and

S C (N) = σ_{4}

.

Additionally,

\hat{R} (t)

is AR1 with a time-varying coefficient, which is constructed as follows:

\begin{matrix} \hat{R} (t + 1) = a_{6} t \hat{R} (t) + ϵ (t), \end{matrix}

(16)

where

ϵ (t)

is independent identically distributed random variables (iid), and it arises from Gaussian distribution

ϵ (t) \sim N (0, 1)

. For simplicity, we consider that the distribution in each regime is the same and the time-varying AR coefficient is increased linearly; however, as mentioned in Section 2.1, in practice, it may be different for different regimes.

In panel (a) in Figure 3, we present the deterministic component

D (t)

and the scale function

S C (t)

for the following values of the parameters:

τ_{1} = 1500

,

τ_{2} = 2500

,

N = 3000

,

σ_{1} = 0.25

,

σ_{2} = 1

,

σ_{3} = 3

,

σ_{4} = 12

, and

c_{1} = 0

.

Figure 3. Generated HI: (a) simulated health index (HI), (b) deterministic component of simulated HI, (c) random component of simulated HI, (d) scale (variance) of simulated HI, (e) AR coefficient of simulated HI.

3.2. Results of Proposed Approach

This subsection applies the proposed methodology to data generated by the suggested model; see Figure 3. The proposed method’s implementation results are presented in Figure 4. Panel (a) shows the simulated health index’s identified trend (deterministic component). As can be seen in panel (a), the proposed method could identify the deterministic component properly. Additionally, panel (b) of Figure 4 illustrates the random component of the generated health index after removing the identified deterministic part from the health index. Panel (c) of Figure 4 demonstrates the identified time-varying scale (variance) of the random component. Comparing of the real scale (variance) and the identified scale proved the efficacy of the proposed approach to the identified scale. In the end, the identified time-varying AR coefficient is shown in Figure 4, as can be seen in panel (d); the proposed method detected the AR coefficient as proper.

Figure 4. Results of the proposed approaches on simulated health index (HI): (a) trend (deterministic component) identification, (b) identified random component, (c) identified scale (variance) of random component, and (d) identified AR coefficient.

The proposed procedure was repeated for 100 simulations from the same model to examine the performance of the proposed methodology, and the results are presented in Figure 5. The left column illustrates the results for the trend, variance, and AR(1) coefficient for 100 simulated data. The real value is presented in blue, and the identified values are shown in cyan. It can be seen that the proposed method could detect and follow the actual values with acceptable performance. Additionally, in the right column, the RMSE is calculated for the left column plot and is shown with the boxplot.

Figure 5. Results of the proposed procedures on 100-simulation health index (trend, variance (scale), AR (1) coefficient). Left column: the proposed method. Right column: the evaluation of the results by RMSE (trend and variance (scale)) by boxplots.

4. Real Data Analysis

This section will apply and evaluate the proposed methodology for available real data sets. These data are typically employed as benchmark data sets for different papers and competitions and have a specific behaviour corresponding to noise properties and deterministic trends. In the following, essential information about objects, experiments, and data will be recalled, and appropriate references are provided.

4.1. FEMTO Data Sets

The FEMTO data set is collected from the PRONOSTIA test rig; see Figure 6 by the Franche-Comté Electronics Mechanics Thermal Science and Optics–Sciences and Technologies Institute (FEMTO). The data were collected by using two acceleration sensors and one temperature sensor during the test. The sampling frequency for collecting acceleration data is 25,600 Hz, recording data for 0.1 s every 10 s [37]. This data set has been used in many studies in recent years for health index construction [38,39,40,41,42,43,44,45], health index evaluation, or the segmentation of a health index [46,47,48,49,50,51] and the prediction of RUL [52,53,54,55,56,57]. The raw signal from one case study in the FEMTO data set is shown in subfigure (a) of Figure 7, and the RMS of each segment has been extracted and used as a health index, as shown in subfigure (b) of the same figure.

Figure 6. FEMTO test rigs [37].

Figure 7. FEMTO data set: (a) raw bearing run-to-failure vibration signals (b) and HI (RMS).

4.2. Wind Turbine Data Set

This data set is collected data from the sensor mounted in the high-speed bearing shaft of the wind turbine; see Figure 8. The extraction procedure for HI is shown in the Figure 9. More information about the procedure of constructing a health index could be found in the following reference [58]. In this data set, the inner race energy of the bearing is calculated every 10 min for +50 days; see Figure 9. Finally, the inner race-bearing fault has occurred, which was proved by inspection; see Figure 8. Additionally, this data set (we call it a wind turbine data set) has been used to predict RUL by several papers in recent years; see, e.g., [58,59,60,61].

Figure 8. Wind turbine test rigs [58].

Figure 9. (a) Procedure for extracting HI from wind turbine data [58]; (b) extracted HI showing inner race degradation.

4.3. Result for FEMTO Data Set

The results for the FEMTO data are illustrated in Figure 10. To detect the trend component (deterministic) (panel (a)), we selected the length of the window 51. As can be seen, the detected trend changes over time; i.e., for the first stage, it is nearly constant; then, it transforms its nature and acts as a linear function, while in the last stage, it behaves as an exponential function. Additionally, this identified trend confirmed the correctness of the assumption of the three stages (constant, linear, and exponential) degradation model, which is discussed in Section 2.1. After separating the deterministic trend from the health index, we obtain the random component (panel (b)). As can be seen, it conforms to a non-homogeneous sequence with a non-constant scale (time-varying scale). Panel (c) presents the identified scale of the random component, and we selected

[0.995, 0.9]

as the discount factors for AR and scale (variance) discount factors, respectively. Additionally, these results confirm our assumption about increasing the random component over time, and one can detect that the scale of a random part grows in a nonlinear manner. It can be seen in the last part of the curves that the amplitude of the scaled noise increases dramatically, which can have a significant effect on the quality of the prognosis. The first four time-varying AR coefficients are presented in panel (d). It should be noted that, here, we selected four coefficients for AR as hyperparameters. As can be seen in panel (d), these coefficients change over time with a fluctuating trend. However, the values of these mentioned coefficients are not significant. The residual of the TVC-AR model is illustrated in panel (e), while its empirical ACF is presented in panel (f). The plot of the empirical autocorrelation function shows that the data can be considered independent observations. Eventually, to confirm our first assumption about Gaussian noise, we plotted the empirical and theoretical tails (panel (g)) plus the well-known non-Gaussian distribution, including the stable and Student’s t distribution. More information on these non-Gaussian distributions can be found in Appendix A. We conclude that the residual series corresponds to the alpha distribution with

α = 1.902

. Additionally, the probability density functions (PDF) of the theoretical distributions of the residual signal are presented in panel (h). This result also rejected our assumption about Gaussian noise. However, the level of impulsivity is relatively low.

Figure 10. Results of the applied methodology for FEMTO case study data set.

We analyse quantile lines constructed based on the identified parameters and propose a fitted model for FEMTO data sets to confirm our results. We should note that the presentation of quantile lines constructed on the basis of the fitted model is one of the most typical techniques for validating that an offered model is properly fitted to the data. This approach is frequently employed in both the literature and practical applications. The procedure for building the quantile lines is as follows: first, the model is fitted to the real data; then we simulate the number of trajectories by using the fitted model, and for every time point, we estimate the quantiles (at appropriate levels). They are named quantile lines. If the real data fall into the constructed intervals (with reasonable probability), then we can confirm that the fitted model is proper. The fitted models have been synthesised, and the simulation results are illustrated in Figure 11. In this figure, the real data sets are shown by purple lines, and by blue lines, the constructed quantile lines on the levels of

5 %

and

95 %

obtained based on 400 simulated trajectories corresponding to the fitted models. Furthermore, based on the results presented in Figure 11, it can be concluded that the fitted models maintain the specific nature of the data sets and can be considered the optimal ones, for example, for the prognosis.

Figure 11. Constructed quantile lines (blue) on the level of

5 %

and

95 %

constructed on the basis of simulated trajectories corresponding to the fitted proposed model.

4.4. Result for Wind Turbine

The results for the wind turbine data are illustrated in Figure 12. To detect the trend (deterministic) component (panel (a)), we selected the window of length 51. As can be seen, the detected trend (deterministic component) changes over time as the FEMTO data set. After separating the deterministic trend from the health index, we obtain the random component (panel (b)). As can be seen, it contains a non-homogeneous sequence with a non-constant scale (time-varying scale). Panel (c) presents the identified scale of the random component, and we selected

[0.995, 0.99]

as the AR and scale (variance) discount factors, respectively. Additionally, these results confirm our assumption about increasing the random component over time, and one can detect that the scale of a random part grows in a nonlinear manner. The scale increases differently for each regime (0–2000 and 2000–5000), after which the situation becomes more complicated. The first ten time-varying AR coefficients are presented in panel (d). It should be noted that here that we selected ten coefficients for AR as hyperparameters. As seen in panel (d), these coefficients converge to a constant value over time. The values of these mentioned coefficients, particularly the first coefficient of the AR model, are valuable. The residual of the TVC-AR model is illustrated in panel (e), while its empirical ACF is presented in panel (f). The plot of the empirical autocorrelation function shows that the data can be considered independent observations. Eventually, we plotted the empirical and theoretical tails (panel (g)) plus the well-known non-Gaussian distribution, including the stable and Student’s t distribution. We conclude that the residual series corresponds to a non-Gaussian distribution. Additionally, the probability density functions (PDF) of the theoretical distributions of the residual signal are presented in panel (h). This result also rejected our assumption about Gaussian noise. However, the level of impulsivity is relatively low.

Figure 12. Results of the applied methodology for wind turbine data set.

In addition, we analyse quantile lines constructed based on the fitted model for the wind turbine data set to confirm our results as the previous one. The fitted models have been synthesised, and the simulation results are as illustrated in Figure 13. In this figure, the real data sets are shown by purple and blue lines, the constructed quantile lines on the levels of

5 %

and

95 %

obtained based on 400 simulated trajectories corresponding to the fitted models. Additionally, based on the results presented in Figure 13, it can be concluded that fitted models maintain the specific nature of the data sets and can be considered the optimal ones, for instance, for the prognosis.

Figure 13. Constructed quantile lines (blue) on the level of

5 %

and

95 %

constructed on the basis of simulated trajectories corresponding to the fitted proposed model.

5. Discussion

The results presented for both real data sets confirm the efficiency of the proposed approach. The deterministic trend, scale (variance), and AR coefficients are detected as time-varying functions for both cases. Additionally, based on the FEMTO results, the deterministic part of the FEMTO data set follows the assumption of a constant, linear, and exponential function trend for the degradation process. On the other hand, for the wind turbine data set, the exponential trend cannot be clearly seen that may arise from this fact; this data set is not entirely run to failure data [58]. In addition, in the deterministic trends of the wind turbine data set, a few fluctuations can be seen that may originate from different reasons and phenomena, such as self-healing. Likewise, the variation of the scale (variance) during the time for FEMTO and wind turbine data sets ultimately confirms our first assumption about the time-varying scale (variance), and it can be seen that the scale (variance) FEMTO data set follows the assumption of constant, linear, and exponential function, while for the scale of the wind turbine data set, we can see some complex trend, including fluctuations after t = 2000, which can cause problems in the efficiency of the model to be used to applications of prediction by over- and underestimate. We should consider the fact that we assumed that the observed noise is Gaussian, while it may not be a correct assumption for such data with non-Gaussian characteristics. Additionally, the Kalman filter used in our proposed model is not robust against intense non-Gaussian noise, and it may be better to use a robust version. We will try to deal with this in our future work. Correspondingly, for the time-varying AR coefficients for both cases, however, the AR coefficients for wind turbines are not significantly giant and, after some time, have approximately constant trends that are predictable based on the fact that this data set is gathered under constant speed and condition. Additionally, we presented the residual of the TVC-AR model and fitted different distributions (Gaussian and non-Gaussian) to investigate whether the noise with Gaussian characteristics assumption for these data sets is valid. As seen in the Results section, the results for FEMTO and wind turbine data sets are more compatible with non-Gaussian distributions like stable noise and Student’s t noise, while it is not so far from Gaussian noise.

Finally, we also compared the results with the method introduced by Żuławiński et al. [23]; this method considers non-Gaussian characteristics, non-homogeneous manners, and autodependence in the time series. However, this method considers constant AR coefficients. for more information on this model, please find this reference: [23]. For this work, we compared quantile lines constructed based on fitted models according to the identified variables for FEMTO and the wind turbine data set. As can be seen in panels (a) and (b) in Figure 14, both models could keep the specific nature of the data sets as well; however, as can be seen in panel (b), the 5% quantile detected by the proposed methods is a little bit more acquired rather than the Żuławiński et al. model. Additionally, for the bold difference between the results of these two models, we presented 95% of the quantile line constructed by these two models plus the difference; see panels (c) and (d) in Figure 14. Therefore, we can figure out that the quantiles for the first and the second regime (healthy stage and degradation regime) take the same values, while for the critical stage, we can see the Żuławiński et al. model take a higher value than our proposed model. Additionally, we should consider this; however, our proposed method is not robust as the Żuławiński et al. model, and it can be affected by strong non-Gaussian characteristics since this model developed by the Bayesian theorem can tolerate soft non-Gaussian characteristics. Additionally, our proposed method can identify and model the time-varying AR coefficients, which is a crucial point, particularly for applications that work under non-stationary conditions. In contrast, the Żuławiński et al. model considers constant AR coefficients. Furthermore, the proposed method can be used for online applications, which is a critical feature for prognosis and diagnosis.

Figure 14. Comparison between the proposed model and the model of Żuławiński et al. for (a) 5th and 95th quantiles on the FEMTO data sets, (b) 5th and 95th quantiles on the wind turbine data sets, (c) 95th quantiles on the FEMTO data sets, and (d) 95th quantiles on the wind turbine data sets.

6. Conclusions

This paper presents a methodology for modelling, identifying, and analysing long-term condition monitoring data exhibiting non-stationary characteristics. The primary objective is to introduce an online approach to identify the most informative aspects of such data. This enables decision makers and monitoring experts to visualize underlying patterns, gain deeper insights into system behaviour, and facilitate the development of models for segmentation and predictive maintenance. In this research, we proposed another way to model long-term data to generalize the classical model. This model has the potential to be used in online applications. Moreover, we considered that the characteristics of the random part might be time-varying, such as the growing scale (variance). The last novelty was modelling the random components by time-varying autoregressive time series, which can give us the ability to describe the time-varying dependency of a random component. Next, we suggest reliable algorithms to identify the components mentioned above. Ultimately, we presented an approach to identify each component and synthesise all of them to simulate the exemplary trajectory of the suggested model. The proposed procedure has been applied to the two simulated and real data sets named FEMTO and wind turbine data set. The simulated data set was designed based on the three-stage model; see Figure 1 with changing scale (variance) and coloured noise to be closer and more realistic to the real data set. The results of the proposed approach to the simulated data set confirmed the method’s efficacy in identifying and modelling the time-varying deterministic part and the random part’s time-varying scale (variance) and AR coefficients.

Additionally, the proposed procedure has been applied to two real data (FEMTO and wind turbine data sets). The results of these two data sets prove our first assumptions about non-homogeneous manners and time-varying autodependence in the time series. Furthermore, we have seen non-Gaussian characteristics in residues of the TVC-AR model, which can influence the results of this proposed approach. For these two real cases, the non-Gaussian characteristics were not so strong; the proposed approach could cover the effects of it, but certainly, if the level of the non-Gaussianity is increased, the efficiency of the method will be decreased. Therefore, in the future, we will try to deal with this problem.

Author Contributions

Conceptualization, H.S. and P.Z.; methodology, H.S.; software, H.S.; validation, H.S.; formal analysis, H.S.; investigation, H.S.; resources, H.S.; data curation, H.S.; writing—original draft preparation, H.S.; writing—review and editing, H.S. and P.Z.; visualization, H.S. and P.Z.; theoretical refinement, P.Z.; project administration, H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data used in this study are from a benchmarks, publicly available, open-source data sets.

Acknowledgments

The research presented in this paper was conducted while the author was affiliated with the Wroclaw University of Science and Technology, Poland.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Heavy-Tailed Probability Density Function

Appendix A.1. Stable Distribution

The stable distribution (also known as Lévy alpha-stable distribution) is defined by its characteristic function and is characterised by four parameters:

α

(stability),

β

(skewness),

σ

(scale), and

μ

(location). However, by considering the symmetric case with a standardized scale, we can assume

β = μ = 0

,

c = 1

, and the corresponding characteristic function is reduced to the following equation:

\begin{matrix} E [e^{i t X}] = e^{- {| t |}^{α}} . \end{matrix}

(A1)

Parameter

α

is called the stability index and takes the value from the

(0, 2]

interval. It should be noted that the stable distribution reduces to the Gaussian distribution when

α = 2

. In case

α

decreases, the distribution becomes significantly non-Gaussian and heavy-tailed [62].

Appendix A.2. Student’s t Distribution

The Student’s t distribution is known to be one of a family of curves of one parameter. This distribution is usually employed to test a hypothesis regarding the population means when the population standard deviation is unknown. The probability density function (PDF) of the Student’s t distribution is the following:

\begin{matrix} y = F (x | ν) = \frac{Γ (\frac{ν + 1}{ν})}{\frac{ν}{2}} \frac{1}{\sqrt{ν π}} \frac{1}{{(1 + \frac{x^{2}}{ν})}^{\frac{ν + 1}{2}}} \end{matrix}

(A2)

where

ν

is the degree of freedom and

γ (.)

is the Gamma function. The result y is the probability of observing a particular value of x from the Student’s t distribution with

ν

degrees of freedom.

It should be noted that the mean of the Student’s t distribution is

μ = 0

(mean) for degrees of freedom

ν

greater than 1. If

ν

equals 1, then the mean is undefined.

The scale (variance) of the Student’s t distribution is

\frac{ν}{ν - 2}

for degrees of freedom

ν

greater than 2. If

ν

is less than or equal to 2, then the scale (variance) is undefined.

References

Kourou, K.; Exarchos, T.P.; Exarchos, K.P.; Karamouzis, M.V.; Fotiadis, D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015, 13, 8–17. [Google Scholar] [CrossRef] [PubMed]
Thomsen, K.; Iversen, L.; Titlestad, T.L.; Winther, O. Systematic review of machine learning for diagnosis and prognosis in dermatology. J. Dermatol. Treat. 2020, 31, 496–510. [Google Scholar] [CrossRef] [PubMed]
Diez-Olivan, A.; Del Ser, J.; Galar, D.; Sierra, B. Data fusion and machine learning for industrial prognosis: Trends and perspectives towards Industry 4.0. Inf. Fusion 2019, 50, 92–111. [Google Scholar] [CrossRef]
Moosavi, F.; Shiri, H.; Wodecki, J.; Wyłomańska, A.; Zimroz, R. Application of Machine Learning Tools for Long-Term Diagnostic Feature Data Segmentation. Appl. Sci. 2022, 12, 6766. [Google Scholar] [CrossRef]
Si, X.S.; Wang, W.; Hu, C.H.; Zhou, D.H. Remaining useful life estimation–a review on the statistical data driven approaches. Eur. J. Oper. Res. 2011, 213, 1–14. [Google Scholar] [CrossRef]
Ye, Z.S.; Xie, M. Stochastic modelling and analysis of degradation for highly reliable products. Appl. Stoch. Model. Bus. Ind. 2015, 31, 16–32. [Google Scholar] [CrossRef]
Kucharczyk, D.; Wyłomańska, A.; Obuchowski, J.; Zimroz, R.; Madziarz, M. Stochastic modelling as a tool for seismic signals segmentation. Shock Vib. 2016, 2016, 8453426. [Google Scholar] [CrossRef]
Shiri, H.; Zimroz, P.; Wodecki, J.; Wyłomańska, A.; Zimroz, R. Data-driven segmentation of long term condition monitoring data in the presence of heavy-tailed distributed noise with finite-variance. Mech. Syst. Signal Process. 2023, 205, 110833. [Google Scholar] [CrossRef]
Heng, A.; Zhang, S.; Tan, A.C.; Mathew, J. Rotating machinery prognostics: State of the art, challenges and opportunities. Mech. Syst. Signal Process. 2009, 23, 724–739. [Google Scholar] [CrossRef]
Kan, M.S.; Tan, A.C.; Mathew, J. A review on prognostic techniques for non-stationary and non-linear rotating systems. Mech. Syst. Signal Process. 2015, 62, 1–20. [Google Scholar] [CrossRef]
Liao, L.; Köttig, F. Review of hybrid prognostics approaches for remaining useful life prediction of engineered systems, and an application to battery life prediction. IEEE Trans. Reliab. 2014, 63, 191–207. [Google Scholar] [CrossRef]
Zhao, Z.; Wu, J.; Li, T.; Sun, C.; Yan, R.; Chen, X. Challenges and opportunities of AI-enabled monitoring, diagnosis & prognosis: A review. Chin. J. Mech. Eng. 2021, 34, 1–29. [Google Scholar]
Lei, Y.; Li, N.; Guo, L.; Li, N.; Yan, T.; Lin, J. Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mech. Syst. Signal Process. 2018, 104, 799–834. [Google Scholar] [CrossRef]
Xiong, J.; Fink, O.; Zhou, J.; Ma, Y. Controlled physics-informed data generation for deep learning-based remaining useful life prediction under unseen operation conditions. Mech. Syst. Signal Process. 2023, 197, 110359. [Google Scholar] [CrossRef]
Yan, B.; Ma, X.; Huang, G.; Zhao, Y. Two-stage physics-based Wiener process models for online RUL prediction in field vibration data. Mech. Syst. Signal Process. 2021, 152, 107378. [Google Scholar] [CrossRef]
Wang, W.; Carr, M.; Xu, W.; Kobbacy, K. A model for residual life prediction based on Brownian motion with an adaptive drift. Microelectron. Reliab. 2011, 51, 285–293. [Google Scholar] [CrossRef]
Bian, L.; Gebraeel, N. Stochastic methodology for prognostics under continuously varying environmental profiles. Stat. Anal. Data Min. ASA Data Sci. J. 2013, 6, 260–270. [Google Scholar] [CrossRef]
Xi, X.; Chen, M.; Zhou, D. Remaining useful life prediction for degradation processes with memory effects. IEEE Trans. Reliab. 2017, 66, 751–760. [Google Scholar] [CrossRef]
Ling, M.; Ng, H.; Tsui, K. Bayesian and likelihood inferences on remaining useful life in two-phase degradation models under gamma process. Reliab. Eng. Syst. Saf. 2019, 184, 77–85. [Google Scholar] [CrossRef]
Liu, H.; Song, W.; Niu, Y.; Zio, E. A generalized cauchy method for remaining useful life prediction of wind turbine gearboxes. Mech. Syst. Signal Process. 2021, 153, 107471. [Google Scholar] [CrossRef]
Song, W.; Liu, H.; Zio, E. Long-range dependence and heavy tail characteristics for remaining useful life prediction in rolling bearing degradation. Appl. Math. Model. 2022, 102, 268–284. [Google Scholar] [CrossRef]
Zhang, H.; Jia, C.; Chen, M. Remaining Useful Life Prediction for Degradation Processes With Dependent and Nonstationary Increments. IEEE Trans. Instrum. Meas. 2021, 70, 1–12. [Google Scholar] [CrossRef]
Żuławiński, W.; Maraj-Zygmąt, K.; Shiri, H.; Wyłomańska, A.; Zimroz, R. Framework for stochastic modelling of long-term non-homogeneous data with non-Gaussian characteristics for machine condition prognosis. Mech. Syst. Signal Process. 2023, 184, 109677. [Google Scholar] [CrossRef]
Liu, D.; Luo, Y.; Liu, J.; Peng, Y.; Guo, L.; Pecht, M. Lithium-ion battery remaining useful life estimation based on fusion nonlinear degradation AR model and RPF algorithm. Neural Comput. Appl. 2014, 25, 557–572. [Google Scholar] [CrossRef]
Su, C.; Chen, H. A review on prognostics approaches for remaining useful life of lithium-ion battery. IOP Conf. Ser. Earth Environ. Sci. 2017, 93, 012040. [Google Scholar] [CrossRef]
Prado, R.; Huerta, G.; West, M. Bayesian time-varying autoregressions: Theory, methods and applications. Resenhas Inst. Mat. Estat. Univ. São Paulo 2000, 4, 405–422. [Google Scholar]
Prado, R.; West, M. Time Series: Modeling, Computation, and Inference; Chapman and Hall/CRC: Boca Raton, FL, USA, 2010. [Google Scholar]
Krishnan, M.; Bhowmik, B.; Hazra, B.; Pakrashi, V. Real time damage detection using recursive principal components and time varying auto-regressive modeling. Mech. Syst. Signal Process. 2018, 101, 549–574. [Google Scholar] [CrossRef]
Zhang, L.; Xiong, G.; Liu, H.; Zou, H.; Guo, W. Time-frequency representation based on time-varying autoregressive model with applications to non-stationary rotor vibration analysis. Sadhana 2010, 35, 215–232. [Google Scholar] [CrossRef]
Amir, N.; Gath, I. Segmentation of EEG during sleep using time-varying autoregressive modeling. Biol. Cybern. 1989, 61, 447–455. [Google Scholar] [CrossRef]
Wei, H.L.; Billings, S.A.; Liu, J.J. Time-varying parametric modelling and time-dependent spectral characterisation with applications to EEG signals using multiwavelets. Int. J. Model. Identif. Control 2010, 9, 215–224. [Google Scholar] [CrossRef]
Sui, Y.; Holan, S.H.; Yang, W.H. Bayesian Circular Lattice Filters for Computationally Efficient Estimation of Multivariate Time-Varying Autoregressive Models. arXiv 2022, arXiv:2206.12280. [Google Scholar] [CrossRef]
Rudoy, D.; Quatieri, T.F.; Wolfe, P.J. Time-varying autoregressions in speech: Detection theory and applications. IEEE Trans. Audio Speech Lang. Process. 2010, 19, 977–989. [Google Scholar] [CrossRef]
Noda, A. A test of the adaptive market hypothesis using a time-varying AR model in Japan. Financ. Res. Lett. 2016, 17, 66–71. [Google Scholar] [CrossRef]
Xu, K.L.; Phillips, P.C. Adaptive estimation of autoregressive models with time-varying variances. J. Econom. 2008, 142, 265–280. [Google Scholar] [CrossRef]
Jiang, X.Q. Time Varying Coefficient AR and VAR Models. In The Practice of Time Series Analysis; Akaike, H., Kitagawa, G., Eds.; Springer: New York, NY, USA, 1999; pp. 175–191. [Google Scholar]
Nectoux, P.; Gouriveau, R.; Medjaher, K.; Ramasso, E.; Chebel-Morello, B.; Zerhouni, N.; Varnier, C. PRONOSTIA: An experimental platform for bearings accelerated degradation tests. In Proceedings of the IEEE International Conference on Prognostics and Health Management, PHM’12, Denver, CO, USA, 18–21 June 2012 ; IEEE Catalog Number: CPF12PHM-CDR. pp. 1–8. [Google Scholar]
Mosallam, A.; Medjaher, K.; Zerhouni, N. Time series trending for condition assessment and prognostics. J. Manuf. Technol. Manag. 2014, 25, 550–567. [Google Scholar] [CrossRef]
Loutas, T.H.; Roulias, D.; Georgoulas, G. Remaining useful life estimation in rolling bearings utilizing data-driven probabilistic e-support vectors regression. IEEE Trans. Reliab. 2013, 62, 821–832. [Google Scholar] [CrossRef]
Javed, K.; Gouriveau, R.; Zerhouni, N.; Nectoux, P. Enabling health monitoring approach based on vibration data for accurate prognostics. IEEE Trans. Ind. Electron. 2014, 62, 647–656. [Google Scholar] [CrossRef]
Singleton, R.K.; Strangas, E.G.; Aviyente, S. Extended Kalman filtering for remaining-useful-life estimation of bearings. IEEE Trans. Ind. Electron. 2014, 62, 1781–1790. [Google Scholar] [CrossRef]
Zhang, B.; Zhang, L.; Xu, J. Degradation feature selection for remaining useful life prediction of rolling element bearings. Qual. Reliab. Eng. Int. 2016, 32, 547–554. [Google Scholar] [CrossRef]
Hong, S.; Zhou, Z.; Zio, E.; Wang, W. An adaptive method for health trend prediction of rotating bearings. Digit. Signal Process. 2014, 35, 117–123. [Google Scholar] [CrossRef]
Lei, Y.; Li, N.; Gontarz, S.; Lin, J.; Radkowski, S.; Dybala, J. A model-based method for remaining useful life prediction of machinery. IEEE Trans. Reliab. 2016, 65, 1314–1326. [Google Scholar] [CrossRef]
Nie, Y.; Wan, J. Estimation of remaining useful life of bearings using sparse representation method. In Proceedings of the 2015 Prognostics and System Health Management Conference (PHM), Beijing, China, 21–23 October 2015; pp. 1–6. [Google Scholar]
Shiri, H.; Zimroz, P.; Wodecki, J.; Wyłomańska, A.; Zimroz, R.; Szabat, K. Using long-term condition monitoring data with non-Gaussian noise for online diagnostics. Mech. Syst. Signal Process. 2023, 200, 110472. [Google Scholar] [CrossRef]
Kimotho, J.K.; Sondermann-Wölke, C.; Meyer, T.; Sextro, W. Machinery Prognostic Method Based on Multi-Class Support Vector Machines and Hybrid Differential Evolution–Particle Swarm Optimization. Chem. Eng. Trans. 2013, 33. [Google Scholar]
Zurita, D.; Carino, J.A.; Delgado, M.; Ortega, J.A. Distributed neuro-fuzzy feature forecasting approach for condition monitoring. In Proceedings of the 2014 IEEE Emerging Technology and Factory Automation (ETFA), Barcelona, Spain, 16–19 September 2014; pp. 1–8. [Google Scholar]
Guo, L.; Gao, H.; Huang, H.; He, X.; Li, S. Multifeatures fusion and nonlinear dimension reduction for intelligent bearing condition monitoring. Shock Vib. 2016, 2016, 4632562. [Google Scholar] [CrossRef]
Jin, X.; Sun, Y.; Que, Z.; Wang, Y.; Chow, T.W. Anomaly detection and fault prognosis for bearings. IEEE Trans. Instrum. Meas. 2016, 65, 2046–2054. [Google Scholar] [CrossRef]
Shiri, H.; Wodecki, J.; Zimroz, R. Robust switching Kalman filter for diagnostics of long-term condition monitoring data in the presence of non-Gaussian noise. IOP Conf. Ser. Earth Environ. Sci. 2023, 1189, 012007. [Google Scholar] [CrossRef]
Li, H.; Wang, Y. Rolling bearing reliability estimation based on logistic regression model. In Proceedings of the 2013 International Conference on Quality, Reliability, Risk, Maintenance and Safety Engineering (QR2MSE), Chengdu, China, 15–18 July 2013; pp. 1730–1733. [Google Scholar]
Huang, Z.; Xu, Z.; Ke, X.; Wang, W.; Sun, Y. Remaining useful life prediction for an adaptive skew-Wiener process model. Mech. Syst. Signal Process. 2017, 87, 294–306. [Google Scholar] [CrossRef]
Wang, Y.; Peng, Y.; Zi, Y.; Jin, X.; Tsui, K.L. A two-stage data-driven-based prognostic approach for bearing degradation problem. IEEE Trans. Ind. Inform. 2016, 12, 924–932. [Google Scholar] [CrossRef]
Shiri, H.; Zimroz, P.; Wyłomańska, A.; Zimroz, R. Estimation of machinery’s remaining useful life in the presence of non-Gaussian noise by using a robust extended Kalman filter. Measurement 2024, 235, 114882. [Google Scholar] [CrossRef]
Wang, L.; Zhang, L.; Wang, X.z. Reliability estimation and remaining useful lifetime prediction for bearing based on proportional hazard model. J. Cent. South Univ. 2015, 22, 4625–4633. [Google Scholar] [CrossRef]
Xiao, L.; Chen, X.; Zhang, X.; Liu, M. A novel approach for bearing remaining useful life estimation under neither failure nor suspension histories condition. J. Intell. Manuf. 2017, 28, 1893–1914. [Google Scholar] [CrossRef]
Bechhoefer, E.; Schlanbusch, R. Generalized Prognostics Algorithm Using Kalman Smoother. IFAC-PapersOnLine 2015, 48, 97–104. In Proceedings of the 9th IFAC Symposium on Fault Detection, Supervision andSafety for Technical Processes SAFEPROCESS, Paris, France, 2–4 September 2015. [Google Scholar] [CrossRef]
Saidi, L.; Ali, J.B.; Bechhoefer, E.; Benbouzid, M. Wind turbine high-speed shaft bearings health prognosis through a spectral Kurtosis-derived indices and SVR. Appl. Acoust. 2017, 120, 1–8. [Google Scholar] [CrossRef]
Saidi, L.; Bechhoefer, E.; Ali, J.B.; Benbouzid, M. Wind turbine high-speed shaft bearing degradation analysis for run-to-failure testing using spectral kurtosis. In Proceedings of the 2015 16th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering (STA), Monastir, Tunisia, 21–23 December 2015; pp. 267–272. [Google Scholar]
Ali, J.B.; Saidi, L.; Harrath, S.; Bechhoefer, E.; Benbouzid, M. Online automatic diagnosis of wind turbine bearings progressive degradations under real experimental conditions based on unsupervised machine learning. Appl. Acoust. 2018, 132, 167–181. [Google Scholar]
Burnecki, K.; Wyłomańska, A.; Beletskii, A.; Gonchar, V.; Chechkin, A. Recognition of stable distribution with Lévy index α close to 2. Phys. Rev. E 2012, 85, 056711. [Google Scholar] [CrossRef]

Figure 1. Example of generated degradation curves with the proposed methodology.

Figure 2. The flowchart of the proposed methodology (each block of this diagram is described in detail in the following).

Figure 3. Generated HI: (a) simulated health index (HI), (b) deterministic component of simulated HI, (c) random component of simulated HI, (d) scale (variance) of simulated HI, (e) AR coefficient of simulated HI.

Figure 4. Results of the proposed approaches on simulated health index (HI): (a) trend (deterministic component) identification, (b) identified random component, (c) identified scale (variance) of random component, and (d) identified AR coefficient.

Figure 5. Results of the proposed procedures on 100-simulation health index (trend, variance (scale), AR (1) coefficient). Left column: the proposed method. Right column: the evaluation of the results by RMSE (trend and variance (scale)) by boxplots.

Figure 6. FEMTO test rigs [37].

Figure 7. FEMTO data set: (a) raw bearing run-to-failure vibration signals (b) and HI (RMS).

Figure 8. Wind turbine test rigs [58].

Figure 9. (a) Procedure for extracting HI from wind turbine data [58]; (b) extracted HI showing inner race degradation.

Figure 10. Results of the applied methodology for FEMTO case study data set.

Figure 11. Constructed quantile lines (blue) on the level of

5 %

and

95 %

constructed on the basis of simulated trajectories corresponding to the fitted proposed model.

Figure 11. Constructed quantile lines (blue) on the level of

5 %

and

95 %

constructed on the basis of simulated trajectories corresponding to the fitted proposed model.

Figure 12. Results of the applied methodology for wind turbine data set.

Figure 13. Constructed quantile lines (blue) on the level of

5 %

and

95 %

constructed on the basis of simulated trajectories corresponding to the fitted proposed model.

Figure 13. Constructed quantile lines (blue) on the level of

5 %

and

95 %

constructed on the basis of simulated trajectories corresponding to the fitted proposed model.

Figure 14. Comparison between the proposed model and the model of Żuławiński et al. for (a) 5th and 95th quantiles on the FEMTO data sets, (b) 5th and 95th quantiles on the wind turbine data sets, (c) 95th quantiles on the FEMTO data sets, and (d) 95th quantiles on the wind turbine data sets.

Table 1. Characteristics of the proposed degradation model.

Property	Regime 1	Regime 2	Regime 3
Trend	Constant	Linear	Exponential or polynomial
Scale	Nearly constant	Linearly growing	Exponential or polynomial growing
Autodependence of noise	White/coloured	White/coloured	White/coloured
Noise distribution	Gaussian	Gaussian	Gaussian

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

Stochastic Identification and Analysis of Long-Term Degradation Through Health Index Data †

Abstract

1. Introduction

2. Methodology and Theory

2.1. Degradation Model

2.2. Methodology

2.3. Theory

2.3.1. Deterministic Component

2.3.2. Separating the Random and Deterministic Component

2.3.3. Random Component

3. Simulation

3.1. Generating the Degradation Data

3.2. Results of Proposed Approach

4. Real Data Analysis

4.1. FEMTO Data Sets

4.2. Wind Turbine Data Set

4.3. Result for FEMTO Data Set

4.4. Result for Wind Turbine

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Heavy-Tailed Probability Density Function

Appendix A.1. Stable Distribution

Appendix A.2. Student’s t Distribution

References

Article Metrics

Article Access Statistics

Stochastic Identification and Analysis of Long-Term Degradation Through Health Index Data^†