Next Article in Journal
Go Source Code Vulnerability Detection Method Based on Graph Neural Network
Previous Article in Journal
LLM-Based Cyberattack Detection Using Network Flow Statistics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling of Distorted Degradation Data Based on Oil Analysis

1
School of Statistics, Capital University of Economics and Business, Beijing 100070, China
2
Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100864, China
3
School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(12), 6531; https://doi.org/10.3390/app15126531
Submission received: 6 May 2025 / Revised: 5 June 2025 / Accepted: 9 June 2025 / Published: 10 June 2025

Abstract

Degradation data are important in judging a machine’s health condition and providing early warning of machine failure. However, interference factors (e.g., oil top-ups) may distort degradation observations, causing the observed data to deviate from the actual physical degradation curve that engineers rely on. To address this distortion problem, this paper proposes a statistical correction framework to recover the actual degradation curve. The main contributions are as follows: First, we developed a degradation correction model that automatically identifies oil top-up events while globally rectifying distorted data, achieving an accurate reconstruction of physical degradation curves. Second, we developed a three-step explosion search algorithm for robust parameter estimation. Notably, though our methodology was initially developed for wear degradation analysis, this data-driven framework demonstrates adaptability to broader degradation scenarios. Finally, numerical simulations and case studies confirmed the practical effectiveness of the proposed method.

1. Introduction

Prognostics and Health Management (PHM) focuses on condition monitoring and the extrapolation of degradation trends based on recent observations [1,2,3]. Among the established techniques for monitoring the state of machinery, oil analysis is the most popular. Oil monitoring is mainly concerned with the analysis of sampled oil to detect wear-related problems [4,5,6,7]. Shell in its advisory report indicates that oil contamination causes approximately 70% of in-service failures in diesel engines, of which 50% are the result of wear-related problems [8]. The wear is complex, including multiple friction pairs, such as wet clutches, transmission gears, sealing rings, bearings, and confluence planetary gear trains.
The wear describes the degradation of mechanical systems to some degree. However, unless the wear is directly observable, such as in the case of a brake pad, it is difficult to quantify it in general. For most mechanical systems, wear is not directly observable and can only be assessed via other measured condition information data such as the concentrations of metal elements. The concentration of metal within oil samples is a good indicator of wear and has been widely used for wear evaluation [8,9,10,11,12]. Wang [8] used metal concentrations to describe the deterioration of a maintained plant. Vališ et al. [9] considered the metal concentration as a potential failure indicator and then proposed a linear regression model to determine a linear course of metal particle generation. Zheng et al. [10] used the Wiener process to model metal concentrations and thus optimized the planned maintenance interval of the power shift steering transmission.
However, in practical scenarios, many factors such as oil top-ups, wear particle removal, and different measurement methods can disturb the observation of the metal concentration, thereby causing the observed data to deviate from the actual physical degradation. This distortion (deviation) indicates that it is infeasible to directly use the observed data to evaluate the wear condition. In other words, it is necessary to recover the actual physical degradation curve based on the observed data.
Macián et al. [13] considered the influence of oil consumption and replenishment and developed an analytical approach to the wear rate of internal combustion engines based on spectrometric oil analysis. Bagshaw et al. [14] used oil volume correction to transform the observed data into absolute wear values. Feng et al. [15] considered the effect of the time-varying wear coefficient. A model of wear debris concentration was built based on Kragelsky’s method with different wear coefficients in corresponding wear stages. However, the literature mentioned above assumes that the refilling is carried out at predetermined maintenance intervals [13,14,15,16] and does not consider dynamic, non-periodic oil top-ups driven by real-time operational conditions. Moreover, the above models were established based on particular mechanical structures such as an internal combustion engine, a continuous flow stirred tank reactor (CFSTR), or a gearbox, thereby restricting their applications.
Considering the disturbance caused by non-periodic oil top-ups, a piecewise linear method was employed to address the resulting variations in oil spectrum analysis [17,18]. The Hermite interpolation approach was used to model the wear characteristics [19]. The grey model and AR time series model were also adopted [20]. However, these methods rely on subjective judgments to identify oil top-up points and use local observed information to correct the data. This indicates that these methods not only lack objectivity but also do not make full use of existing observation information.
This paper focuses on rectifying observed degradation data distorted by non-periodic oil top-ups. As shown in Figure 1 (Vehicle 1 dataset), the observed data exclusively contain Cu (copper) concentration measurements with the corresponding sampling time but lack documentation of oil top-up times and quantities. Detailed experimental information is provided in Section 4. It is evident that the acquired Cu concentration exhibits non-monotonic behavior despite the inherent monotonicity of mechanical wear processes. This discrepancy stems from dilution effects caused by fresh oil additions. The contributions of this paper can be summarized as follows:
  • A statistical correction model is proposed to process degradation data that are disturbed by non-periodic oil top-ups. The proposed model can not only automatically identify the locations of oil top-up points but also globally correct the observed degradation data.
  • A three-step explosion search algorithm is developed, enabling robust parameter estimation through both parametric and non-parametric approaches.
  • The corrected degradation data can reflect the actual physical degradation; thus, these corrected data can be used to assess system conditions and predict a failure.
Notably, while the methodology is developed through wear degradation analysis, its data-driven architecture ensures generalizability to various degradation monitoring scenarios requiring measurement distortion correction.
The rest of the paper is organized as follows: The proposed methodology is presented in detail in Section 2. In Section 3, we perform some numerical simulations. Case studies are shown in Section 4. Section 5 provides discussion and conclusions.

2. Methodology

2.1. Data and Model

The actual physical degradation serves as a “health indicator” for system wear; however, the observation of the physical degradation process is affected by oil top-ups. To be specific, once the fresh oil is added, the observed degradation data, such as the concentration of metal elements, will be diluted. Therefore, it is necessary to correct the observed degradation data. Before establishing the model, we have the following assumptions: If the mass of the oil added is unknown, then the observed degradation data, namely, the concentrations of metal elements, are diluted in the same ratio each time fresh oil is added. This indicates that the oil top-up effect can be described by a constant.
The definitions of all variables and mathematical symbols are listed in Table 1. If the observed error cannot be neglected, then the observed degradation is random, namely,
y i = f i + ε i ,
where ε i is the random error and follows a normal distribution with zero mean and standard deviation σ .
We established a model to describe the relationship between observed degradation data without measurement errors and the actual physical degradation process. Typically, we consider three cases: complete observation data, partially missing data, and completely missing data.
Case I: The observed data consist of t i , y i , z i , δ i i = 1 n , representing complete observations. If fresh oil is added after the ( i 1 ) th observation, then the concentration becomes z i f i 1 ; otherwise, it remains f i 1 . Therefore, neglecting measurement errors, the observed degradation f i consists of two components: the cumulative degradation z i δ i f i 1 and the increment in the degradation process h i h i 1 . Formally, the correction model can be expressed as follows:
f i = h i , i = 1 ; h i h i 1 + z i δ i f i 1 , i = 2 , , n .
Case II: The observed data consist of t i , y i , δ i i = 1 n , which are partial missing observations. Since the effects of oil top-ups are unknown, their mean value α is adopted. Therefore, in this case, the model can be represented as follows:
f i = h i , i = 1 ; h i h i 1 + α δ i f i 1 , i = 2 , , n .
Case III: The observed data consist of t i , y i i = 1 n , which are complete missing observations. The formula of the model is also given by Equation (3). However, we note that δ i is unknown in this case.

2.2. Parameter Estimation

Indeed, engineers primarily focus on the actual degradation data h i . Case II can be considered a bridge, and it links Case I and Case III. Thus, we start with Case II to explore the parameter estimation approaches.

2.2.1. Parametric Estimation Approach

If the parametric representation of the physical degradation process can be derived based on available physical theories, then the parametric estimation approach is a good choice. Here, the physical degradation h is a function of the observed time t and the unknown parameter vector θ , given by h ( t , θ ) .
Through a simple derivation, Equation (3) for Case II can be reformulated as follows:
f ( t i , θ , α ) = h ( t i , θ ) , i = 1 , h ( t i , θ ) + α δ i 1 h ( t i 1 , θ ) i = 2 , h ( t i , θ ) + α δ i 1 h ( t i 1 , θ ) + k = 1 i 2 α δ k + 1 1 h ( t k , θ ) j = k + 2 i α δ j , otherwise ,
where f ( t i , θ , α ) f i and h ( t i , θ ) h i . Under the assumption that the observed error follows the normal distribution, the likelihood function is given by
L n ( θ , α , σ ) = 1 ( 2 π ) 2 / n σ n exp i = 1 n f ( t i , θ , α ) y i 2 2 σ 2 .
We use traditional optimization methods such as the Gauss–Newton iteration method to maximize this likelihood function. Then, the maximum likelihood estimations of the unknown parameters obtained are denoted by ( θ ^ , α ^ , σ ^ ) .
Similarly, we can obtain the maximum likelihood estimations ( θ ^ , σ ^ ) for Case I. Compared with Case I, there is an additional parameter α under Case II.
Under Case III, the locations of oil top-up points are unknown; therefore, it is necessary to identify them. Inspired by the monotonic non-decreasing nature of the physical degradation, we propose an explosion search algorithm, which detects the oil top-up points from the decreased points. Note that the decreased point refers to the point where the observed degradation is lower than the previous point.
More specifically, the indicator of whether the fresh oil is added may be 1 or 0 at the decreased point, while it is fixed to 0 at the increased point. Therefore, with d decreased points, there are 2 d candidate degradation paths. For each candidate degradation path, we calculate its maximum likelihood estimation based on Equation (4). Then, we obtain 2 d estimations and select one with the minimum residual sum of squares as the final estimation. The procedure for the parametric estimation approach for Case III is described by Algorithm 1 in detail.
Algorithm 1: The three-step explosion search algorithm
  • Step 1:
  • Find the candidate oil top-up paths:
  • Find the locations of the decreased points;
  • Calculate the number of the decreased points, denoted by d;
  • Define the candidate oil top-up paths δ i , i = 1 , , 2 d .
  • Step 2:
  • Calculate the maximum likelihood estimation of each candidate oil top-up path:
  • for all  i = 1 , , 2 d  do
  •     Calculate maximum likelihood estimations ( θ ^ i , α ^ i , σ ^ i ) by Equation (4), where δ = δ i ;
  •     Calculate the residual sum of squares, r e i = j = 1 n y j f ( t j , θ ^ i , α ^ i ) 2 .
  • end for
  • Step 3:
  • Obtain the final estimation:
  • j = arg min i r e i , θ ^ = θ ^ j , α ^ = α ^ j , σ ^ = σ ^ j .
In addition to point estimation, engineers may be also interested in interval estimation. We take Case II as an example to illustrate how to use the percentile bootstrap to construct confidence intervals.
(1)
Generate bootstrap sample y b = ( y 1 b , y 2 b , , y n b ) , b = 1 , , B , where y i b = f ( t i , θ ^ , α ^ ) + ε i b , i = 1 , , n , and ε 1 b , , ε n b are independent and identically distributed residuals generated from N ( 0 , σ ^ 2 ) .
(2)
Calculate the maximum likelihood estimation θ ^ b of the bth bootstrap sample y b , b = 1 , , B .
(3)
The 100 ( 1 α ) % percentile bootstrap interval of h ( t i , θ ) is given by
[ h l ( t i ) , h u ( t i ) ] ,
where h l ( t i ) and h u ( t i ) are the lower and upper α / 2 quantiles of { h ( t i , θ ^ b ) , b = 1 , , B } , respectively.

2.2.2. Non-Parametric Estimation Approach

To avoid possible model misspecification in a parametric analysis, we consider an alternative approach, namely, the non-parametric estimation approach.
Under Case II, we use a natural cubic smoothing spline [21] to fit the actual physical degradation curve. The natural cubic smoothing spline is a smooth and continuous twice differentiable curve that minimizes the penalized sum of squares. The penalized sum of squares consists of two parts: the residual sum of squares that quantifies the goodness-of-fit to the data, and the roughness penalty term based on the second derivative. More specifically, we have the following optimization problem:
min α , h i = 1 n ( f i y i ) 2 + λ h ( t ) 2 d t subject   to   h i h i 1 0 , i = 2 , , n , 0 α 1 ,
where h = ( h 1 , , h n ) T , λ is a positive smoothing parameter, controlling the trade-off between fidelity to the data and the roughness of the function estimation, and the constraint condition h i h i 1 0 , to some extent, ensures that the estimation of the actual physical degradation curve is monotonically non-decreasing. Then, the interior-point method is used to solve this problem (see Appendix A for more details).
Under Case III, due to the absence of δ = ( δ 1 , , δ n ) T , the three-step explosion search algorithm in the previous subsection is used in the non-parametric estimation approach. Note that its second step is replaced by the non-parametric estimation based on Equation (5). Moreover, similarly to the parametric estimation approach, we can compute the bootstrap confidence interval for non-parametric estimation.

2.3. Failure Prediction

The estimation of the actual physical degradation curve, which is also a correction to the observed degradation, can reflect the actual physical degradation well. Consequently, this estimation can be used to assess the system state and predict a failure. A failure occurs when the degradation curve h ( t ) hits the failure threshold ω that is empirically predetermined by engineers. According to the concept of First Hitting Time (FHT), the lifetime T of the system is defined as
T = inf { t : h ( t ) ω | h ( 0 ) ω } .
Moreover, the Remaining Useful Life (RUL) is predicted; that is, T t n .

2.4. Implementation Procedure

In order to illustrate how to use our methodology, we show its implementation procedure in Figure 2.

3. Numerical Simulations

In order to compare the proposed methodology with the piecewise linear method [17], two kinds of degradation functions are considered in this subsection.
Specifically, the observed degradations y i is given by y i = f i + ε i , where f i = h i h i 1 + z i δ i f i 1 . Here, the actual degradation h i equals h ( t i ) . The observed points t i ( i = 1 , . . . , n ) with n = 30 are equally spaced in [ 0 , 1 ] . The measure error ε i follows a normal distribution with zero mean and standard deviation σ = 0.5 . δ i follows Bernoulli distribution with the parameter p = 0.3 . z i follows the normal distribution with mean α and variance 0.01 , and we set α = 0.6 . We simulate 1000 datasets based on the above parameter configuration and calculate the estimations for each dataset.
Since it is impossible to enumerate all degradation patterns, here, we will only choose certain representative patterns for illustration. We consider the following two degradation scenarios for all the three cases defined in Section 2:
  • Scenario (I): The actual degradation h ( t ) is linear given by h ( t ) = a + b t where a = 10 , b = 30 .
  • Scenario (II): The actual degradation h ( t ) is non-linear given by h ( t ) = a + b t c where a = 12 , b = 28 , c = 0.5 .
For the parameter estimations obtained by the parametric approach, we calculate the mean of these 1000 estimations, as well as the standard deviation (SD) and the mean square error (MSE). The results of Scenario (I) and those of Scenario (II) are shown in Table 2 and Table 3, respectively. We can see that the SD and MSE are very small for each parameter. Moreover, the mean values of estimations are very close to the true values. These results show that the proposed parametric approach indeed works well.
The results obtained by the non-parametric approach in Scenario (I) and Scenario (II) are, respectively, plotted in Figure 3 and Figure 4. The top, middle, and bottom rows display the non-parametric estimations for Case I, Case II, and Case III, respectively. The 95% point-wise confidence intervals are computed by taking the 2.5 th percentile and the 97.5 th percentile of 1000 estimations at each observed time point. It can be seen that the mean values of the non-parametric estimations are very close to the actual degradation curve and the confidence intervals are narrow. These results indicate that the proposed non-parametric approach performs well.
Moreover, the piecewise linear method [17] is also used in our simulation. The results obtained by the piecewise linear method are also shown in Figure 3 and Figure 4. We can see that the following is true: (a) The mean values of estimations obtained by the piecewise linear method are farther away from the actual degradation curve than those obtained by our non-parametric approach. (b) The confident intervals obtained by the piecewise linear method are wider than those obtained by our non-parametric approach. These results imply that the performances of our non-parametric approach are better than those of the piecewise linear method.
In addition, from Table 2 and Table 3 and Figure 3 and Figure 4, we can conclude that the estimations for Case I are more precise than those for Case II, and the estimates for Case II are better than those for Case III. This is reasonable, because the more complete the observed data, the more accurate the obtained estimations. Moreover, we computed the coverage probabilities for the 95 % bootstrap confidence intervals, and we list them in Table 4 and Table 5. We can observe that the coverage probabilities are all around 95%. This indicates that our model and the corresponding estimation approaches are effective.
Overall, simulation studies show that our model and the corresponding estimation approaches perform well whether the degradation is linear or non-linear. Moreover, our methodology can recover the actual degradation curve better than the piecewise linear method; therefore, we can conclude that our methodology is superior to the piecewise linear method.

4. Case Studies

In this section, we validate the practical applicability of our proposed degradation model by investigating wear degradation data in the power-shift steering transmission (PSST) of tracked vehicles. PSST is characterized by its high loading capacity. The wear in PSST is complex, occurring by the plastic displacement of surface and near-surface material and by the detachment of particles. The metal particles of various shapes and sizes that are formed by the wear are distributed in the oil as contaminants.
Our experiment is an on-road test rather than a laboratory test. To be specific, the tracked vehicle runs on the road and the oil is sampled at discrete time points. Then, the spectrometric oil analysis technique is used to measure concentrations of metal particles within oil samples. In consideration of oil consumption, at some sampling points, fresh oil is added to the PSST after sampling. The Cu (copper) element represents the overall wear level. Therefore, the observations for Cu in the PSST of Vehicle 1 are shown in Figure 1 and listed in Table 6. We can see that the concentrations of Cu are non-monotone although the actual wear process is monotone. This is due to the fact that the addition of fresh oil reduces the concentration of Cu in the PSST.
Consequently, it is necessary to correct the observed data and thus recover the actual degradation curve. Unfortunately, as shown in Table 6, the data obtained only include the sampling time and the observed concentration, without the oil top-up time and the oil top-up quantity. Thus, we model these observed data based on Case III defined in Section 2 and then, respectively, use the parametric and non-parametric approaches to estimate the parameters of the model.
In the parametric approach, the parametric representation of the physical degradation process is critical. The fresh oil is added after running for some time; therefore, the first few observed concentrations are not affected by the fresh oil and can represent the actual physical degradation. According to the first few observed concentrations in Figure 1, we can assume that the running time and actual physical degradation are non-linearly expressed as follows:
h ( t ) = a + b t c ,
where t is the time and a , b , c are unknown parameters.
In the non-parametric approach, as introduced in Section 2, we use a natural cubic smoothing spline to fit the actual physical degradation curve.
Figure 5 shows the results for the parametric and non-parametric approaches, including the observed data, the detected oil top-up points, the actual degradation (the corrected curve), and the bootstrap confidence intervals. We can see that the number of oil top-ups is relatively large. Frequent oil top-ups may be due to high-intensity work or a high leakage rate. Moreover, we notice that there is a big gap between the observed degradation and the corrected one. The “uncorrected” (observed) data are non-monotone and relative while the “corrected” (actual) data are monotone and absolute. This implies that the corrected curve reflects the actual physical degradation process better than the observed data. Therefore, it is meaningful to use our methodology to correct the observed data.
Based on actual experience, the failure threshold of Cu is set to 160 ppm. Figure 5 also shows the FHT for failure occurrence based on parametric and non-parametric approaches. Furthermore, we can see that the actual wear degradation has not reached the failure threshold yet, and thus Vehicle 1 can still work normally for a while.
The observed degradation data in the PSST of Vehicle 2 are shown in Table 7. Figure 6 shows the results obtained, respectively, by the parametric approach and the non-parametric approach. It can be seen that the number of oil top-ups is relatively small, and there is a small gap between the observed degradation and the corrected one.
According to the above two examples, our methodology can correct the distortion data and obtain an estimation of the actual degradation curve. The “uncorrected” (observed) concentration only indicates that wear is taking place while the “corrected” (actual) concentration reflects the degree of wear. Therefore, our methodology can avoid the misdiagnosis of wear condition to some degree.
Furthermore, we compared our methodology with the piecewise linear method [17]. The results shown in Figure 7 and Figure 8 can be summarized as follows:
  • The estimation of the actual degradation obtained by the piecewise linear method is the minimum among the three methods. This indicates that the possibility of missing alarm is relatively high and the possibility of false alarm is relatively low if the piecewise linear method is used. Therefore, if engineers’ goal is to minimize the possibility of false alarms, then the piecewise linear method is a good choice.
  • The estimation of the actual degradation obtained by the non-parametric approach is the maximum among the three methods. This means that the possibility of missing an alarm is relatively low and the possibility of a false alarm is relatively high if the non-parametric approach is used. Consequently, when engineers want to pay more attention on the possibility of missing an alarm, the non-parametric approach should be used.
  • The estimation of the actual degradation obtained by the parametric approach is located in the middle of the three methods. Therefore, if engineers are very certain about the parametric representation of the physical degradation process, it is recommended to use the parametric approach.
It should be noted that the piecewise linear method has three drawbacks: Firstly, it uses local observations to piecewisely correct the observed degradation, rather than global observations. Secondly, it uses engineers’ subjective determination to identify the disturbed points, thereby lacking objectivity. Thirdly, it assumes that the actual degradation curve is linear. However, the actual degradation is non-linear in many scenarios.
In conclusion, compared with the piecewise linear method, our model and the corresponding estimate approaches have some obvious advantages, as follows:
  • Our model is established based on the global observed data, so it can locate more valuable information in the observed data.
  • Our model can identify the disturbed points automatically, thereby avoiding subjective judgment and improving the objectivity and accuracy.
  • Both the parametric approach and the non-parametric approach can describe various kinds of degradation patterns, whether linear or non-linear. Therefore, our approaches have wider applicability and practicability.

5. Discussion and Conclusions

As we all know, most of the actual degradation process is monotone. However, in some circumstances, such as the observation being disturbed by some factors, the observed degradation data lose their monotonicity and are distorted. In this paper, we propose a model to characterize this distorted case and come up with the corresponding approach to solving this model.
Compared with the piecewise linear method [17], the simulation results demonstrate that our methodology can automatically identify disturbed points, thereby avoiding the subjective judgment required for the piecewise linear method, and can also more accurately recover the actual degradation curves. Importantly, our method is flexible enough to model both linear and non-linear degradation patterns, which expands its applicability and practical utility beyond that of the piecewise linear method. Case studies demonstrate that our methodology can describe the effect of disturbed factors, as well as recovering the actual degradation curve and automatically identifying the disturbed points. This is significantly meaningful for condition monitoring, because it avoids a false diagnosis of degradation to some degree.
We remark that our proposed model can be generalized to a situation where the physical degradation process is monotone while the observed process is not monotone due to the disturbance of a point process. Specifically, without observed error, the observation is a perturbation of the actual physical degradation process, as follows:
f ( t ) = h ( t ) + 0 t f ( τ ) d N ( τ ) ,
where the actual physical degradation h ( t ) is a monotone process, such as a Gamma process or inverse Gaussian process, N ( t ) = t i t z i δ i 1 is a marked point process, the random variable z i indicates the influence of disturbance, and the random variable δ i takes one of two values, 0 or 1; it equals 1 when there is an existing disturbance at the ith observed time t i and equals 0 otherwise. If the observed error cannot be neglected, then the observed degradation is as follows:
y ( t ) = f ( t ) + ε ( t ) ,
where ε ( t ) is the stochastic observed error with E ( ε ( t ) ) = 0 .
Particle morphology influences wear mechanism identification. Therefore, these morphological parameters should be incorporated into future investigations. Furthermore, while the current validation is limited to the PSST, extending model verification to multiple machinery types would strengthen the generalizability. Additionally, integrating data-driven methodology with equipment-specific engineering knowledge represents a critical direction for subsequent research.

Author Contributions

Conceptualization, Y.C.; methodology, Y.C.; supervision, J.S.; writing—review and editing, Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Youth Academic Innovation Team Construction project of Capital University of Economics and Business, grant No.QNTD202303.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Based on Equation (3), with trivial derivation, the first term in Equation (5) can be rewritten as ( f y ) T ( f y ) = ( A α h y ) T ( A α h y ) , where f = ( f 1 , , f n ) T , A α is an n × n matrix with respect to α given by
A α = 1 0 0 0 α δ 2 1 1 0 0 ( α δ 2 1 ) j = 3 n 1 α δ j ( α δ 3 1 ) j = 4 n 1 α δ j 1 0 ( α δ 2 1 ) j = 3 n α δ j ( α δ 3 1 ) j = 4 n α δ j α δ n 1 1 .
Note that we use a natural cubic smoothing spline to fit the data. Therefore, the roughness penalty has the form
h ( t ) 2 d t = h T K h ,
where K is an n × n matrix given by K = Q T R 1 Q . Here, Q is an ( n 2 ) × n matrix of second differences with elements Q i i = 1 / τ i , Q i , i + 1 = 1 / τ i 1 / τ i + 1 and Q i , i + 2 = 1 / τ i + 1 , R is an ( n 2 ) × ( n 2 ) symmetric tri-diagonal matrix with elements R i 1 , i = R i , i 1 = τ i / 6 and R i i = ( τ i + τ i + 1 ) / 3 , where τ i = t i + 1 t i is the distance between successive knots.
Above all, Equation (5) can be transformed into the following form:
min α , h ( A α h y ) T ( A α h y ) + λ h T K h subject   to   B h 0 , 0 α 1 ,
where B is an ( n 1 ) × n matrix given by B = 1 1 0 0 0 0 1 1 0 0 0 0 0 1 1 .
Obviously, the inter-point method can be adapted to the above optimization problem. In addition, the grid search method is used to select the parameter λ .

References

  1. Sikorska, J.Z.; Hodkiewicz, M.; Ma, L. Prognostic modelling options for remaining useful life estimation by industry. Mech. Syst. Signal Process. 2011, 25, 1803–1836. [Google Scholar] [CrossRef]
  2. Zhang, Z.; Si, X.; Hu, C.; Lei, Y. Degradation data analysis and remaining useful life estimation: A review on Wiener-Process-based methods. Eur. J. Oper. Res. 2018, 271, 775–796. [Google Scholar] [CrossRef]
  3. Cao, W.; Dong, G.; Chen, W.; Wu, J.; Xie, Y.B. Correction strategies of debris concentration for engine wear monitoring via online visual ferrograph. Proc. Inst. Mech. Eng. Part J J. Eng. Tribol. 2015, 229, 1319–1329. [Google Scholar] [CrossRef]
  4. Zhu, X.; Zhong, C.; Zhe, J. Lubricating oil conditioning sensors for online system health monitoring—A review. Tribol. Int. 2017, 109, 473–484. [Google Scholar] [CrossRef]
  5. Zhu, J.; Yoon, J.; He, D.; Qiu, B.; Bechhoefer, E. Online condition monitoring and remaining useful life prediction of particle contaminated lubrication oil. In Proceedings of the IEEE Conference on Prognostics and Health Management (PHM), Gaithersburg, MD, USA, 24–27 June 2013; pp. 1–14. [Google Scholar]
  6. Du, Y.; Wu, T.; Makis, V. Parameter estimation and remaining useful life prediction of lubricating oil with HMM. Wear 2017, 376–377, 1227–1233. [Google Scholar] [CrossRef]
  7. Fan, B.; Li, B.; Feng, S.; Mao, J.; Xie, Y.B. Modeling and experimental investigations on relationship between wear debris concentration and wear rate lubrication systems. Tribol. Int. 2017, 109, 114–123. [Google Scholar] [CrossRef]
  8. Wang, W. A prognosis models for wear prediction based on oil-based monitoring. J. Oper. Res. Soc. 2007, 58, 887–893. [Google Scholar] [CrossRef]
  9. Vališ, D.; Žák, L.; Pokora, O. Failure prediction of diesel engine based on occurrence of selected wear particles in oil. Eng. Fail. Anal. 2015, 256, 501–511. [Google Scholar] [CrossRef]
  10. Zheng, C.S.; Liu, P.; Liu, Y.; Zhang, Z.L. Oil-based maintenance interval optimization for power-shift steering transmission. Adv. Mech. Eng. 2018, 10, 1–8. [Google Scholar] [CrossRef]
  11. Pradhan, D.; Mishra, A.K. Analysis of ISO VG 68 bearing oil for condition monitoring collected from an externally pressurized ball bearing system. Mater. Today Proc. 2021, 44, 4602–4606. [Google Scholar] [CrossRef]
  12. Wang, W.; Hussin, B. Plant residual time modelling based on observed variables in oil samples. J. Oper. Res. Soc. 2009, 60, 789–796. [Google Scholar] [CrossRef]
  13. Macián, V.; Tormos, B.; Olmeda, P.; Montoro, L. Analytical approach to wear rate determination for internal combustion engine condition monitoring based on oil analysis. Tribol. Int. 2003, 36, 771–776. [Google Scholar] [CrossRef]
  14. Bagshawa, J.A.; Fox, M.F.; Jones, C.J.; Picken, D.J.; Seare, K.D.R. The continuous flow stirred tank reactor (CFSTR) model and used oil volume corrections in condition monitoring. Tribol. Int. 1997, 30, 271–274. [Google Scholar] [CrossRef]
  15. Feng, S.; Fan, B.; Mao, J.; Xie, Y. Prediction on wear of a spur gearbox by on-line wear debris concentration monitoring. Wear 2015, 336–337, 1–8. [Google Scholar] [CrossRef]
  16. Wang, C.J.; Hu, Q.P.; Yu, D. Reliability evaluation under accelerated degradation testing with recovery capability considered. In Proceedings of the 2019 Annual Reliability and Maintainability Symposium (RAMS), Orlando, FL, USA, 28-31 January 2019; pp. 1–5. [Google Scholar]
  17. Xu, T.F.; Yan, X.P.; Sheng, C.X. The research of avoiding the disturbing of oil changing in spectroscopic analysis on diesel engine oil. Lub. Eng. 2006, 7, 71–75. [Google Scholar]
  18. Ji, S.D.; Yang, T.J.; Liu, X.H.; Pei, W. The treatment of changing oil disturbance in diesel engine oil spectrum analysis. Veh. Eng. 2011, 2, 86–89. [Google Scholar]
  19. Ji, S.D.; Xu, S.Y.; Liu, X.H.; Yang, T.J. The pretreatment of spectral analysis data in diesel engine oil diagnosis. Lub. Eng 2011, 36, 105–108. [Google Scholar]
  20. Gao, J.W.; Zhang, Y.T.; Ren, G.Q.; Zhang, X. Study on diesel engine oil spectrum analysis and prediction model. Des. Manuf. Diesel Engine 2004, 3, 26–28. [Google Scholar]
  21. Chong, G. Smoothing Spline ANOVA Models; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
Figure 1. The concentration of Cu in the power shift steering transmission of Vehicle 1.
Figure 1. The concentration of Cu in the power shift steering transmission of Vehicle 1.
Applsci 15 06531 g001
Figure 2. The implementation procedure of our methodology.
Figure 2. The implementation procedure of our methodology.
Applsci 15 06531 g002
Figure 3. Scenario (I): The results obtained by the non-parametric approach and the piecewise linear method.
Figure 3. Scenario (I): The results obtained by the non-parametric approach and the piecewise linear method.
Applsci 15 06531 g003
Figure 4. Scenario (II): The results obtained by the non-parametric approach and the piecewise linear method.
Figure 4. Scenario (II): The results obtained by the non-parametric approach and the piecewise linear method.
Applsci 15 06531 g004
Figure 5. Vehicle 1: the relationship between time and degradation.
Figure 5. Vehicle 1: the relationship between time and degradation.
Applsci 15 06531 g005
Figure 6. Vehicle 2: the relationship between time and degradation.
Figure 6. Vehicle 2: the relationship between time and degradation.
Applsci 15 06531 g006
Figure 7. The performance comparison for Vehicle 1.
Figure 7. The performance comparison for Vehicle 1.
Applsci 15 06531 g007
Figure 8. The performance comparison for Vehicle 2.
Figure 8. The performance comparison for Vehicle 2.
Applsci 15 06531 g008
Table 1. Nomenclature: summary of all variables and mathematical symbols.
Table 1. Nomenclature: summary of all variables and mathematical symbols.
DescriptionSymbol
The ith observed time t i
Actual physical degradation at ith observed time h i
Observed degradation with observed error at ith observed time y i
Observed degradation without observed error at ith observed time f i
Observed error at ith observed time ε i
Indicator of whether the fresh oil is added δ i
Dilution effect of fresh oil at ith observed time z i
Mean of the dilution effect α
Number of observationn
Standard deviation of observed errors σ
Table 2. Scenario (I): The results obtained by the parametric approach for Case I, Case II, and Case III.
Table 2. Scenario (I): The results obtained by the parametric approach for Case I, Case II, and Case III.
Data TypeParameterTrue ValueMean of EstimationsSDMSE
Case Ia109.99910.26110.0682
b3029.91610.45830.2171
σ 0.50.47880.05680.0037
Case IIa1010.01750.29620.0881
b3030.04011.29641.6823
α 0.60.59850.01580.0003
σ 0.50.47890.05510.0035
Case IIIa1010.02770.30530.0939
b3029.51392.50236.4975
α 0.60.60040.01890.0003
σ 0.50.49070.06810.0047
Table 3. Scenario (II): The results obtained by the parametric approach for Case I, Case II, and Case III.
Table 3. Scenario (II): The results obtained by the parametric approach for Case I, Case II, and Case III.
Data TypeParameterTrue ValueMean of EstimationsSDMSE
Case Ia1212.04950.45880.2129
b2827.87620.58910.3623
c0.50.50170.01750.0003
σ 0.50.46990.05540.0040
Case IIa1212.04510.46790.2210
b2827.88831.46272.1520
c0.50.50180.02110.0004
α 0.60.60050.01480.0002
σ 0.50.47040.05350.0037
Case IIIa1212.04350.48710.2392
b2827.52322.55416.7508
c0.50.52270.10920.0124
α 0.60.60190.01560.0002
σ 0.50.49050.10770.0117
Table 4. Scenario (I): The coverage probabilities for h ( t ) with t = 0.25 , 0.50 , 0.75 , and 1.00 .
Table 4. Scenario (I): The coverage probabilities for h ( t ) with t = 0.25 , 0.50 , 0.75 , and 1.00 .
TimeThe Parametric ApproachThe Non-Parametric Approach
Case ICase IICase ICase II
0.250.9510.9610.9480.946
0.500.9600.9510.9530.946
0.750.9700.9610.9510.949
1.000.9620.9520.9520.947
Table 5. Scenario (II): The coverage probabilities for h ( t ) with t = 0.25 , 0.50 , 0.75 , and 1.00 .
Table 5. Scenario (II): The coverage probabilities for h ( t ) with t = 0.25 , 0.50 , 0.75 , and 1.00 .
TimeThe Parametric ApproachThe Non-Parametric Approach
Case ICase IICase ICase II
0.250.9500.9600.9480.945
0.500.9530.9590.9520.948
0.750.9620.9560.9500.951
1.000.9500.9520.9650.943
Table 6. The observed data for Vehicle 1.
Table 6. The observed data for Vehicle 1.
Serial NumberSampling Time [h]Observed Concentration [ppm]
1027.7000
213.540.1500
32649.6500
44157.9000
55047.3000
663.543.9500
77642.9500
89144.7500
910045.9000
10113.523.6500
1112626.6500
1214125.4500
1315034.2500
14163.523.7500
1517627.7000
1619130.0500
1720037.2500
18213.523.0000
1922625.5000
2024119.1000
2125022.7000
22263.523.6000
2327618.8000
2429119.8000
2530020.4000
Table 7. The observed data for Vehicle 2.
Table 7. The observed data for Vehicle 2.
Serial NumberSampling Time [h]Observed Concentration [ppm]
12.515.0000
21527.4000
31832.1250
42441.1500
53039.8500
637.548.2750
742.550.6500
84547.2000
95050.1250
1057.556.8000
116559.2500
126860.3000
138067.1250
1482.567.2250
1587.570.1500
1692.570.5750
179573.0250
1810072.4750
19102.576.9500
20107.579.7667
2111567.4750
2213071.1250
23132.574.7250
24137.572.5250
25142.578.6750
2614575.4000
2715080.1500
28152.582.3750
29157.585.2250
3016582.1750
3118095.5500
32182.5104.7750
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, Y.; Shi, J. Modeling of Distorted Degradation Data Based on Oil Analysis. Appl. Sci. 2025, 15, 6531. https://doi.org/10.3390/app15126531

AMA Style

Chen Y, Shi J. Modeling of Distorted Degradation Data Based on Oil Analysis. Applied Sciences. 2025; 15(12):6531. https://doi.org/10.3390/app15126531

Chicago/Turabian Style

Chen, Yue, and Jian Shi. 2025. "Modeling of Distorted Degradation Data Based on Oil Analysis" Applied Sciences 15, no. 12: 6531. https://doi.org/10.3390/app15126531

APA Style

Chen, Y., & Shi, J. (2025). Modeling of Distorted Degradation Data Based on Oil Analysis. Applied Sciences, 15(12), 6531. https://doi.org/10.3390/app15126531

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop