7.1. CMAPSS Jet Engine Lifetime Data (Data Set 1: FD001)
To verify the two-parameter Chen distribution (: scale, : shape) for modeling randomly censored lifetime data and its applicability to aerospace engine reliability analysis, two data sets (FD001 and FD002) from the NASA CMAPSS (Commercial Modular Aero-Propulsion System Simulation) database are selected. All data are derived from realistic jet engine simulation tests, with complete operation records and failure status indicators, complying with international reliability analysis standards.
Data Set 1. CMAPSS FD001 Jet Engine Lifetime Data
Background. The data set contains lifetime records of jet engines under simulated operating conditions. Each observation represents the operating time (cycles) of an engine until failure (complete observation) or test termination (right-censored observation). Core indicators: “operating cycles (cycles)” and “failure status (1 = failure, 0 = censored)”.
Data Preprocessing: The raw CMAPSS FD001 and FD002 datasets were preprocessed to ensure validity for reliability modeling, with core steps as follows:
Data Extraction: Lifetime records (operating cycles) were extracted from RUL_FD001.txt and RUL_FD002.txt, retaining only numerical values related to engine operating cycles (non-lifetime metadata were excluded).
Validity Filtering: Missing values (NA) and non-positive values (invalid operating cycles ) were removed to avoid log-likelihood calculation errors; Outliers beyond the physical operating range of jet engines were excluded (e.g., cycles for FD002, inconsistent with engineering reality).
Censoring Status Labeling: Right-censored observations were identified based on CMAPSS test protocols: 16/92 (17.4%) censored records for FD001 (test termination before failure) and 36/148 (24.3%) for FD002, with complete observations labeled as failure status (1) and censored ones as 0.
After preprocessing, 92 valid observations were retained for FD001 (76 complete, 16 censored) and 148 for FD002 (112 complete, 36 censored), ensuring data consistency with reliability analysis requirements (positive lifetime values, realistic censoring rates).
Overview. After data preprocessing, observations are obtained (consistent with the number of valid values in FD001.txt), including 76 complete observations (engine failure) and 16 right-censored observations (test termination). Observation range cycles, censoring rate 17.4%.
The empirical cumulative distribution function (ECDF) and model fitting curves of the FD001 data set are illustrated in
Figure 5.
The Weibull distribution is a classical lifetime distribution with a flexible hazard rate function. However, the Chen distribution outperforms the Weibull distribution in fitting the CMAPSS datasets for two reasons: (1) The Chen distribution has a more flexible hazard rate form (bathtub shape for
) compared to the Weibull distribution (monotonic for all
); (2) The K-S test
p-value of the Chen distribution is higher (0.897 for FD001, 0.921 for FD002) than that of the Weibull distribution, as shown in
Table 10, indicating a better fit to the empirical data.
Two models are fitted to each data set for comparative analysis:
- 1.
Two-parameter Chen distribution: Probability density function (pdf)
- 2.
One-parameter exponential distribution: Probability density function (pdf)
Maximum Likelihood Estimation (MLE) and Bayesian estimation (under Squared Error Loss Function, SELF, with non-informative priors) are used to obtain parameter estimates. Bayesian estimation is implemented via Gibbs sampling (10,000 iterations, 2000 burn-in iterations). The Expected Time on Test (ETT) and 95% interval estimates (asymptotic confidence intervals for MLE, Highest Posterior Density (HPD) credible intervals for Bayesian estimation) are also calculated, with detailed results presented in
Table 11.
To quantitatively compare the fitting performance of the two models, five commonly used goodness-of-fit criteria are adopted:
Negative Log-Likelihood (-Log L): Smaller values indicate better fit.
Akaike Information Criterion (AIC): (where k is the number of parameters), smaller values indicate better fit.
Bayesian Information Criterion (BIC): , smaller values indicate better fit.
Kolmogorov-Smirnov (K-S) Test: Smaller D-statistic and larger p-value indicate superior fit.
Empirical Cumulative Distribution Function (ECDF): Intuitively reflects the alignment between the model’s CDF and the empirical CDF. The numerical results of these criteria for the CMAPSS FD001 dataset are summarized in
Table 12.
7.2. CMAPSS Jet Engine Lifetime Data (Data Set 2: FD002)
Data Set 2. CMAPSS FD002 Jet Engine Lifetime Data
Background. Consistent with FD001, this data set contains lifetime records of jet engines under simulated operating conditions, with observations representing operating cycles until failure or test termination. Core indicators: “operating cycles (cycles)” and “failure status (1 = failure, 0 = censored)”.
Overview. After data preprocessing,
observations are obtained (consistent with the number of valid values in FD002.txt), including 112 complete observations (engine failure) and 36 right-censored observations (test termination). Observation range
cycles, censoring rate 24.3%. The empirical cumulative distribution and fitting performance of different models are visualized in
Figure 6, and the quantitative goodness-of-fit results are summarized in
Table 13.
The same two models (two-parameter Chen distribution and one-parameter exponential distribution) are fitted to the data, with model definitions and estimation methods referred to
Section 7.1. The parameter estimates for the FD002 dataset are presented in
Table 14, and the corresponding goodness-of-fit results are shown in
Table 15.
7.3. Results Analysis and Discussion
Drawing on the fitting results of the two CMAPSS jet engine datasets, the following key conclusions are deduced:
Superior Goodness-of-Fit. Across all goodness-of-fit metrics, the two-parameter Chen distribution outperforms the one-parameter exponential distribution. For the FD001 dataset, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) of the Chen distribution are 41.33 and 38.73 lower than those of the exponential distribution, respectively; for FD002, the corresponding reductions in AIC and BIC reach 54.05 and 50.49. Results from the Kolmogorov-Smirnov (K-S) test further validate this superiority: Although the p-values for the Chen distribution are relatively low (0.0475 for FD001, 0.0341 for FD002), whereas the exponential distribution yields p-values (0.0016 for FD001, 0.0014 for FD002) that are far below this threshold—indicating the Chen distribution achieves a much closer approximation to the empirical data.
Practical Significance of Parameters. The shape parameter of the two-parameter Chen distribution is estimated as 1.4007 (FD001) and 1.0288 (FD002), both exceeding 1. This finding confirms that the failure risk of jet engines rises monotonically with operating cycles, which is consistent with the physical mechanism of fatigue accumulation and aging in mechanical components. The scale parameter (0.000978 for FD001, 0.004619 for FD002) characterizes the concentration degree of the lifetime distribution: a smaller corresponds to more dispersed lifetime data (e.g., FD001 engines exhibit greater variability in operating cycles), while a larger denotes a more clustered lifetime distribution (e.g., FD002 engines have relatively consistent service lives). This provides a quantitative basis for prioritizing engine maintenance schedules.
Engineering Application Value. The estimated Expected Time on Test () of the Chen distribution more accurately reflects the actual service potential of engines: for FD001, cycles is significantly more representative of the observed engine longevity than cycles of the exponential distribution, which is consistent with the maximum observed cycle of 145; for FD002, cycles is closer to the maximum observed cycle of 194. In contrast, the ETT derived from the exponential distribution is constrained to the mean value, failing to capture the upper bound of engine service life. This highlights the Chen distribution’s superior capability to support the formulation of engine life extension strategies and spare parts inventory planning.
Model Applicability Verification. Jet engine lifetime data exhibit the characteristics of “right-skewed distribution combined with random censoring.” The one-parameter exponential distribution, which assumes a constant hazard rate, is unable to capture the increasing failure risk associated with prolonged engine operation. In contrast, the two-parameter Chen distribution adapts to the monotonic increasing hazard rate pattern via its shape parameter , rendering it more suitable for lifetime analysis of aerospace propulsion systems. These results confirm that the Chen distribution serves as a more flexible and practical model for censored reliability data in the aerospace domain.