1. Introduction
Complex equipment typically refers to precision equipment that has a complicated mechanical and electronic structure that allows for accurate measurement or control [
1,
2,
3]. Complex equipment has wide applications in manufacturing, medical, aerospace, communications, military equipment, and many other fields. Therefore, it is important to predict the health state of complex equipment to ensure its long-term stable operation [
4,
5,
6].
Currently, there are three main types of methods for complex equipment health-state assessment. (1) The first category is the model-based approach. The core idea of this method is to build a physical or mathematical model based on the failure modes and performance degradation mechanisms of the complex equipment and identify model parameters by reducing the error between actual and model outputs. Hu et al. [
7] utilized the great likelihood estimation method to estimate the model parameters, and established a wind power bearing performance degradation model based on the Wiener process. Zhang et al. [
8] established the state of predicted residuals between the weight optimization unscented Kalman filter (WOUKF) and the true capacity of the battery. (2) The second category relies on methodologies driven by data analysis. The core idea of the method is to acquire a large amount of complex equipment test data and then directly establish the mapping relationship between test statistics and health states. Jiang et al. [
9] proposed a fusion of a discrete entropy-based multiscale sequence aggregation scheme and a long short-term memory neural network to predict the aero-engine’s health evolution state. Chen et al. [
10] integrated the sparrow search intelligent algorithm to optimize the training effectiveness of the neural network, achieving an appropriate mean squared error between the predicted values and the actual values. (3) The third classification is based on fusion information methods. In most cases, a health-state assessment model can be developed for complex equipment based on quantitative data and qualitative knowledge. These methods mainly include: ① Belief Rule-Based approach (BRB), such as Zheng et al. [
11], utilized both quantitative and qualitative information to complete health assessment for complex systems. ② Fuzzy neural network approach, Khashi et al. [
12] based on the basic concepts of approximate nearest neighbor search (ANNs) and fuzzy regression modeling, proposed a new hybrid approach that allows for more accurate results in the presence of incomplete datasets. ③ The evidential reasoning approach. Based on the above analysis, “black-box” models built on quantitative data require qualitative knowledge to give them physical meaning for practical engineering applications. Methods based on semi-quantitative information enable the integrated use of quantitative data and qualitative knowledge, ensuring both the accuracy and interpretability of the assessment. As such, this paper develops a health-state assessment model for complex equipment based on a typical fusion information method, the evidential reasoning rule.
In 1994, Yang and Singh first proposed the evidential reasoning approach, which contains both quantitative and qualitative information, providing ideas for effectively solving multi-attribute decision-making problems [
13,
14,
15]. On this basis, Yang et al. proposed the evidential reasoning rule (ER rule) in 2013 [
16], which considers the weight of evidence and reliability, thereby enhancing the ER method’s capability to handle ambiguity, uncertainty, and incompleteness problems. Due to its excellent performance in integrating multi-source data and processing uncertain information, this paper establishes the health-state assessment model based on the ER rule.
However, the ER rule is valid only if the pieces of evidence are independent of one another; otherwise, they will not satisfy the commutative and associative laws, and the evidence fusion process cannot proceed. In the process of health-state assessment based on ER rule, health indicators need to be transformed into evidence and then fused. Due to the causality-informed correlation of subsystem-level health indicators, the problem of handling correlated evidence also arises. Currently, research on correlated evidence follows three directions. (1) Modifying the evidence fusion rule method. This method introduces a new evidence fusion rule that modifies existing evidence combination rules to eliminate the requirement for evidence independence [
17,
18,
19]. (2) Methods based on relevant source evidence models. This method posits that two pieces of evidence are relevant because they were both updated from the same evidence source. Therefore, its core principle is to prevent duplicate counting of the same evidence source [
20,
21,
22]. (3) Methods based on discount adjustment models. The core concept of this method involves applying a discounting approach to evidence, thereby transforming correlated evidence into independent evidence [
23,
24,
25].
Based on the assessment of the health state of complex equipment, analyze the correlated evidence theories mentioned above ①. The method of modifying fusion rules can theoretically solve the impact of relevant evidence, but its current proposed method only addresses the fusion of correlated evidence in certain special situations and lacks universality. ② Method based on relevant evidence sources is relatively simple in theoretical understanding and computationally efficient. However, determining the relevant or approximately relevant evidence sources in engineering practice remains a challenging problem. ③ Method based on discount adjustment model, despite the issue that the correlation coefficient may not accurately reflect the actual dependence between pieces of evidence, is convenient for engineering applications and suitable for various scenarios involving evidence correlation. Therefore, the method based on the discounting and correction model represents a more feasible solution at the current stage. Accordingly, this paper adopts the discount adjustment model-based approach to determine the discount factor according to the causality-informed correlation among subsystem-level health indicators, and integrates this method into the ER rule to establish a health-state assessment model.
Currently, health-state assessment methods tend to explore the causal relationships behind the features of the data [
26,
27,
28]. The purpose is to move from explaining complex “phenomena” from a statistical perspective to analyzing “causes” from a system dynamics perspective, thus improving the model’s learning efficiency and interpretability. For complex equipment, it is usually composed of multiple subsystems, and the subsystems work together in a coupled manner to accomplish a set task. Causal coupling is a relatively common type of coupling, where the output of one subsystem is the input of another subsystem. In this paper, the statistical correlation that results from the causal coupling relationship in the dynamics of the system is defined as a causality-informed correlation.
Below is a further explanation of causality-informed correlation from the perspective of complex equipment health-state assessment. The complex equipment health indicator system is typically structured into three levels: ① device-level indicator (primary indicator); ② subsystem-level indicators (secondary indicators); and ③ underlying indicators (tertiary indicators). Causality-informed correlation exists between subsystem-level indicators in the complex equipment indicator system when causal coupling is present in the system dynamics, i.e., the health information of one indicator is transmitted to another along the causal pathway, and in turn propagates further, ultimately leading to health information redundancy within the entire layer of subsystem-level indicators. Therefore, a key challenge is to identify the causal coupling relationships between subsystems and quantitatively analyze the strength of statistical dependence arising from such causal relationships, to reduce health information redundancy and improve the accuracy of health-state assessment.
Currently, there are three primary methods for conducting analysis of complex equipment internal causality coupling relationships: (1) the Granger causality analysis method. Cheng et al. [
29] proposed a Granger causality analysis method based on generalized radial basis function (GRBF) neural networks for fault root cause diagnosis in industrial systems, enhancing the accuracy of diagnostic results. Zhang and Wu [
30] proposed a graph neural network (GNN)-based bearing fault diagnosis method incorporating Granger causality, aiming to enhance the accuracy of bearing fault diagnosis under real-world operating conditions. (2) TF causal analysis method. Zhang et al. [
31] proposed a delay-sensitive causal inference method to mitigate alarm overload issues in industrial control systems (ICS). Liu et al. [
32] proposed a fault root cause analysis method based on Liang-Kleeman information flow, which can infer the location and cause of fault mechanisms by analyzing causal relationships between variables. (3) Convergent cross-mapping (CCM) method. Sharma et al. [
33] applied CCM technology to nonlinear dynamic systems for detecting process anomalies or failures and identifying their root causes. Tian et al. [
34] employed CCM to construct causal networks for root cause tracing in alarm systems of nonlinear industrial processes.
The three methods are compared as follows: First, the Granger causality analysis method is only applicable to linear systems. However, the system dynamic equations of complex equipment are typically nonlinear. Before applying the Granger method, the system dynamic equations must undergo linearization processing, a process that may result in the loss of critical information. Alternatively, one could use the nonlinear Granger causality method [
35]; however, this method is sensitive to noise and places a heavy computational burden on multivariate systems. Subsequently, although the TF method is applicable to nonlinear systems, it requires estimating complex probability density functions. For multivariate systems, this entails a significant computational burden and poses challenges in meeting the requirements for lightweight deployment in engineering applications. In summary, the CCM method is suitable for analyzing causal relationships within complex equipment due to its high applicability to nonlinear systems and the relatively low computational burden of state-space reconstruction. A more detailed comparison of causal inference methods will be conducted in
Section 6.1 using specific data.
Building upon the ER rule, this paper incorporates causality-informed correlation among subsystems to establish the evidential reasoning rule considering causality-informed correlation (ERr-CIC). The modeling process of the ERr-CIC model is illustrated in
Figure 1. First, we demonstrate how causal coupling relationships between subsystems lead to causality-informed correlation among subsystem-level health indicators (
Figure 1a). Second, the CCM method is employed to analyze causal coupling relationships among subsystems (
Figure 1b). Subsequently, based on linear or nonlinear coupling relationships between subsystem outputs, the conditional hybrid correlation coefficient (CHCC) is calculated to quantitatively assess the magnitude of causality-informed correlation (
Figure 1c). Finally, evidence fusion from the underlying health indicators to subsystem-level health indicators is performed based on the evidential reasoning rule to assess the health state of complex equipment (
Figure 1d).
The innovations of this study are highlighted in several aspects. First, the introduction of causality-informed correlation effectively quantifies the causal coupling between subsystems, reducing redundancy in health information and improving assessment accuracy. Second, the CCM method is employed to identify nonlinear causal relationships among subsystems of complex equipment, providing a scientific basis for health-state assessment. Third, a health-state assessment model based on ERr-CIC is proposed, which integrates multi-indicator information through a discount factor and a CHCC, with the fusion order derived from signaling sequences to effectively handle causal correlations. Moreover, sensitivity and robustness analyses are conducted to identify key parameters and evaluate the model’s reliability under parameter perturbations. Finally, experimental validation on a PAMD simulation device, along with comparisons to typical assessment methods, demonstrates the model’s comprehensive advantages in terms of stability, interpretability, and accuracy.
3. The Modeling Process
3.1. Causality-Informed Correlation Among Subsystem-Level Health Indicators
This section primarily addresses how causal coupling relationships in system dynamics lead to statistical causality-informed correlation, as illustrated in
Figure 1a. Below, causality-informed correlation is analyzed from the perspectives of system dynamics and health-state assessment.
- (1)
From the perspective of system dynamics
In the process of complex equipment health assessment, if there exists a causal relationship between subsystems, the entire system can be abstracted as a finite-dimensional coupled dynamical system:
where
denotes the global state vector and
is a continuously differentiable dynamical mapping.
denotes the derivative of the state vector with respect to time
t. Assume the equipment consists of
N subsystems
. The state vector can be partitioned as
where
represents the state vector of subsystem
, and
.
For subsystem
, the system dynamics can be decomposed into:
where
is dynamic functions for subsystem
.
denotes factors other than system state, such as environmental noise and external inputs. When subsystems
and
exhibit causal coupling relationships, there is:
This indicates that the state variables of subsystem
exert a direct dynamical influence on the evolution of subsystem
. It can be further expressed as:
where
denotes the state transfer function determined by system dynamics.
Based on system dynamics analysis, the following discussion addresses health-state assessment under conditions of causal relationships. Assume that the subsystem-level health indicator for subsystem
is
, and the underlying health indicators is
. These health indicators serve as a basis for reflecting the health state, and the subsystem health degradation evolution function is expressed as:
where
and
represent the observed values of
and
at time
t, respectively.
denotes the subsystem health degradation mapping function.
Remark 1.
The underlying health indicator is essentially part of the state vector . However, during health-state assessment, a subset of states from that are observable and sufficiently representative of the subsystem’s health condition are typically selected as health indicators. For clarity in subsequent reasoning, and are presented separately here.
Similarly, for subsystem
j, its health degradation evolution function is expressed as:
where
,
, and
represent the subsystem-level health indicators, underlying health indicators, and state vector for subsystem
, respectively. Since
depends on
, and
, therefore
will be affected by
.
- (2)
From the perspective of health assessment
As shown in the preceding analysis, subsystem-level indicator
is influenced by
. This relationship leads to redundancy in health information during the health-state assessment process, as explained below. Health state
can be understood as a latent variable reflecting the degradation of the equipment’s health. During the health-state assessment process, the amount of information that
and
can provide to
is
where
represents the entropy of
.
denotes the entropy of
given the observation of
and
.
is the mutual information between
,
and
, indicating the extent to which the uncertainty regarding
is reduced after observing
and
.
If
and
are independent of each other, then
can be approximated as:
However, since there is redundancy in health information between
and
, therefore
. Redundant health information is represented as
where
represents redundant health information between
and
. Since
, Equation (
10) can be rewritten as
It is evident that
essentially depends on the degree of information dependence between
and
, i.e., the magnitude of the mutual information
and
is directionless. At the same time,
satisfies the mapping relationship described below.
where
denotes the correlation coefficient of
and
.
indicates the mapping function from
to
.
Therefore, redundant health information can be reflected by the correlation coefficient. Based on the above analysis, the causal relationship between and leads to redundancy in health information during the health-state assessment process, and this redundant health information can be reflected by the correlation coefficient. This correlation is referred to as causality-informed correlation in this paper.
Based on the above analysis, calculating causality-informed correlation between subsystems requires addressing the following two problems: ① Determining whether dynamic causal coupling relationships exist between subsystems; ② Quantifying the magnitude of causality-informed correlation among health indicators.
3.2. Convergent Cross-Mapping for Causal Relationships Inference
Sugihara proposed the convergent cross-mapping (CCM) method in 2012. For complex equipment, the intricate coupling relationships among internal subsystems often make it difficult to establish accurate parametric models. CCM can detect directed causal relationships within coupled dynamic systems without requiring explicit parametric models, making it well-suited for identifying causal relationships between subsystems in complex equipment.
Based on the findings of Butler [
36] and Cummins [
37] on the applicability of state-space reconstruction (SSR) and convergent cross-mapping (CCM), the conditions for subsystems to be eligible for causal coupling analysis are established as follows: Condition ①: Each subsystem performs a single function, meaning a subsystem ultimately has only one output. The main purpose of dividing subsystems according to a single output is to ensure the clarity of the causal analysis chain, thereby making it easier to identify the dominant factor at any given moment. Condition ②: No closed-loop feedback structures exist between subsystems. According to Butler’s research, if there is a feedback loop between two objects, their influences on each other will be coupled, making it impossible to distinguish which one is the driving factor, thereby rendering causal analysis meaningless. Condition ③: External inputs to a subsystem can be treated as constants, i.e.,
, where
is a constant parameter. If the system is subject to external inputs that change over time, it is impossible to form a stable manifold, and consequently, state-space reconstruction cannot be performed, rendering the basic conditions for the CCM method invalid.
The process of determining the causal coupling relationship within the subsystem is illustrated in
Figure 1b. Assume that the time series generated by projecting systems
and
onto a one-dimensional space is:
,
. According to the Takens embedding theorem [
38], let the embedding dimension be
m and the delay time be
. The reconstructed state vector is represented as:
where
m and
are fixed positive integers.
m is determined using the pseudo-neighborhood method [
39], and
is determined using the average mutual information method [
40]. The reconstructed manifolds are, respectively:
,
.
According to research by Butler et al., when the reconstructed state space satisfies both Auto-predictability fraction (AF) and Recurrence fraction (RF) metrics on top of meeting the conditions mentioned earlier, CCM can be employed to determine causal relationships. Specifically, AF and RF should approach 1. For detailed calculation methods, please refer to reference [
36], and further elaboration is omitted here.
For a specific time
t, find the
nearest neighbor points
on
that are closest to
, with the corresponding time index being
. Map
to
, where the corresponding sample point is
. Calculate the estimated value
of
.
where
denotes Euclidean distance.
represents the nearest neighbor distance. Define
as the cross-mapping from
to
of
. Calculate the correlation coefficient
between
and
using the equation:
where
Z denotes the sample length.
,
represent the mean of the actual values and the mean of the estimated values, respectively. As the sample length
Z increases,
gradually converges toward
, and the correlation coefficient ultimately converges to [0, 1]. This convergence suggests the existence of a causal relationship from subsystem
to subsystem
, represented as
; The coefficient
is then computed to assess the reverse causality. Specifically, if
, there is
. Conversely, if
, no causal relationship is inferred between
and
, denoted as
. This paper represents the causal relationships between subsystems in the form of a directed acyclic graph (DAG), referred to as a causal relationship graph.
Definition 1.
Subsystem causal relationship graph , is the set of vertices. corresponds to subsystem ; is the set of directed edges connecting the vertices, used to indicate the causal direction between two subsystems.
For example, suppose the causal relationships among subsystems
,
, and
are as follows:
,
,
. Then the causal relationship graph of the subsystems is represented as
Figure 2.
3.3. Calculation of Conditionally Hybrid Correlation Coefficients
In
Section 3.2, the analysis of causal relationships among subsystems is conducted using the CCM method. However, CCM can only determine the direction of causality and cannot quantitatively analyze the magnitude of the causal driving effect between subsystems. This poses a challenge for all current causal analysis methods. However, for the subsequent health-state assessment task, what is needed is not the magnitude of the causal effect itself, but the strength of statistical dependence among subsystem-level indicators induced by the identified coupling structure. This dependence encompasses both linear and nonlinear correlations. This paper utilizes CHCC to represent the correlation between indicators. The calculation process is shown in
Figure 1c.
Assume the causal relationship between subsystems
and
is
. The one-dimensional projection of state space of subsystems
and
are
and
. From the perspective of system dynamics, if the relationship between
and
is linear, which means
and
satisfy the relation equation:
where
a,
b are arbitrary constants. Then Pearson’s correlation coefficient is used to calculate the correlation coefficient between
and
.
where
denotes linear correlation coefficient.
and
indicate the average of
and
, respectively.
,
stand for the variance of
and
.
When there is a nonlinear relationship between
and
, the correlation coefficient is calculated using the empirical distance covariance [
41]:
where
denotes nonlinear correlation coefficient.
,
,
stand for the empirical distance covariance between
,
, and
, respectively.
Specifically, the calculation of the correlation coefficient is determined by the function
:
where
is a generalized representation of the correlation coefficient between
and
.
In practice, the following methods can be used to determine whether the relationship between
and
is linear or nonlinear. First, perform a linear regression fit on
to obtain the residual sequence
.
where
represents the fitted value.
indicates the residual sequence. If
and
are linearly related, then
should not contain any systematic structure. Calculate the empirical distance correlation coefficient between
and
. The criterion for determining whether the relationship between
and
is linear or nonlinear is
represents a minimum value constant.
Without loss of generality, define the projection vector of the
N subsystems
as
. The correlation matrix which denoted as
K is obtained as follows:
The initial correlation coefficient
for subsystem
is:
Then the CHCC
of
is:
Remark 2.
The parameter increases monotonically with the indicator correlation strength. When calculating the discount factor for an indicator, is inverted and normalized so that the larger the value of , the smaller the value of , and accordingly, the discount for that indicator is approximately larger. takes values in the range of [0, 1]. = 0 means that the health information of subsystem can be completely represented by other subsystems. = 1 means that subsystem is completely independent from other subsystems. means that subsystem has causality-informed correlation with other subsystems.
3.4. Calculation of Other Parameters
For the subsystem with causality-informed correlation, it consists of mutually independent underlying indicators . In the ER rule, the process of transforming indicators into evidence requires the determination of three other key parameters: confidence level, weight, and reliability.
Assume there are M health grade, represented as:
.
is called the assessment framework.
where
represents the probability that indicator
is assessed as health grade
at time
t, which is called confidence level in the ER rule.
represents the reference value for health grade
and meets
.
represents the monitored value of indicator
at time
t. Through the above processing, health indicator
monitoring data can be transformed into an evidence-distribution form.
where
represents the evidence distribution of indicator
at time
t.
The coefficient of the variation-based weighting (CVBW) can effectively capture the fluctuation of indicators [
42], which reflects the level of attention given to each indicator. Therefore, this study employs CVBW to calculate the weights of the underlying indicators.
Indicator weight
is calculated as:
where
,
denote the mean and standard deviation of the monitoring data of
, respectively.
represents the coefficient of variation.
The reliability of the evidence
consists of a static reliability
and a dynamic reliability
. The exact value of the static reliability
is given by the expert. Dynamic reliability
is calculated using the distance-based calculation method [
43]. As indicated in reference [
44], the evidence reliability
r can be derived by combining the static reliability
and dynamic reliability
through the perturbation coefficient. The specific calculation method is beyond the focus of this paper and will not be discussed further.
3.5. Evidence Fusion Process
This section obtains the health-state assessment results of complex equipment through evidence fusion from underlying indicators to subsystem-level indicators, as shown in
Figure 1d.
According to the ER rule [
16], before performing evidence fusion, the evidence needs to be converted into the weighted belief distribution with reliability (WBDR) form. The elements of WBDR are called basic probability mass, and the basic probability mass of evidence
is represented as
where
denotes the power set of
.
indicates the hybrid weight of health indicator
.
The WBDR can be represented by
The evidence is integrated in a pairwise fusion manner, and the specific process of the final fusion is shown as Equations (31)–(33).
where
denote the unnormalized basic probability mass after fusing the
health indicators of subsystem
and assessed as health grade
.
represents the unnormalized basic probability mass of the power set. Similarly,
and
indicate the fusion results of
indicators.
is the normalized basic probability mass.
The confidence level for health grade
after fusing
underlying indicators is
According to Equation (
28), the hybrid weight after the fusion of
indicators is calculated as
where
denote the hybrid weight after the fusion of
indicators.
According to Equation (
35), the fusion results of the hybrid weights of the underlying indicators of each subsystem
can be obtained. Normalize the above hybrid weights to obtain subsystem-level health indicator weights.
where
indicates the weight of subsystem-level health indicator
.
Remark 3.
If the hybrid weights obtained from the fusion of underlying indicators are directly used as the weights for subsystem-level health indicators, it is essentially equivalent to directly fusing the underlying indicators of all subsystems. Reference [45] demonstrates that when using the ER rule for evidence fusion, the more evidence is fused, the greater the risk of overfitting the fusion result. Therefore, this paper adopts normalization to reduce the risk of overfitting. In summary, the WBDR of subsystem-level health indicator
can be expressed as:
where
denote the WBDR of subsystem-level health indicator
.
represents the basic probability mass of
being assessed at health grade
.
indicates the CHCC of
.
By repeating the fusion process of Equations (31)–(33), performing evidence fusion on subsystem-level health indicators. The fusion process is abbreviated as follows:
where
denotes the basic probability mass assigned to the health grade
after the fusion of
N subsystem-level health indicators. ⊗ represents the fusion symbol.
Based on Equation (
34), the confidence level for health grade
after fusing
N subsystem-level indicators is
Ultimately, it can yield the evidence distribution (or referred to as the health-state distribution) results of complex equipment.
where
denotes the evidence distribution of complex equipment.
represents the confidence level that complex equipment is assessed as health grade
.
represents the fusion of
N subsystem-level health indicators.
Since the evidence is correlated, the evidence fusion will no longer satisfy the law of exchange and the law of union. The problem of determining the fusion order is needed when evidence with correlation is fused [
46]. This paper proposes a method to determine the fusion order based on the signaling sequence.
Let the signaling relationship between the subsystems with causality-informed correlation be: the output of is the input of and the output of serve as the input of …Then, the signaling sequence is: . denotes the signaling sequence number of subsystem .
Let
denote the fusion order of
, and
represents that piece of evidence is fused first. Then there are:
The fusion order reflects the priority assigned to each indicator, where a higher priority indicates greater attention. For indicators with causality-informed correlation, the health information of upstream indicators in the signaling sequence is transmitted downstream. Accordingly, upstream indicators should be assigned higher priority. The fusion order follows the signaling sequence derived from the causal relationship graph and equipment working mechanism, corresponding to a topological ordering that ensures causally consistent information propagation and gives sufficient priority to upstream indicators.
The distribution form shown in Equation (
41) is not conducive to model optimization or comparison. Therefore, further defuzzification processing is required to convert the distribution form into a deterministic numerical form. According to the utility theory proposed by Yang [
14], the output utility of complex equipment is:
where
is the utility value of health grade
and is typically obtained based on expert knowledge or statistical results from actual engineering data.
indicates the output utility of the complex equipment at time
t.
3.6. Optimization of Model Parameters
The parameters of ERr-CIC , , and are calculated based on the indicator monitoring data or determined by expert knowledge. However, the adaptability of the model parameters decreases due to factors such as disturbances and environmental changes that are inevitable during complex equipment health monitoring. Therefore, the model needs to be trained with multiple sets of data to optimize the model parameters.
Mean squared error (MSE) is widely used for evaluating model accuracy. It quantifies the precision of the ERr-CIC model by calculating the average of the squares of the difference between the actual output utility
and the expected output utility
.
is typically determined by domain experts based on the evaluation scenario, historical statistical analysis, and industry standards. The optimization objective functions are defined as:
The corresponding parameter constraints are shown in Equations (45)–(47).
6. Comparison Experiment
To further illustrate the feasibility and superiority of the method proposed in the article, this paper conducts comparative experiments from two aspects: the comparison of causal inference methods and the health-state assessment methods.
6.1. Comparison of Causal Inference Methods
To justify the use of the CCM method, this section compares it with two mainstream approaches for causal inference in nonlinear dynamic systems: transfer entropy (TE) and nonlinear Granger (NG).
The TF of
is defined as
where
,
represent the output data of the subsystems
and
with lengths
l and
k, respectively. In this study,
was adopted to ensure robust probability estimation under limited samples and to provide a fair baseline comparison with CCM. The joint and conditional probabilities were estimated using a histogram-based discretization method. When
occurs, then
holds.
For the NG method, a restricted model and an unrestricted model were constructed:
where
b represents the lag order. In this study, the lag order was set to
for all subsystem pairs to balance predictive capability and estimation stability under the limited sample size.
are nonlinear prediction functions.
,
are the residuals of the two models, respectively. In this study, Gaussian-kernel nonlinear regression was used to implement both models.
indicates that the addition of the historical information of reduces the prediction error of , implying holds. To evaluate statistical significance, a permutation test was further performed by randomly shuffling the source series and recalculating the causality statistic to generate the null distribution.
The causal relationship graph obtained by methods CCM, TF, and NG is shown in
Figure 7.
As shown in
Figure 7, CCM and TF produce the same causal graph, whereas the NG method misses the edge from
to
. From the PAMD working mechanism, the three subsystems operate in a serial signal-processing chain and are dynamically coupled. In such a nonlinear coupled system, the state information of upstream subsystems can propagate through intermediate subsystems and remain embedded in the dynamics of downstream subsystems. In this sense, the edge
is physically and dynamically reasonable. In summary, from the perspectives of consistency in results and mechanistic analysis, the causal relationship graph obtained using the CCM and TF methods is more reliable.
The three methods were then compared in terms of computational efficiency and the accuracy of health-state assessments. Computational efficiency is defined as the runtime of each method, recorded over 20 independent runs, with the mean and variance of runtime subsequently calculated. Health-state assessment accuracy is compared using the following process: First, the CHCC is calculated separately for each causal relationship graph derived from each method. Second, the CHCC is input into the ERr-CIC model, and health-state assessment is performed using the underlying indicator monitoring data obtained in
Section 5.2, following the same model optimization process. Finally, the RMSE between the model’s output utility and the expected output utility is compared.
The comparison results are shown in
Table 4.
The NG method achieved the lowest accuracy in health-state assessment, with an RMSE of 0.078. The primary reason for this may be that discrepancies in the causal relationship graph analysis led to errors in the CHCC calculation, which in turn affected the accuracy of the health-state assessment results. At the same time, the accuracy of the health-state assessment results further demonstrates the rationality of the CCM method and the TF method in analyzing causal relationship graphs. Although CCM and TF methods have the same assessment accuracy, the TF method requires probability density estimation, resulting in significantly lower computational efficiency compared to CCM. In summary, the CCM is suitable for the causal inference of nonlinear dynamical systems such as complex equipment.
6.2. Comparison of Health-State Assessment Methods
Comparison models include: ① CNN-Transformer model [
58]. This model uses a CNN network to achieve end-to-end feature extraction between health indicator monitoring data and output utility, and then utilizes the attention mechanism of the Transformer network to deeply capture nonlinear relationships, such as correlations between features, achieving health-state assessment of complex equipment. ② Graph Convolution Network (GCN) model [
59]. Compared with traditional deep learning methods, this model can learn from graphs, a non-matrix-based information representation method, which is suitable for capturing the complex causal relationship features from the subsystem causal relationship graph. ③ T-S fuzzy model [
60]. The T-S model, as a typical semi-quantitative information method, can effectively utilize quantitative monitoring data and qualitative knowledge and convert them into fuzzy rules to achieve complex equipment health-state assessment. The hyperparameters of the comparison models are shown in
Table 5.
To ensure a fair comparison, all models were constructed using the same input-output setting. Specifically, the input of all models consisted of the same six underlying indicators as shown in
Table 3, while the output was the same output utility
. All models used the same training and validation sets. Since the compared models belong to different methodological categories, their parameters were optimized using the solvers commonly adopted for each type of model. The CNN-Transformer and GCN models employed the Adam optimizer with a learning rate of
, which is reduced by a factor of 0.5 every 10 epochs. The momentum parameter is set to 0.9, and L2 regularization with a weight decay of
is applied to prevent overfitting. The T-S fuzzy model used the same SQP optimization algorithm as the ER model, with identical optimization parameter settings: the maximum number of iterations was set to
and the optimization terminates when the step size falls below
. The model convergence curve and training loss rate are shown in
Figure 8.
Firstly, the CNN-Transformer model exhibits a rapid decrease in error during the initial training phase, indicating its strong ability in feature extraction and fitting. However, there is a certain degree of oscillation in the middle and later stages, with a more pronounced fluctuation in the loss reduction rate, suggesting that the stability of its optimization process needs improvement. Secondly, the ERr-CIC model converges quickly overall, with a small gap between training and validation errors, demonstrating good generalization ability. Its loss reduction process is relatively smooth, with only slight fluctuations in the early stages, and then quickly stabilizes, indicating that the model optimization process is efficient and stable. For the GCN model, although there is a significant decrease in error during the initial training phase, the overall fluctuation is large, especially in the loss reduction rate curve, which shows frequent oscillations. This suggests that the model is susceptible to gradient fluctuations during the optimization process and has relatively weak stability. Finally, the T-S fuzzy model demonstrates the smoothest and most stable convergence process. Its training and validation errors decrease uniformly and quickly stabilize, with minimal fluctuation in the loss reduction rate, indicating that the model has good robustness and convergence performance during the optimization process. In summary, compared to other models, the T-S fuzzy model performs optimally in terms of convergence speed and stability, while the ERr-CIC model has certain advantages in terms of generalization ability; the CNN-Transformer and GCN models, although possessing strong fitting capabilities, still have room for improvement in training stability.
The comparison of model output performance results is shown in
Figure 9a. It can be seen that among the four models, the CNN-Transformer has the highest accuracy, with an RMSE of 0.030. The reason the GCN model, also as a deep learning method, performs poorly in terms of accuracy may be that it does not adequately capture the relationship between causal coupling in system dynamics and statistical correlations, resulting in biased evaluations.
To verify whether the differences in model performance are statistically significant and to minimize performance fluctuations caused by random factors, this paper conducts a Friedman test on the model results. The specific steps are as follows: ① The monitoring data obtained in subsection A are randomly split into training and validation sets over 20 independent trials. ② In each trial, the model uses the same training and validation sets. ③ Record the mean and standard deviation of RMSE for each trial. The distribution of the model results is shown in
Figure 10. It can be seen that the ERr-CIC model and the T-S fuzzy model produce more stable results. The statistics of 20 trials for each model are shown in
Table 6.
Based on the statistical measures derived from the above results, a Friedman test was conducted. Specifically, the models were first ranked according to their RMSE values from each run, with lower RMSE values corresponding to higher rankings. Subsequently, the average rank for each model was calculated as shown in Equation (
67).
where
represents the ranking of model
j in its
ith run.
indicates the average rank of the model
j. The Friedman statistic is expressed as:
where
denotes the number of runs.
represents the number of models.
approximately follows a chi-squared distribution with
degrees of freedom. After calculation,
, there is:
. In summary, the Friedman test indicates that the differences in RMSE among the compared models are statistically significant.
However, model performance cannot be fully assessed based solely on accuracy. Below, we conduct a comprehensive evaluation of each model, including interpretability, stability, efficiency, and accuracy four comparison metrics. ① Model accuracy is determined by the average RMSE of the results from 20 runs. ② Model stability is determined by the variance of the RMSE across 20 model runs. ③ Efficiency is assessed by the average time over 20 runs. ④ Model interpretability is assessed by 5 experts based on the traceability of output, and the comprehensibility of results. Each model was evaluated using the final converged parameters obtained after optimization. The results of the model comparison are shown in
Table 7.
To make the comparison results clearer, each result has been mapped to the 0–1 range, as shown in
Figure 9b. While ERr-CIC exhibits lower model accuracy compared to CNN-Transformer, it significantly outperforms it in terms of interpretability and efficiency. When making maintenance decisions, methods with high interpretability are preferred. At the same time, although the classic T-S model performs the worst in terms of model accuracy, it has advantages in efficiency and interpretability. Overall, the ERr-CIC model performs relatively evenly in all aspects, making it more suitable for assessing the health state of complex equipment with causality-informed correlation in engineering practice.