A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment

Li, Wenbo; Feng, Zhichao; Sun, Yijie; Zhang, Xinyi

doi:10.3390/e28050533

Open AccessArticle

A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment

Graduate School of Rocket Force University of Engineering, Xi’an 710025, China

^*

Author to whom correspondence should be addressed.

Entropy 2026, 28(5), 533; https://doi.org/10.3390/e28050533

Submission received: 12 March 2026 / Revised: 20 April 2026 / Accepted: 24 April 2026 / Published: 7 May 2026

(This article belongs to the Special Issue Causal Representation Learning with Its Applications)

Download

Browse Figures

Versions Notes

Abstract

Health-state assessment is a critical component of prognostics and health management (PHM) for complex equipment. Previous studies on assessing the health state of complex equipment have overlooked the statistical dependence arising from causal coupling relationships between subsystems, which is defined as causality-informed correlation in this study. This correlation introduces redundancy in health information, leading to assessment bias. To address these limitations, this study proposes a health-state assessment model based on the evidential reasoning rule considering causality-informed correlation (ERr-CIC). First, the causal coupling relationships in dynamics and their effects on health-assessment results are analyzed. Based on this analysis, the convergent cross-mapping (CCM) method is employed to examine causal coupling between subsystems. Subsequently, a health-assessment model based on ERr-CIC is developed. This model incorporates a discount factor to quantify the causality-informed correlation among indicators, realized using a conditionally hybrid correlation coefficient (CHCC), and a fusion order derived from signaling sequences. Furthermore, a sensitivity and robustness analysis of the model output to the CHCC is conducted to identify the key parameters governing system behavior and to assess the reliability of the model results under parameter perturbations. Finally, experiments are performed on the PAMD simulation device for validation, and the proposed model is compared with three other typical health-state assessment models. The results show that the ERr-CIC model proposed in this paper achieves relatively balanced performance in terms of stability and interpretability while maintaining competitive model accuracy.

Keywords:

evidential reasoning rule; health-state assessment; causal coupling; convergent cross-mapping

1. Introduction

Complex equipment typically refers to precision equipment that has a complicated mechanical and electronic structure that allows for accurate measurement or control [1,2,3]. Complex equipment has wide applications in manufacturing, medical, aerospace, communications, military equipment, and many other fields. Therefore, it is important to predict the health state of complex equipment to ensure its long-term stable operation [4,5,6].

Currently, there are three main types of methods for complex equipment health-state assessment. (1) The first category is the model-based approach. The core idea of this method is to build a physical or mathematical model based on the failure modes and performance degradation mechanisms of the complex equipment and identify model parameters by reducing the error between actual and model outputs. Hu et al. [7] utilized the great likelihood estimation method to estimate the model parameters, and established a wind power bearing performance degradation model based on the Wiener process. Zhang et al. [8] established the state of predicted residuals between the weight optimization unscented Kalman filter (WOUKF) and the true capacity of the battery. (2) The second category relies on methodologies driven by data analysis. The core idea of the method is to acquire a large amount of complex equipment test data and then directly establish the mapping relationship between test statistics and health states. Jiang et al. [9] proposed a fusion of a discrete entropy-based multiscale sequence aggregation scheme and a long short-term memory neural network to predict the aero-engine’s health evolution state. Chen et al. [10] integrated the sparrow search intelligent algorithm to optimize the training effectiveness of the neural network, achieving an appropriate mean squared error between the predicted values and the actual values. (3) The third classification is based on fusion information methods. In most cases, a health-state assessment model can be developed for complex equipment based on quantitative data and qualitative knowledge. These methods mainly include: ① Belief Rule-Based approach (BRB), such as Zheng et al. [11], utilized both quantitative and qualitative information to complete health assessment for complex systems. ② Fuzzy neural network approach, Khashi et al. [12] based on the basic concepts of approximate nearest neighbor search (ANNs) and fuzzy regression modeling, proposed a new hybrid approach that allows for more accurate results in the presence of incomplete datasets. ③ The evidential reasoning approach. Based on the above analysis, “black-box” models built on quantitative data require qualitative knowledge to give them physical meaning for practical engineering applications. Methods based on semi-quantitative information enable the integrated use of quantitative data and qualitative knowledge, ensuring both the accuracy and interpretability of the assessment. As such, this paper develops a health-state assessment model for complex equipment based on a typical fusion information method, the evidential reasoning rule.

In 1994, Yang and Singh first proposed the evidential reasoning approach, which contains both quantitative and qualitative information, providing ideas for effectively solving multi-attribute decision-making problems [13,14,15]. On this basis, Yang et al. proposed the evidential reasoning rule (ER rule) in 2013 [16], which considers the weight of evidence and reliability, thereby enhancing the ER method’s capability to handle ambiguity, uncertainty, and incompleteness problems. Due to its excellent performance in integrating multi-source data and processing uncertain information, this paper establishes the health-state assessment model based on the ER rule.

However, the ER rule is valid only if the pieces of evidence are independent of one another; otherwise, they will not satisfy the commutative and associative laws, and the evidence fusion process cannot proceed. In the process of health-state assessment based on ER rule, health indicators need to be transformed into evidence and then fused. Due to the causality-informed correlation of subsystem-level health indicators, the problem of handling correlated evidence also arises. Currently, research on correlated evidence follows three directions. (1) Modifying the evidence fusion rule method. This method introduces a new evidence fusion rule that modifies existing evidence combination rules to eliminate the requirement for evidence independence [17,18,19]. (2) Methods based on relevant source evidence models. This method posits that two pieces of evidence are relevant because they were both updated from the same evidence source. Therefore, its core principle is to prevent duplicate counting of the same evidence source [20,21,22]. (3) Methods based on discount adjustment models. The core concept of this method involves applying a discounting approach to evidence, thereby transforming correlated evidence into independent evidence [23,24,25].

Based on the assessment of the health state of complex equipment, analyze the correlated evidence theories mentioned above ①. The method of modifying fusion rules can theoretically solve the impact of relevant evidence, but its current proposed method only addresses the fusion of correlated evidence in certain special situations and lacks universality. ② Method based on relevant evidence sources is relatively simple in theoretical understanding and computationally efficient. However, determining the relevant or approximately relevant evidence sources in engineering practice remains a challenging problem. ③ Method based on discount adjustment model, despite the issue that the correlation coefficient may not accurately reflect the actual dependence between pieces of evidence, is convenient for engineering applications and suitable for various scenarios involving evidence correlation. Therefore, the method based on the discounting and correction model represents a more feasible solution at the current stage. Accordingly, this paper adopts the discount adjustment model-based approach to determine the discount factor according to the causality-informed correlation among subsystem-level health indicators, and integrates this method into the ER rule to establish a health-state assessment model.

Currently, health-state assessment methods tend to explore the causal relationships behind the features of the data [26,27,28]. The purpose is to move from explaining complex “phenomena” from a statistical perspective to analyzing “causes” from a system dynamics perspective, thus improving the model’s learning efficiency and interpretability. For complex equipment, it is usually composed of multiple subsystems, and the subsystems work together in a coupled manner to accomplish a set task. Causal coupling is a relatively common type of coupling, where the output of one subsystem is the input of another subsystem. In this paper, the statistical correlation that results from the causal coupling relationship in the dynamics of the system is defined as a causality-informed correlation.

Below is a further explanation of causality-informed correlation from the perspective of complex equipment health-state assessment. The complex equipment health indicator system is typically structured into three levels: ① device-level indicator (primary indicator); ② subsystem-level indicators (secondary indicators); and ③ underlying indicators (tertiary indicators). Causality-informed correlation exists between subsystem-level indicators in the complex equipment indicator system when causal coupling is present in the system dynamics, i.e., the health information of one indicator is transmitted to another along the causal pathway, and in turn propagates further, ultimately leading to health information redundancy within the entire layer of subsystem-level indicators. Therefore, a key challenge is to identify the causal coupling relationships between subsystems and quantitatively analyze the strength of statistical dependence arising from such causal relationships, to reduce health information redundancy and improve the accuracy of health-state assessment.

Currently, there are three primary methods for conducting analysis of complex equipment internal causality coupling relationships: (1) the Granger causality analysis method. Cheng et al. [29] proposed a Granger causality analysis method based on generalized radial basis function (GRBF) neural networks for fault root cause diagnosis in industrial systems, enhancing the accuracy of diagnostic results. Zhang and Wu [30] proposed a graph neural network (GNN)-based bearing fault diagnosis method incorporating Granger causality, aiming to enhance the accuracy of bearing fault diagnosis under real-world operating conditions. (2) TF causal analysis method. Zhang et al. [31] proposed a delay-sensitive causal inference method to mitigate alarm overload issues in industrial control systems (ICS). Liu et al. [32] proposed a fault root cause analysis method based on Liang-Kleeman information flow, which can infer the location and cause of fault mechanisms by analyzing causal relationships between variables. (3) Convergent cross-mapping (CCM) method. Sharma et al. [33] applied CCM technology to nonlinear dynamic systems for detecting process anomalies or failures and identifying their root causes. Tian et al. [34] employed CCM to construct causal networks for root cause tracing in alarm systems of nonlinear industrial processes.

The three methods are compared as follows: First, the Granger causality analysis method is only applicable to linear systems. However, the system dynamic equations of complex equipment are typically nonlinear. Before applying the Granger method, the system dynamic equations must undergo linearization processing, a process that may result in the loss of critical information. Alternatively, one could use the nonlinear Granger causality method [35]; however, this method is sensitive to noise and places a heavy computational burden on multivariate systems. Subsequently, although the TF method is applicable to nonlinear systems, it requires estimating complex probability density functions. For multivariate systems, this entails a significant computational burden and poses challenges in meeting the requirements for lightweight deployment in engineering applications. In summary, the CCM method is suitable for analyzing causal relationships within complex equipment due to its high applicability to nonlinear systems and the relatively low computational burden of state-space reconstruction. A more detailed comparison of causal inference methods will be conducted in Section 6.1 using specific data.

Building upon the ER rule, this paper incorporates causality-informed correlation among subsystems to establish the evidential reasoning rule considering causality-informed correlation (ERr-CIC). The modeling process of the ERr-CIC model is illustrated in Figure 1. First, we demonstrate how causal coupling relationships between subsystems lead to causality-informed correlation among subsystem-level health indicators (Figure 1a). Second, the CCM method is employed to analyze causal coupling relationships among subsystems (Figure 1b). Subsequently, based on linear or nonlinear coupling relationships between subsystem outputs, the conditional hybrid correlation coefficient (CHCC) is calculated to quantitatively assess the magnitude of causality-informed correlation (Figure 1c). Finally, evidence fusion from the underlying health indicators to subsystem-level health indicators is performed based on the evidential reasoning rule to assess the health state of complex equipment (Figure 1d).

The innovations of this study are highlighted in several aspects. First, the introduction of causality-informed correlation effectively quantifies the causal coupling between subsystems, reducing redundancy in health information and improving assessment accuracy. Second, the CCM method is employed to identify nonlinear causal relationships among subsystems of complex equipment, providing a scientific basis for health-state assessment. Third, a health-state assessment model based on ERr-CIC is proposed, which integrates multi-indicator information through a discount factor and a CHCC, with the fusion order derived from signaling sequences to effectively handle causal correlations. Moreover, sensitivity and robustness analyses are conducted to identify key parameters and evaluate the model’s reliability under parameter perturbations. Finally, experimental validation on a PAMD simulation device, along with comparisons to typical assessment methods, demonstrates the model’s comprehensive advantages in terms of stability, interpretability, and accuracy.

2. Problem Formulation

This section outlines the two key problems addressed in this paper:

Problem 1: Traditional health-assessment models neglect the correlation between subsystem-level indicators. Excluding such causality-informed correlation in health-assessment methods results in redundant computation of health-state information during indicator fusion, which in turn leads to an overestimation problem that systematically degrades assessment accuracy. In this paper, the discount factor method is employed to address the causality-informed correlation among health indicators. In summary, the objective of problem 1 is to establish the following model:

O (\cdot) = Ξ (I (a_{1}), I (a_{2}), \dots, I (a_{m}), Γ, v)

(1)

where

O (\cdot)

denotes the final output of the complex equipment health-state assessment model;

I (a_{1}), I (a_{2}), \dots, I (a_{m})

denote the input values of health indicators

1, 2, \dots, m

;

Γ = δ_{i} ∣ i \in [1, N]

represents the discount factor that accounts for causality-informed correlation, with N being the number of subsystems; and

v

denotes the vector consisting of the other parameters in the health-assessment model. Problem 2: Current health-state assessment methods for complex equipment lack performance analysis specifically targeting models that account for causality-informed correlation. The objective of model performance analysis is to evaluate the impact of key parameters on model output, thereby providing data support for prognostics and health management (PHM). Analyzing the effect of causality-informed correlation among health indicators entails investigating the influence of the discount factor

Γ

on the health-state assessment outcome

O (\cdot)

in Equation (1). This task inevitably involves complex mathematical computations and in-depth analysis of internal mechanisms, rendering it more challenging to implement.

3. The Modeling Process

3.1. Causality-Informed Correlation Among Subsystem-Level Health Indicators

This section primarily addresses how causal coupling relationships in system dynamics lead to statistical causality-informed correlation, as illustrated in Figure 1a. Below, causality-informed correlation is analyzed from the perspectives of system dynamics and health-state assessment.

(1): From the perspective of system dynamics

In the process of complex equipment health assessment, if there exists a causal relationship between subsystems, the entire system can be abstracted as a finite-dimensional coupled dynamical system:

\dot{X} = F (X)

(2)

where

X \in R^{n}

denotes the global state vector and

F (\cdot) : R^{n} \to R^{n}

is a continuously differentiable dynamical mapping.

\dot{X} = \frac{d X}{d t}

denotes the derivative of the state vector with respect to time t. Assume the equipment consists of N subsystems

X_{1}, X_{2}, \dots, X_{N}

. The state vector can be partitioned as

X = (X_{1}, X_{2}, \dots, X_{N})

where

X_{i} \in R^{n_{i}}

represents the state vector of subsystem

X_{i}

, and

X_{i} = (x_{i 1}, x_{i 2}, \dots, x_{i n_{i}})

.

For subsystem

X_{i}

, the system dynamics can be decomposed into:

{\dot{X}}_{i} = f_{i} (X_{1}, X_{2}, \dots, X_{M}, χ (t))

(3)

where

f_{i} (\cdot) : R^{n} \to R^{n_{i}}

is dynamic functions for subsystem

X_{i}

.

χ (t)

denotes factors other than system state, such as environmental noise and external inputs. When subsystems

X_{i}

and

X_{j}

exhibit causal coupling relationships, there is:

\frac{\partial f_{i}}{\partial X_{j}} \neq 0 (i \neq j)

(4)

This indicates that the state variables of subsystem

X_{j}

exert a direct dynamical influence on the evolution of subsystem

X_{i}

. It can be further expressed as:

X_{i} (t) = g (X_{j} (t), χ (t))

(5)

where

g (\cdot)

denotes the state transfer function determined by system dynamics.

Based on system dynamics analysis, the following discussion addresses health-state assessment under conditions of causal relationships. Assume that the subsystem-level health indicator for subsystem

X_{i}

is

ϕ_{i} \in R

, and the underlying health indicators is

h_{i} = (h_{i 1}, h_{i 2}, . . ., h_{i K_{i}}) \in R^{K_{i}}

. These health indicators serve as a basis for reflecting the health state, and the subsystem health degradation evolution function is expressed as:

ϕ_{i} (t) = Ω_{i} (h_{i} (t), X_{i} (t))

(6)

where

ϕ_{i} (t)

and

h_{i} (t)

represent the observed values of

ϕ_{i}

and

h_{i}

at time t, respectively.

Ω_{i} (\cdot) : R^{K_{i}} \times R^{n_{i}} \to R

denotes the subsystem health degradation mapping function.

Remark 1.

The underlying health indicator

h_{i}

is essentially part of the state vector

X_{i}

. However, during health-state assessment, a subset of states from

X_{i}

that are observable and sufficiently representative of the subsystem’s health condition are typically selected as health indicators. For clarity in subsequent reasoning,

h_{i}

and

X_{i}

are presented separately here.

Similarly, for subsystem j, its health degradation evolution function is expressed as:

ϕ_{j} (t) = Ω_{j} (h_{j} (t), X_{j} (t))

(7)

where

ϕ_{j} (t) \in R

,

h_{j} (t) \in R^{K_{j}}

, and

X_{j} (t) \in R^{n_{j}}

represent the subsystem-level health indicators, underlying health indicators, and state vector for subsystem

X_{j}

, respectively. Since

X_{i}

depends on

X_{j}

, and

ϕ_{i} (t) = Ω_{i} (h_{i} (t), X_{i} (t))

, therefore

ϕ_{i}

will be affected by

X_{j}

.

(2): From the perspective of health assessment

As shown in the preceding analysis, subsystem-level indicator

ϕ_{i}

is influenced by

ϕ_{j}

. This relationship leads to redundancy in health information during the health-state assessment process, as explained below. Health state

Ω

can be understood as a latent variable reflecting the degradation of the equipment’s health. During the health-state assessment process, the amount of information that

ϕ_{i}

and

ϕ_{j}

can provide to

Ω

is

I (Ω; ϕ_{i}, ϕ_{j}) = H (Ω) - H (Ω |ϕ_{i}, ϕ_{j})

(8)

where

H (Ω)

represents the entropy of

Ω

.

H (Ω |ϕ_{i}, ϕ_{j})

denotes the entropy of

Ω

given the observation of

ϕ_{i}

and

ϕ_{j}

.

I (Ω; ϕ_{i}, ϕ_{j})

is the mutual information between

Ω

,

ϕ_{i}

and

ϕ_{j}

, indicating the extent to which the uncertainty regarding

Ω

is reduced after observing

ϕ_{i}

and

ϕ_{j}

.

If

ϕ_{i}

and

ϕ_{j}

are independent of each other, then

I (Ω; ϕ_{i}, ϕ_{j})

can be approximated as:

I (Ω; ϕ_{i}, ϕ_{j}) = I (Ω; ϕ_{i}) + I (Ω; ϕ_{j})

(9)

However, since there is redundancy in health information between

ϕ_{i}

and

ϕ_{j}

, therefore

I (Ω; ϕ_{i}, ϕ_{j}) < I (Ω; ϕ_{i}) + I (Ω; ϕ_{j})

. Redundant health information is represented as

R (ϕ_{i}, ϕ_{j}) = I (Ω; ϕ_{i}) + I (Ω; ϕ_{j}) - I (Ω; ϕ_{i}, ϕ_{j})

(10)

where

R (ϕ_{i}, ϕ_{j})

represents redundant health information between

ϕ_{i}

and

ϕ_{j}

. Since

I (Ω; ϕ_{i}, ϕ_{j}) = I (Ω; ϕ_{i}) + I (Ω; ϕ_{i} |ϕ_{j})

, Equation (10) can be rewritten as

R (ϕ_{i}, ϕ_{j}) = I (Ω; ϕ_{j}) - I (Ω; ϕ_{i} |ϕ_{j})

(11)

It is evident that

R (ϕ_{i}, ϕ_{j})

essentially depends on the degree of information dependence between

ϕ_{i}

and

ϕ_{j}

, i.e., the magnitude of the mutual information

I (ϕ_{i}, ϕ_{j})

and

I (ϕ_{i}, ϕ_{j})

is directionless. At the same time,

I (ϕ_{i}, ϕ_{j})

satisfies the mapping relationship described below.

I (ϕ_{i}, ϕ_{j}) = f (K (ϕ_{i}, ϕ_{j})), \frac{d f}{d K} > 0

(12)

where

K (ϕ_{i}, ϕ_{j})

denotes the correlation coefficient of

ϕ_{i}

and

ϕ_{j}

.

f (\cdot)

indicates the mapping function from

K (ϕ_{i}, ϕ_{j})

to

I (ϕ_{i}, ϕ_{j})

.

Therefore, redundant health information can be reflected by the correlation coefficient. Based on the above analysis, the causal relationship between

ϕ_{i}

and

ϕ_{j}

leads to redundancy in health information during the health-state assessment process, and this redundant health information can be reflected by the correlation coefficient. This correlation is referred to as causality-informed correlation in this paper.

Based on the above analysis, calculating causality-informed correlation between subsystems requires addressing the following two problems: ① Determining whether dynamic causal coupling relationships exist between subsystems; ② Quantifying the magnitude of causality-informed correlation among health indicators.

3.2. Convergent Cross-Mapping for Causal Relationships Inference

Sugihara proposed the convergent cross-mapping (CCM) method in 2012. For complex equipment, the intricate coupling relationships among internal subsystems often make it difficult to establish accurate parametric models. CCM can detect directed causal relationships within coupled dynamic systems without requiring explicit parametric models, making it well-suited for identifying causal relationships between subsystems in complex equipment.

Based on the findings of Butler [36] and Cummins [37] on the applicability of state-space reconstruction (SSR) and convergent cross-mapping (CCM), the conditions for subsystems to be eligible for causal coupling analysis are established as follows: Condition ①: Each subsystem performs a single function, meaning a subsystem ultimately has only one output. The main purpose of dividing subsystems according to a single output is to ensure the clarity of the causal analysis chain, thereby making it easier to identify the dominant factor at any given moment. Condition ②: No closed-loop feedback structures exist between subsystems. According to Butler’s research, if there is a feedback loop between two objects, their influences on each other will be coupled, making it impossible to distinguish which one is the driving factor, thereby rendering causal analysis meaningless. Condition ③: External inputs to a subsystem can be treated as constants, i.e.,

χ (t) = χ_{0}

, where

χ_{0}

is a constant parameter. If the system is subject to external inputs that change over time, it is impossible to form a stable manifold, and consequently, state-space reconstruction cannot be performed, rendering the basic conditions for the CCM method invalid.

The process of determining the causal coupling relationship within the subsystem is illustrated in Figure 1b. Assume that the time series generated by projecting systems

X_{i}

and

X_{j}

onto a one-dimensional space is:

Y_{i} = {Y_{i} (t) |t \in [1, T]}

,

Y_{j} = {Y_{j} (t) |t \in [1, T]}

. According to the Takens embedding theorem [38], let the embedding dimension be m and the delay time be

τ

. The reconstructed state vector is represented as:

{\tilde{Y}}_{i} (t) = [Y_{i} (t), Y_{i} (t - τ), \dots, Y_{i} (t - (m - 1) τ)]

(13)

{\tilde{Y}}_{j} (t) = [Y_{j} (t), Y_{j} (t - τ), \dots, Y_{j} (t - (m - 1) τ)]

(14)

where m and

τ

are fixed positive integers. m is determined using the pseudo-neighborhood method [39], and

τ

is determined using the average mutual information method [40]. The reconstructed manifolds are, respectively:

M_{i} = {{\tilde{Y}}_{i} (t) |t = (m - 1) τ + 1, \dots, T}

,

M_{i} = {{\tilde{Y}}_{j} (t) |t = (m - 1) τ + 1, \dots, T}

.

According to research by Butler et al., when the reconstructed state space satisfies both Auto-predictability fraction (AF) and Recurrence fraction (RF) metrics on top of meeting the conditions mentioned earlier, CCM can be employed to determine causal relationships. Specifically, AF and RF should approach 1. For detailed calculation methods, please refer to reference [36], and further elaboration is omitted here.

For a specific time t, find the

Q = m + 1

nearest neighbor points

{{\tilde{Y}}_{i} (t_{q})}_{q = 1}^{Q}

on

M_{i}

that are closest to

{\tilde{Y}}_{i} (t)

, with the corresponding time index being

t_{k}

. Map

{{\tilde{Y}}_{i} (t_{q})}_{q = 1}^{Q}

to

Y_{j} = {Y_{j} (t) |t \in [1, T]}

, where the corresponding sample point is

{Y_{j} (t_{k})}_{k = 1}^{K}

. Calculate the estimated value

{\hat{Y}}_{j} (t)

of

Y_{j} (t)

.

{\hat{Y}}_{j} (t) = \sum_{q = 1}^{Q} v_{q} Y_{j} (t_{q})

(15)

v_{q} = \frac{exp (- d_{q} / d_{1})}{\sum_{ι = 1}^{Q} exp (- d_{ι} / d_{1})}

(16)

d_{q} = ∥\tilde{Y} (t) - \tilde{Y} (t_{q})∥

(17)

where

∥\tilde{Y} (t) - \tilde{Y} (t_{q})∥

denotes Euclidean distance.

d_{1}

represents the nearest neighbor distance. Define

{\hat{Y}}_{j} (t)

as the cross-mapping from

Y_{i}

to

Y_{j}

of

Y_{j} (t)

. Calculate the correlation coefficient

r_{i j}

between

{\hat{Y}}_{j} (t)

and

Y_{j} (t)

using the equation:

r_{i j} = \frac{\sum_{t = 1}^{Z} (Y_{j} (t) - {\bar{Y}}_{j} (t)) ({\hat{Y}}_{j} (t) - {\bar{Y}}_{j}^{'} (t))}{\sqrt{\sum_{t = 1}^{Z} {(Y_{j} (t) - {\bar{Y}}_{j} (t))}^{2} \sum_{t = 1}^{Z} {({\hat{Y}}_{j} (t) - {\bar{Y}}_{j}^{'} (t))}^{2}}}

(18)

where Z denotes the sample length.

{\bar{Y}}_{j} (t)

,

{\bar{Y}}_{j}^{'} (t)

represent the mean of the actual values and the mean of the estimated values, respectively. As the sample length Z increases,

{\hat{Y}}_{j} (t)

gradually converges toward

Y_{j} (t)

, and the correlation coefficient ultimately converges to [0, 1]. This convergence suggests the existence of a causal relationship from subsystem

X_{i}

to subsystem

X_{j}

, represented as

X_{i} \to X_{j}

; The coefficient

r_{j i}

is then computed to assess the reverse causality. Specifically, if

r_{j i} \in [0, 1]

, there is

X_{j} \to X_{i}

. Conversely, if

r_{i j}, r_{j i} \notin [0, 1]

, no causal relationship is inferred between

X_{i}

and

X_{j}

, denoted as

X_{i} ⊥ X_{j}

. This paper represents the causal relationships between subsystems in the form of a directed acyclic graph (DAG), referred to as a causal relationship graph.

Definition 1.

Subsystem causal relationship graph

G = (V, E)

,

V = {V_{i} |i \in [1, M]}

is the set of vertices.

V_{i}

corresponds to subsystem

X_{i}

;

E = {E_{i j} |i, j \in [1, M]}

is the set of directed edges connecting the vertices, used to indicate the causal direction between two subsystems.

For example, suppose the causal relationships among subsystems

X_{1}

,

X_{2}

, and

X_{3}

are as follows:

X_{1} \to X_{2}

,

X_{1} \to X_{3}

,

X_{2} ⊥ X_{3}

. Then the causal relationship graph of the subsystems is represented as Figure 2.

3.3. Calculation of Conditionally Hybrid Correlation Coefficients

In Section 3.2, the analysis of causal relationships among subsystems is conducted using the CCM method. However, CCM can only determine the direction of causality and cannot quantitatively analyze the magnitude of the causal driving effect between subsystems. This poses a challenge for all current causal analysis methods. However, for the subsequent health-state assessment task, what is needed is not the magnitude of the causal effect itself, but the strength of statistical dependence among subsystem-level indicators induced by the identified coupling structure. This dependence encompasses both linear and nonlinear correlations. This paper utilizes CHCC to represent the correlation between indicators. The calculation process is shown in Figure 1c.

Assume the causal relationship between subsystems

X_{i}

and

X_{j}

is

X_{i} \to X_{j}

. The one-dimensional projection of state space of subsystems

X_{i}

and

X_{j}

are

Y_{i}

and

Y_{j}

. From the perspective of system dynamics, if the relationship between

Y_{i}

and

Y_{j}

is linear, which means

Y_{i}

and

Y_{j}

satisfy the relation equation:

Y_{j} = a Y_{i} + b

(19)

where a, b are arbitrary constants. Then Pearson’s correlation coefficient is used to calculate the correlation coefficient between

Y_{i}

and

Y_{j}

.

L (Y_{i}, Y_{j}) = \frac{\sum_{t = 1}^{N} (Y_{i} (t) - {\bar{Y}}_{i}) (Y_{j} (t) - {\bar{Y}}_{j})}{σ_{i} σ_{j}}

(20)

where

L (Y_{i}, Y_{j})

denotes linear correlation coefficient.

{\bar{Y}}_{i}

and

{\bar{Y}}_{j}

indicate the average of

Y_{i}

and

Y_{j}

, respectively.

σ_{i}

,

σ_{j}

stand for the variance of

Y_{i}

and

Y_{j}

.

When there is a nonlinear relationship between

X_{i}

and

X_{j}

, the correlation coefficient is calculated using the empirical distance covariance [41]:

R (Y_{i}, Y_{j}) = \frac{v^{2} (Y_{i}, Y_{j})}{\sqrt{v^{2} (Y_{i}, Y_{i}) v^{2} (Y_{j}, Y_{j})}}

(21)

where

R (Y_{i}, Y_{j})

denotes nonlinear correlation coefficient.

v^{2} (Y_{i}, Y_{j})

,

v^{2} (Y_{i}, Y_{i})

,

v^{2} (Y_{j}, Y_{j})

stand for the empirical distance covariance between

(Y_{i}, Y_{j})

,

(Y_{i}, Y_{i})

, and

(Y_{j}, Y_{j})

, respectively.

Specifically, the calculation of the correlation coefficient is determined by the function

F (Y_{i}, Y_{j})

:

K (Y_{i}, Y_{j}) = F (Y_{i}, Y_{j}) = \{\begin{matrix} L (Y_{i}, Y_{j}) & Y_{j} = a Y_{i} + b \\ R (Y_{i}, Y_{j}) & e l s e \end{matrix}

(22)

where

K (Y_{i}, Y_{j})

is a generalized representation of the correlation coefficient between

Y_{i}

and

Y_{j}

.

In practice, the following methods can be used to determine whether the relationship between

Y_{i}

and

Y_{j}

is linear or nonlinear. First, perform a linear regression fit on

Y_{j}

to obtain the residual sequence

ε

.

Y_{j} = a Y_{i} + b + ε

(23)

ε = Y_{j} - {\hat{Y}}_{j}

(24)

where

{\hat{Y}}_{j}

represents the fitted value.

ε

indicates the residual sequence. If

Y_{i}

and

Y_{j}

are linearly related, then

ε

should not contain any systematic structure. Calculate the empirical distance correlation coefficient between

Y_{i}

and

ε

. The criterion for determining whether the relationship between

Y_{i}

and

Y_{j}

is linear or nonlinear is

\{\begin{matrix} R (Y_{i}, ε) < ι & linear \\ R (Y_{i}, ε) ⩾ ι & nolinear \end{matrix}

(25)

ι

represents a minimum value constant.

Without loss of generality, define the projection vector of the N subsystems

X_{1}, X_{2}, \dots, X_{N}

as

Y_{1}, Y_{2}, \dots, Y_{N}

. The correlation matrix which denoted as K is obtained as follows:

K = [\begin{matrix} K (Y_{1}, Y_{1}) \\ K (Y_{2}, Y_{1}) & K (Y_{2}, Y_{2}) \\ \dots & \dots & \dots \\ K (Y_{N}, Y_{1}) & K (Y_{N}, Y_{2}) & \dots & K (Y_{N}, Y_{N}) \end{matrix}]

(26)

The initial correlation coefficient

k_{i}

for subsystem

X_{i}

is:

k_{i} = \sum_{j = 1}^{i} K (Y_{i}, Y_{j})

(27)

Then the CHCC

δ_{i}

of

X_{i}

is:

δ_{i} = \frac{1 / k_{i}}{\sum_{i = 1}^{N} 1 / k_{i}}

(28)

Remark 2.

The parameter

k_{i}

increases monotonically with the indicator correlation strength. When calculating the discount factor for an indicator,

k_{i}

is inverted and normalized so that the larger the value of

k_{i}

, the smaller the value of

δ_{i}

, and accordingly, the discount for that indicator is approximately larger.

δ_{i}

takes values in the range of [0, 1].

δ_{i}

= 0 means that the health information of subsystem

X_{i}

can be completely represented by other subsystems.

δ_{i}

= 1 means that subsystem

δ_{i}

is completely independent from other subsystems.

0 < δ_{i} < 1

means that subsystem

X_{i}

has causality-informed correlation with other subsystems.

3.4. Calculation of Other Parameters

For the subsystem

X_{i}

with causality-informed correlation, it consists of

K_{i}

mutually independent underlying indicators

h_{i} \in R^{K_{i}}

. In the ER rule, the process of transforming indicators into evidence requires the determination of three other key parameters: confidence level, weight, and reliability.

Calculation of confidence level $β$

Assume there are M health grade, represented as:

Θ = {φ_{k} |k \in [1, M]}

.

Θ

is called the assessment framework.

β_{φ_{k}, i z} (t) = \{\begin{matrix} \frac{U (φ_{n + 1}) - h_{i z} (t)}{U (φ_{n + 1}) - U (φ_{n})} & k = n, if U (φ_{n}) ⩽ h_{i z} (t) ⩽ U (φ_{n + 1}) \\ \frac{h_{i z} (t) - U (φ_{n})}{U (φ_{n + 1}) - U (φ_{n})} & k = n + 1 \\ 0 & k \neq n, n + 1 \end{matrix}

(29)

where

β_{φ_{k}, i z} (t)

represents the probability that indicator

h_{i z} (i \in [1, N], z \in [1, K_{i}])

is assessed as health grade

φ_{k}

at time t, which is called confidence level in the ER rule.

U (φ_{k})

represents the reference value for health grade

φ_{k}

and meets

U (φ_{1}) < U (φ_{2}) <, \dots, < U (φ_{M})

.

h_{i z} (t)

represents the monitored value of indicator

h_{i z}

at time t. Through the above processing, health indicator

h_{i z}

monitoring data can be transformed into an evidence-distribution form.

e_{i z} (t) = {(φ_{k}, β_{φ_{k}, i z} (t)) |\forall φ_{k} \subseteq Θ}

(30)

where

e_{i z} (t)

represents the evidence distribution of indicator

h_{i z}

at time t.

Calculation of weight w

The coefficient of the variation-based weighting (CVBW) can effectively capture the fluctuation of indicators [42], which reflects the level of attention given to each indicator. Therefore, this study employs CVBW to calculate the weights of the underlying indicators.

Indicator weight

w_{i z}

is calculated as:

w_{i z} = ε_{i z} / \sum_{ι = 1}^{K_{i}} ε_{i ι}

(31)

ε_{i z} = σ_{i z} / {\bar{h}}_{i z}

(32)

where

{\bar{h}}_{i z}

,

σ_{i} z

denote the mean and standard deviation of the monitoring data of

h_{i z}

, respectively.

ε_{i z}

represents the coefficient of variation.

Calculation of the reliability r

The reliability of the evidence

r_{i z} (t)

consists of a static reliability

{r_{i z}}^{S}

and a dynamic reliability

{r_{i z}}^{D} (t)

. The exact value of the static reliability

{r_{i z}}^{S}

is given by the expert. Dynamic reliability

{r_{i z}}^{D} (t)

is calculated using the distance-based calculation method [43]. As indicated in reference [44], the evidence reliability r can be derived by combining the static reliability

{r_{i z}}^{S}

and dynamic reliability

{r_{i z}}^{D} (t)

through the perturbation coefficient. The specific calculation method is beyond the focus of this paper and will not be discussed further.

3.5. Evidence Fusion Process

This section obtains the health-state assessment results of complex equipment through evidence fusion from underlying indicators to subsystem-level indicators, as shown in Figure 1d.

According to the ER rule [16], before performing evidence fusion, the evidence needs to be converted into the weighted belief distribution with reliability (WBDR) form. The elements of WBDR are called basic probability mass, and the basic probability mass of evidence

e_{i z} (t)

is represented as

m_{φ_{k}, i z} (t) = \{\begin{matrix} 0 & φ_{k} = \emptyset \\ {\tilde{w}}_{i z} (t) β_{φ_{k}, i z} (t) & φ_{k} \subseteq Θ, φ_{k} \neq \emptyset \\ 1 - {\tilde{w}}_{i z} (t) & φ_{k} = P (Θ) \end{matrix}

(33)

{\tilde{w}}_{i z} (t) = \frac{w_{i z}}{1 + w_{i z} + r_{i z} (t)}

(34)

where

P (Θ)

denotes the power set of

Θ

.

{\tilde{w}}_{i z} (t)

indicates the hybrid weight of health indicator

h_{i z}

.

The WBDR can be represented by

m_{i z} (t) = {(φ_{k}, m_{φ_{k}, i z} (t)), \forall φ_{k} \subseteq Θ; (P (Θ), m_{P (Θ), i z} (t))}

(35)

The evidence is integrated in a pairwise fusion manner, and the specific process of the final fusion is shown as Equations (31)–(33).

{\hat{m}}_{φ_{k}, i (K_{i})} (t) = [(1 - r_{K_{i}} (t)) {\hat{m}}_{φ_{k}, i (K_{i} - 1)} (t) + {\hat{m}}_{P (Θ), i (K_{i} - 1)} (t) m_{φ_{k}, i K_{i}} (t)]

(36)

{\hat{m}}_{P (Θ), i (K_{i})} (t) = (1 - r_{i K_{i}} (t)) {\hat{m}}_{P (Θ), i (K_{i} - 1)} (t)

(37)

m_{φ_{k}, i (K_{i})} (t) = \frac{{\hat{m}}_{φ_{k}, i (K_{i})} (t)}{\sum_{E \subseteq Θ} {\hat{m}}_{E, i (K_{i})} (t) + {\hat{m}}_{P (Θ), i (K_{i})}}

(38)

where

{\hat{m}}_{φ_{k}, i (K_{i})} (t)

denote the unnormalized basic probability mass after fusing the

K_{i}

health indicators of subsystem

X_{i}

and assessed as health grade

φ_{k}

.

{\hat{m}}_{P (Θ), i (K_{i})}

represents the unnormalized basic probability mass of the power set. Similarly,

{\hat{m}}_{φ_{k}, i (K_{i} - 1)} (t)

and

{\hat{m}}_{P (Θ), i (K_{i} - 1)} (t)

indicate the fusion results of

K_{i} - 1

indicators.

m_{φ_{k}, i (K_{i})} (t)

is the normalized basic probability mass.

The confidence level for health grade

φ_{k}

after fusing

K_{i}

underlying indicators is

β_{φ_{k}, i (K_{i})} (t) = \frac{{\hat{m}}_{φ_{k}, i (K_{i})} (t)}{\sum_{E \subseteq Θ} {\hat{m}}_{E, i (K_{i})} (t)}

(39)

According to Equation (28), the hybrid weight after the fusion of

K_{i}

indicators is calculated as

{\tilde{w}}_{i (K_{i})} (t) = \frac{m_{φ_{k}, i (K_{i})} (t)}{β_{φ_{k}, i (K_{i})} (t)} = \frac{\sum_{E \subseteq Θ} {\hat{m}}_{E, i (K_{i})} (t)}{\sum_{E \subseteq Θ} {\hat{m}}_{E, i (K_{i})} (t) + {\hat{m}}_{P (Θ), i (K_{i})} (t)}

(40)

where

{\tilde{w}}_{i (K_{i})} (t)

denote the hybrid weight after the fusion of

K_{i}

indicators.

According to Equation (35), the fusion results of the hybrid weights of the underlying indicators of each subsystem

{\tilde{w}}_{1 (K_{1})} (t), {\tilde{w}}_{2 (K_{2})} (t), \dots, {\tilde{w}}_{N (K_{N})} (t)

can be obtained. Normalize the above hybrid weights to obtain subsystem-level health indicator weights.

{\bar{w}}_{i} (t) = {\tilde{w}}_{i (K_{i})} (t) / \sum_{ι = 1}^{N} {\tilde{w}}_{ι (K_{ι})} (t)

(41)

where

{\bar{w}}_{i} (t)

indicates the weight of subsystem-level health indicator

ϕ_{i}

.

Remark 3.

If the hybrid weights obtained from the fusion of underlying indicators are directly used as the weights for subsystem-level health indicators, it is essentially equivalent to directly fusing the underlying indicators of all subsystems. Reference [45] demonstrates that when using the ER rule for evidence fusion, the more evidence is fused, the greater the risk of overfitting the fusion result. Therefore, this paper adopts normalization to reduce the risk of overfitting.

In summary, the WBDR of subsystem-level health indicator

ϕ_{i}

can be expressed as:

m_{i} (t) = {(φ_{k}, m_{φ_{k}, i} (t)), \forall φ_{k} \subseteq Θ; (P (Θ), m_{P (Θ), i} (t))}

(42)

m_{φ_{k}, i} (t) = \{\begin{matrix} 0 & φ_{k} = \emptyset \\ {\bar{w}}_{i} (t) δ_{i} β_{φ_{k}, i (K_{i})} (t) & φ_{k} \subseteq Θ, φ_{k} \neq \emptyset \\ 1 - {\bar{w}}_{i} (t) & φ_{k} = P (Θ) \end{matrix}

(43)

where

m_{i} (t)

denote the WBDR of subsystem-level health indicator

ϕ_{i}

.

m_{φ_{k}, i} (t)

represents the basic probability mass of

ϕ_{i}

being assessed at health grade

φ_{k}

.

δ_{i}

indicates the CHCC of

ϕ_{i}

.

By repeating the fusion process of Equations (31)–(33), performing evidence fusion on subsystem-level health indicators. The fusion process is abbreviated as follows:

m_{φ_{k}, (N)} (t) = m_{φ_{k}, 1} (t) \otimes m_{φ_{k}, 2} (t) \otimes, \dots, m_{φ_{k}, N} (t)

(44)

where

m_{φ_{k}, (N)} (t)

denotes the basic probability mass assigned to the health grade

φ_{k}

after the fusion of N subsystem-level health indicators. ⊗ represents the fusion symbol.

Based on Equation (34), the confidence level for health grade

φ_{k}

after fusing N subsystem-level indicators is

β_{φ_{k}, (N)} = \frac{{\hat{m}}_{φ_{k}, (N)} (t)}{\sum_{E \subseteq Θ} {\hat{m}}_{E, (N)} (t)}

(45)

Ultimately, it can yield the evidence distribution (or referred to as the health-state distribution) results of complex equipment.

e_{a l l} (t) = {(φ_{k}, β_{φ_{k}, (N)} (t)) |\forall φ_{k} \subseteq Θ}

(46)

where

e_{a l l} (t)

denotes the evidence distribution of complex equipment.

β_{φ_{k}, (N)} (t)

represents the confidence level that complex equipment is assessed as health grade

φ_{k}

.

(N)

represents the fusion of N subsystem-level health indicators.

Since the evidence is correlated, the evidence fusion will no longer satisfy the law of exchange and the law of union. The problem of determining the fusion order is needed when evidence with correlation is fused [46]. This paper proposes a method to determine the fusion order based on the signaling sequence.

Let the signaling relationship between the subsystems

X_{1}, X_{2}, \dots, X_{N}

with causality-informed correlation be: the output of

X_{1}

is the input of

X_{2}

and the output of

X_{2}

serve as the input of

X_{3}

…Then, the signaling sequence is:

1, 2, \dots, N

.

S e q (i)

denotes the signaling sequence number of subsystem

X_{i}

.

Let

S e q F (i)

denote the fusion order of

S_{i}

, and

S e q F (i) = 1

represents that piece of evidence is fused first. Then there are:

S e q F (i) = S e q (i)

(47)

The fusion order

S e q F (i)

reflects the priority assigned to each indicator, where a higher priority indicates greater attention. For indicators with causality-informed correlation, the health information of upstream indicators in the signaling sequence is transmitted downstream. Accordingly, upstream indicators should be assigned higher priority. The fusion order follows the signaling sequence derived from the causal relationship graph and equipment working mechanism, corresponding to a topological ordering that ensures causally consistent information propagation and gives sufficient priority to upstream indicators.

The distribution form shown in Equation (41) is not conducive to model optimization or comparison. Therefore, further defuzzification processing is required to convert the distribution form into a deterministic numerical form. According to the utility theory proposed by Yang [14], the output utility of complex equipment is:

η (t) = \sum_{φ_{k} \subseteq Θ} η (φ_{k}) β_{φ_{k}, (N)} (t)

(48)

where

η (φ_{k})

is the utility value of health grade

φ_{k}

and is typically obtained based on expert knowledge or statistical results from actual engineering data.

η (t)

indicates the output utility of the complex equipment at time t.

3.6. Optimization of Model Parameters

The parameters of ERr-CIC

Γ = {δ_{i} |i \in [1, N]}

,

U = {U (φ_{k}) |k \in [1, M]}

, and

Ψ = {η (φ_{k}) |k \in [1, M]}

are calculated based on the indicator monitoring data or determined by expert knowledge. However, the adaptability of the model parameters decreases due to factors such as disturbances and environmental changes that are inevitable during complex equipment health monitoring. Therefore, the model needs to be trained with multiple sets of data to optimize the model parameters.

Mean squared error (MSE) is widely used for evaluating model accuracy. It quantifies the precision of the ERr-CIC model by calculating the average of the squares of the difference between the actual output utility

η (t)

and the expected output utility

η_{exp} (t)

.

η_{exp} (t)

is typically determined by domain experts based on the evaluation scenario, historical statistical analysis, and industry standards. The optimization objective functions are defined as:

min_{Γ, U, Ψ} M S E (η (t) - η_{exp} (t))

(49)

The corresponding parameter constraints are shown in Equations (45)–(47).

0 ⩽ δ_{i} ⩽ 1, \sum_{i = 1}^{N} δ_{i} = 1

(50)

U (φ_{1}) < U (φ_{2}) <, \dots, U (φ_{k})

(51)

η (φ_{1}) < η (φ_{2}) <, \dots, η (φ_{k})

(52)

4. Sensitivity and Robustness Analysis

To improve the model’s interpretability and provide a quantitative analysis of model parameters for practical engineering applications, this section presents a sensitivity and robustness analysis.

4.1. Sensitivity Analysis

For problem 2, based on the reasoning process outlined earlier, this section analyzes the sensitivity of the output confidence level to CHCC within the complex equipment. In addition, through the mathematical derivation process of the sensitivity analysis, the traceability analysis of the model output is also realized.

Proposition 1.

According to engineering practice, the less dependent an indicator is on other indicators, the greater its impact on the overall health state. In the ERr-CIC model, a higher CHCC of an indicator at the same evaluation level corresponds to a higher overall output utility of the equipment. Thus, it is speculated that a negative correlation exists between indicator causality-informed correlation and complex equipment health state.

Proof of the Proposition 1.

The sensitivity analysis is aimed at the sensitivity of the output to the relative overall correlation coefficient (in the case of

δ_{i}

), denoted as

\frac{\partial β_{φ_{k}, (N)} (t)}{\partial δ_{i}}

. For convenience of notation, the variable t will be omitted in the subsequent derivation:

\frac{\partial β_{φ_{k}, (N)}}{\partial δ_{i}} = [\frac{\partial β_{φ_{k}, (N)}}{\partial {\hat{m}}_{φ_{k}, (N)}}, \frac{\partial β_{φ_{k}, (N)}}{\partial {\hat{m}}_{P (Θ), (N)}}] \cdot [\begin{matrix} \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial δ_{i}} \\ \frac{\partial {\hat{m}}_{P (Θ), (N)}}{\partial δ_{i}} \end{matrix}]

(53)

\begin{matrix} \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial δ_{i}} = \\ [\begin{matrix} \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial m_{P (Θ), i}} \\ \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial {\hat{m}}_{φ_{k}, (N - 1)}} \\ \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial {\hat{m}}_{P (Θ), (N - 1)}} \\ \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial m_{φ_{k}, i}} \\ \frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial \sum_{C \cap D = φ_{k}} m_{B, (N - 1)} m_{C, i}} \end{matrix}] \cdot {[\begin{matrix} \frac{\partial m_{P (Θ), i}}{\partial δ_{i}} \\ \frac{\partial {\hat{m}}_{φ_{k}, (N - 1)}}{\partial δ_{i}} \\ \frac{\partial {\hat{m}}_{P (Θ), (N - 1)}}{\partial δ_{i}} \\ \frac{\partial m_{φ_{k}, i}}{\partial δ_{i}} \\ \frac{\partial \sum_{C \cap D = φ_{k}} m_{B, (N - 1)} m_{C, i}}{\partial δ_{i}} \end{matrix}]}^{T} \\ = [\begin{matrix} {\hat{m}}_{φ_{k}, (N - 1)} \\ m_{P (Θ), i} \\ m_{φ_{k}, i} \\ {\hat{m}}_{P (Θ), (N - 1)} \\ 1 \end{matrix}] \cdot {[\begin{matrix} - {\bar{w}}_{i} \\ 0 \\ 0 \\ {\bar{w}}_{i} β_{φ_{k}, i} \\ \sum_{C \cap D = φ_{k}} m_{B, (N - 1)} {\bar{w}}_{i} β_{φ_{k}, i} \end{matrix}]}^{T} \end{matrix}

(54)

Equation (48) is derived using the chain rule, where the output confidence

β_{φ_{k}, (N)}

depends on the basic probability mass

{\hat{m}}_{φ_{k}, (N)}

and

{\hat{m}}_{P (Θ), (N)}

, which are further influenced by the CHCC parameter

δ_{i}

. Therefore, the sensitivity is decomposed into the contributions of these intermediate variables. The term

\frac{\partial {\hat{m}}_{φ_{k}, (N)}}{\partial δ_{i}}

captures how the combined mass assigned to health grade

φ_{k}

changes with respect to

δ_{i}

. Since the ER-rule fusion is recursive, this derivative is further decomposed into contributions from the previous fusion step and the current evidence source.

\begin{matrix} \frac{\partial {\hat{m}}_{P (Θ), (N)}}{\partial δ_{i}} = \frac{\partial \prod_{ι = 1}^{N} (1 - {\bar{w}}_{ι} δ_{ι})}{\partial δ_{i}} \\ = - {\bar{w}}_{i} \prod_{ι = 1}^{N - 1} (1 - {\bar{w}}_{ι} δ_{ι}) \end{matrix}

(55)

This term represents the sensitivity of the basic probability mass of power sets

{\hat{m}}_{P (Θ), (N)}

with respect to

δ_{i}

. Indicating that the impact of

δ_{i}

propagates multiplicatively through all fused evidence.

Therefore, the sensitivity of CHCC to the confidence level

β_{φ_{k}, (N)} (t)

can be determined by Equation (51).

ξ (t) = \sum_{i = 1}^{N} \sum_{k = 1}^{M} \frac{\partial β_{φ_{k}, (N)} (t)}{\partial δ_{i}}

(56)

In practical engineering fields,

ξ

is a sensitivity factor of the output confidence to the CHCC, indicating the validity of the subsystem’s causality-informed correlation in the model output.

Based on the above analysis, we can further derive the sensitivity factor of output utility

η (t)

with respect to

η (t) = \sum_{φ_{k} \subseteq Θ} η (φ_{k}) β_{φ_{k}, (N)} (t)

, define the output utility sensitivity factor

ξ^{(η)}

as

ξ^{(η)} = |\frac{\partial η}{\partial δ_{i}} \cdot \frac{δ_{i}}{η}|

(57)

Based on Equation (52), we can determine the tolerance range for CHCC. Specifically, factors such as external disturbances can cause fluctuations in the calculated value of

δ_{i}

, which in turn affect the output utility

η (t)

. In engineering practice, a tolerance coefficient

ρ

is typically defined to represent the maximum acceptable range of fluctuation in the output utility, as shown in Equation (53).

|Δ η| ⩽ ρ η

(58)

where

|Δ η|

represents the range of fluctuations in output utility. The specific value of the tolerance coefficient

ρ

is determined based on the actual equipment and operating conditions.

Under conditions of small disturbances

Δ η \approx \frac{\partial η}{\partial δ_{i}} \cdot Δ δ_{i}

(59)

Therefore, as long as Equation (55) is satisfied, the fluctuations in output utility will not exceed

ρ η

.

|\frac{\partial η}{\partial δ_{i}} \cdot Δ δ_{i}| ⩽ ρ η

(60)

This gives the upper bound of the fluctuation of

δ_{i}

|Δ δ_{i}| ⩽ \frac{ρ η}{|\frac{\partial η}{\partial δ_{i}}|} = \frac{ρ δ_{i}}{ξ^{(η)}}

(61)

In summary, the tolerance range for

δ_{i}

is

{\tilde{δ}}_{i} \in [{δ_{i}}^{(up)}, {δ_{i}}^{(low)}] = [δ_{i} - \frac{ρ δ_{i}}{ξ^{(η)}}, δ_{i} + \frac{ρ δ_{i}}{ξ^{(η)}}] \cap [0, 1]

(62)

where

{\tilde{δ}}_{i}

represents the disturbed CHCC of the subsystem

X_{i}

. □

4.2. Robustness Analysis

The analysis of model robustness focuses primarily on scenarios where the conditions for the subsystem are outlined in Section 3.2 are not met. Condition ① is relatively common in system design and will not be discussed in detail here.

(1): Robustness analysis for Condition ②

First, analyze the scenario where Condition ② is not met, i.e., the case where “No closed-loop feedback structures exist between subsystems” is not satisfied. First, according to the research by Butler [36] and Quian [47], when there is a feedback loop between systems

X_{i}

and

X_{j}

, the driving relationships between

X_{i}

and

X_{j}

become mutually coupled and difficult to distinguish, rendering causal analysis of

X_{i}

and

X_{j}

ineffective. Current approaches use non-system dynamics methods (such as structural equation modeling (SEM) [48]) to construct cyclical causal graphs, thereby enabling causal analysis that accounts for closed-loop feedback. However, constructing cyclical causal graphs requires long-term, stable monitoring data to yield accurate results, which is difficult to achieve for complex equipment with variable operating conditions and inherent performance degradation.

For cases where Condition ② is not met, the following solution is provided. In the presence of a feedback loop between subsystems

X_{i}

and

X_{j}

, they are consolidated into a single subsystem

\hat{X} i j

, from which a subsystem-level health indicator

\hat{ϕ} i j

is derived. Additionally,

{\hat{X}}_{i j}

includes all the underlying indicators

h_{i}

and

h_{j}

of

X_{i}

and

X_{j}

. The above processing procedure is shown in Figure 3.

From the perspective of model robustness, treating

X_{i}

and

X_{j}

as a single subsystem

{\hat{X}}_{i j}

introduces the risk of erroneous causal relationship judgments, indicating uncertainty in the causal structure of the subsystem. Specifically, the aforementioned operation will inevitably lead to changes in the correlation matrix

K

:

K = [\begin{matrix} K (Y_{i}, Y_{i}) \\ K (Y_{j}, Y_{i}) & K (Y_{j}, Y_{j}) \\ K (Y_{v}, Y_{i}) & K (Y_{v}, Y_{j}) & K (Y_{v}, Y_{v}) \end{matrix}] \Rightarrow \hat{K} = [\begin{matrix} K ({\hat{Y}}_{i j}, {\hat{Y}}_{i j}) \\ K (Y_{v}, {\hat{Y}}_{i j}) & K (Y_{v}, Y_{v}) \end{matrix}]

(63)

where

\hat{K}

indicates the transformed correlation matrix.

{\hat{Y}}_{i j}

denote the output signal of subsystem

{\hat{X}}_{i j}

, and

{\hat{Y}}_{i j} = Y_{j}

. Therefore,

\hat{K}

can be equivalently expressed as:

\hat{K} = [\begin{matrix} K (Y_{j}, Y_{j}) \\ K (Y_{v}, Y_{j}) & K (Y_{v}, Y_{v}) \end{matrix}]

(64)

According to Equations (22) and (23), the CHCC values before and after transformation can be calculated separately from

K

and

\hat{K}

, respectively. Then the CHCC variation can be obtained, denoted as

Δ δ_{i}

. However, according to Equations (54) and (55), it can be deduced that

Δ η ⩽ |\frac{\partial η}{\partial δ_{i}}| \cdot Δ δ_{i}

(65)

Equation (60) indicates that the output utility variation

Δ η

caused by

Δ δ_{i}

is bounded. Even if causal directions are partially misidentified, their effect is reflected as perturbations in the CHCC values. Therefore, robustness to CHCC variation implies robustness to structural uncertainty.

Based on the tolerance range of CHCC obtained in Section 4.1, it can be known that when using the ERr-CIC model to assess the health state of complex equipment with feedback loops in subsystems, the solution proposed in this subsection can be adopted to calculate the value of

Δ δ_{i}

. If

Δ δ_{i}

is within the tolerance range, the output utility obtained by the model can be adopted.

(2): Robustness analysis for Condition ③

The CCM-based causal inference framework assumes that external inputs remain constant. However, in real-world engineering systems, subsystems are often subject to time-varying external control signals. To address this problem, the external control signal is used to control the device to perform periodic and stable actions. In this case, the control signal can be regarded as an unchanged external excitation. According to CCM theory, such deterministic inputs can be incorporated into the reconstructed state space without destroying the underlying manifold structure.

From the perspective of robustness, periodic external control signals introduce structured and bounded perturbations, rather than stochastic disturbances. These perturbations do not alter the intrinsic coupling relationships among subsystems. Therefore, the inferred causal relationships remain stable under such conditions.

5. Case Study

5.1. Experiment Setting

As a reliable and precise measuring device, the electronic theodolite is widely used in hydraulic engineering, resource exploration, and military fields. The PAMD, serving as the core component of the electronic theodolite, is tasked with the measurement, computation, and display of angles. Therefore, assessing the health state of the PAMD is of significant importance. A specific model of PAMD was selected for the experimental analysis.

The PAMD consists of three main subsystems [49,50]: a photoelectric conversion unit (subsystem 1), a differential amplification unit (subsystem 2), and a microprocessor (subsystem 3). The photoelectric conversion unit converts received optical signals into electrical signals that encode angular information. These signals are then passed to the differential amplification unit, which performs filtering, transformation, and amplification. The processed signals are subsequently fed into the microprocessor, which includes a microcontroller unit (MCU) and peripheral circuits. The MCU executes computational algorithms to determine the theodolite’s rotation angle and displays the measurement results.

However, PAMDs are typically enclosed in electronic theodolites with high precision. It is difficult to obtain the output signal from the PAMD within an electronic theodolite without damaging its internal optical structure. Therefore, the experiment employed simulated experimental equipment that shares the same subsystems, working mechanism, and dynamic coupling relationships as the actual PAMD.

The system dynamics equations of the simulation equipment were established with reference to previous studies on the dynamic modeling of theodolite systems [51,52] to ensure that the coupling relationships among the simulated subsystems were consistent with the main dynamic interaction characteristics of the actual PAMD. The dynamic equations for the simulation system were set up as follows:

\{\begin{matrix} Y_{1} (t + 1) = a_{1} Y_{1} (t) + b_{1} G (θ (t), L (t)) + c_{1} + w_{1} (t) \\ Y_{2} (t + 1) = a_{2} Y_{2} (t) + b_{2} Y_{1} (t) + c_{2} + w_{2} (t) \\ Y_{3} (t + 1) = a_{3} Y_{3} (t) + b_{3} ϕ (Y_{2} (t)) + c_{3} + w_{3} (t) \end{matrix}

(66)

where

Y_{i}

denotes the output of subsystem

X_{i}

at time t;

a_{i}, i \in [1, 3]

represents the dynamic retention coefficient;

c_{i}

is the bias term;

b_{i}

denotes the gain term;

L (t)

represents the incident light intensity, and

G (\cdot)

indicates the optical signal modulation function;

ϕ (\cdot)

denotes the calibrated angular solution function; and

w_{i}

represents the noise term, which accounts for potential electronic fluctuations in the actual equipment as well as measurement uncertainties. The noise term was set to Gaussian white noise

w_{i} \sim N (0, {σ_{i}}^{2})

in this paper to avoid systematic bias and minimize the possibility of introducing correlations other than those caused by causal coupling. The variance

σ_{i}

was set to

σ_{i} = ℏ_{i} \cdot {\bar{Y}}_{i}

, where

ℏ_{i}

represents the noise ratio coefficient and

{\bar{Y}}_{i}

represents the mean of the subsystem’s output time series.

By analyzing the dynamic equations of the simulated equipment system, it can be concluded that the formulated equations capture the key causal interactions among subsystems, including optical signal modulation, inter-subsystem coupling, and nonlinear transformation processes. The parameters

a_{i}

and

b_{i}

characterize the intrinsic dynamic retention and coupling strength, respectively, while the functions

G (\cdot)

and

ϕ (\cdot)

represent the physical transformation mechanisms corresponding to optical modulation and angular solution processes in the actual equipment. Therefore, the model retains the core dynamic behaviors and interaction pathways observed in real PAMD equipment.

The position of the PAMD in the electronic theodolite and the composition of the simulation experimental equipment are shown in Figure 4a.

5.2. Causal Relationships Inference

Section 3.2 of this paper discusses the applicability conditions of complex equipment causal analysis. Through mechanism analysis of PAMD, it is evident that it meets conditions ① and ②: Each subsystem of PAMD has only one output, and the subsystems are in an open-loop structure with no feedback elements. To satisfy condition ③, this paper periodically collects the output of the subsystem, so that the external input can be regarded as a constant “0”.

Using the subsystem output time series as the projection of the subsystem state space in one-dimensional space. The output signals are directly acquired from the built-in output ports of the simulated experimental platform. This acquisition mechanism ensures that the recorded data faithfully reflects the intrinsic evolution of each subsystem, avoiding additional measurement noise or distortion introduced by external sensing processes. Periodically collect the outputs of each subsystem, collecting a total of two cycles as shown in Figure 4b. The outputs of subsystems 1, 2, and 3 are set as

Y_{1} = {Y_{1} (t) |t \in [1, 500]}

,

Y_{2}

, and

Y_{3}

, respectively.

According to the average mutual information method and the false nearest neighbors method, the embedding dimension and delay time are determined as

m = 4

,

τ = 5

. The reconstructed state space is represented as

M_{1}

,

M_{2}

, and

M_{3}

. It is worth noting that the choice of two operational cycles represents a trade-off between data sufficiency and computational efficiency. Empirically, this length is adequate to capture the dominant dynamic patterns of the system while avoiding redundancy and excessive computational overhead in later processing stages. To verify whether the subsystem output data obtained from monitoring can be used for causal analysis, AF and RF metrics of

M_{1}

,

M_{2}

, and

M_{3}

are validated. The test results are shown in Table 1. It can be seen that the three state spaces of the reconstruction, AF and RF values, all approach 1, meeting the requirements.

Using CCM to infer the causal relationships of subsystems, the results are shown in Figure 4c. From the figure, it can be seen that

r_{12}

and

r_{23}

quickly converge to “1” when the sample size increases to around 50.

r_{13}

begins to converge when the sample size increases to around 150.

r_{21}

and

r_{31}

, however, fluctuate around “0” and do not converge. In summary, the causal relationship among subsystems 1, 2, and 3 is:

X_{1} \to X_{2}

,

X_{1} \to X_{3}

,

X_{2} \to X_{3}

. The causal graph is shown in Figure 4d.

5.3. Health-State Assessment

PAMD’s health indicator system is constructed as follows [53,54]: ① Primary indicator: health state of PAMD. ② Subsystem-level indicators: Based on the operational mechanism analysis, primary functional assessment, and expert knowledge, the secondary indicators of the PAMD are determined as: sensitivity, amplification capability, and computational resolution. ③ Underlying indicators: according to the knowledge of experts and product specifications, six underlying indicators are selected: standard deviation of measurement (sdm), temperature (temp), responsiveness (res), lossy voltage (lv), magnifying power (mp), and offset error (oe). The health indicator system is shown in Figure 4e. Based on GB/T 5080 [55] and product instructions, PAMD health state is categorized into the following three health grades:

Θ = {φ_{1}, φ_{2}, φ_{3}} = {G o o d, M i d d l e, P o o r}

(67)

By varying the internal state parameters of the simulation equipment, the system was able to generate monitoring data ranging from healthy to poor health grades. The simulation device was internally equipped with a Hall effect sensor to measure current and voltage values at a measurement frequency of 100–250 kHz, and a temperature sensor with a measurement frequency of 100 Hz. Monitoring data of underlying health indicators were collected using the built-in sensors of the simulation equipment (Figure 4f). Sampling was performed based on the actual duration of a single electronic theodolite measurement. Since different sensors had varying sampling frequencies, the monitoring data in Figure 4f were first aligned on the timeline, resulting in a total of 600 time points. As can be seen from Figure 4f, the indicators deviated from the healthy values to varying degrees as the duration of use increased.

According to GB/T 36537-2018 [56], the reference values of each underlying health indicator are obtained as Table 2.

According to the indicator reference value and Equation (24), the indicator test data can be transformed into the form of a confidence level. The CHCC for evidence of subsystem-level indicators 1, 2, and 3 were calculated using Equations (14)–(23), with the results as:

δ_{1} = 1, δ_{2} = 0.5067, δ_{3} = 0.3619

. It can be seen that subsystem-level indicator 1 is independent of the other subsystem-level health indicators. Based on the signaling sequence between the subsystems, the fusion order of subsystem 1, 2, and 3 can be determined as:

S e q F (1) = 1, S e q F (2) = 2, S e q F (3) = 3

.

According to the ERr-CIC model, the health state of the PAMD is obtained as Figure 4g. The curves in the graph are color-coded, with green, yellow, and purple representing Good, Middle, and Poor health states, respectively. The overall health state of PAMD demonstrates a gradual decline over time, consistent with the expected performance degradation of the equipment.

According to Equation (43), the original output utility of PAMD is shown in Figure 4h. Industry experts determine the expected output utility based on PAMD’s actual operational scenarios and in conjunction with standard GB/T 37084-2018 [57]. It can be seen that the original output utility basically follows the same trend as the expected output utility. The root mean square error (RMSE) between the original output and the expected output is 0.056. However, during the transition period of changes in health state, the deviations are generally larger. Optimize the original ERr-CIC model according to the objective function and constraints shown in Equations (44)–(47). The ERr-CIC model utilized the sequential quadratic programming (SQP) algorithm, with a maximum number of iterations set to

1 \times 10^{3}

and the optimization terminates when the step size falls below

1 \times 10^{- 6}

. The parameter optimization results are shown in Table 3. The RMSE of the optimized model is 0.032. RMSE decreases from 0.056 to 0.032 after optimization, corresponding to a relative reduction of

75 %

.

5.4. Numerical Validation of Model Performance

This subsection conducts numerical verification of the sensitivity and robustness analysis result presented in Section 4, and specifically analyzes the practical application value of sensitivity factors in the health-state assessment of complex equipment.

First, the sensitivity factor is calculated based on the health-state assessment results from Section 5.2, as shown in Figure 5.

Figure 5a shows the sensitivity analysis results of confidence levels for various health grades on

δ_{i}

. The greater the result, the more sensitive the change in health grade confidence level

β_{φ_{k}, (N)}

to

δ_{i}

. Figure 5a indicates that each health state exhibits a certain degree of sensitivity to

δ_{1}

, particularly when the complex equipment is at health grade

φ_{3}

. To further clarify the impact of

δ_{i}

on the health-state assessment results, the distribution of sensitive factor results is presented in the form of a box plot, as shown in Figure 5b. For each

δ_{i}

, a longer length of the box plot indicates a greater impact on the confidence level of that health grade. As can be seen more clearly from Figure 5b,

δ_{1}

has the greatest impact on the health-state assessment results, and especially for those with a health grade of Poor. This further demonstrates the conclusion from Section 4.1 that subsystems with larger

δ_{i}

values have a greater impact on the health-state assessment results. Since

δ_{2}

and

δ_{3}

are numerically close, their sensitivity analysis results are also relatively close. However, Figure 5b still shows that the assessment results are more sensitive to

δ_{2}

than to

δ_{3}

.

Furthermore, the tolerance range of sensitive factors was calculated, and the results are shown in Figure 6.

Since the tolerance range analysis step is identical for all

δ_{i}

, we use

δ_{2}

as an example for analysis. Firstly, change the value of

δ_{2}

by 0.1 steps to analyze its impact on the output utility, as shown in Figure 6a. As can be seen from Figure 6a, the impact of changes in

δ_{2}

on output utility is not proportional. Assuming the tolerance coefficient

ρ = 5 %

, the tolerance range of

δ_{2}

can be obtained as

{\tilde{δ}}_{2} \in [0.474, 0.581]

according to Equation (57), as shown in Figure 6b. This result provides a clear quantitative guideline for parameter selection in practical deployment.

From a robustness perspective, this nonlinearity implies that small perturbations in

δ_{2}

do not lead to significant fluctuations in the output, reflecting the stability of the ERr-CIC model structure. In particular, within certain intervals, the output utility changes smoothly, suggesting the existence of locally insensitive regions where the model maintains consistent performance despite parameter disturbances. The tolerance range shown in Figure 6b demonstrates that within a certain variation range of

δ_{2}

, the deviation of the output utility remains within an acceptable threshold. In other words, the model output is insensitive to moderate perturbations of

δ_{2}

, further verifying the robustness of the model.

6. Comparison Experiment

To further illustrate the feasibility and superiority of the method proposed in the article, this paper conducts comparative experiments from two aspects: the comparison of causal inference methods and the health-state assessment methods.

6.1. Comparison of Causal Inference Methods

To justify the use of the CCM method, this section compares it with two mainstream approaches for causal inference in nonlinear dynamic systems: transfer entropy (TE) and nonlinear Granger (NG).

The TF of

X_{i} \to X_{j} (i, j \in [1, 3])

is defined as

T E_{X_{i} \to X_{j}} = \sum p (Y_{i} (t + 1), {Y_{i}}^{(l)}, {Y_{j}}^{(k)}) log \frac{p (Y_{i} (t + 1) |{Y_{i}}^{(l)}, {Y_{j}}^{(k)})}{p (Y_{i} (t + 1) |{Y_{i}}^{(l)})}

(68)

where

{Y_{i}}^{(l)} = (Y_{i} (t), Y_{i} (t - 1), \dots, Y_{i} (t - l + 1))

,

{Y_{j}}^{(k)} = (Y_{j} (t), Y_{j} (t - 1), \dots, Y_{j} (t - k + 1))

represent the output data of the subsystems

X_{i}

and

X_{j}

with lengths l and k, respectively. In this study,

l = k = 1

was adopted to ensure robust probability estimation under limited samples and to provide a fair baseline comparison with CCM. The joint and conditional probabilities were estimated using a histogram-based discretization method. When

T E_{X_{i} \to X_{j}} > 0

occurs, then

X_{i} \to X_{j}

holds.

For the NG method, a restricted model and an unrestricted model were constructed:

Y_{j} (t + 1) = f (Y_{j} (t), Y_{j} (t - 1), \dots, Y_{j} (t - b + 1)) + {ε_{t}}^{(r)}

(69)

Y_{j} (t + 1) = g (Y_{j} (t), Y_{j} (t - 1), \dots, Y_{j} (t - b + 1), Y_{i} (t), Y_{i} (t - 1), \dots, Y_{i} (t - b + 1)) + {ε_{t}}^{(u)}

(70)

where b represents the lag order. In this study, the lag order was set to

b = 2

for all subsystem pairs to balance predictive capability and estimation stability under the limited sample size.

f (\cdot) : R^{p} \to R, g (\cdot) : R^{2 p} \to R

are nonlinear prediction functions.

{ε_{t}}^{(r)}

,

{ε_{t}}^{(u)}

are the residuals of the two models, respectively. In this study, Gaussian-kernel nonlinear regression was used to implement both models.

G C_{X_{i} \to X_{j}} = ln \frac{V a r ({ε_{t}}^{(r)})}{V a r ({ε_{t}}^{(u)})}

(71)

G C_{X_{i} \to X_{j}} > 0

indicates that the addition of the historical information of

X_{i}

reduces the prediction error of

Y_{j} (t + 1)

, implying

X_{i} \to X_{j}

holds. To evaluate statistical significance, a permutation test was further performed by randomly shuffling the source series and recalculating the causality statistic to generate the null distribution.

The causal relationship graph obtained by methods CCM, TF, and NG is shown in Figure 7.

As shown in Figure 7, CCM and TF produce the same causal graph, whereas the NG method misses the edge from

X_{1}

to

X_{3}

. From the PAMD working mechanism, the three subsystems operate in a serial signal-processing chain and are dynamically coupled. In such a nonlinear coupled system, the state information of upstream subsystems can propagate through intermediate subsystems and remain embedded in the dynamics of downstream subsystems. In this sense, the edge

X_{1} \to X_{3}

is physically and dynamically reasonable. In summary, from the perspectives of consistency in results and mechanistic analysis, the causal relationship graph obtained using the CCM and TF methods is more reliable.

The three methods were then compared in terms of computational efficiency and the accuracy of health-state assessments. Computational efficiency is defined as the runtime of each method, recorded over 20 independent runs, with the mean and variance of runtime subsequently calculated. Health-state assessment accuracy is compared using the following process: First, the CHCC is calculated separately for each causal relationship graph derived from each method. Second, the CHCC is input into the ERr-CIC model, and health-state assessment is performed using the underlying indicator monitoring data obtained in Section 5.2, following the same model optimization process. Finally, the RMSE between the model’s output utility and the expected output utility is compared.

The comparison results are shown in Table 4.

The NG method achieved the lowest accuracy in health-state assessment, with an RMSE of 0.078. The primary reason for this may be that discrepancies in the causal relationship graph analysis led to errors in the CHCC calculation, which in turn affected the accuracy of the health-state assessment results. At the same time, the accuracy of the health-state assessment results further demonstrates the rationality of the CCM method and the TF method in analyzing causal relationship graphs. Although CCM and TF methods have the same assessment accuracy, the TF method requires probability density estimation, resulting in significantly lower computational efficiency compared to CCM. In summary, the CCM is suitable for the causal inference of nonlinear dynamical systems such as complex equipment.

6.2. Comparison of Health-State Assessment Methods

Comparison models include: ① CNN-Transformer model [58]. This model uses a CNN network to achieve end-to-end feature extraction between health indicator monitoring data and output utility, and then utilizes the attention mechanism of the Transformer network to deeply capture nonlinear relationships, such as correlations between features, achieving health-state assessment of complex equipment. ② Graph Convolution Network (GCN) model [59]. Compared with traditional deep learning methods, this model can learn from graphs, a non-matrix-based information representation method, which is suitable for capturing the complex causal relationship features from the subsystem causal relationship graph. ③ T-S fuzzy model [60]. The T-S model, as a typical semi-quantitative information method, can effectively utilize quantitative monitoring data and qualitative knowledge and convert them into fuzzy rules to achieve complex equipment health-state assessment. The hyperparameters of the comparison models are shown in Table 5.

To ensure a fair comparison, all models were constructed using the same input-output setting. Specifically, the input of all models consisted of the same six underlying indicators as shown in Table 3, while the output was the same output utility

η (t)

. All models used the same training and validation sets. Since the compared models belong to different methodological categories, their parameters were optimized using the solvers commonly adopted for each type of model. The CNN-Transformer and GCN models employed the Adam optimizer with a learning rate of

1 \times 10^{- 3}

, which is reduced by a factor of 0.5 every 10 epochs. The momentum parameter is set to 0.9, and L2 regularization with a weight decay of

1 \times 10^{- 4}

is applied to prevent overfitting. The T-S fuzzy model used the same SQP optimization algorithm as the ER model, with identical optimization parameter settings: the maximum number of iterations was set to

1 \times 10^{3}

and the optimization terminates when the step size falls below

1 \times 10^{- 6}

. The model convergence curve and training loss rate are shown in Figure 8.

Firstly, the CNN-Transformer model exhibits a rapid decrease in error during the initial training phase, indicating its strong ability in feature extraction and fitting. However, there is a certain degree of oscillation in the middle and later stages, with a more pronounced fluctuation in the loss reduction rate, suggesting that the stability of its optimization process needs improvement. Secondly, the ERr-CIC model converges quickly overall, with a small gap between training and validation errors, demonstrating good generalization ability. Its loss reduction process is relatively smooth, with only slight fluctuations in the early stages, and then quickly stabilizes, indicating that the model optimization process is efficient and stable. For the GCN model, although there is a significant decrease in error during the initial training phase, the overall fluctuation is large, especially in the loss reduction rate curve, which shows frequent oscillations. This suggests that the model is susceptible to gradient fluctuations during the optimization process and has relatively weak stability. Finally, the T-S fuzzy model demonstrates the smoothest and most stable convergence process. Its training and validation errors decrease uniformly and quickly stabilize, with minimal fluctuation in the loss reduction rate, indicating that the model has good robustness and convergence performance during the optimization process. In summary, compared to other models, the T-S fuzzy model performs optimally in terms of convergence speed and stability, while the ERr-CIC model has certain advantages in terms of generalization ability; the CNN-Transformer and GCN models, although possessing strong fitting capabilities, still have room for improvement in training stability.

The comparison of model output performance results is shown in Figure 9a. It can be seen that among the four models, the CNN-Transformer has the highest accuracy, with an RMSE of 0.030. The reason the GCN model, also as a deep learning method, performs poorly in terms of accuracy may be that it does not adequately capture the relationship between causal coupling in system dynamics and statistical correlations, resulting in biased evaluations.

To verify whether the differences in model performance are statistically significant and to minimize performance fluctuations caused by random factors, this paper conducts a Friedman test on the model results. The specific steps are as follows: ① The monitoring data obtained in subsection A are randomly split into training and validation sets over 20 independent trials. ② In each trial, the model uses the same training and validation sets. ③ Record the mean and standard deviation of RMSE for each trial. The distribution of the model results is shown in Figure 10. It can be seen that the ERr-CIC model and the T-S fuzzy model produce more stable results. The statistics of 20 trials for each model are shown in Table 6.

Based on the statistical measures derived from the above results, a Friedman test was conducted. Specifically, the models were first ranked according to their RMSE values from each run, with lower RMSE values corresponding to higher rankings. Subsequently, the average rank for each model was calculated as shown in Equation (67).

{\bar{R}}_{j} = \frac{1}{N} \sum_{i = 1}^{N} r_{i j}

(72)

where

r_{i j}

represents the ranking of model j in its ith run.

{\bar{R}}_{j}

indicates the average rank of the model j. The Friedman statistic is expressed as:

{χ^{2}}_{F} = \frac{12 N}{k (k + 1)} \sum_{j = 1}^{k} {\bar{R}}_{j}^{2} - 3 N (k + 1)

(73)

where

N = 20

denotes the number of runs.

k = 4

represents the number of models.

{χ^{2}}_{F}

approximately follows a chi-squared distribution with

k - 1

degrees of freedom. After calculation,

{χ^{2}}_{F} = 10.8

, there is:

p = P ({χ^{2}}_{3} ⩾ 10.8) = 0.0128

. In summary, the Friedman test indicates that the differences in RMSE among the compared models are statistically significant.

However, model performance cannot be fully assessed based solely on accuracy. Below, we conduct a comprehensive evaluation of each model, including interpretability, stability, efficiency, and accuracy four comparison metrics. ① Model accuracy is determined by the average RMSE of the results from 20 runs. ② Model stability is determined by the variance of the RMSE across 20 model runs. ③ Efficiency is assessed by the average time over 20 runs. ④ Model interpretability is assessed by 5 experts based on the traceability of output, and the comprehensibility of results. Each model was evaluated using the final converged parameters obtained after optimization. The results of the model comparison are shown in Table 7.

To make the comparison results clearer, each result has been mapped to the 0–1 range, as shown in Figure 9b. While ERr-CIC exhibits lower model accuracy compared to CNN-Transformer, it significantly outperforms it in terms of interpretability and efficiency. When making maintenance decisions, methods with high interpretability are preferred. At the same time, although the classic T-S model performs the worst in terms of model accuracy, it has advantages in efficiency and interpretability. Overall, the ERr-CIC model performs relatively evenly in all aspects, making it more suitable for assessing the health state of complex equipment with causality-informed correlation in engineering practice.

7. Conclusions

In this paper, a health-state assessment model for complex equipment is developed based on evidential reasoning rules. Two main problems are addressed. The first is that the health-state assessment model does not consider the causality-informed correlation among health indicators and the fusion order of indicators. The second is the sensitivity analysis of the performance of the health state model. The main innovations can be categorized into the following four points:

Aiming at the problem that current health-state assessment models do not consider the causality-informed correlation of subsystem-level indicators, the CCM is used to determine causal coupling relationships, followed by calculating the CHCC to quantify the magnitude of causality-informed correlation.
To address the problem that current health-state assessment models do not consider the fusion order of indicators, this paper presents a method to determine the fusion order based on signaling sequences.
This study contributes to the development of a health-state assessment model utilizing ERr-CIC, which integrates subsystem causality-informed correlation into the model parameters, providing a possible approach to addressing the complexity of equipment health-state assessments.
The lack of performance analysis of assessment models in current complex equipment health-state assessment methods is addressed. In this paper, the sensitivity of the output results to the CHCC for the proposed ERr-CIC model is analyzed. Based on actual engineering requirements, the tolerance range of the CHCC is further obtained. The sensitivity analysis conclusion is validated through experiments in the case study.

However, the ERr-CIC model still has the following limitations:

The model assumes that the underlying indicators of the subsystems are independent of each other. However, in some cases, correlations may also exist between underlying indicators. Such correlations can lead to redundant health information as well, resulting in biased health-state assessment results. In the future, further research will be conducted on the correlations between underlying indicators of subsystems, and a complete correlation processing system will be established from underlying indicators to the subsystem level.
There are three conditions that need to be met when conducting subsystem causal analysis in the current model. When these conditions are not met, the results of causal analysis may be affected, which in turn affects the accuracy of the CHCC. Extending the causal analysis method of the ERr-CIC model to cyclic causal diagrams to enhance the applicability of the model in different scenarios is one of the important research directions for the future.
The validation in this study is conducted based on a mechanism-driven simulation model. Although the simulation is constructed from system dynamics equations and measurement principles to approximate realistic operating conditions, validation on real-world equipment is still necessary. Future work will focus on applying the proposed method to practical engineering datasets to further demonstrate its applicability and robustness.

Author Contributions

Conceptualization, W.L. and Z.F.; methodology, W.L.; validation, Y.S. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 62573349 and 62203461.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ERr-CIC	Evidential reasoning rule considering causality-informed correlation
CCM	Convergent cross-mapping
CHCC	Conditionally hybrid correlation coefficient
PAMD	Photoelectric angle measurement device

References

Zhang, A.; Tang, L.; Hu, G. Micro Fault Diagnosis of Driving Motor Bearings Based on Multi-Residual Neural Networks and Evidence Reasoning Rule. Entropy 2026, 28, 53. [Google Scholar] [CrossRef]
Feng, Q.; Liu, Y.; Li, Y.; Chang, G.; Liang, X.; Su, Y.; Cao, G. Study on a Fault Diagnosis Method for Heterogeneous Chiller Units Based on Transfer Learning. Entropy 2025, 27, 1049. [Google Scholar] [CrossRef] [PubMed]
Zhang, Q.; Zhang, Y.; Qin, J.; Duan, J.; Zhou, Y. Dynamic MAML with Efficient Multi-Scale Attention for Cross-Load Few-Shot Bearing Fault Diagnosis. Entropy 2025, 27, 1063. [Google Scholar] [CrossRef] [PubMed]
Yang, R.H.; Gong, X.; Feng, Z.C.; Hao, Y.H. Distributed Fault-Tolerant for Leader-Following Multi-Unmanned Aerial Vehicle Systems with Faulty Sensors based on Belief Rule Base. Eng. Appl. Artif. Intell. 2025, 157, 111388. [Google Scholar] [CrossRef]
Zhou, K.; Zhang, Z.; Xu, H.; Wang, L.; Shi, Y. Reliability Analysis Based on Evidential Likelihood for Uncertain Mixed Weibull Distribution. IEEE Trans. Reliab. 2026, 75, 1020–1034. [Google Scholar] [CrossRef]
Gao, J.; Zhou, K.; Zhu, Y.; Wu, K. Importance Ranking in Complex Networks via Influence-Aware Causal Node Embedding. IEEE Trans. Netw. Sci. Eng. 2026, 13, 6754–6771. [Google Scholar] [CrossRef]
Hu, Y.; Li, H.; Shi, P.; Chai, Z.; Wang, K.; Xie, X.; Chen, Z. A prediction method for the real-time remaining useful life of wind turbine bearings based on the Wiener process. Renew. Energy 2018, 127, 452–460. [Google Scholar] [CrossRef]
Zhang, Y.; Tu, L.; Xue, Z.; Li, S.; Tian, L.; Zheng, X. Weight optimized unscented Kalman filter for degradation trend prediction of lithium-ion battery with error compensation strategy. Energy 2022, 251, 123890. [Google Scholar] [CrossRef]
Jiang, W.; Xu, Y.; Chen, Z.; Zhang, N.; Xue, X.; Zhou, J. Measurement of health evolution tendency for aircraft engine using a data-driven method based on multi-scale series reconstruction and adaptive hybrid model. Measurement 2022, 199, 111502. [Google Scholar] [CrossRef]
Chen, R.; Chen, G.; Xu, X.; Hu, X.; Zhang, Y. Bearing performance degradation trend prediction sparrow search algorithm optimization bidirectional gating cycle unit. J. Vib. Shock. 2023, 42, 12–18. [Google Scholar]
Lian, Z.; Zhou, Z.-J.; Hu, C.-H.; Wang, J.; Zhang, C.-C.; Zhang, C.-L. A health assessment method with attribute importance modeling for complex systems using belief rule base. Reliab. Eng. Syst. Saf. 2024, 251, 110387. [Google Scholar] [CrossRef]
Khashei, M.; Hejazi, S.R.; Bijari, M. A new hybrid artificial neural networks and fuzzy regression model for time series forecasting. Fuzzy Sets Syst. 2008, 159, 769–786. [Google Scholar] [CrossRef]
Yang, J.-B.; Sen, P. A general multi-level evaluation process for hybrid MADM with uncertainty. IEEE Trans. Syst. Man Cybern. 1994, 24, 1458–1473. [Google Scholar] [CrossRef]
Yang, J.-B.; Xu, D.-L. On the Evidential Reasoning Algorithm for Multiple Attribute Decision Analysis Under Uncertainty. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 2002, 32, 289–304. [Google Scholar] [CrossRef]
Yang, J.B.; Wang, Y.M.; Xu, D.L.; Chin, K.S. The evidential reasoning approach for MADA under both probabilistic and fuzzy uncertainties. Eur. J. Oper. Res. 2006, 171, 309–343. [Google Scholar] [CrossRef]
Yang, J.-B.; Xu, D.-L. Evidential reasoning rule for evidence combination. Artif. Intell. 2013, 205, 1–29. [Google Scholar] [CrossRef]
Sun, H.J.; Yang, J.Y. A Method for Combining Correlated Evidence. Chin. J. Comput. 1999, 22, 1004–1007. [Google Scholar]
Ferson, S.; Hajagos, J.; Berleant, D.J.; Zhang, J.; Tucker, W.T.; Ginzburg, L.R.; Oberkampf, W.L. Dependence in Probabilistic Modeling, Dempster-Shafer Theory, and Probability Bounds Analysis; Sandia National Laboratories: Albuquerque, NM, USA, 2004; pp. 1–150. [Google Scholar]
Monney, P.A.; Chan, M. Modeling Dependence in Dempster-Shafer Theory. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 2007, 15, 93–114. [Google Scholar] [CrossRef]
Xiao, W.; Wang, Z.Y.; Wang, Y.D. A Combination Rule for Correlated Evidence. Control Decis. 2011, 26, 773–776. [Google Scholar]
Wu, Y.G.; Yang, J.Y.; Liu, K. On the Evidence Inference Theory. Inf. Sci. 1996, 89, 245–260. [Google Scholar] [CrossRef]
Luo, Z.Z.; Ye, M. Fusion of Correlated Information Using Evidence Theory. J. Electron. Inf. Technol. 2001, 23, 970–974. [Google Scholar]
Yager, R.R. On the Dempster-Shafer Framework and New Combination Rules. Inf. Sci. 1987, 41, 93–137. [Google Scholar] [CrossRef]
Sun, Y.; Zhou, Z.; Feng, Z.; Hu, C.; Lian, Z. An Evidential Reasoning Rule-Based Performance Evaluation Method for Complex Electronic System Under Off-Cycle Data. IEEE Trans. Aerosp. Electron. Syst. 2026, 62, 1526–1537. [Google Scholar] [CrossRef]
Sun, Y.; Feng, Z.; Zhou, Z. Performance Evaluation Method for Complex Systems Based on Outcome-Oriented Correlated Evidence Reasoning. IEEE Trans. Ind. Electron. 2025, 72, 7926–7936. [Google Scholar] [CrossRef]
Li, G.; Teng, Y.; Ding, S. Complex physical-model based dynamic system safety analysis of Aviation Piston Engine considering hybrid uncertainty of fault. Eng. Fail. Anal. 2023, 152, 107515. [Google Scholar] [CrossRef]
Fu, X.; Yang, M.; Liu, H.; Wang, L.; Li, Q. Risk Analysis and Simulation of Large Bridge Construction Based on System Dynamics. Buildings 2024, 14, 1488. [Google Scholar] [CrossRef]
Kamdem, Y.S. Integrating machine learning with causal inference for enhanced system dynamics modeling: A framework for predicting complex interactions. Int. J. Sci. Res. Arch. 2024, 13, 3160–3167. [Google Scholar] [CrossRef]
Chen, H.; Wang, J.-G.; Ding, P.; Ye, X.-Y.; Yao, Y.; Chen, H.-L. A Granger causality analysis method based on GRBF network. In Proceedings of the 2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS), Xiangtan, China, 12–14 May 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1871–1876. [Google Scholar] [CrossRef]
Zhang, Z.; Wu, L. Graph neural network-based bearing fault diagnosis using Granger causality test. Expert Syst. Appl. 2024, 242, 122827. [Google Scholar] [CrossRef]
Zhang, Y.; Qu, H.; Liu, Y.; Liu, H.; Wang, B. TF-Based Causal Inference for Industrial Alarm Overload Mitigation. Electronics 2025, 14, 4066. [Google Scholar] [CrossRef]
Liu, X.; Liu, J.; Yang, X.; Wu, Z.; Wei, Y.; Xu, Z.; Wen, J. Fault Root Cause Analysis Based on Liang–Kleeman Information Flow and Graphical Lasso. Entropy 2025, 27, 213. [Google Scholar] [CrossRef]
Sharma, S.; Lakshminarayanan, S.; Karimi, I.; Srinivasan, R. Convergent Cross-mapping based Fault Detection and Diagnosis for Non-linear Dynamic Systems. In Proceedings of the 2021 60th Annual Conference of the Society of Instrument and Control Engineers (SICE), Tokyo, Japan, 8–10 September 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 13296–13308. [Google Scholar]
Tian, C.; Zhao, C.; Fan, H.; Zhang, Z. Causal network construction based on convergent cross mapping (CCM) for alarm system root cause tracing of nonlinear industrial process. IFAC-PapersOnLine 2020, 53, 13619–13624. [Google Scholar] [CrossRef]
Liang, Q.; Lin, Q.; Guo, M.; Lu, Q.; Zhang, D. Forecasting Crude Oil Prices: A Gated Recurrent Unit-Based Nonlinear Granger Causality Model. Int. Rev. Financ. Anal. 2025, 102, 104124. [Google Scholar] [CrossRef]
Butler, K.; Feng, G.; Djurić, P.M. On Causal Discovery with Convergent Cross Mapping. IEEE Trans. Signal Process. 2023, 71, 2595–2607. [Google Scholar] [CrossRef]
Cummins, B.; Gedeon, T.; Spendlove, K. On the Efficacy of State Space Reconstruction Methods in Determining Causality. SIAM J. Appl. Dyn. Syst. 2015, 14, 335–381. [Google Scholar] [CrossRef]
Stark, J. Delay Embeddings for Forced Systems. I. Deterministic Forcing. J. Nonlinear Sci. 1999, 9, 255–289. [Google Scholar] [CrossRef]
Wallot, S.; Mønster, D. Calculation of Average Mutual Information (AMI) and False-Nearest Neighbors (FNN) for the Estimation of Embedding Parameters of Multidimensional Time Series in Matlab. Front. Psychol. 2018, 9, 1679. [Google Scholar] [CrossRef]
Zhang, L.; Lin, G.; Wei, L.; Kou, Y. Feature subset selection for multi-scale neighborhood decision information system via mutual information. Artif. Intell. Rev. 2024, 57, 15. [Google Scholar] [CrossRef]
Székely, G.J.; Rizzo, M.L.; Bakirov, N.K. Measuring and testing dependence by correlation of distances. Ann. Stat. 2007, 35, 2769–2794. [Google Scholar] [CrossRef]
He, W.; Liu, L.-C.; Yang, J.-P. Reliability analysis of stiffened tank-roof stability with multiple random variables using minimum distance and lagrange methods. Eng. Fail. Anal. 2013, 32, 304–311. [Google Scholar] [CrossRef]
Zhao, F.-J.; Zhou, Z.-J.; Hu, C.-H.; Chang, L.-L.; Zhou, Z.-G.; Li, G.-L. A new evidential reasoning-based method for online safety assessment of complex systems. IEEE Trans. Syst. Man Cybern. Syst. 2018, 48, 954–966. [Google Scholar] [CrossRef]
Tang, S.-W.; Zhou, Z.-J.; Hu, C.-H.; Zhao, F.-J.; Cao, Y. A New Evidential Reasoning Rule-Based Safety Assessment Method With Sensor Reliability for Complex Systems. IEEE Trans. Cybern. 2022, 52, 4027–4038. [Google Scholar] [CrossRef]
Ning, P.; Zhou, Z.; Cao, Y.; Tang, S.; Wang, J. A Concurrent Fault Diagnosis Model via the Evidential Reasoning Rule. IEEE Trans. Instrum. Meas. 2022, 71, 1–16. [Google Scholar] [CrossRef]
Yang, J.-B.; Xu, D.-L. Inferential modelling and decision making with data. In Proceedings of the 2017 23rd International Conference on Automation and Computing (ICAC), Huddersfield, UK, 7–8 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar] [CrossRef]
Quiroga, R.Q.; Arnhold, J.; Grassberger, P. Learning Driver-Response Relationships from Synchronization Patterns. Phys. Rev. E 2000, 61, 5142–5148. [Google Scholar] [CrossRef] [PubMed]
Wang, Z.; Yang, X.; Cai, H. Analysis of Influencing Factors of Soil Erosion Changes Based on Structural Equation Model. Land 2025, 14, 304. [Google Scholar] [CrossRef]
Liu, J. Measuring principle of electronic theodolite and its applications in spacecraft inspection. Spacecr. Environ. Eng. 2007, 24, 116–120. [Google Scholar]
Mu, Y.; Hou, N.; Wang, C.; Zhao, Y.; Chen, K.; Chi, Y. An Optoelectronic Detector with High Precision for Compact Grating Encoder Application. Electronics 2022, 11, 3486. [Google Scholar] [CrossRef]
Xu, F.; Guo, Y.; Yu, W.; Li, Z.-G.; Yuan, X.-Y. Simulation and Analysis of the Electromechanical Coupling Dynamic Model of Photoelectric Theodolite. Acta Photonica Sin. 2008, 37, 2076–2079. [Google Scholar]
Li, H.; Shen, X.H. Electromechanical Dynamics Modeling and Coupling of Photoelectric Theodolite. Opt. Precis. Eng. 2007, 15, 1577–1582. [Google Scholar]
Li, K.; Yuan, F.; Ding, Z.; Qiu, Z. Vision measurement error compensation research of double-theodolite based on neural network approaching. In Proceedings of the 30th Chinese Control Conference, Yantai, China, 22–24 July 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 2815–2820. [Google Scholar]
Jarvis, J. Calibration of theodolites. In Proceedings of the 1988 IEEE International Conference on Robotics and Automation, Philadelphia, PA, USA, 24–29 April 1988; IEEE: Piscataway, NJ, USA, 1988; Volume 2, pp. 952–954. [Google Scholar]
GB/T 5080.1-2012; Reliability Testing—Part 1: Test Conditions and Statistical Test Principles. Standards Press of China: Beijing, China, 2012.
GB/T 36537-2018; Electronic Theodolite. Standards Press of China: Beijing, China, 2018.
GB/T 37084-2018; General Requirement for Photoelectric Detection Instruments Reliability. Standards Press of China: Beijing, China, 2018.
Junfeng, L.; Xiang, Y.; Haibo, W.; Xiao, L. Rolling bearing fault diagnosis method based on MFMD and Transformer-CNN. J. Aerosp. Power 2023, 38, 1446–1456. [Google Scholar]
Jiang, M.; Liu, G.; Su, Y.; Wu, X. Self-attention empowered graph convolutional network for structure learning and node embedding. Pattern Recognit. 2024, 153, 110537. [Google Scholar] [CrossRef]
Tao, L.; Zhichao, L.; Mingliang, S. Fault diagnosis based on fuzzy Bayesian risk and T-S fuzzy model. Meas. Control Technol. 2019, 38, 7–12. [Google Scholar]

Figure 1. ERr-CIC modeling process.

Figure 2. Causal relationship graph example.

Figure 3. The handling method for Condition ②.

Figure 4. Experimental procedure.

Figure 5. Calculation results of sensitive factors.

Figure 6. CHCC tolerance range analysis.

Figure 7. Causal relationship inference results.

Figure 8. Model training and optimization process. The dashed line in the figure represents the reference line for the variation of the loss reduction rate.

Figure 9. Model comparison.

Figure 10. Distribution of model results from 20 trials. The blue, orange, yellow, and purple dots represent the distribution of RMSE results for CNN Transformer, GCN, T-S fuzzy model, and ERr CIC, respectively.

Table 1. AF, RF metrics test results.

Reconstructed State Space	AF	RF
$M_{1}$	1	1
$M_{2}$	0.98	1
$M_{3}$	0.99	0.98

Table 2. Reference values of health indicators.

Health Indicator	$φ_{1}$	$φ_{2}$	$φ_{3}$
sdm	0.2	0.446	0.5
temp	20 °C	27.5 °C	30 °C
res	0.5 A/W	0.66 A/W	0.7 A/W
lv	100 uv	283.4 uv	325 uv
mp	100	107.171	110
oe	152.588 uv	271.727 uv	305.176 uv

Table 3. Model parameter optimization results.

Health Indicator	$φ_{1}$	$φ_{2}$	$φ_{3}$
sdm	0.215	0.450	0.680
temp	22.160 °C	27.520 °C	30.184 °C
res	0.498 A/W	0.667 A/W	0.741 A/W
lv	100.211 uv	283.423 uv	326.168 uv
mp	102.175	117.178	125.641
oe	152.591 uv	275.610 uv	308.200 uv
CHCC	0.983	0.506	0.361
utility value	1	0.505	0.169

Table 4. Comparison of computational efficiency and health-state assessment accuracy among causal inference methods.

Causal Inference Method	Average of Runtime	Variance of Runtime	Health-State Assessment Accuracy
CCM	0.84 s	0.05 s	0.032
TF	4.73 s	0.21 s	0.032
NG	1.92 s	0.14 s	0.078

Table 5. Model hyperparameters.

Model Hyperparameters	CNN-Transformer	GCN	T-S Fuzzy Model
input dimension	3	3	/
output dimension	1	1	/
number of attention heads	8	/	/
normalization method	/	symmetric normalization	/
batch size	16	16	/
number of rules	/	/	9
membership function	/	/	Gaussian membership function
de-blurring methods	/	/	weighted average

Table 6. Statistics of model results from 20 runs.

Model	Mean RMSE	Std	95% CI	Average Rank
CNN-Transformer	0.0304	0.0085	[0.0289, 0.0319]	1.9
GCN	0.0457	0.0098	[0.0447, 0.0467]	2.8
T-S fuzzy model	0.0692	0.0018	[0.0675, 0.0709]	3.1
ERr-CIC	0.0321	0.0045	[0.0314, 0.0328]	2.2

Note: Std represents the standard deviation of RMSE across 20 independent runs. The

95 %

CI denotes the confidence intervals of RMSE and was calculated based on the t-distribution.

Table 7. Comparison metrics result.

Comparison Metrics	CNN-Transformer	GCN	T-S Fuzzy Model	ERr-CIC
Accuracy	0.030	0.045	0.069	0.032
Stability	0.0085	0.0098	0.0018	0.0045
Efficiency	3.6 s	5.4 s	0.8 s	1.2 s
Interpretability	0.2	0.4	0.8	0.9

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, W.; Feng, Z.; Sun, Y.; Zhang, X. A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment. Entropy 2026, 28, 533. https://doi.org/10.3390/e28050533

AMA Style

Li W, Feng Z, Sun Y, Zhang X. A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment. Entropy. 2026; 28(5):533. https://doi.org/10.3390/e28050533

Chicago/Turabian Style

Li, Wenbo, Zhichao Feng, Yijie Sun, and Xinyi Zhang. 2026. "A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment" Entropy 28, no. 5: 533. https://doi.org/10.3390/e28050533

APA Style

Li, W., Feng, Z., Sun, Y., & Zhang, X. (2026). A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment. Entropy, 28(5), 533. https://doi.org/10.3390/e28050533

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Causality-Informed Correlation-Aware Health-State Assessment for Complex Equipment

Abstract

1. Introduction

2. Problem Formulation

3. The Modeling Process

3.1. Causality-Informed Correlation Among Subsystem-Level Health Indicators

3.2. Convergent Cross-Mapping for Causal Relationships Inference

3.3. Calculation of Conditionally Hybrid Correlation Coefficients

3.4. Calculation of Other Parameters

3.5. Evidence Fusion Process

3.6. Optimization of Model Parameters

4. Sensitivity and Robustness Analysis

4.1. Sensitivity Analysis

4.2. Robustness Analysis

5. Case Study

5.1. Experiment Setting

5.2. Causal Relationships Inference

5.3. Health-State Assessment

5.4. Numerical Validation of Model Performance

6. Comparison Experiment

6.1. Comparison of Causal Inference Methods

6.2. Comparison of Health-State Assessment Methods

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI