Simultaneous-Fault Diagnosis of Satellite Power System Based on Fuzzy Neighborhood ζ-Decision-Theoretic Rough Set

Laifa Tao; Chao Wang; Yuan Jia; Ruzhi Zhou; Tong Zhang; Yiling Chen; Chen Lu; Mingliang Suo

doi:10.3390/math10193414

,

and

¹

Institute of Reliability Engineering, Beihang University, Beijing 100191, China

²

Science & Technology on Reliability & Environmental Engineering Laboratory, Beijing 100191, China

³

School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China

⁴

Beijing Institute of Radio Metrology and Measurement, China Aerospace Science and Industry Corporation Limited, Beijing 100039, China

Mathematics2022, 10(19), 3414;https://doi.org/10.3390/math10193414

This article belongs to the Special Issue Mathematical Problems in Aerospace

Version Notes

Order Reprints

Abstract

Due to the increasing complexity of the entire satellite system and the deteriorating orbital environment, multiple independent single faults may occur simultaneously in the satellite power system. However, two stumbling blocks hinder the effective diagnosis of simultaneous-fault, namely, the difficulty of obtaining the simultaneous-fault data and the extremely complicated mapping of the simultaneous-fault modes to the sensor data. To tackle the challenges, a fault diagnosis strategy based on a novel rough set model is proposed. Specifically, a novel rough set model named FNζDTRS by introducing a concise loss function matrix and fuzzy neighborhood relationship is proposed to accurately mine and characterize the relationship between fault and data. Furthermore, an attribute rule-based fault matching strategy is designed without using simultaneous-fault data as training samples. The numerical experiments demonstrate the effectiveness of the FNζDTRS model, and the diagnosis experiments performed on a satellite power system illustrate the superiority of the proposed approach.

Keywords:

simultaneous-fault diagnosis; rough set; attribute reduction; satellite power system

MSC:

94C12

1. Introduction

The power system is regarded as the heart of a satellite, whose health management is critical to the on-orbit operation of the entire satellite. The satellite power system is mainly composed of a solar array and battery pack. The solar array exposes in the outer space environment for a long time, and is very vulnerable to external environment intrusion. The battery pack is in a frequent and long-term working state with the periodic operation of the satellite. Therefore, with the increasing probability of space junk collisions, intense radiation of space particles, and striking temperature differences in space, the satellite power system may have multiple independent single faults occurring at the same time, which is called simultaneous-fault [1]. Accurate fault diagnosis is the basis for the health management of a satellite. At present, the research on the diagnosis of single-fault has achieved great success [2,3]. However, as satellites become more complex in their functional composition and longer in their mission time, the mode of simultaneous-fault has become the key factor affecting the normal on-orbit operation of satellites, and the risk and influence caused by such a fault mode cannot be ignored, because the development speed and destructive power of such a fault mode are far more than that of a single-fault mode. Therefore, it is necessary to diagnose the simultaneous-fault precisely to make sound decisions to enable satellites to perform their missions smoothly and safely. This is the core motivation of our work, namely, we try to solve the diagnosis problem of simultaneous-fault, which is more complex and more harmful than that of single-fault.

With respect to the diagnosis of simultaneous-fault, there are two major challenges that can be listed as follows: (1) The historical simultaneous-fault data are scarce, which greatly limits the effectiveness of data-driven models; (2) The simultaneous-faults would involve multiple sensors, and the mapping between sensor data and fault modes is complicated, which leads to the uncertainty in the diagnosis process. Therefore, new cognitive methods and further research are needed for simultaneous-fault cognition and diagnosis. These can be considered as the technical motivation of our research.

Regarding the first challenge issue, the absence of historical simultaneous-fault data is a thorny problem that needs to be solved urgently. Unlike the traditional fault diagnosis studies that require all kinds of samples during the training phase [4,5,6], some literature has shown that multi-label classification is expected to achieve simultaneous-fault diagnosis without historical simultaneous-fault data [1,7,8,9,10]. The multi-label classification task focuses on the problem where each training simple is represented by a single instance with a single label, and the task is to yield a model that can predict the proper label sets for unseen instances [11]. Multi-label classification methods can be divided into two categories, one of which is the problem transformation methods, including Binary Relevance [12], Classifier Chains [13], Calibrated Label Ranking [14], and other classical methods; the other includes the algorithm adaptation methods, including multi-label K-nearest neighbor (ML-KNN) [15], multi-label decision tree (ML-DT) [16], etc. However, in the face of complex problems, the above methods cannot effectively deal with the problem of insufficient data, and there is still a need for long-term and in-depth research.

For the second challenge issue, some data mining methods are good solutions. In terms of the cognition of things, rough set theory provides a perspective of knowledge and data fusion. This is the main reason why this paper chooses the rough set model as the basic model. The setting of condition attribute and decision attribute can provide multiple information for the characterization of fault, which is conducive to extracting the mapping information between sensor data and fault modes. Rough set theory initiated by Pawlak [17] provides an authoritative mathematical framework for analyzing and handling ambiguous and uncertain data, which can be used to attribute reduction [18,19,20,21,22], rule extraction [23,24,25,26], and uncertainty reasoning [22,27,28,29]. Among kinds of rough set models, the decision-theoretic rough set (DTRS) model has been proved to be a generalized model of many other rough set models [30,31]. At present, there have been related studies on various decision-theoretic rough set models for fault diagnosis, which have proved that the models can effectively select the fault attributes when the pair of the threshold parameters is set appropriately [30,31]. Nevertheless, how to determine the appropriate threshold parameters is the biggest difficulty in the research and application of DTRS. In our previous work [32], we have presented a single-parameter decision-theoretic rough set (SPDTRS) model by setting only one parameter named compensation coefficient rather than two or six, which facilitates the convenient application of the DTRS model. However, the setting of the compensation coefficient in this model is still not clear enough, and the setting of the loss function matrix is defective. In addition, this model lacks the consideration of uncertain information in data description, which makes it unable to deal with continuous data directly. Therefore, in order to make the rough set model (i.e., SPDTRS) more effective in dealing with the simultaneous-fault problem, we need to carry out more targeted improvement work. The details are as follows.

Motivated by the analyses mentioned above, in this work, we propose a fault matching strategy for simultaneous-fault diagnosis based on a revised DTRS named fuzzy neighborhood ζ-decision-theoretic rough set model (FNζDTRS). Since there is a coupling relationship of fault characteristics between a single-fault and its associated simultaneous-fault, this paper proposes the fault matching strategy based on this principle. The main idea of the proposed strategy is that when an unknown simultaneous-fault occurs, its fault attributes are first selected by the FNζDTRS and then classified according to the correlation between the obtained fault attributes and the fault attributes of each single-fault selected by the FNζDTRS model beforehand. Therefore, the main novelties and contributions of this study can be listed as follows.

(1): A novel and concise data-driven loss function matrix is designed for DTRS.
(2): A fuzzy neighborhood ζ-decision-theoretic rough set model is proposed with the help of the fuzzy neighborhood relationship and the proposed loss function matrix, which can deal with hybrid data common in engineering.
(3): The proposed FNζDTRS model, used for attribute reduction, has a significant advantage in classification accuracy compared with other existing rough sets. This proves that it is more suitable for real fault diagnosis.
(4): A diagnosis strategy of simultaneous-fault is put forward based on a coupling mapping relationship between single-fault and its associated simultaneous-fault. This ensures that our strategy can handle both single-fault and simultaneous-fault.
(5): The proposed strategy is successfully applied to the simultaneous-fault diagnosis of the satellite power system and only requires single-fault samples in the training phase, which is highly feasible for practical applications.

The remainder of this paper starts with some preliminaries and related work, then puts forward the presentation of the FNζDTRS model in Section 2 and presents the basic framework of simultaneous-fault diagnosis in Section 3. The effectiveness and superiority of the FNζDTRS model is verified through some numerical experiments in Section 4, and further demonstrated by a comparative analysis with several baseline algorithms for simultaneous-fault diagnosis in Section 5. The paper closes with main conclusions in Section 6.

2. Preliminaries and Related Work

This subsection will review some notions about rough sets that are relevant to the development of our theory.

Definition 1.

(Decision system) A binary group:

D S = (U, C \cup D)

can describe a decision system. Among them,

U = \{x_{1}, x_{2}, \dots, x_{m}\}

is called the universe, which is a finite and nonempty set.

D

is the set of decision attributes which is a nonempty set.

C

is the collection of conditional attributes,

C \cap D = \emptyset

,

D \neq \emptyset

31. Therefore, the relationship between each element in a decision system can be represented as shown in Figure 1. To better understand the above definition, we describe the above-mentioned elements in combination with the fault diagnosis problem. C represents the parameters output by the sensor or the extracted feature attributes, D represents the category of the failure mode, and U denotes the collected data.

Figure 1. The illustration of a decision system.

The decision-theoretic rough set (DTRS) presented by Yao et al. [33]. provides a concise semantic interpretation through a loss function matrix. The loss function matrix is described in Table 1.

Table 1. The detailed information of a loss function matrix.

Consider that

X

is the subset of samples with the same label

d_{k}

. The state

Q

suggests that a related sample defined as

x

is in

X

, and the state

Q^{c}

suggests that

x

is not in

X

. The set of actions

a_{P}

,

a_{B}

, and

a_{N}

indicate the classification of x into three regions, which are

x \in P O S (X)

,

x \in B N D (X)

,

x \in N E G (X)

.

P O S (X)

denotes the acceptance of the event

x \in X

.

B N D (X)

denotes the deferment of the event

x \in X

, also considers

B N D (X)

denotes the non-commitment of the event

x \in X

.

N E G (X)

denotes the rejection of

x \in X

. Furthermore,

λ_{• P}

denotes the loss caused by taking actions (

a_{P}

,

a_{B}

,

a_{N}

) while

x \in X

.

λ_{• N}

is the loss caused by taking actions (

a_{P}

,

a_{B}

,

a_{N}

) while

x \notin X

.

Consider this scenario: the risk of delaying the execution of the correct action is increased compared to that of the correct action, and both are less than the loss of taking the wrong action, the DTRS model therefore made a reasonable assumption:

0 \leq λ_{P P} \leq λ_{B P} < λ_{N P}

and

0 \leq λ_{N N} \leq λ_{B N} < λ_{P N}

, which is the basis for generating this rough set model.

Thanks to the above assumption, a pair of threshold parameters is used to define the positive region

P O S (X)

, the boundary region

B N D (X)

and the negative region

N E G (X)

to construct the DTRS model, which is guided by the Bayesian risk minimization principle and the three-way decision theory. Thus, we have the form of a DTRS model as follows 31:

P O S_{(α, β)} (X) = \{x \in U | P (X | [x]) \geq α\},

(1)

B N D_{(α, β)} (X) = {x \in U | β < P (X [x]) < α},

(2)

N E G_{(α, β)} (X) = \{x \in U | P (X | [x]) \leq β\} .

(3)

The following equations represent the relationship between the two threshold parameters and the six loss functions:

α = \frac{(λ_{P N} - λ_{B N})}{(λ_{P N} - λ_{B N}) + (λ_{B P} - λ_{P P})},

(4)

β = \frac{(λ_{B N} - λ_{N N})}{(λ_{B N} - λ_{N N}) + (λ_{N P} - λ_{B P})} .

(5)

The key parts of the DTRS model are the loss function or threshold parameter

(α, β)

. To study and employ the DTRS model, an important issue is how to determine these parameters. Inspired by the idea of being data-driven, our previous work proposed a single-parameter decision-theoretic rough set (SPDTRS) model [32] that simplifies the traditional DTRS model. Specifically, the model requires only one parameter to be preset rather than the pair of

(α, β)

or the six parameters in the loss function matrix. However, the solution of employing two truncation functions utilized in the model calculation makes the model relatively complex. Moreover, the interpretability of the loss function matrix in the SPDTRS model is slightly insufficient. The above two disadvantages are the focus of this paper in proposing a new rough set model.

3. Fuzzy Neighborhood ζ-Decision-Theoretic Rough Set

3.1. Granular Computing Based on Fuzzy Neighborhood Relationship

In order to be more applicable to practical problems, rough set models need to be able to handle a hybrid dataset, including continuous and discrete data. Fuzzy relationship and neighborhood relationship are two effective means to deal with the spatial relationship of samples. Their combined form is used by a variety of models [34]. The fuzzy neighborhood relationship can analyze the relationship between the entities in the decision system more precisely. Therefore, to overcome the inability of the SPDTRS model to handle the hybrid dataset, we introduce this fuzzy neighborhood relationship.

Definition 2.

(Fuzzy neighborhood relationship) Given a decision system

D S = (U, C \cup D)

, for an arbitrary sample

x \in U

, the fuzzy neighborhood subset of

x

is defined as:

{[x]}^{δ} = \{y \in U | r (x, y) \geq δ\},

(6)

where

δ

is fuzzy neighborhood radius. The range of

δ

is

0 \leq δ \leq 1

. If

x

and y are continuous data, we have

r (x, y) = 1 - \frac{1}{n} \sqrt{(\sum_{i = 1}^{n} {(x_{i} - y_{i})}^{2})}

. While the two elements

x

and y are discrete data, we have

r (x, y) = \{\begin{matrix} 1, if x_{i} = y_{i} \\ 0, if x_{i} \neq y_{i} \end{matrix},

(7)

Thus, the fuzzy neighborhood subset is also called equivalence class. On this basis, the fuzzy conditional probability of

x

could be described as:

\tilde{P} (X | {[x]}^{δ}) = \frac{\sum \{r (x, y) | y \in (X \cap {[x]}^{δ})\}}{\sum \{r (x, z) | z \in {[x]}^{δ}\}},

(8)

where

X

is the subset of samples with the same label

d_{k}

. Under the assumption of the fuzzy neighborhood subset

{[x]}^{δ} \cap X \neq \emptyset

, we have

0 < \tilde{P} (X | {[x]}^{δ}) \leq 1

, while

\tilde{P} (X | {[x]}^{δ}) = 1

if and only if

{[x]}^{δ} \subseteq X

.

\sum {}

represents the sum of all elements in its set.

3.2. Determination of the Two Threshold Parameters

Considering the disadvantage of the SPDTRS model, a new loss function matrix is proposed, which is under fuzzy neighborhood relationship by a concise loss function relationship to avoid introducing the truncation functions. The novel SPDTRS model avoids the discussion of multiple situations and reduces the computational complexity of the SPDTRS model.

In the new loss function matrix, the data-driven loss functions under fuzzy neighborhood relationship is shown in Table 2. Besides,

\tilde{P} (X | {[x]}^{δ})

is the fuzzy neighborhood conditional probability, which can be calculated by Equation (8). The compensation coefficient is ζ with

0 \leq ζ < 1

.

\tilde{S} (X | {[x]}^{δ})

and

{\tilde{S}}^{c} (X | {[x]}^{δ})

are the significance coefficients, which can be described as follows:

\tilde{S} (X | {[x]}^{δ}) = \frac{\sum \{\tilde{P} (X | {[y]}^{δ}) | y \in (X \cap {[x]}^{δ})\}}{\sum \{\tilde{P} (X | {[z]}^{δ}) | z \in X\}},

(9)

{\tilde{S}}^{c} (X | {[x]}^{δ}) = \frac{\sum \{\tilde{P} (X^{c} | {[y]}^{δ}) | y \in (X^{c} \cap {[x]}^{δ})\}}{\sum \{\tilde{P} (X | {[z]}^{δ}) | z \in X\}} .

(10)

where

\sum {}

represents the sum of all elements in its set.

Table 2. The fuzzy neighborhood data-driven loss function matrix.

Under the assumption of the equivalence class

{[x]}^{δ} \cap X \neq \emptyset

, the relationships

\tilde{S} (X | {[x]}^{δ}) > 0

and

{\tilde{S}}^{c} (X | {[x]}^{δ}) \geq 0

hold, and

{\tilde{S}}^{c} (X | {[x]}^{δ}) = 0

if and only if

{[x]}^{δ} \subseteq X

.

Subsequently, we can conclude the pair of threshold parameters according to the fuzzy neighborhood data-driven loss function matrix, which can be represented as follows. In addition, we rewrite

\tilde{S} (X | {[x]}^{δ}) = S

,

{\tilde{S}}^{c} (X | {[x]}^{δ}) = S^{c}

,

\tilde{P} (X | {[x]}^{δ}) = P

for the sake of convenience.

\begin{matrix} α^{f n} = \frac{(λ_{P N} - λ_{B N})}{(λ_{P N} - λ_{B N}) + (λ_{B P} - λ_{P P})} \\ = \frac{(S^{C} - S^{C} (1 - P) ζ)}{(S^{C} - S^{C} (1 - P) ζ) + (S P ζ - 0)} \\ = \frac{S^{C} (1 - ζ + P ζ)}{S^{C} (1 - ζ + P ζ) + S P ζ} \end{matrix},

(11)

\begin{matrix} β^{f n} = \frac{(λ_{B N} - λ_{N N})}{(λ_{B N} - λ_{N N}) + (λ_{N P} - λ_{B P})} \\ = \frac{(S^{C} (1 - P) ζ - 0)}{(S^{C} (1 - P) ζ - 0) + (S - S P ζ)} \\ = \frac{S^{C} (1 - P) ζ}{S^{C} (1 - P) ζ + S (1 - P ζ)} \end{matrix} .

(12)

Subsequently, we can set up three-way decision rules as follows:

Rule (P): Decide

x \in P O S (X)

while

\tilde{P} (X | {[x]}^{δ}) \geq α^{f n}

;

Rule (B): Decide

x \in B N D (X)

while

β^{f n} < \tilde{P} (X | {[x]}^{δ}) < α^{f n}

;

Rule (N): Decide

x \in N E G (X)

while

\tilde{P} (X | {[x]}^{δ}) \leq β^{f n}

.

According to the decision rules, the following roots about

α^{f n}

and

β^{f n}

can be described in the following two cases.

Case 1:

0 < ζ < 1

From the rule (P), we can obtain

P \leq \frac{(2 ζ S^{C} - S^{C}) - \sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})}}{2 ζ (S + S^{C})},

P \geq \frac{(2 ζ S^{C} - S^{C}) + \sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})}}{2 ζ (S + S^{C})} .

(13)

From the rule (N), we can obtain

P \leq \frac{(2 ζ S^{C} + S) - \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)}}{2 ζ (S + S^{C})},

P \geq \frac{(2 ζ S^{C} + S) + \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)}}{2 ζ (S + S^{C})} .

(14)

From these results, we can only accept two roots because of the relationship

0 < P \leq 1

. Thus, we can rewrite these two roots:

α_{1}^{f n} = \frac{(2 ζ S^{C} - S^{C}) + \sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})}}{2 ζ (S + S^{C})},

(15)

β_{1}^{f n} = \frac{(2 ζ S^{C} + S) - \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)}}{2 ζ (S + S^{C})} .

(16)

Case 2:

ζ = 0

The values of these two parameters are as follows:

α_{2}^{f n} = 1,

(17)

β_{2}^{f n} = 0 .

(18)

The model arising from this case corresponds to the Pawlak model. Both boundary loss functions are equal to 0. Therefore the model clearly exhibits a two-way decision-making characteristic. Thus, the Pawlak model is one of the specific examples of our model, and

ζ = 0

is a necessary non-sufficient condition for it.

Theorem 1.

For the fuzzy neighborhood data-driven loss function matrix, assuming the equivalence class

{[x]}^{δ} \cap X \neq \emptyset

, when

{[x]}^{δ} ⊈ X

holds, namely, the concerned equivalence class is not a consistent class, then we have:

(a₁)

λ_{P P} \leq λ_{B P} < λ_{N P}

,

(a₂)

λ_{N N} \leq λ_{B N} < λ_{P N}

.

When

{[x]}^{δ} \subseteq X

holds, i.e., the concerned equivalence class is a consistent class, then we have:

(b₁)

λ_{P P} \leq λ_{B P} < λ_{N P}

(b₂)

λ_{N N} = λ_{B N} = λ_{P N} = 0

Proof.

(a₁) If

{[x]}^{δ} ⊈ X

, then

\tilde{S} (X | {[x]}^{δ}) > 0

,

0 < \tilde{P} (X | {[x]}^{δ}) < 1

. Since

0 \leq ζ < 1

,

λ_{B P} = \tilde{S} (X | {[x]}^{δ}) \tilde{P} (X | {[x]}^{δ}) ζ

, then

0 \leq λ_{B P} < \tilde{S} (X | {[x]}^{δ})

. Due to

λ_{P P} = 0

and

λ_{N P} = \tilde{S} (X | {[x]}^{δ})

, hence

λ_{P P} \leq λ_{B P} < λ_{N P}

.

(a₂) If

{[x]}^{δ} ⊈ X

, then

{\tilde{S}}^{c} (X | {[x]}^{δ}) > 0

,

0 < \tilde{P} (X | {[x]}^{δ}) < 1

. Since

0 \leq ζ < 1

,

λ_{B N} = {\tilde{S}}^{c} (X | {[x]}^{δ}) (1 - \tilde{P} (X | {[x]}^{δ})) ζ

, then

0 \leq λ_{B N} < {\tilde{S}}^{c} (X | {[x]}^{δ})

. Due to

λ_{N N} = 0

and

λ_{P N} = {\tilde{S}}^{c} (X | {[x]}^{δ})

, hence

λ_{N N} \leq λ_{B N} < λ_{P N}

.

(b₁) If

{[x]}^{δ} \subseteq X

, then

\tilde{S} (X | {[x]}^{δ}) > 0

,

\tilde{P} (X | {[x]}^{δ}) = 1

. Since

0 \leq ζ < 1

,

λ_{B P} = \tilde{S} (X | {[x]}^{δ}) \tilde{P} (X | {[x]}^{δ}) ζ

, then

0 \leq λ_{B P} < \tilde{S} (X | {[x]}^{δ})

. Due to

λ_{P P} = 0

and

λ_{N P} = \tilde{S} (X | {[x]}^{δ})

, hence

λ_{P P} \leq λ_{B P} < λ_{N P}

.

(b₂) If

{[x]}^{δ} \subseteq X

, then

{\tilde{S}}^{c} (X | {[x]}^{δ}) = 0

, and

\tilde{P} (X | {[x]}^{δ}) = 1

. Since

0 \leq ζ < 1

,

λ_{B N} = {\tilde{S}}^{c} (X | {[x]}^{δ}) (1 - \tilde{P} (X | {[x]}^{δ})) ζ

, then

λ_{B N} = 0

. Due to

λ_{N N} = 0

and

λ_{P N} = {\tilde{S}}^{c} (X | {[x]}^{δ})

, hence

λ_{N N} = λ_{B N} = λ_{P N} = 0

. QED. □

3.3. Establishment of FNζDTRS

Reasoning by Section 3.2, the expressions of these two threshold functions lead to the following results:

α^{f n} = f (S, S^{c}, ζ)

and

β^{f n} = f (S, S^{c}, ζ)

under the above two conditions, where

α^{f n} = \{α_{1}^{f n}, α_{2}^{f n}\}

and

β^{f n} = \{β_{1}^{f n}, β_{2}^{f n}\}

. Based on Equations (9) and (10), it is easily to obtain

S

and

S^{c}

by confirming the parameter

δ

and analyzing the distribution information of the original data. In summary, we can change these two threshold parameters to

α^{f n} = f (δ, ζ)

,

β^{f n} = f (δ, ζ)

. The rough set model below, which is including parameters

δ

and

ζ

is defined as fuzzy neighborhood ζ-decision-theoretic rough set (FNζDTRS):

\tilde{P O S} = \{x \in U | \tilde{P} (X | {[x]}^{δ}) \geq α^{f n}\},

(19)

\tilde{B N D} = \{x \in U | β^{f n} < \tilde{P} (X | {[x]}^{δ}) < α^{f n}\},

(20)

\tilde{N E G} = \{x \in U | \tilde{P} (X | {[x]}^{δ}) \leq β^{f n}\},

(21)

where both threshold parameters have different descriptions under the following two conditions:

Case 1:

0 < ζ < 1

, then

α_{1}^{f n} = \frac{(2 ζ S^{C} - S^{C}) + \sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})}}{2 ζ (S + S^{C})},

(22)

β_{1}^{f n} = \frac{(2 ζ S^{C} + S) - \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)}}{2 ζ (S + S^{C})} .

(23)

Case 2:

ζ = 0

, then

α_{2}^{f n} = 1,

(24)

β_{2}^{f n} = 0 .

(25)

In the above FNζDTRS model, there are only two cases to discuss, in contrast to the SPDTRS model that requires four cases to discuss, which greatly reduces the computational complexity of the SPDTRS model due to the concise setting of loss functions.

Theorem 2.

In the FNζDTRS model, given two compensation coefficients

ζ_{1}

and

ζ_{2}

with

0 \leq ζ_{1} < 1

and

0 \leq ζ_{2} < 1

, and the parameter

δ

is fixed, if there exists

ζ_{1} \geq ζ_{2}

, then

α^{f n} (ζ_{1}) \leq α^{f n} (ζ_{2})

and

β^{f n} (ζ_{1}) \geq β^{f n} (ζ_{2})

hold.

Proof.

If the equivalence class

{[x]}^{δ} \subseteq X

, then

S^{C} = 0

, according to Equations (22)–(25), when

0 < ζ < 1

,

α^{f n} (ζ_{1}) = α^{f n} (ζ_{2}) = 0

,

β^{f n} (ζ_{1}) = β^{f n} (ζ_{2}) = 0

, when

ζ = 0

,

α^{f n} (ζ_{1}) = α^{f n} (ζ_{2}) = 1

,

β^{f n} (ζ_{1}) = β^{f n} (ζ_{2}) = 0

. If the equivalence class

{[x]}^{δ} ⊈ X

, then its monotonicity relations will be proved by the following derivations.

Part I: For

ζ_{1} \geq ζ_{2} \Rightarrow α^{f n} (ζ_{1}) \leq α^{f n} (ζ_{2})

, two cases need to be considered.

Case 1:

0 < ζ < 1

Since

α_{1}^{f n} = \frac{(2 ζ S^{C} - S^{C}) + \sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})}}{2 ζ (S + S^{C})}

, we set

η = 1 / ζ

, due to

0 < ζ < 1

, then

η > 1

and

α_{1}^{f n} = \frac{(2 S^{C} - η S^{C}) + \sqrt{η^{2} {(S^{C})}^{2} + 4 η S S^{C} - 4 S S^{C}}}{2 (S + S^{C})}

. Due to

S > 0

,

S^{C} > 0

, we can set

{\bar{α}}_{1}^{f n} = (2 S^{C} - η S^{C}) + \sqrt{η^{2} {(S^{C})}^{2} + 4 η S S^{C} - 4 S S^{C}}

for simplicity of derivation. We can find the partial derivative of it, denoted as

f_{α}^{1}

, which is

f_{α}^{1} = \frac{\partial {\bar{α}}_{1}^{f n}}{\partial η} = - S^{C} + \frac{η {(S^{C})}^{2} + 2 S S^{C}}{\sqrt{η^{2} {(S^{C})}^{2} + 4 η S S^{C} - 4 S S^{C}}} .

Its second-order partial derivative is denoted as

f_{α}^{2}

:

f_{α}^{2} = \frac{\partial^{2} {\bar{α}}_{1}^{f n}}{\partial^{2} η} = \frac{- 4 S {(S^{C})}^{2} (S + S^{C})}{{(η^{2} {(S^{C})}^{2} + 4 η S S^{C} - 4 S S^{C})}^{3 / 2}} < 0 .

Because

f_{α}^{2}

is less than 0,

f_{α}^{1}

is monotonically decreasing. Since

η > 1

,

f_{α}^{1} \to 2 S > 0 (1 \leftarrow η)

,

f_{α}^{1} \to 0 (η \to + \infty)

, then

f_{α}^{1} > 0

. Therefore,

α_{1}^{f n}

grows monotonically with respect to

η

. Hence,

α_{1}^{f n}

decreases monotonically with respect to

ζ

, that is

ζ_{1} \geq ζ_{2} \Rightarrow α_{1}^{f n} (ζ_{1}) \leq α_{1}^{f n} (ζ_{2})

.

Case 2:

ζ = 0

In this case, we have

α_{2}^{f n} = 1

. Thus, for

ζ_{1} \geq ζ_{2}

, we have

α_{2}^{f n} (ζ_{1}) = α_{2}^{f n} (ζ_{2})

.

Part II: For

ζ_{1} \geq ζ_{2} \Rightarrow β^{f n} (ζ_{1}) \geq β^{f n} (ζ_{2})

, two cases need to be considered as well.

Case 1:

0 < ζ < 1

Since

β_{1}^{f n} = \frac{(2 ζ S^{C} + S) - \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)}}{2 ζ (S + S^{C})}

, we also set

η = 1 / ζ

, due to

0 < ζ < 1

, then

η > 1

and

β_{1}^{f n} = \frac{(2 S^{C} + η S) - \sqrt{η^{2} S^{2} + 4 η S S^{C} - 4 S S^{C}}}{2 (S + S^{C})}

. Due to

S > 0

,

S^{C} > 0

, we have the simple form

β_{1}^{f n} = (2 S^{C} + η S) - \sqrt{η^{2} S^{2} + 4 η S S^{C} - 4 S S^{C}}

. We describe partial derivatives of

β_{1}^{f n}

like:

f_{β}^{1} = \frac{\partial {\bar{β}}_{1}^{f n}}{\partial η} = S - \frac{η S^{2} + 2 S S^{C}}{\sqrt{η^{2} S^{2} + 4 η S S^{C} - 4 S S^{C}}}, f_{β}^{2} = \frac{\partial^{2} {\bar{β}}_{1}^{f n}}{\partial^{2} η} = \frac{4 S^{2} S^{C} (S + S^{C})}{{(η^{2} S^{2} + 4 η S S^{C} - 4 S S^{C})}^{3 / 2}} > 0 .

From

f_{β}^{2} > 0

, we know that

f_{β}^{1}

increases monotonously along with

η

. Since

η > 1

,

f_{β}^{1} \to - 2 S^{C} < 0 (1 \leftarrow η)

,

f_{β}^{1} \to 0 (η \to + \infty)

, then

f_{α}^{1} < 0

, and

β_{1}^{f n}

is monotonously decreasing with regard to

η

. Therefore,

β_{1}^{f n}

is monotonously increasing with

ζ

, that is

ζ_{1} \geq ζ_{2} \Rightarrow β_{1}^{f n} (ζ_{1}) \geq β_{1}^{f n} (ζ_{2})

. QED. □

Theorem 3.

In the FNζDTRS model, the relationship

0 \leq β^{f n} \leq α^{f n} \leq 1

holds.

Proof.

If the equivalence class

{[x]}^{δ} \subseteq X

, then

S^{C} = 0

, according to Equations (22)–(25), when

0 < ζ < 1

,

α^{f n} = β^{f n} = 0

, when

ζ = 0

,

α^{f n} = 0

,

β^{f n} = 1

, satisfying the inequality

0 \leq β^{f n} \leq α^{f n} \leq 1

. If the equivalence class

{[x]}^{δ} ⊈ X

, then in following three parts we intend to prove the inequality.

Part I: Proof of

0 \leq β^{f n}

under two cases.

Case 1:

0 < ζ < 1

According to Equation (23), we could get

β_{1}^{f n} = \frac{(2 ζ S^{C} + S) - ψ}{2 ζ (S + S^{C})}, ψ = \sqrt{{(S + 2 ζ S^{C})}^{2} - 4 ζ^{2} S^{C} (S + S^{C})}

, and then we only need to prove

4 ζ^{2} S^{C} (S + S^{C}) > 0

. Since

S > 0

,

S^{C} > 0

, then

4 ζ^{2} S^{C} (S + S^{C}) > 0

, so

β_{1}^{f n} > 0

.

Case 2:

ζ = 0

In this case, we have

β_{2}^{f n} = 0

.

Part II: Proof of the inequality

β^{f n} \leq α^{f n}

in two cases.

Case 1:

0 < ζ < 1

According to Equations (22) and (23), we could get

α_{1}^{f n} - β_{1}^{f n} = \frac{- S^{C} - S + \sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})} + \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)}}{2 ζ (S + S^{C})} .

Since

S > 0

,

S^{C} \geq 0

, then

\sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})} = \sqrt{4 ζ S S^{C} (1 - ζ) + {(S^{C})}^{2}} > \sqrt{{(S^{C})}^{2}} = S^{C}, \sqrt{S (- 4 ζ^{2} S^{C} + 4 ζ S^{C} + S)} = \sqrt{4 ζ S S^{C} (1 - ζ) + S^{2}} > \sqrt{S^{2}} = S,

so

α_{1}^{f n} - β_{1}^{f n} > 0

, that is

β_{1}^{f n} < α_{1}^{f n}

.

Case 2:

ζ = 0

In this case, we have

α_{2}^{f n} = 1

,

β_{2}^{f n} = 0

, so

β_{2}^{f n} < α_{2}^{f n}

.

Part III: Proof of the inequality

α^{f n} \leq 1

in two cases.

Case 1:

0 < ζ < 1

According to Equation (22), if we want to prove

α_{1}^{f n} \leq 1

, then we need to prove

\sqrt{S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C})} - (2 ζ S + S^{C}) < 0

, which means we need to prove

S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C}) - {(2 ζ S + S^{C})}^{2} < 0

. Since

S > 0

,

S^{C} \geq 0

, then

S^{C} (- 4 ζ^{2} S + 4 ζ S + S^{C}) - {(2 ζ S + S^{C})}^{2} = - 4 ζ^{2} S S^{C} - 4 ζ^{2} S^{2} = - 4 ζ^{2} S (S^{C} + S) < 0 .

Therefore, we have

α_{1}^{f n} < 1

.

Case 2:

ζ = 0

In this case, we have

α_{2}^{f n} = 1

. QED. □

Theorem 4.

For a decision system, which is described as

D S = (U, C \cup D)

with a fixed parameter

δ

, and two parameters

ζ_{1}

and

ζ_{2}

with

0 \leq ζ_{1} \leq ζ_{2} < 1

, we have

{\tilde{P O S}}_{1} \subseteq {\tilde{P O S}}_{2}

,

{\tilde{N E G}}_{1} \subseteq {\tilde{N E G}}_{2}

,

{\tilde{B N D}}_{1} \supseteq {\tilde{B N D}}_{2}

.

Proof.

At the very beginning of the proof, we assume an arbitrary sample

y

to facilitate the proof of the theorem. While

y \in {\tilde{P O S}}_{1}

, we have

P (X | [y]) \geq α^{f n} (ζ_{1})

. Since

0 \leq ζ_{1} \leq ζ_{2} < 1

, the relation

α^{f n} (ζ_{1}) \geq α^{f n} (ζ_{2})

holds according to Theorem 2. Thus,

P (X | [y]) \geq α^{f n} (ζ_{2})

and

y \in {\tilde{P O S}}_{2}

hold. Hence, we conclude that

{\tilde{P O S}}_{1} \subseteq {\tilde{P O S}}_{2}

.

Likewise, to conclude that

{\tilde{N E G}}_{1} \subseteq {\tilde{N E G}}_{2}

and

{\tilde{B N D}}_{1} \supseteq {\tilde{B N D}}_{2}

is easy via Theorem 2.

From the above, we can find that

ζ

is inversely correlated with the range of neutrality and positively correlated with the uncertainty of the decision. QED. □

3.4. FNζDTRS-Based Attribute Reduction Algorithm

Jia et al. [35] presented a reduction principle in response to the problem of attribute reduction by using the DTRS model. The core idea of it is minimizing the risk of the reduction subset. On this principle, our previous work has designed an attribute reduction algorithm based on the SPDTRS model [32,34]. It is built on minimizing the risk of overall decisions, which can be utilized for the attribute reduction in our proposed FNζDTRS model. Therefore, the detailed attribute reduction algorithm is not repeated in this paper. For details, the readers could refer to our previous work [32,34].

4. Strategy of Simultaneous-Fault Diagnosis

Under the assumption that the data in all single-fault modes are fully available, when an unknown fault occurs, we use the FNζDTRS model to mine the fault attributes of the unknown fault. If the fault attributes of the unknown fault are different from the fault attributes of the existing single-fault, then the unknown fault can be considered as a simultaneous-fault. Furthermore, we can use the attribute reduction results obtained from the FNζDTRS model to analyze and identify the corresponding fault modes of the simultaneous-fault. Finally, a strategy of simultaneous-fault diagnosis called fault matching strategy is formed, as shown in Figure 2.

Figure 2. The procedure of fault diagnosis strategy for simultaneous-fault.

The proposed fault matching strategy consists of two main parts, prior knowledge acquisition and rule matching. In the first part, the single-fault data with abnormal labels and normal data with normal labels are sent into the FNζDTRS model as the training data set, and the optimal fault attribute subsets of each single-fault are obtained by attribute reduction, used as the prior knowledge for subsequent diagnosis. In the second part, the simultaneous-fault data with abnormal labels and normal data with normal labels are combined to form the data to be diagnosed, and then the data are fed into the FNζDTRS model to obtain the optimal fault attribute subset. The optimal fault attribute subset is then obtained here and the optimal fault attribute subsets obtained in the first part are measured by using the Jaccard similarity coeffective. It is worth noting that there may be some single faults with the same fault attributes, therefore we set some rules based on the differences between attribute data to subdivide the faults and complete the fault matching, which can be seen in the subsequent experiments based on the satellite power system.

Based on the above description, we can write the core pseudo code in the above process, as shown in Table 3.

Table 3. The core pseudo code of the diagnosis strategy.

5. Numerical Experiment of Attribute Reduction

The effectiveness and advantage of the proposed FNζDTRS model is verified on several hybrid decision systems from the UCI (http://archive.ics.uci.edu/ml/index.php, accessed on 21 July 2021) and KEEL (https://sci2s.ugr.es/keel/datasets.php, accessed on 21 July 2021) datasets. As shown in Table 4, the test datasets include both discrete and continuous data. Specific comparative experiments regarding parameters test and attribute reduction are conducted on the same hard and soft platforms. Ten baseline classifiers, including NaiveBayes, REPTree, LogitBoost, SMO, Filtered, Bagging, PART, IBk, J48 and JRip, are employed with a 10-fold cross-validation in Weka (https://waikato.github.io/weka-wiki/downloading_weka/, accessed on 21 July 2021) software to demonstrate the accuracy of attribute selection. The input data are normalized into the range of [0, 1] during preprocessing.

Table 4. The information of the employed datasets.

5.1. Parameters Test for FNζDTRS

For the FNζDTRS model, two parameters

ζ

and

δ

need to be set in advance. As described in Section 2, the theoretic value field of

ζ

is

[0, 1)

, and

δ

is

[0, 1]

. Therefore,

ζ

is sampled with an interval of 0.05, and end at 0.99.

δ

is also sampled with an interval of 0.05, but end at 1. Figure 3 shows the experimental results.

Figure 3. The accuracy results of

ζ

and

δ

with respect to FNζDTRS.

The results show that the appropriate settings of

ζ

and

δ

range in [0.3, 0.99] and [0.85, 0.95], respectively. It could be explained by the fact that a smaller

ζ

will result in a larger boundary region, marking more samples as uncertain state. It means setting a smaller

ζ

for the FNζDTRS decision system will lead to greater uncertainty. When applied in real world,

ζ

should be adjusted appropriately according to the risk of wrong decision. When the danger of making a bad decision is low,

ζ

can be set larger, and vice versa. On the other hand, with the fuzzy neighborhood threshold

δ

closer to 1, the fuzzy neighborhood granules will be finer, allowing for the samples to be classified accurately into the appropriate regions. The above two parameters are the core parameters of the model proposed in this paper, and their setting values directly affect the test results. Therefore, when setting the above parameters, the values of the two parameters need to be adjusted according to the actual needs with the above principles.

5.2. Comparison Experiments on Attribute Reduction

In this part, seven related models, DTRS-EF [36], DTRS-SMDNS [37], SPDTRS-EF [32], SPDTRS-SMDNS [32], NDTRS [38], FDTRS [39] and FN3WD [34], are introduced into a contrastive analysis on the attribute reduction to demonstrate the superiority of the proposed FNζDTRS. The settings of the relevant parameters in these comparison models are the same as those of the corresponding models. The number of reduction attributes and the classification accuracy are the common evaluation indicators of the comparison experiment of attribute reduction [40,41]. The standard deviations of ten trials are also calculated, and the results are shown in Table 5 and Table 6.

Table 5. The classification accuracy of the reduction subset.

Table 6. The number of reduction attributes.

According to the results in Table 4 and Table 5, the following analysis can be obtained:

(a): The analysis based on the classification accuracy indicates that the FNζDTRS model is superior to other rough set models. The main reason may lie in the different methods to describe spatial granules. Discretization methods such as EF and SMDNS are commonly introduced to process continuous data in the traditional DTRS models, which results in the destruction of the spatial structure of granules. Using special measures (such as fuzzy relationship, neighborhood relationship, etc.) can avoid the distortion of the discretization method, but it also has some disadvantages, such as simple measurement, insufficient description ability, etc. The proposed FNζDTRS model utilizes fuzzy neighborhood relationships to overcome the above shortcomings. Compared with other DTRS models, the description of spatial granules is more precise in our model and results in the higher classification accuracy.
(b): With respect to the number of reduction attributes, the FDTRS model has the least number of reduction attributes, but it fails to achieve a desired classification accuracy, whereas the FNζDTRS model can maintain high classification accuracy while keeping the number of reduction attributes small. The results show that the classification ability can be maintained or improved only when the reduction attributes are accurately selected. The above conclusion also conforms to the basic principle of attribute reduction, that is, in the operation of reduction, we want to get a relatively concise set, which can ensure that the original classification accuracy is not reduced, and the purpose is to improve the operation efficiency.
(c): The standard deviation is used to measure the robustness of models. It is obvious that the standard deviation of the FNζDTRS model is the smallest regardless of the classification accuracy or the number of reduced attributes, which directly proves that the robustness of the FNζDTRS model is the highest compared to other models. The above robustness characteristics also show that we have a large selection range when setting our two parameters, which is conducive to the wide application of the model in practical projects.

6. Simultaneous-Fault Diagnosis of Satellite Power System

In-orbit faults of the power system should be avoided to the maximum extent for satellites. Therefore, simulation is the best platform to mine fault diagnosis knowledge. In this section, the effectiveness of the proposed simultaneous-fault diagnosis scheme is verified with the simulation model of a geosynchronous (GEO) satellite power system [3]. As shown in Figure 4, the power system works in a direct energy transfer mode during the simulation, and ten telemetry parameters can be measured in the marked position. The information of the telemetry parameters is shown in Table 7.

Figure 4. The schematic diagram of the power system.

Table 7. Information of the telemetry parameters.

The raw data used for simultaneous-fault diagnosis is composed of the above-mentioned ten kinds of attributes, and all the data are selected in the stationary period. The dataset is stored in a time-series format, with each subset representing one of the scenarios shown in Table 8. There are a total of 12 scenarios, where scenario 0 represents the system without any fault. F1 represents open-circuit failure in solar array. F2 represents the short-circuit failure in the battery. F3 represents shunt regulator failure without shunt. F4 represents shunt regulator failure with constant shunt. The remaining 7 scenarios are concurrent failures composed of the above 4 single failures occurring at the same time, where F3 and F4 cannot occur simultaneously.

Table 8. Different scenarios for faults in satellite power system.

It can be seen that this means can effectively solve the problem of insufficient data in the simultaneous-fault diagnosis, which also responds to one the difficulties in the simultaneous-fault diagnosis introduced at the beginning of this paper.

6.1. Simultaneous-Fault Diagnosis Based on the Fault Matching Strategy

The two main parts of the simultaneous-fault diagnosis strategy, namely prior knowledge acquisition and rule matching, are equivalent to the training and testing process. We choose 4 kinds of single-fault data as the training set and the remaining 7 kinds of simultaneous-fault data as the testing set. The results obtained through the first step of prior knowledge acquisition are shown in Table 9. It can be found that the results of the output attribute subset of F3 and F4 are the same. Therefore, in order to distinguish F3 and F4, further information needs to be excavated. For attribute a₃, its corresponding shunt current data can directly distinguish F3 from F4. The shunt current value of F3 fluctuates between 6.42–8.67 and that of F4 is between 12.45–14.77. Therefore, F3 and F4 can be distinguished by setting the threshold value of the shunt current average.

Table 9. The results of the training process.

The results obtained through the second step of rule matching are shown in Table 10. For a simultaneous-fault, the Jaccard similarity coefficient between its output attribute subset and the output attribute subset of each single-fault obtained in the training process can be calculated in turn. If the Jaccard similarity coefficient is 0, the corresponding single fault can be eliminated preliminarily. Since F3 and F4 cannot be distinguished by the Jaccard similarity coefficient, it is necessary to further distinguish F3 and F4 through the set shunt current threshold and to finally obtain the matching result. Through the final matching result, it can be found that the diagnostic accuracy of the fault matching strategy is 100%. The above fault matching process comprehensively utilizes the similarity of attributes and expert knowledge, which can ensure that the obtained diagnosis results are more accurate.

Table 10. The results of the testing process.

6.2. Comparison Experiment on Simultaneous-Fault Diagnosis

6.2.1. Experimental Setup

The superiority of the proposed fault matching strategy is demonstrated through comparison experiments of simultaneous-fault diagnosis with several multi-label classification algorithms (Binary Relevance, Classifier Chain, Calibrated Label Ranking, ML-KNN, and ML-DT). In these comparison algorithms, the classifiers of the first three algorithms are all set to Random Forest, which is the best classifier after the pretest, and the value of k for ML-KNN is 12. Subset accuracy, hamming loss, precision, recall, and F1 are introduced as the metrics of the multi-label classification performance [11].

Subset Accuracy = \frac{1}{n} \sum_{i = 1}^{n} I (y_{i} = {\hat{y}}_{i}) .,

(26)

Hamming Loss = \frac{1}{n L} \sum_{i = 1}^{n} \sum_{j = 1}^{L} I (y_{i}^{j} \neq {\hat{y}}_{i}^{j}),

(27)

Precision = \frac{1}{n} \sum_{i = 1}^{n} \frac{|y_{i}^{j} = 1 \cap {\hat{y}}_{i}^{j} = 1|}{|y_{i}^{j} = 1|},

(28)

Recall = \frac{1}{n} \sum_{i = 1}^{n} \frac{|y_{i}^{j} = 1 \cap {\hat{y}}_{i}^{j} = 1|}{|{\hat{y}}_{i}^{j} = 1|},

(29)

F_{1} = \frac{1}{n} \underset{i = 1}{\sum^{n}} \frac{2 |y_{i}^{j} = 1 \cap {\hat{y}}_{i}^{j} = 1|}{|y_{i}^{j} = 1| + |{\hat{y}}_{i}^{j} = 1|},

(30)

where

y_{i}

represents the ground-truth label vector of the i sample,

{\hat{y}}_{i}

represents the predicted label vector,

y_{i}^{j}

represents the ground-truth label of j position in i sample,

{\hat{y}}_{i}^{j}

represents the predicted label of j position in i sample. Intuitively, subset accuracy, precision, recall, F1 perform as the multi-label counterparts of traditional metrics. Hamming lose performs as a special metric of multi-label.

The training set of all algorithms uses single-fault data, while the test set uses simultaneous-fault data. Ten-fold cross validation is also employed in this part.

6.2.2. Experimental Results and Analysis

The results are shown in Table 11. It is obvious that the proposed method outperforms other methods in the simultaneous-faults diagnosis of a satellite power system, which benefits from both the FNζDTRS model and the fault matching strategy (FNζDTRS-FMS). Fundamentally, on the one hand, the attribute reduction results obtained from the FNζDTRS model can accurately represent the fault information, and on the other hand, the proposed matching rules are reasonable and reliable.

Table 11. The results of the simultaneous-fault diagnosis.

In addition to the results of our method, as for multi-label classification methods, the false alarm rate performs satisfactorily, while the missed diagnosis rate performs poorly. It can be seen that the recall rates have been lower than 50%, which is unacceptable in engineering applications. At the same time, the results of the F1 index generated by the multi-label classification methods are relatively low, only slightly higher than 60%. Similarly, the results obtained by the Calibrated Label Ranking method are similar to those of ML-KNN and ML-DT. The results obtained by the Classifier Chain method are in the middle level. In addition, Binary Relevance has the best performance among the multi-label classification methods, which means that there are no relevant dependencies between the single faults.

7. Conclusions

In this work, a novel DTRS model called FNζDTRS is proposed and the fault match strategy (FMS) based on the FNζDTRS model is designed to overcome three fundamental hurdles faced by simultaneous-fault diagnosis. The effectiveness and superiority of our methodology is demonstrated by both numerical experiments conducted on several standard datasets and comparison analysis of simultaneous-fault diagnosis performed on a simulation model of a satellite power system. Consequently, two main conclusions can be drawn, as follows.

(1): The proposed FNζDTRS model performs attribute reduction more effectively compared with other models, and it has strong generalization ability. This benefits from the concise loss functions and the introduction of the fuzzy neighborhood relationships. The advantages of our model can greatly promote the smooth implementation of the model in simultaneous-fault diagnosis, which reflects the effectiveness and superiority of our selection of this model.
(2): The proposed FNζDTRS–FMS does not require simultaneous-fault samples to accomplish training and performs excellently in simultaneous-fault diagnosis compared to classic multi-label classification algorithms. This is completely consistent with the real situation, that is, the existing data cannot completely cover all the imagined failure modes. Therefore, the diagnostic strategy proposed in this paper has stronger application value.

Although the model we proposed has the above advantages, it still has the problem of low computational efficiency compared with the classical rough set because of the use of fuzzy neighborhood computing. This is one of our future research directions. Furthermore, our future work will also focus on fusing rough set models and multi-label learning algorithms to make the simultaneous-fault diagnosis framework more general.

Author Contributions

Data curation, R.Z.; Formal analysis, Y.J. and T.Z.; Funding acquisition, C.L. and M.S.; Methodology, L.T. and C.W.; Writing—original draft, Y.C., Y.J. and M.S.; Validation, Y.J.; Investigation, T.Z.; Supervision, M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the National Natural Science Foundation of China (Grant Nos. 61903015 and 61973011), the Fundamental Research Funds for the Central Universities (Grant No. KG21003001), National key Laboratory of Science and Technology on Reliability and Environmental Engineering (Grant No. WDZC2019601A304), as well as the Capital Science & Technology Leading Talent Program (Grant No. Z191100006119029).

Data Availability Statement

The public data can be accessed through the following two websits: the UCI (http://archive.ics.uci.edu/ml/index.php, accessed on 21 July 2021) and KEEL (https://sci2s.ugr.es/keel/datasets.php, accessed on 21 July 2021). However, the data set of the satellite power system is not convenient for public display. If necessary, researchers can make further consultation via email.

Acknowledgments

Thank the reviewers for their comments on improving the quality of our papers, and thank the journal editors for their enthusiastic work.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

$U = \{x_{1}, x_{2}, \dots, x_{m}\}$	the universe, which is a finite and nonempty set.
$D$	the set of decision attributes that is a nonempty set.
$C$	the collection of conditional attributes.
$X$	the subset of samples with the same label $d_{k}$ .
$a_{P}$ , $a_{B}$ , and $a_{N}$	the classification of x into three regions, which are $x \in P O S (X)$ , $x \in B N D (X)$ , $x \in N E G (X)$ .
$P O S (X)$	the acceptance of the event $x \in X$ .
$B N D (X)$	the non-commitment of the event $x \in X$ , denotes the deferment of the event $x \in X$ .
$N E G (X)$	the rejection of $x \in X$ .
$λ_{• P}$	the loss caused by taking actions ( $a_{P}$ , $a_{B}$ , $a_{N}$ ) while $x \in X$ .
$λ_{• N}$	the loss caused by taking actions ( $a_{P}$ , $a_{B}$ , $a_{N}$ ) while $x \notin X$ .
$α, β$	the threshold parameters of the DTRS model.
$δ$	fuzzy neighborhood radius.
$ζ$	the compensation coefficient.

References

Li, S.; Cao, H.; Yang, Y. Data-driven simultaneous fault diagnosis for solid oxide fuel cell system using multi-label pattern identification. J. Power Sources 2018, 378, 646–659. [Google Scholar] [CrossRef]
Suo, M.; Tao, L.; Zhu, B.; Chen, Y.; Lu, C.; Ding, Y. Soft decision-making based on decision-theoretic rough set and Takagi-Sugeno fuzzy model with application to the autonomous fault diagnosis of satellite power system. Aerosp Sci. Technol. 2020, 106, 106108. [Google Scholar] [CrossRef]
Suo, M.; Zhu, B.; An, R.; Sun, H.; Xu, S.; Yu, Z. Data-driven fault diagnosis of satellite power system using fuzzy Bayes risk and SVM. Aerosp Sci. Technol. 2019, 84, 1092–1105. [Google Scholar] [CrossRef]
Asgari, S.; Gupta, R.; Puri, I.K.; Zheng, R. A data-driven approach to simultaneous fault detection and diagnosis in data centers. Appl. Soft Comput. 2021, 110, 107638. [Google Scholar] [CrossRef]
Liang, P.; Deng, C.; Wu, J.; Yang, Z.; Zhu, J.; Zhang, Z. Single and simultaneous fault diagnosis of gearbox via a semi-supervised and high-accuracy adversarial learning framework. Knowl.-Based Syst. 2020, 198, 105895. [Google Scholar] [CrossRef]
Zhang, Z.; Li, S.; Xiao, Y.; Yang, Y. Intelligent simultaneous fault diagnosis for solid oxide fuel cell system based on deep learning. Appl. Energ. 2019, 233, 930–942. [Google Scholar] [CrossRef]
Pooyan, N.; Shahbazian, M.; Salahshoor, K.; Hadian, M. Simultaneous Fault Diagnosis using multi class support vector machine in a Dew Point process. J. Nat. Gas. Sci. Eng. 2015, 23, 373–379. [Google Scholar] [CrossRef]
Vong, C.; Wong, P.; Ip, W. A New Framework of Simultaneous-Fault Diagnosis Using Pairwise Probabilistic Multi-Label Classification for Time-Dependent Patterns. Ieee T Ind. Electron. 2013, 60, 3372–3385. [Google Scholar] [CrossRef]
Wong, P.K.; Zhong, J.; Yang, Z.; Vong, C.M. Sparse Bayesian extreme learning committee machine for engine simultaneous fault diagnosis. Neurocomputing 2016, 174, 331–343. [Google Scholar] [CrossRef]
Wu, B.; Cai, W.; Chen, H.; Zhang, X. A hybrid data-driven simultaneous fault diagnosis model for air handling units. Energy Build. 2021, 245, 111069. [Google Scholar] [CrossRef]
Zhang, M.; Zhou, Z. A Review on Multi-Label Learning Algorithms. IEEE Trans. Knowl. Data Eng. 2014, 26, 1819–1837. [Google Scholar] [CrossRef]
Boutell, M.R.; Luo, J.B.; Shen, X.P.; Brown, C.M. Learning multi-label scene classification. Pattern Recognit. 2004, 37, 1757–1771. [Google Scholar] [CrossRef]
Read, J.; Pfahringer, B.; Holmes, G.; Frank, E. Classifier chains for multi-label classification. Mach. Learn. 2011, 85, 333–359. [Google Scholar] [CrossRef]
Fuernkranz, J.; Huellermeier, E.; Mencia, E.L.; Brinker, K. Multilabel classification via calibrated label ranking. Mach. Learn. 2008, 73, 133–153. [Google Scholar] [CrossRef]
Zhang, M.; Zhou, Z. ML-KNN: A lazy learning approach to multi-label leaming. Pattern Recognit. 2007, 40, 2038–2048. [Google Scholar] [CrossRef]
Clare, A.; King, R.D. Knowledge Discovery in Multi-label Phenotype Data. In European Conference on Principles of Data Mining and Knowledge Discovery; Springer: Berlin/Heidelberg, Germany, 2001; pp. 42–53. [Google Scholar]
Pawlak, Z. Rough sets. Int. J. Comput. Inf. Sci. 1982, 11, 341–356. [Google Scholar] [CrossRef]
Dong, L.; Chen, D.; Wang, N.; Lu, Z. Key energy-consumption feature selection of thermal power systems based on robust attribute reduction with rough sets. Inf. Sci. 2020, 532, 61–71. [Google Scholar] [CrossRef]
Su, L.; Yu, F. Matrix approach to spanning matroids of rough sets and its application to attribute reduction. Theor. Comput. Sci. 2021, 893, 105–116. [Google Scholar] [CrossRef]
Sahu, R.; Dash, S.R.; Das, S. Career selection of students using hybridized distance measure based on picture fuzzy set and rough set theory. Decis. Mak. Appl. Manag. Eng. 2021, 4, 104–126. [Google Scholar] [CrossRef]
Dash, S.R.; Dehuri, S.; Sahoo, U.K. Interactions and Applications of Fuzzy, Rough, and Soft Set in Data Mining. Int. J. Fuzzy Syst. Appl. 2015, 3, 37–50. [Google Scholar] [CrossRef]
Zhang, P.; Li, T.; Wang, G.; Luo, C.; Chen, H.; Zhang, J.; Wang, D.; Yu, Z. Multi-source information fusion based on rough set theory: A review. Inf. Fusion 2021, 68, 85–117. [Google Scholar] [CrossRef]
Bai, H.; Ge, Y.; Wang, J.; Li, D.; Liao, Y.; Zheng, X. A method for extracting rules from spatial data based on rough fuzzy sets. Knowl.-Based Syst. 2014, 57, 28–40. [Google Scholar] [CrossRef]
Landowski, M.; Landowska, A. Usage of the rough set theory for generating decision rules of number of traffic vehicles. Transp. Res. Procedia 2019, 39, 260–269. [Google Scholar] [CrossRef]
Sharma, H.K.; Kumari, K.; Kar, S. A rough set theory application in forecasting models. Decis. Mak. Appl. Manag. Eng. 2020, 3, 1–21. [Google Scholar] [CrossRef]
Guo, Y.; Tsang, E.C.C.; Xu, W.; Chen, D. Adaptive weighted generalized multi-granulation interval-valued decision-theoretic rough sets. Knowl.-Based Syst. 2020, 187, 104804. [Google Scholar] [CrossRef]
Wang, T.; Liu, W.; Zhao, J.; Guo, X.; Terzija, V. A rough set-based bio-inspired fault diagnosis method for electrical substations. Int. J. Electr. Power Energy Syst. 2020, 119, 105961. [Google Scholar] [CrossRef]
Sang, B.; Yang, L.; Chen, H.; Xu, W.; Guo, Y.; Yuan, Z. Generalized multi-granulation double-quantitative decision-theoretic rough set of multi-source information system. Int. J. Approx. Reason. 2019, 115, 157–179. [Google Scholar] [CrossRef]
Zhang, P.F.; Li, T.R.; Yuan, Z.; Luo, C.; Liu, K.Y.; Yang, X.L. Heterogeneous Feature Selection Based on Neighborhood Combination Entropy. IEEE Trans. Neural Netw. Learn. Syst. 2022, 1–14. [Google Scholar] [CrossRef]
Wang, L.; Shen, J.; Mei, X. Cost Sensitive Multi-Class Fuzzy Decision-theoretic Rough Set Based Fault Diagnosis. In Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China, 26–28 July 2017; pp. 6957–6961. [Google Scholar]
Yu, J.; Ding, B.; He, Y. Rolling bearing fault diagnosis based on mean multigranulation decision-theoretic rough set and non-naive Bayesian classifier. J. Mech. Sci. Technol. 2018, 32, 5201–5211. [Google Scholar] [CrossRef]
Suo, M.; Tao, L.; Zhu, B.; Miao, X.; Liang, Z.; Ding, Y.; Zhang, X.; Zhang, T. Single-parameter decision-theoretic rough set. Inf. Sci. 2020, 539, 49–80. [Google Scholar] [CrossRef]
Yao, Y.Y.; Wong, S.K.M. A decision theoretic framework for approximating concepts. Int. J. Man Mach. Stud. 1992, 37, 793–809. [Google Scholar] [CrossRef]
Suo, M.; Cheng, Y.; Zhuang, C.; Ding, Y.; Lu, C.; Tao, L. Extension of labeled multiple attribute decision making based on fuzzy neighborhood three-way decision. Neural Comput. Appl. 2020, 32, 17731–17758. [Google Scholar] [CrossRef]
Jia, X.; Liao, W.; Tang, Z.; Shang, L. Minimum cost attribute reduction in decision-theoretic rough set models. Inf. Sci. 2013, 219, 151–167. [Google Scholar] [CrossRef]
Yao, Y. Three-way decisions with probabilistic rough sets. Inf. Sci. 2010, 180, 341–353. [Google Scholar] [CrossRef]
Jiang, F.; Sui, Y. A novel approach for discretization of continuous attributes in rough set theory. Knowl.-Based Syst. 2015, 73, 324–334. [Google Scholar] [CrossRef]
Li, W.; Huang, Z.; Jia, X.; Cai, X. Neighborhood based decision-theoretic rough set models. Int. J. Approx. Reason. 2016, 69, 1–17. [Google Scholar] [CrossRef]
Song, J.; Tsang, E.C.C.; Chen, D.; Yang, X. Minimal decision cost reduct in fuzzy decision-theoretic rough set model. Knowl.-Based Syst. 2017, 126, 104–112. [Google Scholar] [CrossRef]
Wang, C.; Qi, Y.; Shao, M.; Hu, Q.; Chen, D.; Qian, Y.; Lin, Y. A Fitting Model for Feature Selection with Fuzzy Rough Sets. IEEE Trans. Fuzzy Syst. 2017, 25, 741–753. [Google Scholar] [CrossRef]
Wang, C.; Shao, M.; He, Q.; Qian, Y.; Qi, Y. Feature subset selection based on fuzzy neighborhood rough sets. Knowl.-Based Syst. 2016, 111, 173–179. [Google Scholar] [CrossRef]

Figure 1. The illustration of a decision system.

Figure 2. The procedure of fault diagnosis strategy for simultaneous-fault.

Figure 3. The accuracy results of

ζ

and

δ

with respect to FNζDTRS.

Figure 4. The schematic diagram of the power system.

Table 1. The detailed information of a loss function matrix.

	$Q$	$Q^{c}$
$a_{P}$	$λ_{P P}$	$λ_{P N}$
$a_{B}$	$λ_{B P}$	$λ_{B N}$
$a_{N}$	$λ_{N P}$	$λ_{N N}$

Table 2. The fuzzy neighborhood data-driven loss function matrix.

	$Q$	$Q^{c}$
$a_{p}$	$λ_{P P} = 0$	$λ_{P N} = {\tilde{S}}^{c} (X \| {[x]}^{δ})$
$a_{B}$	$λ_{B P} = \tilde{S} (X \| {[x]}^{δ}) \tilde{P} (X \| {[x]}^{δ}) ζ$	$λ_{B N} = {\tilde{S}}^{c} (X \| {[x]}^{δ}) (1 - \tilde{P} (X \| {[x]}^{δ})) ζ$
$a_{N}$	$λ_{N P} = \tilde{S} (X \| {[x]}^{δ})$	$λ_{N N} = 0$

Table 3. The core pseudo code of the diagnosis strategy.

Input:	Raw data of each single fault and normal state
Output:	Fault mode
Part I	Prior Knowledge Acquisition
	For each DS regarding to single fault or state
	Initialized: red = $\emptyset, C l$ = C, Rred = H, //H is a large positive number.
	While Cl ≠ ∅
	For $c \in C l$
	$a = c \cup$ red, // $a$ is a temporary set.
	Compute the risk generated by $a$ .
	End For
	Find such a subset $a = c \cup r e d$ with the minimum risk, i.e., $R a$ .
	If $R a < R r e d$
	The subset $a$ is the selected set
	End If
	End While
	End For, return the reduction $r e d$ set of each state.
Part II	Rule Matching
	Utilize the above code to obtain the reduction set $r$ of the given fault data to be diagnosed.
	For each $r e d$
	Compute the similarity between $r e d$ and $r$ .
	End For
	Find such a $r e d$ with the maximum similarity, which could be considered as the similar fault mode f.
	Return f

Table 4. The information of the employed datasets.

ID	Full Name	Name	Samples	Attribute	Discrete	Continuous	Class	Source
1	Mutagenesis-Atoms	Atoms	1618	10	8	2	2	KEEL
2	Australian Credit Approval	Australian	690	14	8	6	2	UCI
3	Breast Cancer	Breast	277	9	6	3	2	UCI
4	Heart Disease Cleveland	Cleve	296	13	7	6	2	UCI
5	Statlog Heart	Heart	270	13	6	7	2	UCI
6	Iris	Iris	150	4	0	4	3	UCI
7	Website Phishing	Phishing	1353	10	10	0	3	UCI
8	South African Hearth	Saheart	462	9	1	8	2	UCI
9	Seismic-Bumps	Seismic	2584	18	12	6	2	UCI
10	Congressional Voting Records	Vote	435	16	16	0	2	UCI

Table 5. The classification accuracy of the reduction subset.

ID	Name	DTRS-EF	DTRS-SMDNS	SPDTRS-EF	SPDTRS-SMDNS	NDTRS	FDTRS	FN3WD	FNζDTRS
1	Atoms	69.65 ± 2.38	71.08 ± 1.66	70.94 ± 1.20	71.27 ± 1.14	70.42 ± 1.61	70.87 ± 2.07	71.71 ± 1.00	72.08 ± 1.13
2	Australian	81.34 ± 5.09	82.39 ± 5.64	82.27 ± 2.87	83.46 ± 0.85	83.31 ± 2.76	72.06 ± 12.31	84.56 ± 0.39	84.97 ± 0.42
3	Breast	72.31 ± 1.06	72.53 ± 0.82	72.67 ± 0.70	72.65 ± 0.72	72.72 ± 0.67	70.38 ± 0.75	73.21 ± 0.29	74.01 ± 0.81
4	Cleve	79.13 ± 1.09	78.34 ± 5.02	78.99 ± 1.01	79.37 ± 0.61	77.69 ± 6.68	66.64 ± 11.29	80.13 ± 0.78	81.34 ± 0.37
5	Heart	78.34 ± 3.38	78.72 ± 5.20	79.99 ± 1.87	79.95 ± 0.88	76.31 ± 7.16	68.37 ± 10.42	80.27 ± 2.25	80.99 ± 1.23
6	Iris	94.85 ± 0.50	94.85 ± 0.62	94.93 ± 0.47	94.87 ± 0.55	94.95 ± 0.41	62.96 ± 17.98	94.82 ± 0.40	95.27 ± 0.33
7	Phishing	84.23 ± 9.62	86.05 ± 6.11	87.08 ± 1.10	87.08 ± 1.10	86.40 ± 5.08	69.93 ± 14.06	87.15 ± 1.06	87.14 ± 1.07
8	Saheart	69.23 ± 0.99	69.01 ± 1.29	69.49 ± 0.38	69.51 ± 0.43	69.35 ± 0.40	67.93 ± 1.99	69.63 ± 1.61	70.35 ± 1.06
9	Seismic	92.64 ± 0.80	92.33 ± 0.90	92.53 ± 0.46	91.94 ± 0.69	91.91 ± 0.70	92.01 ± 0.76	92.56 ± 0.48	93.21 ± 0.15
10	Vote	94.66 ± 0.38	94.67 ± 0.38	94.63 ± 0.36	94.63 ± 0.36	94.65 ± 0.38	83.86 ± 15.36	94.63 ± 0.36	94.65 ± 0.31
	Average	81.64 ± 2.53	82.00 ± 2.76	82.35 ± 1.04	82.47 ± 0.73	81.77 ± 2.59	72.50 ± 8.70	82.87 ± 0.86	83.40 ± 0.69

* Bolded indicates that the model achieves the best performance on this dataset.

Table 6. The number of reduction attributes.

ID	Name	DTRS-EF	DTRS-SMDNS	SPDTRS-EF	SPDTRS-SMDNS	NDTRS	FDTRS	FN3WD	FNζDTRS
1	Atoms	5.5 ± 1.8	6.4 ± 1.8	6.8 ± 0.4	7.8 ± 0.4	5.2 ± 1.2	4.5 ± 1.3	1.2 ± 0.4	2.0 ± 0.0
2	Australian	11.2 ± 1.9	12.2 ± 3.0	11.3 ± 0.6	13.0 ± 0.1	12.7 ± 1.4	4.7 ± 2.7	10.8 ± 0.6	8.4 ± 0.5
3	Breast	7.7 ± 2.7	8.5 ± 2.0	9.0 ± 0.0	9.0 ± 0.1	9.0 ± 0.2	3.6 ± 1.0	8.0 ± 0.0	6.9 ± 0.6
4	Cleve	9.8 ± 0.5	12.5 ± 2.4	9.9 ± 0.5	13.0 ± 0.2	10.2 ± 2.6	4.1 ± 2.7	7.1 ± 1.6	3.0 ± 0.0
5	Heart	7.1 ± 2.0	12.1 ± 2.6	7.9 ± 0.6	12.7 ± 0.4	10.0 ± 3.0	4.8 ± 3.1	6.8 ± 1.3	3.0 ± 0.0
6	Iris	3.8 ± 0.7	3.6 ± 0.8	3.6 ± 0.5	4.0 ± 0.1	4.0 ± 0.0	1.7 ± 1.3	2.6 ± 0.5	1.0 ± 0.0
7	Phishing	8.3 ± 2.3	8.8 ± 1.4	9.0 ± 0.0	9.0 ± 0.0	8.8 ± 1.4	1.1 ± 0.2	9.0 ± 0.0	9.0 ± 0.0
8	Saheart	8.7 ± 1.6	8.5 ± 1.9	9.0 ± 0.0	9.0 ± 0.0	9.0 ± 0.0	6.4 ± 2.8	4.8 ± 0.4	2.6 ± 0.3
9	Seismic	6.6 ± 4.8	10.9 ± 6.3	4.0 ± 0.2	13.4 ± 0.9	13.8 ± 0.5	8.3 ± 0.7	2.0 ± 0.0	1.0 ± 0.0
10	Vote	8.5 ± 0.6	8.5 ± 0.6	8.5 ± 0.5	8.5 ± 0.5	8.5 ± 0.8	1.1 ± 0.3	8.5 ± 0.5	8.1 ± 0.3
	Average	7.7 ± 1.9	9.2 ± 2.3	7.9 ± 0.3	9.9 ± 0.3	9.1 ± 1.1	4.0 ± 1.6	6.1 ± 0.5	4.5 ± 0.2

* Bolded indicates that the model achieves the best performance on this dataset.

Table 7. Information of the telemetry parameters.

ID	Attribute	Rate Range	Data Type	Unit
a₁	Duty cycle	0–1	Continuous	-
a₂	Bus current	13.5–17.3	Continuous	A
a₃	Shunt current	5.3–12.4	Continuous	A
a₄	Battery current	3.6–19.4	Continuous	A
a₅	Output power	1070–1090	Continuous	W
a₆	Battery pressure	2.0–5.4	Continuous	MPa
a₇	Battery quantity	54.3–71.2	Continuous	Ah
a₈	Status word	−1 0 1	Discrete	-
a₉	Bus voltage	40.5–43.1	Continuous	V
a₁₀	Battery voltage	33.0–40.5	Continuous	V

Table 8. Different scenarios for faults in satellite power system.

Scenario	Fault Name	Scenario	Fault Name
0	----	6	F1-F3
1	F1	7	F1-F4
2	F2	8	F2-F3
3	F3	9	F2-F4
4	F4	10	F1-F2-F3
5	F1-F2	11	F1-F2-F4

Table 9. The results of the training process.

Fault Name	The Output Attribute Subset	Average Value of the Data for Attribute a₃
F1	a₅	-
F2	a₇	-
F3	a₂, a₃, a₉	7.46
F4	a₂, a₃, a₉	13.47

Table 10. The results of the testing process.

Fault Name	The Output Attribute Subset	Jaccard Similarity Coefficient				Average Value of the Data for Attribute a₃	Matching Result
Fault Name	The Output Attribute Subset	F1	F2	F3	F4	Average Value of the Data for Attribute a₃	Matching Result
F1-F2	a₅, a₇	0.50	0.50	0	0	-	F1-F2
F1-F3	a₂, a₃, a₅, a₉	0.25	0	0.75	0.75	6.63	F1-F3
F1-F4	a₂, a₃, a₅, a₉	0.25	0	0.75	0.75	12.62	F1-F4
F2-F3	a₂, a₃, a₇, a₉	0	0.25	0.75	0.75	7.47	F2-F3
F2-F4	a₂, a₃, a₇, a₉	0	0.25	0.75	0.75	13.46	F2-F4
F1-F2-F3	a₂, a₃, a₅, a₇, a₉	0.20	0.20	0.60	0.60	6.63	F1-F2-F3
F1-F2-F4	a₂, a₃, a₅, a₆, a₇, a₉	0.17	0.17	0.50	0.50	12.63	F1-F2-F4

Table 11. The results of the simultaneous-fault diagnosis.

Algorithm	Accuracy/Subset Accuracy	Hamming Loss	Precision	Recall	F1
FNζDTRS–FMS	100.0 ± 0.0	-	100.0 ± 0.0	100.0 ± 0.0	100.0 ± 0.0
Binary Relevance	82.34 ± 5.39	4.43 ± 1.35	100.0 ± 0.0	93.88 ± 1.87	96.28 ± 1.14
Classifier Chain	65.78 ± 8.93	9.30 ± 2.63	100.0 ± 0.0	86.42 ± 4.45	91.31 ± 3.07
Calibrated Label Ranking	0.0 ± 0.0	32.14 ± 7.11	100.0 ± 0.0	45.24 ± 0.0	61.90 ± 0.0
ML-KNN	0.0 ± 0.0	32.15 ± 0.0	99.98 ± 0.0	45.23 ± 7.11	61.89 ± 0.0
ML-DT	0.0 ± 0.0	32.35 ± 0.31	99.59 ± 0.62	45.03 ± 0.31	61.63 ± 0.42

* Bolded indicates that the model achieves the best performance on this metric.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Simultaneous-Fault Diagnosis of Satellite Power System Based on Fuzzy Neighborhood ζ-Decision-Theoretic Rough Set

Abstract

1. Introduction

2. Preliminaries and Related Work

3. Fuzzy Neighborhood ζ-Decision-Theoretic Rough Set

3.1. Granular Computing Based on Fuzzy Neighborhood Relationship

3.2. Determination of the Two Threshold Parameters

3.3. Establishment of FNζDTRS

3.4. FNζDTRS-Based Attribute Reduction Algorithm

4. Strategy of Simultaneous-Fault Diagnosis

5. Numerical Experiment of Attribute Reduction

5.1. Parameters Test for FNζDTRS

5.2. Comparison Experiments on Attribute Reduction

6. Simultaneous-Fault Diagnosis of Satellite Power System

6.1. Simultaneous-Fault Diagnosis Based on the Fault Matching Strategy

6.2. Comparison Experiment on Simultaneous-Fault Diagnosis

6.2.1. Experimental Setup

6.2.2. Experimental Results and Analysis

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Nomenclature

References

Article Metrics

Citations

Article Access Statistics