1. Introduction
In recent years, MASs have become a hot research topic and are widely used in transportation, multi-robot control and other fields [
1,
2,
3]. Along with the complexity of the working environment and functionality of MAS increases, the safety and reliability of MASs have become more vulnerable, and they are prone to failure. Considering that the agents in a MAS are interacting with each other, when the agent fails, the output of the faulty agent may be deviated, then the neighbor agents of the fault agent and even the whole system will be affected. Therefore, the study of fault diagnosis of a MAS is of great importance.
In addition, the distributed structure of the MAS determines that the agent can only obtain relative information, such as the relative position and relative velocity of its neighbors, while in practice, the communication information between agents may show loss due to the influence of interference present in the real environment or the limitation of the communication range, which in turn leads to unreliability of the fault-diagnosis results of the agents. Therefore, more research is needed on how to solve the problem of decreased fault-diagnosis accuracy due to missing communication data between agents or other reasons.
In recent years, distributed fault-diagnosis methods have been the research focus of fault diagnosis for MASs [
4,
5,
6], among which the observer-based fault-diagnosis method is one of the most commonly used fault-diagnosis methods for MASs. The authors in [
7] constructed an adjustable parameter-based distributed fault estimation observer to obtain fault estimates in agents under directed graph topology. When the actuator of a MAS fails, the authors in [
8] presented a novel distributed-sliding-mode fault-diagnosis scheme based on the local relative output information. The authors in [
9] decomposed a MAS into three subsystems by coordinate transformation, then finite-time unknown input observers were constructed on specific agents by using the communication topology and local information, to achieve distributed fault diagnosis for MASs. What is more, using unknown input observers (UIO) to obtain relative output information between agents is also a commonly used fault-diagnosis scheme. The authors in [
10] constructed a bank of finite frequency
UIOs, and detected the fault in MAS by dividing the unknown input into the decoupled part and the non-decoupled part. The authors in [
11] set UIO on each agent, and the distributed fault diagnosis for MAS was achieved by using the relative information between an agent and its neighbors. Authors in [
12] studied the distributed cooperative fault-detection problems for a MAS, which constructed a robust
observer on each agent, then, a residual-based distributed cooperative FD strategy was presented by using the zonotope method. The authors in [
13] proposed a heterogeneous multi-agent fault-diagnosis method to realize the fault diagnosis through mutual information interaction and a given error signal threshold of a large aircraft actuation system.
Considering the complexity of MASs and the limitations of the actual environment, all the measurements may not be available to access sometimes. When the agent can only obtain relative information, such as the relative position and relative velocity, of its neighbors, the observer may not be observable [
14]. In this case, observer-based fault-diagnosis methods may not be applicable. Authors in [
15] presented a data-driven state prediction and fault classification method based on the BP neural network model by using the residual signal designed the threshold, and then the fault diagnosis is carried out. The knowledge-based methods analyze the system, do not require an accurate model of the system, instead using information about the system and expert knowledge in the relevant field to achieve fault diagnosis, and is applicable to many complex systems [
16,
17]. In this paper, a BRB-based distributed fault-diagnosis method is used in MAS. BRB is an expert system proposed by Yang et al. [
18]. The BRB model can deal with obscure, incomplete or uncertain observational data, and can also achieve good results for small sample data. Therefore, by constructing a fault-diagnosis model of a MAS based on BRB, the influence of external disturbances, such as the environment on fault diagnosis, can be well avoided. At present, BRB-based fault diagnosis has achieved many research results in many fields [
19,
20,
21]. Authors in [
22] constructed the fault-diagnosis model of the hierarchical BRB, realized more efficient fault diagnosis of WSN, and used P-CMA-ES to optimize the parameters of BRB, making the results of the fault diagnosis more reliable. Authors in [
23] considered both the performance of the sensor and the disturbance of the external environment, and quantified it as the property of the BRB model to realize the fault diagnosis of high-speed-train-running gear systems.
Through the analysis of existing research results, a BRB-based fault-diagnosis model for each agent is constructed by using the relative information between the agent and its neighbors. However, when the relative output information of the agent cannot be accurately obtained or the fault-diagnosis accuracy is too low, this causes the diagnosis results to no longer be reliable. Considering the case when there are multiple faulty agents in a MAS at the same time, when problems like this occur, the relative information used for fault diagnosis may be inaccurate, and the results of the fault diagnosis may be deviated. Therefore, in response to the above problems, this paper proposes a topology-switching strategy. According to the BRB-based fault-diagnosis model, the average distance between the obtained fault-diagnosis accuracy of each agent is calculated, and according to this average distance, the decrease in fault-diagnosis accuracy is divided into two cases, then the threshold is set according to the feedback. When the accuracy of the fault diagnosis meets the threshold condition, the topology structure is updated to improve the accuracy and reliability of the fault-diagnosis method.
In this work, based on BRB, a distributed fault-diagnosis model was constructed for MASs with one or multiple faulty agents. In addition, in view of the problem that the accuracy of the fault-diagnosis model decreases, a corresponding topology-switching strategy was proposed, and the main contributions of this paper are as follows:
Firstly, a distributed fault-diagnosis model for MASs based on BRB is constructed, then by combining expert knowledge, the fault diagnosis of MAS is achieved, and the situation when multiple agents fail at the same time is considered, which makes the proposed method more general;
Secondly, the strategy of setting the threshold based on the average distance between the accuracy of the fault diagnosis of each agent is proposed, and the feedback of the average distance of the obtained accuracy is used to analyze the fault-diagnosis accuracy of each agent and the system, then the threshold value is updated;
Compared with the existing BRB-based fault-diagnosis methods [
22,
24], the problem of unreliable fault-diagnosis results due to the system being influenced by the faults or its own factors is considered, and a topology-switching strategy based on the average distance of the output diagnosis accuracy is proposed to update the topology of the agent when the threshold condition is satisfied so that the fault-diagnosis results are more reliable.
The structure of this paper can be divided into the following sections.
Section 2 describes the preliminary knowledge of the MAS, and the problem to be solved in this paper is presented.
Section 3 provides a description of the construction of the fault-diagnosis model.
Section 4 describes the topology-switching strategy proposed in this paper.
Section 5 gives a simulation experiment to validate the effectiveness and advancement of the method. Finally, some conclusions are given in
Section 6.
2. Problem Formulation
Graph theory can intuitively represent the complex communication network of agents, with nodes representing each agent, and edges between nodes to describe the communication interaction between agents. Therefore, consider that an undirected graph expressed by = (, , ) denotes the undirected graph, where denotes the set of nodes and N is the number of nodes, represents the set of edges and ⊆ × , }. Node represents the agent in the MAS, and the neighbors of agent i are defined as . The communication topology of graph is denoted by the adjacency matrix , and if , , else . The Laplacian matrix of graph is denoted as , and if , then , else if .
When the communication topology of the MAS meets certain conditions and is thus adjusted, define the adjusted topology graph as = (, , ), where ⊆ × , } is the set of edges and is the adjacency matrix of the adjusted topology; then, the Laplacian matrix of is .
The dynamics model with fault signals of agent
i can be described by the following system:
where
represents the position of agent
represents the velocity of agent
is the control input, and
is the fault signal, and when there is no fault in agent
; otherwise,
. It should be noted that the number of faulty agents cannot exceed half of the total number of system agents.
Based on the relative position and velocity of agent
i, the following consistency control protocol is proposed for agent
i:
where
are controller parameters.
Assumption A1. Both the initial graph and the adjusted graph are undirected and connected.
Definition 1. For the MAS (2), if the following conditions are met,then, we can say that this multi-agent system achieves consensus. The issues to be addressed in this paper are as follows.
Problem 1. There may be multiple agents malfunctioning at the same time in MAS.
Considering that there are multiple agents that fail at the same time, since the distributed fault diagnosis uses the relative information of the neighbor nodes for fault diagnosis, if an agent and its neighbors fail at the same time, the relative information of the neighbor nodes will be no longer reliable. At this time, the fault-diagnosis accuracy of this agent may decrease. Therefore, to ensure the accuracy and universality of the fault-diagnosis model, it is necessary to consider the case of simultaneous faults of multiple agents.
Problem 2. The accuracy of fault diagnosis is low, and the results of fault diagnosis are unreliable.
Since MASs are limited by the actual industrial environment, and the relative information between an agent and its neighboring agents can be affected by various factors and thus have deviations, this will lead to a situation where the overall fault-diagnosis accuracy of the system is reduced or the local fault-diagnosis accuracy is reduced. The local accuracy is reduced, that is, under the premise of distributed fault diagnosis, the fault-diagnosis accuracy of individual agents is decreased, and the accuracy can be improved by adjusting the topology structure appropriately. Therefore, it is necessary to propose a topology switching strategy to improve the accuracy of fault diagnosis.
4. Inference of the Topology Switching Strategy
In this section, the inference of the topology switching strategy is given. In
Section 4.1, the calculation process of the topology switching threshold is given; next, for MAS satisfying the threshold condition, the topology switching operation is given in
Section 4.2.
4.1. Inference Process of the Threshold
The average distance is an indicator that reflects the central tendency of the data, which refers to the sum of all data in a set of data divided by the number of data in this set. The average distance is one of the important measures to describe the central tendency and dispersion of data samples.
For the BRB-based fault-diagnosis model set on each agent, calculate the average distance between the output fault-diagnosis accuracy, take the sum and multiply by
, where
n represents the total number of samples in the dataset, that is, the number of agents. The specific calculation process is as follows:
where
represents the fault-diagnosis accuracy of the BRB model set on agent
i, and
represents the average distance between the fault-diagnosis accuracy of the BRB output on each agent.
The obtained average distance of fault-diagnosis accuracy can be classified into two cases: the average distance is large and the average distance is small. Therefore, in these two cases, the factors that may lead to the decrease in fault-diagnosis accuracy are analyzed separately, and appropriate thresholds are set.
Step 1: Analyze the possible states of the fault-diagnosis system at this time.
When the average distance is large, it indicates that the gap between the fault-diagnosis accuracy of each agent is large, that is, the fault-diagnosis accuracy of individual agents is low, which may be caused by the fact that an agent has only one neighbor agent and the relative information is insufficient, or the neighbors of an agent are faulty, which may also lead to deviations in relative information, resulting in inaccurate fault diagnosis.
Step 2: When the fault-diagnosis accuracy decreases, analyze the feedback of the average distance of the obtained fault-diagnosis accuracy.
When the obtained average distance is large, it means that it is a local problem caused by the low fault-diagnosis accuracy of the individual agents. At this time, the accuracy of the agent must be sensitive to the threshold, which means that once the accuracy rate is lower than the set threshold, the topology structure should be adjusted in time to ensure the validity and reliability of fault diagnosis. Therefore, when the average distance is large, the set topology switching threshold should be appropriately larger.
Step 3: Set a suitable threshold.
The thresholds are set as follows:
where
represents the judgment standard of
,
and
respectively represent the threshold conditions to be met for topology switching, and note that
.
Step 1: Analyze the possible states of the fault-diagnosis system at this time.
When the average distance is small, there are two possibilities, the first is that the accuracy of the fault-diagnosis model on each agent is very high, indicating that the model is suitable for the distributed fault diagnosis of MAS and has a good effect.
The second is that the accuracy of the fault-diagnosis model on each agent is very low. At this time, the relative information between the agents may be deviated due to the overall problem of the system, and the accuracy of the fault diagnosis will be affected, or because the model is not suitable for fault diagnosing with this MAS.
Step 2: When the fault-diagnosis accuracy decreases, analyze the feedback of the average distance of the obtained fault-diagnosis accuracy.
When the average distance is small, the decrease in the fault-diagnosis accuracy may be a problem of the overall system. The topology-switching threshold should be low. When the accuracy rate is lower than this threshold, it means that the overall fault diagnosis of the system is no longer reliable, and other operations need to be considered.
Step 3: Set a suitable threshold.
The thresholds are set as follows:
Remark 1. The threshold is set according to the fault-diagnosis accuracy of the model. A large switching threshold is intended to monitor and switch the topology in time when the accuracy of the fault-diagnosis model on an agent decreases to ensure the reliability of the fault diagnosis.
Remark 2. After testing, for a MAS with 5 agents, when the fault-diagnosis accuracy of each agent is above 90%, the average distance will not exceed 5, and therefore, 5 can be used as a benchmark to judge whether the average distance is large or small, but the final specific value should be determined based on the experimental results.
4.2. Description of the Topology Update Operation
This paper mainly analyzes the situation that the accuracy of local fault diagnosis decreases, and proposes the corresponding topology-switching strategy. Taking the MAS with 5 agents as an example, as shown in
Figure 3, and assuming that agent 2 and agent 5 are faulty, for two common situations that may lead to low fault-diagnosis accuracy, the topology-switching strategy is as follows:
Case 1: The first is to consider the situation that there are multiple faulty agents in the MAS. At this time, if an agent and its neighbor agents fail at the same time, or all the neighbor nodes of an agent fail, it may lead to deviations in the fault-diagnosis results. It can be solved by adding or switching neighbor agents. For agent 3, it can be seen from
Figure 3 that both of its neighbor agents are faulty. Therefore, for the BRB fault-diagnosis model on agent 3, the specific operations are shown in
Figure 4.
Case 2: Secondly, there is a problem that an agent has insufficient relative information due to only one neighbor or other factors, which will also make the fault-diagnosis result inaccurate. At this time, the neighbor agents of the agent can be added to improve the BRB model data. The specific operation is shown in
Figure 5.
Remark 3. When adding neighbor agents or adjusting the topology, the agent with a small number of neighbor agents should be selected as the new neighbor agent of an agent so as to avoid excessive computational burden on the BRB model and avoid the problem of combinatorial explosion.
Remark 4. Compared with some existing fault-diagnosis methods, this paper gives more consideration to improving the reliability of fault-diagnosis results. This topology-switching strategy is proposed for the fault diagnosis of MAS, which can well cope with the problem of decreasing the accuracy of fault diagnosis, and improve the reliability of the fault-diagnosis methods.
5. Case Study
A. System description
Considering that the vehicles in a multi-vehicle system are interacting with each other, when one or multiple vehicles fail, the output of the faulty vehicle may be deviated, and the neighbor vehicles of the fault vehicles and even the whole system will be affected. Therefore, in this section, consider a multi-vehicle system with 6 vehicles shown in (
18).
where
and
represent the position and velocity of vehicle
i, respectively,
is the control input, and
is the fault signal of vehicle
i. The communication connection of each vehicle in the system is shown in
Figure 6.
This multi-vehicle system uses (2) as the control input, and the initial values of position and velocity of each vehicle are set as follows:
. The control parameters
and
in (
2) are set as 0.6. As shown in
Figure 7, the position and velocity of each vehicle under the control of the control law reach a consistent state over a period of time.
Assume that in this multi-vehicle system, both vehicle 2 and vehicle 6 have constant faults, and the faults are described as
.
Figure 8 shows the status of the vehicles when vehicle 2 and vehicle 6 are faulty, and from the figure, it can be clearly seen that after the system stabilizes, when vehicle 2 and 6 fail, their positions and velocities deviate from those of the vehicle that did not fail.
B. Construction of the fault-diagnosis model
By observing the state of each vehicle when vehicle 2 and vehicle 6 fail, select key factors as the prior attribute, according to
Section 3.1, according to the relative information between the vehicle and its neighbors, and calculate the consistent position error between this vehicle and its neighbors as follows:
Use the obtained consistent position error as the prior attribute of the BRB model.
Figure 9 shows the consistent position error of a vehicle and its neighbors if a vehicle also fails under the premise that both vehicle 2 and vehicle 6 have a constant fault. It can be seen through
Figure 9 that the difference in
between the vehicle that failed and the other vehicles is not significant; therefore, in order to make the characteristics of the premise attribute more obvious, the obtained consistent position error is normalized. According to the consistent position error, after normalization as shown in
Figure 10, set the reference points and reference values for the attributes and output results, and construct the belief rules of BRB, then the BRB-based fault-diagnosis model of this multi-vehicle system can be described as follows:
where
represents the consistent position error after normalization.
According to
Figure 10, take the fault-diagnosis model on vehicle 2 as an example. By combining data, the semantic values for
are set as
, the semantic values for
are set as
, the semantic values for
are set as
, and
Table 1,
Table 2 and
Table 3 show the reference values for each attribute. For the results of fault diagnosis, the semantic values and reference values are set as shown in
Table 4, and
Table 5 shows the belief rules that optimized by P-CMA-ES.
When training the model, the data within [13–15] s are selected for training, and the step size is set to 0.01, so there are 200 sets of data in total. For these 200 sets of data, 100 sets are derived from the data of the vehicle and its neighbors when the vehicle is normal, and the other 100 sets are from the data of the vehicle and its neighbors when the vehicle is faulty. In addition, the number of iterations of the optimized model is set to 50.
Figure 11 shows the fault-diagnosis results of the BRB model set on vehicle 2. Similarly, one can obtain the results of the fault-diagnosis model set on other vehicles;
Table 6 shows the accuracy of the fault-diagnosis model set on each vehicle.
C. Topology switching strategy
According to
Table 6, calculate the average distance of the accuracy of each vehicle’s fault-diagnosis model by the method described in
Section 4.1. When
, the average distance is calculated as follows:
Then, set
; therefore, the thresholds can be set as follows:
It can be seen from the fault-diagnosis results of each vehicle that when vehicle 2 and vehicle 6 fail at the same time, the model diagnosis accuracy of these two vehicles is low, and the set threshold conditions are met, so the topology structure needs to be adjusted. For vehicle 2, the topology adjustment strategy is shown in
Figure 12, and after topology adjustment, the fault-diagnosis accuracy of vehicle 2 reaches
.
Similarly, for vehicle 5, it can be seen that when vehicle 2 and vehicle 6 fail at the same time, since the neighbor nodes of vehicle 5 are vehicle 2, vehicle 6 and vehicle 3, it may be interfered by the faults of vehicle 2 and vehicle 6, resulting in a decrease in the fault-diagnosis accuracy of the BRB model on vehicle 5, which also satisfies the threshold condition and performs topology adjustment as shown in
Figure 13. The adjusted fault-diagnosis accuracy rate also reaches
.
Finally, for vehicle 6, topology adjustment is performed as shown in
Figure 14. The adjusted fault-diagnosis accuracy rate also finally reached
.
Based on the BRB-based fault-diagnosis model for MAS, this paper proposes a topology-switching strategy by considering several common situations that may lead to unreliable fault-diagnosis results. Compared with general fault-diagnosis methods, the reliability of the diagnosis results is improved. It can be seen from
Figure 15 that the accuracy of the fault-diagnosis results before and after adopting the topology-switching strategy is compared.
6. Conclusions
In this paper, a new topology-switching strategy for a BRB-based fault-diagnosis model of MASs is proposed to address the problem of the decreasing accuracy of the fault-diagnosis model. On the one hand, considering the limitations of model-based methods, such as observer-based methods in the actual environment, the distributed fault-diagnosis model based on BRB is constructed, and the model is made more general by taking into account the real environment where multiple fault agents may exist simultaneously in MAS. On the other hand, by analyzing the conditions that may lead to a decrease in the accuracy of the fault-diagnosis model, a strategy of using the feedback of the average distance of fault-diagnosis accuracy to set the threshold is proposed, and according to this set threshold, a topology-switching strategy is proposed to solve the problem of the decreasing model accuracy. The method proposed in this paper can well improve the reliability of the fault-diagnosis model, and the effectiveness of the proposed strategy is verified through simulation experiments applied to the multi-vehicle system.
It is worth noting that the topology-switching strategy proposed in this paper has the defect of poor adaptivity, so the subsequent research will continue to develop the topology adaptive strategy for this problem in order to further improve the fault-detection performance of the fault-diagnosis model.