1. Introduction
A UAV swarm comprises many UAVs equipped with specific autonomous capabilities that enable them to successfully carry out assigned missions through real-time information exchange and collaborative cooperation. A typical application scenario for a UAV swarm is surveillance coverage, which includes tasks such as forest fire monitoring, search-and-rescue operations, and military reconnaissance [
1]. However, during surveillance tasks, some UAVs may encounter disruptions coming from harsh environments or deliberate adversaries. These disruptions can lead to a decrease in surveillance coverage and hinder the overall completion of the assigned tasks. Highly reliable components, redundant designs, and other traditional methods have proven ineffective in effectively and promptly dealing with unpredictable threats and undesirable events [
2]. However, it is anticipated that UAV swarms could be able to withstand disruptive events, handle accidents, and restore their intended performance through self-adaptation [
3].
In a UAV swarm, resilience refers to the system’s ability to keep a desired performance despite incidents that may disrupt its normal operation [
4]. Resilience offers a new perspective on enhancing and improving systems’ ability to bear unforeseen events and recover from disruptive threats. Therefore, resilience is a suitable evaluating concept for a UAV swarm in situations where incidents inevitably occur. The evaluation of the resilience of a swarm primarily focuses on two key aspects: developing an improved network model that accurately describes disruption and recovery events and proposing an appropriate metric for evaluating resilience.
A well-designed topological structure for UAV swarms can greatly enhance their surveillance capabilities [
5]. However, most resilience studies published so far have only focused on the recovery ability of UAV network structures for inevitable threats and disruptive events [
6,
7]. Unfortunately, these studies have limited capability to determine whether the swarm can effectively carry out its assigned surveillance task. To address this gap, Dui et al. [
8] demonstrated that the reliability of UAVs in different locations has varying effects on swarm performance, and they discussed topology enhancements for UAV swarms based on importance metrics. Additionally, Xu [
9] identified node movement as a key factor impacting network performance and developed a novel model for information exchange networks in unmanned swarm systems. Furthermore, Bai et al. [
10] investigated the topology characteristics of UAV swarms, taking into consideration the influence of limited communication range, and proposed an improved UAV swarm model. They also presented an effective method to mitigate the adverse effects of unforeseen threats and disruptive events through topology self-adaptation in UAV swarms. Those studies attempted to utilize the structure self-adaptation method in UAV swarms, but self-adaptation was found to inevitably cause structure and state dynamics problems.
Some studies have focused on the dynamic nature of swarms. For instance, Liu et al. [
11] examined multistate systems, wherein their network and component status follow their probability distributions. Hu et al. [
12] introduced an innovative form of UAV network based on blockchain and a software-defined network architecture. This design enabled the rapid construction of swarm networks while meeting application requirements and communication circumstances. In [
13], a centralized training and decentralized enforcement mechanism was implemented. This approach utilized data gathered by all UAVs to train shared control programs. Each UAV made decisions according to the observational information it collected. Additionally, Wang et al. [
14] proposed a dynamic and distributed method for UAV control based on formation, specifically targeting the connectivity and coverage of spatially dispersed users. They utilized flocking dynamics to develop detailed plans for UAVs during mission execution. These studies have primarily focused on the dynamic characteristics of swarms, which can partially but not comprehensively meet resiliency requirements [
15].
Recent studies have highlighted the significance of interdependent interactions between dynamic characteristics and network topology in UAV swarms [
16]. There is a growing trend towards incorporating resilience considerations into the study of UAVs, particularly in relation to topology and dynamics. George et al. have put forward a groundbreaking framework that adopts a comprehensive approach to resilience design. This framework enables a system to effectively monitor both external and internal disturbances, detect and anticipate threats, and take appropriate actions to counteract any performance-related issues [
15]. In a similar vein, Hu et al. decoupled the data plane from the control plane of each UAV and proposed a practical approach [
12]. Additionally, Mou et al. developed a trajectory planning algorithm based on graph convolutional neural networks that enables the rapid reconstruction of communication links during the self-healing process [
17]. A UAV swarm’s structure and dynamics are two sides of the same coin; as such, it was found that structural self-adaptation and dynamic self-adaptation could not meet the requirements for swarm resilience independently. Considering the cooperation of structure adaptation and dynamic adaptation gives us a feasible direction.
The resilience metric has recently been proposed and applied to multi-UAV system-of-systems (SoS) [
18,
19]. Tran defined the total received information in an information exchange network at any given time as the system performance [
2]. Based on Tran’s work, Bai et al. [
10] proposed an improved resilience metric, where the standard performance of the swarm was calculated using different methods. The operation loop was described as the working process of an unmanned weapon, representing a specific instance of the primary operation loop. Sun et al. [
20] used the quantity of operation loops as a comprehensive metric to characterize the performance of unmanned weapon SoS. Based on information traffic load balancing, Zhang et al. [
21] proposed a resiliency evaluation framework for UAV swarms as well. Zhang et al. [
22] utilized complex network theory to analyze network topology, using topological metrics such as the clustering coefficient, average node degree, and network efficiency as indicators of system performance. Li expanded on the generalized baseline function of information exchange capacity in communication networks, and also considered the heterogeneous communication network topology of UAV swarms. Compared with Tran [
2] and Bai [
10,
23] incorporated functional factors into the exchange capacity model for network information. The majority of studies on surveillance missions predominantly concentrate on achieving optimal coverage area [
24,
25], the recognition and tracking of multiple targets [
26], and the effective distribution of UAVs in a surveillance area [
27].
The above resilience and performance metric may not be suitable for the UAV swarm performing surveillance task. In most cases, UAV swarms are required to carry out a mission successfully, so the minimum performance should be capable of accomplishing the designated surveillance mission [
28,
29]. The commander or administrator may wish to grasp the differences between the real-time performance and the standard performance of a UAV swarm after a disruption occurs. It should be noted that the total information quantity metric of a swarm system received at time
t, as a metric of swarm performance, is unable to provide insights into the characteristics of a surveillance mission [
30]. A significant portion of the information transmitted in the swarm is irrelevant to conducting the surveillance mission. The information metric cannot reflect the situation of overlapping surveillance areas, which is common in surveillance missions [
31,
32]. The validity of using the shortest path method [
9,
33] to calculate the total amount of information in the swarm as a performance indicator requires further proof in practical scenarios [
34].
It is imperative to propose a suitable performance and resilience metric to accurately measure and evaluate a UAV swarm’s ability to perform a surveillance mission [
35]. In order to compensate for the shortcomings in the performance and resilience metric [
9,
33] of the surveillance mission, we utilize the surveillance area of each individual UAV at time
t as a measure of its performance, and the total surveillance area is used as an indicator of the swarm’s overall performance [
32,
36]. By employing this metric, we aim to accurately assess the capacity of a UAV swarm to carry out surveillance missions. The main contributions of our research are summarized as follows:
- I.
We improve the adaptation models of ecosystems and propose a static model, structural adaptation model, dynamic adaptation model, and co-adaptation mode for UAV swarms to resist disruptive events and uncertain threats and ensure performance recovery.
- II.
We propose a new performance metric for UAV swarms executing an surveillance mission. Based on the surveillance mission, we also provide an algorithm to compute this performance metric.
- III.
We develop an improved resilience metric for UAV swarms based on the reported resilience metric.
The remainder of this paper is organized as follows.
Section 2 presents four different models of a UAV swarm. In
Section 3, we investigate an improved resilience metric.
Section 4 verifies the proposed model through several illustrative experiments. In
Section 5, we give our conclusions and future work.
2. Adaptable Model of the UAV Swarm
Figure 1 depicts a sketch map of a UAV swarm, each UAV is characterized as a node and distinguished by numbers. The nodes size denotes the surveillance scope of UAVs and the black lines are communication links between UAVs.
The interaction weight of UAVs (
i,
j) is determined by the number of
js connected neighboring UAVs. The weight between (
i,
j) becomes weaker as the neighboring UAVs number of
j increases. UAV
j contributes
to
i, where the denominator is the total number of
j’s neighbors.
represents the number of mutually connected relationships between
i and
j. For each pair (
i,
j), the contribution of the interaction weight
between
i and
j is calculated using the following expressions:
Adaptation is important for ecosystems to avoid coextinctions when confronted with natural disasters. Previous studies have assumed that link weight keeps unchanged, but this unchanged assumption has recently been questioned. Then, a structure-based approach has been proposed to compensate for the weight loss. Additionally, some studies suggested mechanisms to adjust link weights, and taking into consideration the dynamics of the systems. Recent studies emphasized the interactions importance between architecture and dynamic behaviors. Zhang et al. have put forward a co-adaptation model between structure and dynamics in mutualistic networks [
37]. The UAV swarm is frequently run in harsh conditions and vulnerable to various disruptions, such as severe weather or opponents’ defense. The objective of this research is to examine how a UAV swarm responds to disruptions and ensures its ability to withstand function loss or performance decline. Taking inspiration from the adaptive characteristics of ecosystems, we proposed four models: the static model, structural adaptation model, dynamic adaptation model, and co-adaptation model for the UAV swarm to withstand various disruptions. These four models are detailed presentation in
Section 2.1–
Section 2.4.
2.1. Static Model
In step 1 of static model, as depicted in
Figure 2, we introduce a disruption to the swarm by removing one UAV (Number 7) marked with a red cross, along with all the links connected to this UAV (
,
), which are also marked with a red cross. In static model, removing a node does not result in any change in the interaction weight between the remaining nodes. And, the surveillance area of the remaining nodes remains stable.
2.2. Structural Adaptation Model
The first two steps of structural adaptation model are the same as the static model, as shown in
Figure 3. After the attacked UAV is removed, in step 2, UAV 4 and 5 expand their surveillance area to account for the loss coverage of UAV 7. To simplify the process, when UAV
i is removed, the lost area is distributed evenly among its neighbors. Consequently, for the remaining UAVs, the renewable surveillance area
is calculated using the following equations:
We call this model the structural adaptation model, as it solely depends on the network’s structural information for adjusting the surveillance area. The model is not concerned with the dynamic nature of influence propagation. However, in real-world scenarios, the information exchange among the remaining UAVs also plays a crucial role in the surveillance mission, particularly during unexpected perturbations.
2.3. Dynamic Adaptation Model
Steps 0 and 1 of the dynamic adaptation model are still the same as static model. After removing UAV 7, UAVs 4 and 5, which are the neighbors of UAV 7, will first expand their surveillance areas to compensate for the loss surveillance areas. Then, UAVs 1, 2 and 6, the neighbors of UAVs 4 and 5 will narrow their surveillance areas to balance the overall surveillance mission. Similarly, the surveillance areas of the remaining UAVs will change based on their neighbors, showing a dynamic propagation of influence, as depicted in
Figure 4. We refer to this mechanism as the dynamic adaptation model because the surveillance area is influenced by the dynamic propagation. We calculate the renewable surveillance area of
j using Equation (
3).
is the initial surveillance area of j before disturbance occurs. i is one of the neighbors of j, which was influenced by disturbance earlier than j; is the surveillance area of i after disturbance influence take effect on i; is the number of i’s neighbors, which are affected by the disturbance after i.
2.4. Co-Adaptation Model
In the co-adaptation model, we first repeat steps 0–2 of the structural adaptation model. Then, we proceed to apply step 2 of the dynamic adaptation model. Finally, the remaining UAVs have an equal surveillance area. We call this improved model as the co-adaptation model because we prioritize the implementation of the structural adaptation model and then utilize dynamic adaptation, as shown in
Figure 5. To calculate the renewable surveillance area (
) of UAV
j with
n UAVs removed, we employ the following equation.
n denotes the quantity of removed UAVs; N denotes the total UAVs number.
The results of static model and co-adaptation model have a similar characteristics, that the surveillance area of each remaining UAV is equivalent in each model. However, the surveillance area of each remaining UAV between two model are different. The surveillance area of each remaining UAV in static model method does not change during the mission, which means a overall performance declining after perturbation occurrence. In other words, under the static model, the swarm is not resilient. Conversely, the surveillance area of each remaining UAV in co-adaptation model changes according to the structure and the dynamic of neighbor UAVs, the overall performance of swarm maintains stability. That is, under the proposed model, the resilient swarm can be achieved. Although the last equivalent surveillance area characteristic is same, the response of two models is different. The static model is passive and un-resilient, co-adaptation model is active and resilient, which is helpful for withstanding disruptions and performance recovery.
3. Performance and Resilience Metric of Mission-Oriented UAV Swarm
In this section, we propose an improved resilience metric for mission-oriented UAV swarms, based on the metrics in [
4,
33]. In
Section 3.1, we introduce the surveillance mission and choose the surveillance area as the system performance.
Section 3.2 analyzes the resilience metric provided in [
4,
33] along with its limitations. In
Section 3.3, we propose an improved resilience metric.
3.1. System Performance of Surveillance Missions
Consider the scenario of monitoring an unidentified battlefield zone using a UAV swarm [
34],
, (as depicted in
Figure 6).
performs surveillance mission independently,
represents the radius of the surveillance area [
31,
32]. The mission performance reflects how effectively the swarm fulfills its mission [
16]. In this study, the surveillance area of the targeted battlefield serves as the performance metric for each UAV. The system performance is the total surveillance area of the UAV swarm.
Each UAV diligently oversees the target area without interruption, constantly exchanging information with neighboring UAVs through communication links. The surveillance area for
at a given time
t can be observed in
Figure 7.
In
Figure 7,
r represents the radius of the surveillance target area for each UAV;
l represents the half distance between two adjacent surveillance areas;
i and
j are the centers of two surveillance areas;
denotes the included angle between
r and
l. The shade area scanned by
i and
j occurs simultaneously.
In this paper, the system performance metric refers to the total surveillance area covered by a swarm of UAVs in a battlefield at time
t. The total area covered by the UAV swarm during the surveillance mission is calculated by:
denotes the initial surveillance target area of each UAV before disruptions;
denotes the total UAVs quantity at time
t;
denotes half of the shaded area in
Figure 7, and is obtained through the following calculation:
,
,
,
represent the position of
i and
j. In this study, we compute the raw
by averaging multiple simulations. We subsequently employ the smooth algorithm [
38] to derive the smooth performance values of
.
3.2. Reported Resilience Metric
In [
4,
9,
33], the swarm performance is determined by the total received information number at time
t, which can be calculated by:
is the shortest path from j to i; is associated with the specific mission’s requirement, with a value ranging from 0 to 1; refers to the received information number of i at time t, and denotes the total number of UAVs at time t.
In
Figure 8, we present the performance value
of a single UAV during a threat event from
to
; the performance before the disruption is denoted as
, the minimum performance as
, and the performance after recovery as
; the time at the disruption occurs is
, the time at the system performance reaches its minimum as
, and the time at system recovery begins as
; the values of
,
and
are calculated by
,
,
.
Based on , , , , and , Tran et al. introduce performance factor , absorption factor , recovery factor , recovery time factor , volatility factor , and give the calculation formulas as: , , , , .
Furthermore, Tran et al. propose a resilience metric for each disruption-recovery event.
Finally, the resilience value of a system is calculated by [
3]:
represents the weight of the event; represents the weight factor of ; represents the number of disruption-recovery events; represents the total UAVs number at time t.
The assumption that each UAV is capable of autonomously selecting the shortest path to their targeted node in the swarm is impractical. Furthermore, a significant portion of the information transmitted within a swarm is often redundant for conducting surveillance missions. Using the shortest path to calculate the total amount of information in a UAV swarm network as the system performance may not be suitable. Additionally, the metric of total information received fails to provide insight into the occurrence of surveillance area overlap, which is common in surveillance missions.
3.3. The Improved Resilience Metric
The goal of a resilience plan for a UAV swarm is to achieve the assigned task while enhancing anti-interference capabilities and improving performance even after disruption. An optimal resilience metric should encompass the entire process, from pre-damage preparation to post-recovery evaluation. Based on the resilience metric proposed in [
4], we present an improved strategy for measuring resilience.
We use the MATLAB to obtain performance data through Monte Carlo simulation. The minimum performance is the most crucial factor in mission execution. However, the decline stage is not taken into account in previous studies when discussing
[
4,
9,
33], which is an oversight. In this paper, we obtain the minimum performance (
) using the following expressions
To assess the performance and resilience curve of each disruption-recovery event accurately, it is more suitable to consider the time of disruption occurring rather than
, when discussing the recovery time factor. This is due to the fact that the timeframe [
,
] has little impact on the performance of the disruption-recovery event.
Based on the
and
, the absorption factor can be obtained by:
Based on the renewal factors, the calculating formula for resilient value of each disruption-recovery event is as follows:
5. Conclusions
UAVs are expected to quickly respond to dynamic changes that may occur during missions. Major causes of mission failures are inevitable disruptive events and uncertain threats. To address this, we have developed an improved model for UAV swarms that considers the impact of network structure and dynamic adaptation within the swarm. This improved model enables the swarm to better cope with adverse events and restore its performance. Based on the co-adaptation model, we also proposed a renewed resilience metric. This metric measures the disparity between the minimum performance observed after a disruption event and the standard performance before the disruption. It provides insights into the swarm’s ability to withstand disruptions, recover its desired performance, and effectively accomplish its assigned task.
This model is particularly useful for simulating and evaluating the resilience of mission-based UAV swarms. It focuses on system designs that include surveillance performance and the ability to withstand uncertain threats, as well as adaptability to potential recovery actions. The resilience of the system, measured by , provides a general understanding of its performance by assessing its ability to maintain and restore surveillance capabilities. By analyzing , which considers stochasticity in simulated performance and system adaptation over time, direct comparisons of system resilience can be made. The results of this study indicate that co-adaptation improves both system performance and resilience. However, it is important to note that the resilience gained by co-adaptation is reduced when the surveillance area exceeds the capacity of the UAV swarm. These findings can be valuable in guiding future investment decisions and research agendas in the overall design process. For instance, when combined with capacity analysis, UAV swarm designers can use these assessments to determine if co-adaptation in the surveillance area is a worthwhile task for their system, based on knowledge of specific threat likelihoods.
In this study, we did not consider certain limitations, such as the surveillance capacity of each UAV and the increasing probability of attacks with the surveillance area expands. Furthermore, we recognize that communication distance and time delay can also impact the co-adaptation capacity of our resilience enhancement model for UAV swarms. In future research, we intend to study the influence of surveillance capacity, communication distance, and time delay on our proposed model.