Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm

Qin, Boyu; Zhang, Dong; Tang, Shuo; Wang, Mengyang

doi:10.3390/app12062865

Open AccessFeature PaperArticle

Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm

¹

School of Astronautics, Northwestern Polytechnical University, Xi’an 710072, China

²

Shannxi Aerospace Flight Vehicle Design Key Laboratory, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(6), 2865; https://doi.org/10.3390/app12062865

Submission received: 18 January 2022 / Revised: 2 March 2022 / Accepted: 9 March 2022 / Published: 10 March 2022

(This article belongs to the Special Issue Intelligent Autonomous Decision-Making and Cooperative Control Technology of High-Speed Vehicle Swarms)

Download

Browse Figures

Versions Notes

Abstract

Aiming at the problem of UAV swarms with distributed subsets performing cooperative reconnaissance-and-attack tasks on multi-targets in complex and uncertain combat scenarios, a distributed grouping cooperative dynamic task assignment method is proposed based on extended contract network protocol. The dynamic task assignment model for the UAV swarm with the topology of distributed subsets is established considering multiple constraints such as task cooperation, performing sequence, dynamic environment, communication topology, payload model, and UAV capability. According to the characteristics of multi-participants and multi-tasks in the process of UAV swarm executing tasks, the determination mechanism on cooperators and the selection mechanism of sequential tasks are proposed, and then the contract network protocol is extended. On the basis of the above, an event-triggered task assignment strategy for dynamic tasks is designed. The simulated results show that the proposed method can achieve the cooperative dynamic assignment of the UAV swarm to perform reconnaissance-and-attack tasks to multi-targets in complex and uncertain combat scenarios, improve the adaptiveness of the swarm under the sudden circumstance, and realize the optimization for task execution efficiency of the UAV swarm.

Keywords:

swarm control; distributed swarm; dynamic task planning; task assignment; event-trigger

1. Introduction

The advances of intelligent autonomous systems have led UAV swarm technology and its application to the current scientific research hotspot [1,2]. Cooperative task assignment stands out as an essential component and a precondition of task accomplishment and autonomous control of UAV swarm systems [3].

Cooperative task assignment is to assign a considerable number of different types of subtasks and their order to each UAV in the swarm while meeting the task requirements with UAV capabilities and the multiple constraints involved. In the past few decades, there have been several main sorts of assignment algorithms for UAV swarm cooperative task assignment: the heuristic algorithm and the market-based algorithm, for example [4]. Heuristic algorithms generally search a certain range of solution space in an acceptable time by simulating natural phenomena to obtain feasible solutions to optimization problems. Common methods include the genetic algorithm [5,6,7,8], the particle swarm optimization algorithm [9], the ant colony algorithm [10,11], and the wolf swarm algorithm [12]. For UAV swarms with distributed architecture, heuristic algorithms always need to obtain global information. This process consumes lots of communication and computing resources as well as time to achieve global consensus; it is not suitable to apply heuristic algorithms. Compared with heuristic algorithms, market-based approaches (such as contract network protocol), distinctively characterized by distributed computing of swarms, requires only local information of the swarm and have the advantages of flexibility, robustness, and high operation speed [13] and, hence, are more suitable for distributed UAV swarms. Meanwhile, with strong scalability, market-based approaches can well handle cooperative task assignments with complex constraints such as limited communication [14] and time windows [15] and have been applied to dynamic task assignments [16,17,18]. The algorithms adopted are a part of the assignment approaches for the task assignment in complex and changeable combat scenarios, which are usually modeled as constraints or objectives from different perspectives.

The crucial topics to be investigated in this field include cooperative task assignment with the trajectory coupled [19,20] as well as under dynamic resistant circumstances [21,22,23]. In [24], a scheme to assign tasks in UAV swarms based on the contract network protocol (CNP) is presented, in which an A* algorithm is applied in flight path planning and path length estimation with a no-fly zone and threat considered. Under this scheme, the coupling between task assignment and flight path planning can be solved. However, their work does not take into consideration either pop-up missions or UAV faults. A study [25] has proposed a novel model for UAV coalition and an algorithm derived from basic geometry that generates a path derived from the original Dubins curve for application in remote sensing missions of fixed-wing UAVs. Another study [26] proposed an unmanned air vehicle (UAV) swarm task and a resource dynamic assignment algorithm based on the task sequence mechanism. By establishing a task sequence, each UAV strictly separates the necessary task time and synchronization waiting time. For the newfound targets, each UAV quickly determines its available time period. According to the available time and task resources, an auction algorithm and a consensus algorithm are used to decompose the task assignment into the initial distributed assignment phase and the swarm consensus phase to develop real-time conflict-free task solutions for UAV swarms. However, their work does not take into account the communication topology and time constraints. In [27], a CNP-based approach to a multi-UAV task assignment is proposed, in which a flight path planning method based on PH curves is combined with cooperative particle swarm optimization (PSO), cooperative variables, and cooperative functions to achieve attack synchronization on certain targets. Nevertheless, it does not take into account no-fly zones, threats, communication topology, and time constraints in addition to the task reassignment in the case of UAV faults. On the basis of this, [27,28] introduced local communication constraints with communication distance and information hop times to determine whether other UAVs participate in the local task assignment; however, it neglected the communication constraint caused by the swarm specific topology, which is crucial for certain command structures so as to improve operational effectiveness.

Compared with [21,22,23,24,25,26,27,28], the problem investigated in this paper is the dynamic task assignment for the heterogeneous UAV swarm consisting of distributed subsets with specific topology. In this paper, common constraints such as time windows, the UAV capability model, as well as new constraints such as topology constraints are combined to build the complex model of dynamic task assignment. The key contribution of this paper is that it proposes a solution to rectify the problem of heterogeneous UAV swarm cooperative dynamic task assignments with specific hierarchical communication topologies and other multiple constraints. An extended-CNP-based distributed assignment approach is proposed, along with distributed heterogeneous UAV swarm executing reconnaissance and attack tasks as the main scenario. The swarm consists of several subsets. On the swarm discovering new targets, it firstly assigns each target to subsets according to communication topology, and then each subset assigns subtasks to the UAVs within the group. The modified artificial potential field method is adopted to preplan the threat avoidance flight path, and battlefield survivability and fuel consumption are introduced to describe the flight path’s impact on the task assignment. Correspondingly, the consumption penalty and threat penalty functions can be designed so as to solve the coupling between task assignment and path planning. Meanwhile, constraints on time and cooperation are introduced to adjust the task executing sequence.

The rest of paper is organized as follows. The description of distributed grouping UAV swarm task assignment problem with multiple constraints is presented in Section 2. In Section 3, combined with hierarchical communication topologies of the UAV swarm, the distributed assignment algorithm based on extended contract network protocol is thoroughly addressed. In Section 4, the dynamic cooperative task assignment scheme based on an event-trigger strategy is proposed. Section 5 demonstrates the approach’s effectiveness, both in uncertain environments and with UAV failure, by numerical examples, and the whole work is concluded in Section 6.

2. Problem Description

2.1. Mission Scenario Analysis

There are heterogeneous UAV swarms consisting of distributed subsets in the mission area. Each subset is composed of several reconnaissance UAVs, attack UAVs, and reconnaissance-attack UAVs. The procedure of task assignment includes target allocation and subtask assignment. The swarm accomplishes the initial assignment, and then the UAVs cooperatively perform tasks. If a UAV discovers a new target, it relays the target information to others within a limited range according to hierarchical communication topologies, which triggers dynamic task assignment and then updates each UAV’s task sequence.

2.2. Hierarchical Communication Topology

The UAV swarm is divided into several distributed subsets. Communication exists within each subset and among subsets, hence hierarchical communication topology is established. In the application process, the swarm can cooperate to assign and complete tasks through the two-layer mechanism of inter-group cooperation and intra-group coordination based on hierarchical communication topology. The structure of the distributed subsets adopted integrates the advantages of both centralized structure and fully distributed structure to realize “global centralization and local autonomy”, which avoids the problems of low redundancy and the heavy central load of the fully centralized structure as well as the disadvantages of high individual capability requirements, communication complexity, and the command conflicts of the fully distributed structure. The structure conforms to the actual combat scenario as well as the development status of UAV swarm technology and will become more normalized and practical [29,30,31,32].

The algebraic graph theory is used to describe the internal interaction of the UAV swarm system. Assume that the UAV swarm has N UAVs, and each UAV is regarded as a node, then the communication relationship is seen as an edge. A directed graph G = {V, E, W} which consists of the node set V = {v₁,v₂,…,v_N}, the edge set

E \subseteq {(v_{i}, v_{j}) : v_{i}, v_{j} \in V, i \neq j}

and the adjacency matrix

W = [w^{i j}] \in ℝ^{N \times N}

, with non-negative entries

w^{i j}

. The entries in

W

are defined with

w^{i j} = 1

for

(v_{i}, v_{j}) \in E

and

w^{i j} = 0

otherwise. In addition,

w^{i i} = 0

for all

i = 1, 2, \dots, N

. The neighbor set of node

v_{i}

is described as

N_{i} = {v_{j} : (v_{i}, v_{j}) \in E}

.

There is a top leader, N_m group leaders, and N_f followers in UAV swarms with distributed subsets, as shown in Figure 1. Each subset i has N_fi followers. The top leader is the highest leader node and leads the initial task assignment. The group leader, which obtains information of each UAV in the subset, is the leader node of the subset, participates in target allocation on behalf of the subset, and leads subtask assignment; the followers are members of the subset and participate in subtask assignment and execution.

Each node is numbered in the UAV swarm, in which the top leader index is 1, the indexes of group leaders are i = 2, …, N_m + 1, and the indices of followers are i = N_m + 2,…, N_m + N_f + 1. The adjacency matrix

W \in ℝ^{N \times N}

has the following form:

W = {[\begin{matrix} 0 & W^{T M} & 0 \\ W^{M T} & W^{M 0} & W^{M F} \\ 0 & W^{F M} & W^{F 0} \end{matrix}]}_{N \times N}

(1)

where

W^{M T} \in ℝ^{N_{m} \times 1}

and

W^{T M} \in ℝ^{1 \times N_{m}}

indicate the communication topology among the top leader and group leaders,

W^{M 0} \in ℝ^{N_{m} \times N_{m}}

expresses the communication topology among group leaders, the communication topology among group leaders, and their followers are denoted as

W^{M 0} \in ℝ^{N_{m} \times N_{m}}

, while

W^{F 0} \in ℝ^{N_{f} \times N_{f}}

is denoted as the communication topology among followers. Assuming that no direct communication exists between each follower and the top leader, the followers of each subset only communicate within UAVs in the subset:

\begin{matrix} W^{F M} = d i a g {W^{F M_{1}}, W^{F M_{2}}, \dots, W^{F M_{N_{m}}}} \\ W^{M F} = d i a g {W^{M F_{1}}, W^{M F_{2}}, \dots, W^{M F_{N_{m}}}} \\ W^{F 0} = d i a g {W^{F_{1}}, W^{F_{2}}, \dots, W^{F_{N_{m}}}} \end{matrix}

(2)

where

W^{F M_{i}} \in ℝ^{N_{f i} \times 1}

and

W^{M F_{i}} \in ℝ^{1 \times N_{f i}}

express the communication topology among group leader and its followers in subset i, and

W^{F_{i}} \in ℝ^{N_{f i} \times N_{f i}}

is the topology among followers in subset i.

2.3. UAV and Payload Model

Denote UAV set V = {V_i| i = 1, 2, …, N}, in which any UAV V_i can be described as a seven-element combination

< X_{i}, h_{i}, C_{i}, P_{i}, F_{i}, D_{i, m a x}, N_{v i, m a x} >

.

X_{i} = (x_{i}, y_{i}, z_{i}, v_{i}, φ_{i}, ψ_{i})

represents the kinetic parameters of UAV V_i, including position, velocity, flight path angle, and flight heading angle;

h_{i} \in [0, 1]

represents the survivability of UAV V_i, and

h_{i} = 0

indicates V_i is destroyed or has encountered faults and then completely loses its mission capability; C_i = {C_i1, C_i2, …, C_in_Ci} expresses indices of UAVs that communicate with V_i;

P_{i} = {P_{i, s c o u t}, P_{i, a t t a c k}}

is the executable task type of V_i, where

P_{i, s c o u t} \in {0, 1}

, and

P_{i, a t t a c k} \in {0, 1}

means reconnaissance capability and attack capability, respectively. When the capability is available, the value is 1, otherwise 0;

F_{i}

is the fuel consumption rate per unit air-range of V_i;

D_{i, \max}

is the maximum air-range;

N_{v i, m a x}

is the maximum number of executable tasks of V_i.

The condition that a reconnaissance UAV discovers and confirms the threat or target is that it is located in the detection area of the reconnaissance payload, as shown in Figure 2a. Reference [33] gives the typical mathematic model of the detection area. The condition that an attack UAV can strike a target is that the target is located in the available area of the attack payload, as shown in Figure 2b. References [33,34] give the typical mathematic model of the available area.

2.4. Threat Model

Any threat can be described by a five-element combination

< q_{o, i}, R_{o, i}, R_{a, i}, p_{o, i}, F_{o, i} >

, where

q_{o, i}

is the position vector of the threat

O_{i}

center,

R_{o . i}

is the no-fly zone radius of threat

O_{i}

, and

R_{a . i}

is the impact radius of threat

O_{i}

. When a UAV flies into the no-fly zone, it will be destroyed; when the UAV enters the impact area, it will be affected by the threat.

p_{o, i} \in [0, 1]

expresses the estimation of threat impact.

F_{o, i} \in {0, 1}

indicates the detected status flag of the target, where the value 0 means the target is undetected; otherwise, the value is 1.

The estimation

p_{o, i} (q)

of the threat

O_{i}

impact on any point in space is denoted as

p_{o, i} (q) = {\begin{array}{l} 1, 0 < ‖ q - q_{o, i} ‖ < R_{o, i} \\ A_{i} (q, q_{o, i}), R_{o, i} \leq ‖ q - q_{o, i} ‖ \leq R_{a, i} \\ 0, ‖ q - q_{o, i} ‖ > R_{a, i} \end{array}

(3)

where

A_{i} (q, q_{o, i}) \in (0, 1)

, which is a function of the distance between the point and the threat center, represents the effect evaluation within the threat range; for the convenience of the study,

A_{i} (q, q_{o, i})

can be taken as a constant value. When the UAV approaches threat

O_{i}

and enters the impact range, its survivability will decrease. We assume the survivability of UAV V_j is

h_{j}

and the survivability becomes

{h^{'}}_{j}

under the impact of threat

O_{i}

, of which the process can be expressed as

{h^{'}}_{j} = h_{j} \cdot (1 - A_{i})

(4)

2.5. Dynamic Task Allocation Problem

We adopt a seven-element combination {V, Sg, T, O, M_t, R, C} to describe the dynamic task assignment problem. V = {V₁, V₂, …, V_N} is the set of UAVs and N represents the number of UAVs in the swarm; S_g = {S_g₁, S_g₂, …, S_gN_G} is the set of subsets and

N_{G}

represents the number of UAV subsets in the swarm; T = {T₁, T₂, …, T_N_T} is the set of targets, where

N_{T}

is the number of targets; O = {O₁, O₂, …, O_N_obs} is the set of obstacles, where N_obs is the number of obstacles; M_t = {M_t1, M_t2, …, M_t,N_type} is the task type set of each target, where

N_{t y p e}

is the number of the types. For the “Reconnaissance-Attack” mission scenario, the task type set includes the reconnaissance and attack of two elements, which can be expressed as

M_{t} = {S c o u t, A t t a c k}

; R = {R₁, R₂, …, R_N_type} is the set of the maximum task values for the “Reconnaissance-Attack” mission scenario

R = {R_{S}, R_{A}}

;

C

is the set of multiple constraints, mainly including UAV capacity constraint, time window constraint, sequential constraint, and cooperation constraint. These constraints are described as follows:

Definition 1 (UAV capability constraints).

The UAV capability constraints are mainly reflected in three aspects: the maximum range, the executable task types, and the maximum number of executable tasks.

(a) (The maximum range constraint) Assume that the initial state of UAV V_i is

s_{i 0}

, and the task sequence is Seq_i = {s_i₁, s_i₂, …, s_i,N_s,i}, where

N_{s, i}

is the number of tasks to be executed. From Section 2.3, the maximum range of UAV V_i is

D_{i, m a x}

and the maximum range constraint can be expressed as

\sum_{j = 1}^{N_{s, i}} L (s_{i j}, s_{i, j - 1}) \leq D_{i, m a x}

(5)

where

L (s_{i j}, s_{i, j - 1})

, which relates to task

s_{i, j - 1}

and task

s_{i j}

, represents the air-range from the position of task

s_{i, j - 1}

to the position of task

s_{i j}

. That means the whole air-range couples with the task sequence.

(b) (The executable task type constraint) In the process of the swarm cooperative task execution, different sorts of UAVs perform different types of tasks. There is a mapping between the UAV capability

P_{i} = {P_{i, s c o u t}, P_{i, a t t a c k}}

and

M_{t} = {S c o u t, A t t a c k}

, which can be expressed as

P_{i} = {P_{i, s c o u t}, P_{i, a t t a c k}} = {\begin{array}{l} {1, *} \to S c o u t \\ {*, 1} \to A t t a c k \end{array}

(6)

(c) (The task number constraint) The payload number and energy carried by UAVs have limits; thus, it is necessary to restrict the maximum number of tasks performed by the UAV. Assuming that the number of tasks assigned to UAV

V_{i}

is

N_{v t, i}

and the upper limit is

N_{v i, \max}

, the constraint is expressed as

N_{v t, i} \leq N_{v i, \max}

(7)

Definition 2 (Sequence constraint).

If there is a specific execution order between subtasks

T a s k_{i}

and

T a s k_{j}

, there is sequence constraint between

T a s k_{i}

and

T a s k_{j}

. Reference [9] gives the concrete model of sequence constraint.

Definition 3 (Time window constraint).

The start time when task

s_{i, j}

inSeq_i= {s_i₁, s_i₂, …, s_i,N_s,i} is performed by UAV

V_{i}

needs to be guaranteed to be in the time window

t_{s i, j} \in [t_{b, j}, t_{e, j}]

, where

t_{b, j}

is the earliest start time and

t_{e, j}

is the latest one. The start time

t_{s i, j}

has relations with the last task and the preplanned flight path of the UAV. Suppose

t_{e, j - 1}

the time when UAV accomplishes the last task and the preplanning air-range from

s_{i, j - 1}

to

s_{i, j}

, then the time window constraint can be expressed as

{\begin{array}{l} t_{s i, j} \leq t_{e, i} \\ t_{s i, j} = \max {t_{b, i}, t_{e, j - 1} + \frac{L_{i}}{V}} \end{array}

(8)

Definition 4 (Cooperation constraint).

For the process of several UAVs cooperatively performing reconnaissance or attack task

T a s k_{j}

, the expected number of participants

N_{p e, j}

is introduced, which means that

T a s k_{j}

can be accomplished by

N_{p e, j}

UAVs at most. The actual number of participants is

N_{p, j}

and has

0 \leq N_{p, j} \leq N_{p e, j}

(9)

Definition 5 (The dynamic task allocation problem).

The objective is to find the best assignment. To be more concise is to optimize the swarm’s reward B_s during the task execution, such that:

\begin{array}{l} {\hat{B}}_{s} & = \max B_{s} (V, S_{g}, T, O, M_{t}, R) \\ = m a x \sum_{i = 1}^{N} \sum_{j = 1}^{N_{s, i}} B_{i j} (h_{i, e s t}, L_{i j}, t_{t a s k_{j}}) \end{array}

subject to the constraints:

{\begin{array}{l} \sum_{j = 1}^{N_{s, i}} L (s_{i j}, s_{i, j - 1}) \leq D_{i, m a x} \\ P_{i} = {P_{i, s c o u t}, P_{i, a t t a c k}} = {\begin{array}{l} {1, *} \to S c o u t \\ {*, 1} \to A t t a c k \end{array} \\ N_{v t, i} \leq N_{v i, \max} \\ t_{s i, j} \leq t_{e, i} \\ t_{s i, j} = \max {t_{b, i}, t_{e, j - 1} + \frac{L_{i}}{V}} \\ 0 \leq N_{p, j} \leq N_{p e, j} \end{array}, \forall i \in V

where the value of

B_{i j}

indicates the reward of UAV i performing the task j; the survivability estimation is

h_{i, e s t}

, which means more loss to the UAV performing the task; the air-range performing

T a s k_{j}

is

L_{i j}

; the start time of task j is

t_{t a s k_{j}}

.

Due to the objective function coupling with the process of contract net protocol, the detailed expression is designed in Section 3.2, where the bidding function of the market mechanism is introduced.

3. Task Assignment Algorithm Based on Extended CNP

This section designs a distributed dynamic task assignment algorithm based on extended contract network protocol. The core of contract net protocol (CNP) is to simulate the “bid–win” market mechanism and realize the optimization of task assignment based on the interaction of individuals. The classical CNP is not suitable for sequential task assignments with multiple constraints under multiple rounds, which means that it needs extending according to complex task constraints.

3.1. Distributed Multi-Constraint Dynamic Task Assignment Algorithm

The algorithm includes four steps: target information release, bidding scheme generation, bid winning authorization, and task execution. The process is shown in Figure 3. The following describes the algorithm process in turn.

Step 1: Release the information of targets

Assume that the UAV detects a sudden target

T_{k}

and reports the target information to the subset leader UAV

V_{i}

, which becomes the tendering UAV on behalf of the subset, and sends a bidding invitation to each subset in the local communication network.

According to the hierarchical communication topology, the set of other subset leaders interacting with

V_{i}

is

V_{p, i} = {V_{j} | w_{i j} = w_{j i} = 1, j \in [2, 1 + N_{m}] \cup j \neq i}

.

V_{p, i}

composes the potential bidders of assignment for target

T_{k}

. Tendering UAV

V_{i}

releases tendering information

I_{i}

to each UAV in set

V_{p, i}

, and

I_{i}

is expressed as

I_{i} = (V_{i}, T_{k}, {T a s k_{k}}, t_{n o w}, H_{b})

(10)

where

V_{i}

is the index of tendering UAV;

T_{k}

is the index of the sudden target;

{T a s k_{k}}

is denoted as the subtask set of target

T_{k}

;

t_{n o w}

is the releasing time;

H_{b}

is the information state flag, of which value 0 means no relay, otherwise 1. The information structure of

T a s k_{k}

can be denoted as:

T a s k_{k} = {M_{t, k}, q_{t, k}, T i m e B a r_{k}, R_{x, k}, N_{p e, j}, T h_{k}}

(11)

where

M_{t, k}

means task type;

q_{t, k}

is the position coordinates;

T i m e B a r_{k} = [t_{b, k}, t_{e, k}]

is the time window;

R_{x, k}

is the maximum task value;

N_{p e, j}

is the expected number of participants;

T h_{k}

is the negotiation threshold.

As attached information during the subtask

T a s k_{k}

assignment process, negotiation threshold

T h_{k}

is applied to preselect bidders from the potential bidder set

{V_{p, i}}

, reducing the negotiation scale as well as the consumption of communication resources and improving assignment efficiency. To adjust the threshold adaptively, the reward of subtask

T a s k_{k}

in performing the subset to which UAV

V_{i}

belongs is selected as

T h_{k}

. If the bidder’s bidding value is greater than

T h_{k}

, it indicates that the swarm efficiency will be optimized and improved after

T a s k_{k}

is assigned to the bidder.

In the process of information release, different UAV groups with intersections of local communication topologies may discover the same target, causing multiple bidding UAVs to release information at the same time in their respective local communication topologies, resulting in system conflict and resource waste. In order to avoid this problem, it is necessary to ensure that different UAV subsets reach a consensus on tendering UAVs and tendering information, which is realized by Algorithm 1.

Algorithm 1: Release tendering information and achieve information consensus.
1:	$if I_{i}$ is not empty
2:	$if I_{i} . H_{b} = 0$
3:	$broadcast I_{i}$ $to {V_{p, i}}$
4:	endif
5:	endif
6:	$for j = 1$ $to number of tendering T a s k_{k}$ invitation received
7:	$if I_{i} . T a s k_{k} . T h_{k}$ < $I_{j} . T a s k_{k} . T h_{k}$
8:	$I_{i} = I_{j}$
9:	$broadcast StopCmd to {V_{p, i}} - {V_{p, j}}$
10:	endif
11:	endfor

In Algorithm 1, “StopCmd” means to abort the negotiation command, and the operation “.” represents the reference to elements in the information structure. For any UAV

V_{i}

in the local communication topology, the tendering information related to subtask

T a s k_{k}

will be released to its directly connected UAV if it is not empty and has not been transmitted (Line 1–4). Meanwhile, the UAV can also receive the tendering information related to

T a s k_{k}

transmitted by others directly connected with it (Line 6–11).

If UAV

V_{i}

finds that the negotiation threshold of UAV

V_{j}

about

T a s k_{k}

is greater,

V_{i}

saves the tendering information

I_{j}

and issues commands to other UAV groups interacted with

V_{i}

but not interacted with

V_{j}

to stop the negotiation so as to make them exit the assignment dominated by UAV

V_{j}

.

After Step 1, the tendering information released in the local communication network achieves consensus, that is

I_{i} = I_{j}

.

Step 2: Generate bidding scheme

On the basis of the target information and various constraints, each potential bidder first determines the bidding scheme for the target of the group, including whether the subset participates in the bidding for target

T_{k}

, subtasks that can be performed and the corresponding UAVs, and the reward for performing each subtask, and so forth.

Through the contract network in subsets, each subtask of target

T_{k}

is preassigned to form the bidding scheme for target

T_{k}

. The specific process is as follows:

① A subset leader UAV

V_{j}

releases subtask information to each follower UAV. In order to improve the assignment efficiency and shorten the time, the task concurrency mechanism is introduced to publish the information of each subtask at the same time;

② For subtask

T a s k_{k}

, the follower UAV

V_{m}

judges whether it has the corresponding type of payload, whether the number of payloads can meet the task execution requirements, and whether it meets the time window

T i m e B a r_{k}

according to the subtask information. If any constraint is not satisfied, UAV

V_{m}

will not bid for subtask

T a s k_{k}

;

③ If the above constraints are met, UAV

V_{m}

preplans the flight path of

T a s k_{k}

based on the modified artificial potential field (MAP) method and substitutes the preplanned range

L_{m k}

and survivability estimation

h_{m, e s t}

into the individual bidding function to calculate the bidding value

B_{m k}

;

④ Compare the bidding value

B_{m k}

with the subtask negotiation threshold

T h_{k}

. If

B_{m k} < T h_{k}

, it means that the system efficiency has not been improved when UAV

V_{m}

is used to perform subtask

T a s k_{k}

, and

V_{m}

actively abandons pre-assignment bidding for

T a s k_{k}

; otherwise,

V_{m}

participates in the bidding;

⑤ Each follower UAV respectively performs steps ②–④ to complete the pre-assignment of each subtask. The subset leader UAV

V_{j}

generates the subset’s bidding scheme for target

T_{k}

according to the pre-assignment results.

The above is the basic process of bidding scheme generation. Due to the existence of multi-UAV cooperatively performing subtasks and the introduction of the task concurrency mechanism, the generation of the bidding scheme needs to solve a “multi-participants–multi-tasks” assignment optimization subproblem.

In order to solve the subproblem, an assignment mechanism based on the contract network within subsets is constructed. For the subtasks that need to be executed by multiple UAVs, a cooperator determination mechanism is introduced to complete the task allocation; for the sequential subtasks with time windows, a sequential task selection mechanism is introduced. Details are as follows:

(1) The determination mechanism on cooperators

The process of the determination mechanism on cooperators is shown in Figure 4. To simplify the assignment process, give the tenderee the initiative, expand the set of bidders, and provide more feasible pre-assignment schemes, the tenderee bidding mechanism is introduced, that is, when the subtask tenderee (subset leader UAV

V_{j}

) meets conditions such as capability constraints and time constraints, it can participate in bidding.

If

V_{j}

meets the constraints for performing

T a s k_{k}

and decides to bid with the bidding value

B_{j k}

, then

T h

is chosen as

B_{j k}

. UAV

V_{j}

sends

T a s k_{k}

information and negotiation threshold

B_{j k}

to each follower UAV. The bidding value of follower UAV

V_{m}

for

T a s k_{k}

is

B_{m k}

. If

B_{m k} > B_{j k}

,

V_{m}

decides to bid and feedback

B_{m k}

to

V_{j}

. At the time, the set of all bidders participating in the

T a s k_{k}

assignment is

V_{b, k} = {V_{m} | w_{j m} = w_{m j} = 1, j \neq m; B_{m k} > B_{j k}} \cup {V_{j}}

If tenderee

V_{j}

does not participate in the bidding, i.e.,

B_{j k} = \emptyset

,

V_{b, k} = {V_{m} | w_{j m} = w_{m j} = 1, j \neq m; B_{m k} > 0}

To sum up, after preselection of negotiation threshold, all bidders participating in the assignment for

T a s k_{k}

can be represented as

V_{b, k} = {\begin{array}{l} {V_{m} | w_{j m} = w_{m j} = 1, j \neq m; B_{m k} > 0}, B_{j k} = \emptyset \\ {V_{m} | w_{j m} = w_{m j} = 1, j \neq m; B_{m k} > B_{j k}} \cup {V_{j}}, B_{j k} \neq \emptyset \end{array}

(12)

Consider the cooperative constraints of subtask

T a s k_{k}

, assume

T a s k_{k}

requires the participation of

N_{p, k}

UAVs, and quickly sort the bidding value set

{B_{j k}}

. The greedy algorithm is used to select the

N_{p, k}

largest bidding values in the set to form the winning bidding value set (reward set)

B_{k}

B_{k} = {B_{(1), k}, B_{(2), k}, \dots, B_{(N p, k), k}}

(13)

where

B_{(x), k}

represents the

x

-th order element of

{B_{m k}}

in descending order. The winner set

W i n n e r_{k}

is correspondingly:

W i n n e r_{k} = I n d e x (B_{(1), k}, B_{(2), k}, \dots, B_{(N p, k), k})

(14)

(2) Sequential task selection mechanism

Take the sequential task assignment process of a single follower UAV as an example. After receiving the task information, the bidder determines the executable tasks according to the negotiation threshold, UAV capability constraints, and the execution sequence and combines them to generate an alternative sequence without a time window conflict. Based on the bid winning situation, the optimal sequence is selected as the final signing one. The operation process is shown in Figure 5.

Assume that the concurrent task set

T a s k^{n e w} = {T a s k_{1}^{n e w}, \dots, T a s k_{N t a s k}^{n e w}}

is received by the follower UAV

V_{m}

. The set of tasks that

V_{m}

can execute and determine to participate in bidding is expressed as

T a s k^{a, m} = {T a s k_{k}^{a, m} | k = 1, 2, \dots, N_{a, m}} \subseteq T a s k^{n e w}

(15)

where

N_{a, m}

is the element number of

T a s k^{a, m}

. Denote

C o m b_{n} (N_{a, m}, N_{v m, n})

as the combination which consists of random

N_{v m, n}

elements from set {1, 2, …, N_a,m}, where

N_{v m, n} \leq N_{a, m}

.

Definition 6 (Task Sequence Alternative).

If

\forall k \in C o m b_{n} (N_{a, m}, N_{v m, n})

,

\forall \bar{k} \in C o m b_{n} (N_{a, m}, N_{v m, n}) - {k}

, there is

T i m e B a r_{k} \cap T i m e B a r_{\bar{k}} = \emptyset

. Then,

C o m b_{n} (N_{a, m}, N_{v m, n})

is called a sequence alternative.

T i m e B a r_{k}

is the time window of subtask

T a s k_{k}^{a, m}

. The corresponding task sequence alternative is defined as

\begin{matrix} \begin{matrix} S e q_{n}^{m} = {T a s k_{k}^{a, m} | k \in C o m b_{n} (N_{a, m}, N_{v m, n})} \end{matrix}, n \leq N_{s m}, S e q_{n}^{m} \subseteq T a s k^{a, m} \end{matrix}

(16)

where

N_{s j}

is the number oftask sequence alternatives.

After the bidding is completed, the set of losing bidding tasks is denoted by

T a s k^{d e, m}

. After the sequences containing losing bidding tasks are eliminated, the set composed of the remaining sequence alternatives is

\begin{matrix} S e q S e t_{m} = {S e q_{n}^{m} | n \leq N_{s m}; \forall T a s k_{k}^{a, m} \in S e q_{n}^{m}, T a s k_{k}^{a, m} \notin T a s k^{d e, m}} \end{matrix}

(17)

Assuming that the bidding value of

V_{m}

for any subtask

T a s k_{k}^{a, m}

in

S e q_{n}^{m}

is

B_{m k}

, where

S e q_{n}^{m} \in S e q S e t_{m}

, the sum of bidding value for each task in

S e q_{n}^{m}

is

B_{m, n} = \sum_{k} B_{m, k}, k \in C o m b_{n} (N_{a, m}, N_{v m, n})

(18)

and the final signing sequence of follower UAV

V_{m}

is

S e q^{w i n, m} = S e q_{n_{b e s t}}^{m}, n_{b e s t} = \underset{n}{\arg \max} (B_{m, n})

(19)

Through the above mechanism, the subtask pre-assignment scheme of the subset in which

V_{j}

is located is generated, that is, the bidding scheme of the subset

B i d S c h_{j} = {S e q^{w i n, m} | m \in G_{j}}

(20)

After that, UAV

V_{j}

calculates the bidding function of the subset according to the bidding scheme and bids to the tenderee

V_{i}

.

Step 3: Determine the winners

After the bidding scheme generation and bidding application in Step 2, the tendering UAV

V_{i}

selects the scheme of which the efficiency is greatest according to the bidding value of each group so as to determine which subset executes subtasks of target

T_{k}

.

The set of bidding values of subsets is

{B_{g, j k}}

. Apply the greedy algorithm and select the greatest value and corresponding subset as the winning value (reward)

B_{g, k}

and the winner

W i n n e r_{g, k}

. The process can be expressed mathematically as

B_{g, k} = \max {B_{g, j k}, W i n n e r_{g, k} = \underset{j}{argmax} (B_{g, j k})

(21)

After determining the winner, UAV

V_{i}

authorizes the subset

W i n n e r_{g, k}

to perform each subtask of target

T_{k}

.

Step 4: Perform the tasks

The winner subset formally authorizes each follower UAV in the subset to perform the subtasks, and the pre-assignment scheme in Step 2 becomes the formal assignment scheme. Follower UAV

V_{m}

will add the signing subtask sequence

S e q^{w i n, m}

to its task sequence to be executed, that is:

A_{m} = A_{m} \oplus S e q^{w i n, m}

(22)

where

A_{m}

is the task sequence to be executed,

\oplus

expresses the operation that it adds the sequence

S e q^{w i n, m}

to the sequence

A_{m}

. Each UAV performs each task according to sequence

A_{m}

and the preplanning flight path in the specific time window.

The pseudo-code of the whole algorithm 2 process described above is as follows:

Algorithm 2: The extended Contract Network Protocol.
Input:Task assignment combination {V, Sg, T, O, M_t, R, C}; Topology G₀; time t_now Output: Task execution sequence set {A_m}
1:	/* Step 1: Release the information of targets */
2:	for i from the first index of leader UAVs to the last one
3.	if V_i is a target tenderee UAV
4:	{V_p_,i} = detTargetPotentialBidders(G₀,C) /* determine the set of potential bidder subsets. */
5:	for k = 1 to number of targets
6:	if T_k is tendered by leader UAV i
7:	{Task_k} = generateSubtasks(T_k) /* Tenderee UAV i generates subtask information. */
8:	end if
9:	end for
10:	I_i = releaseTargetInfo ({T_k}, {Task_k}, t_now, {V_p_,i}} /* Tenderee UAV i releases target information. */
11:	end if
12:	end for
13:	/* *Step 2: Generate bidding scheme /**
14:	for j from the first index of leader UAVs to the last one
15:	for i from the first index of tenderee leader UAVs to the last one
16:	if V_j belongs to {V_p_,i}
17:	Receive I_i and broadcast it to V_j ‘s followers
18:	end if
19:	end for
20:	for m from the first index of followers of V_j to the last one
21:	{Task^a,m} = checkTaskConstaints({I_i.Task_k}) /* Select the executable subtasks from {Task_k} */
22:	{ $S e q_{n}^{m}$ } = combineTasks({Task^a,m }) /* Generate the set of alternative sequence */
23:	{B_m,n} = biddingFuncCal({SeqSet_m}) /* Calculate the bidding function of each sequence */
24:	Seq^win,m = $S e q_{n_{b e s t}}^{m}$ , n_best = $\underset{n}{argmax} (B_{m, n})$ /* Determine the sequence to execute */
25:	end for
26:	B_g,jk = $\sum_{m} \max (B_{m, n})$
27:	end for
28:	/* Step 3: Determine the winners */
29:	for i = 1 to number of tenderee leader UAV i
30:	for k = 1 to number of targets tendered by leader UAV i
31:	B_g,k = max{B_g,jk}, Winner_g,k = $\underset{j}{argmax} (B_{g, j k})$ /* Determine winner of tendering for Target k */
32:	end for
33:	end for
34:	/* *Step 4: Perform the tasks /**
35:	for n = 1 to number of subsets
36:	for m = 1 to number of follower UAV m of Subset n
37:	A_m = A_m $\oplus$ Seq^win,m /* Add the signing subtask sequence to task execution sequence A_m. */
38:	end for
39:	end for

Analyze the worst time complexity of the algorithm and take the case of task assignment by the top leader as an example:

The time complexity of the top leader issuing N_T targets with subtasks bidding information to N_G subset leaders is O(N_TN_G), the complexity of the initial evaluation of 4 types of constraints presented in Section 2.5 by N_G subset leaders is 4O(N_TN_G), the complexity of the bidding of potential bidder subsets {V_p_,i} is O(K), where K is the number of elements in {V_p_,i}; after K potential bidder subsets are determined, each potential bidder subset needs to pre-assign the subtasks and generate the bidding scheme of the subset.

Assume that each target has N_t subtasks, each subtask needs the most N_p participants and each subset consists of (N_f + 1) UAVs. Each subset will generate the target scheme based on the CNP within the subset. The time complexity of the subset leaders issuing N_t subtasks to their N_f followers is O(N_TN_tN_f). The sequential task selection is an arrangement–combination problem, of which the time complexity is N_T•O(N_t³), and the time complexity of bidding to the subset leader in each subset is O(N_TN_tN_f). Subset-leader-selecting cooperators can be regarded as a sorting problem, and the quick sort algorithm can be adopted so that the time complexity is N_TN_t•O(N_plog₂N_p). Therefore, the time complexity of the subtask pre-assignment is 2O(N_TN_tN_f) + N_T•O(N_t³) + N_TN_t•O(N_plog₂N_p).

After the bidding schemes of each subset are generated, the bidding winner subset will be determined. It is a quick sort process, so the time complexity is N_T•O(N_Glog₂N_G).

To conclude, the whole worst time complexity is 2O(N_TN_tN_f) + N_T•O(N_t³) + N_TN_t•O(N_plog₂N_p) + N_T•O(N_Glog₂N_G). It indicates that the algorithm is a polynomial one.

3.2. Bidding Function Design

As the core of extended CNP, the bidding function, which needs to be designed on the basis of the concrete task assignment problem in CNP, is a sort of objective function. According to the method process described above, the whole task assignment is divided into the target assignment and the subtask assignment. Both the target assignment and the subtask assignment are based on the extended CNP. Hence, the bidding function of each subset is designed for the target assignment, while the individual bidding function within the group is designed for the subtask assignment.

3.2.1. Individual Bidding Function

The individual bidding function B_ij consists of value function Re_ij and penalty function Pe_ij. Based on the analysis for the UAV swarm task assignment model in Section 2, the individual bidding function mainly relates to: ① the maximum task values, which is denoted in Section 2.5; ② the flight path; ③ the time windows and start time of each subtask.

Since the UAV task assignment couples with path planning, the bidding function needs to be designed in combination with the UAV flight path. By analyzing the scenario, it can be seen that the impact of the flight path on task assignment is mainly reflected in UAV survivability estimation

h_{i, e s t}

and air-range

L_{i j}

(fuel consumption), as shown in Figure 6.

Supposing that UAV

V_{i}

participates in the bidding process of subtask

T a s k_{j}

with time window

t_{t a s k_{j}} \in [t_{s, j}, t_{e, j}]

, where

t_{t a s k_{j}}

is the start time of task execution, of which the mathematical expression is given in Section 2.5,

t_{s, j}

is the earliest start time, and

t_{e, j}

is the latest one.

To evaluate the impact of threats {O_k | k = 1, 2, …, N_obs} on UAV

V_{i}

performing tasks, it introduces the survivability estimation of

V_{i}

, which can be expressed as

h_{i, e s t} = h_{i} \prod_{k = 1}^{N_{a p p}} (1 - A_{k, a p p})

(23)

where

h_{i}

is the current survivability of

V_{i}

,

A_{k, a p p}

is the impact of threats

O_{k, a p p} \in {O_{k}}

approached by

V_{i}

,

A_{k, a p p} \in (0, 1)

, and

N_{a p p}

is the number of threats approached.

(1) Value function Re_ij

In the actual scenario, the value of UAVs performing any task is generally related to the start time when the UAV performs the task. The nearer the start time

t_{t a s k_{j}}

to the time window

T i m e B a r_{k}

, the greater the value for UAV

V_{i}

to perform the task; otherwise, the smaller the value; hence, the item related to the start time

t_{t a s k_{j}}

is introduced. The value function of

V_{i}

performing

T a s k_{j}

is designed as

{R e}_{i j} (h_{i, e s t}, t_{t a s k_{j}}) = h_{i, e s t} \cdot R_{x, j} \cdot e^{- λ \frac{t_{t a s k_{j}} - t_{s, j}}{t_{e, j} - t_{t a s k_{j}}}}

(24)

where

R_{x, j}

is the maximum value of

T a s k_{j}

;

λ

is the attenuation factor of the start time

t_{t a s k_{j}}

on the task value. Apparently,

{R e}_{i j}

is a monotonic increasing function of battlefield survivability

h_{i, e s t}

and decreases monotonically with the start time

t_{t a s k_{j}}

. When

h_{i, e s t} : 1 \to 0, t_{t a s k_{j}} : t_{s, j} \to t_{e, j}

, there is

{R e}_{i j} : R_{x, j} \to 0

.

(2) Penalty function Pe_ij

The penalty function is composed of the consumption penalty and the threat penalty. Assuming that the per-unit fuel consumption of UAV

V_{i}

is

F_{i}

, the air-range performing

T a s k_{j}

is

L_{i j}

, the estimation of survivability is

h_{i, e s t}

, and the penalty function is designed as

P e_{i j} (h_{i, e s t}, L_{i j}) = \frac{1}{h_{i, e s t}} \cdot F_{i} L_{i j}

(25)

Obviously,

{R e}_{i j}

is a monotonic function of battlefield survivability

h_{i, e s t}

. When

h_{i, e s t} : 1 \to 0^{+}

, there is

P e_{i j} : F_{i} L_{i j} \to + \infty

.

The penalty indicates the coupling impact of flight path planning on task assignment. On one hand, the fuel consumption of performing subtask

T a s k_{j}

is directly related to the preplanned air-range

L_{i j}

; on the other hand, the more threats approached by UAV

V_{i}

during the task execution, the lower the survivability estimation

h_{i, e s t}

, which means more loss when the UAV performs the task.

Combining the value function and the penalty function, the individual bidding function

B_{i j}

of UAV

V_{i}

for

T a s k_{j}

is denoted by

B_{i j} (h_{i, e s t}, L_{i j}, t_{t a s k_{j}}) = {R e}_{i j} - P e_{i j} = h_{i, e s t} \cdot R_{x, j} \cdot e^{- λ \frac{t_{t a s k_{j}} - t_{s, j}}{t_{e, j} - t_{t a s k_{j}}}} - \frac{1}{h_{i, e s t}} \cdot F_{i} L_{i j}

(26)

where a greater value of

B_{i j}

indicates the reward of performing the task.

B_{i j}

monotonically increases with

h_{i, e s t}

and decreases with

L_{i j}

and

t_{t a s k_{j}}

, which agrees with the actual scenario, which suggests that

B_{i j}

has practical significance.

3.2.2. Bidding Function of Each Subset

The bidding function of subsets is based on the individual function. For target

T_{j}

, supposing that the bidding value set is

{B_{1}, B_{2}}

after the subtask assignment in subset

G_{i}

,

B_{1}, B_{2}

respectively express the bidding value of subset

G_{i}

performing the reconnaissance subtask and the attack one. The bidding function of subsets is defined as the sum of elements in

{B_{1}, B_{2}}

, which is

B_{g, i j} = \sum B_{1} + \sum B_{2}

(27)

4. Dynamic Assignment Strategy Based on Event Triggering

4.1. Event Trigger Conditions for Dynamic Tasks

Due to the incomplete perception of the situation at the beginning and the occurrence of emergencies during the task execution, the initial assignment scheme needs to be adjusted. Therefore, the event trigger conditions are introduced to judge whether or not to carry out the dynamic task assignment. Event trigger conditions are the key to realizing dynamic task assignments, which need to be selected in combination with different scenarios. The dynamic task assignment process based on event triggering can be expressed as:

E : if g (s (t), s_{t, j} (t)) \geq 0, then T a r g e t_{j} \leftarrow T a r g e t I n f o_{j}^{*}, T a s k_{j, k} \leftarrow T a s k I n f o_{j, k}^{*} (k = 1, 2, \dots, N_{f})

where

g (s (t), s_{t, j} (t)) \geq 0

is the triggering condition,

s (t)

is the state of UAV (such as the position),

s_{t, j} (t)

is the status of target

T_{j}

.

T a r g e t_{j} \leftarrow T a r g e t I n f o_{j}^{*}

expresses the update of the set of targets, and

T a s k_{j, k} \leftarrow T a s k I n f o_{j, k}^{*}

expresses the update of the set of subtasks.

(1) The unknown targets appear

The initial target assignment is based on the initial situation awareness information, which can only cover the known targets but is not guaranteed to cover all the targets in the mission area. After the reconnaissance UAV or reconnaissance-attack UAV

V_{i}

flies near the unknown target

T_{j}

during task execution, it will detect the target and obtains the target information

T a r g e t I n f o_{n e w, j}

and then updates the target set and the subtask set. Event trigger conditions can be described as

g (s (t), s_{t, j} (t)) = D A (r_{i}, r_{n t, j}, R_{d e t e c t, i})

where

D A (r_{i}, r_{n t, j}, R_{d e t e c t, i})

is defined as the reconnaissance payload constraint, which is the constraint on the relative position and angle relations between UAV

V_{i}

and target

T_{j}

. The details are in reference [34]. When target

T_{j}

is located in the detect area of the reconnaissance-attack UAV or reconnaissance UAV,

D A (r_{i}, r_{n t, j}, R_{d e t e c t, i}) \geq 0

;

r_{i}

is the position vector of UAV

V_{i}

;

r_{n t, j}

is the position vector of target

T_{j}

;

R_{d e t e c t, i}

is the detection range of UAV

V_{i}

. The event-triggering process can be expressed as

\begin{array}{l} E_{1} : if D A (r_{i}, r_{n t, j}, R_{d e t e c t, i}) \geq 0, then T a r g e t_{N_{t a r g e t} + j} \leftarrow T a r g e t I n f o_{n e w, j}, \\ T a s k_{N_{t a r g e t} + j, k} \leftarrow T a s k I n f o_{N e w, j, k} \end{array} (k = 1, 2, \dots N_{n e w t a s k, j})

where

T a r g e t_{N_{t a r g e t} + j} \leftarrow T a r g e t I n f o_{n e w, j}

indicates the update of target set,

N_{t a r g e t}

is the number of targets;

T a s k_{N_{t a r g e t} + j, k} \leftarrow T a s k I n f o_{N e w, j, k}

indicates the update of subtask set,

N_{n e w t a s k, j}

is the number of subtasks of

T_{j}

.

(2) The UAV encounters faults and cannot perform tasks

During task execution, UAVs may encounter non-cooperative behaviors such as collision and attack and lose mission capability. Its tasks that have not been executed will be reassigned. Event triggering conditions can be described as

g (s (t), s_{t, j} (t)) = h_{i, b} - h_{i}

where

h_{i}

is the survivability of

V_{i}

;

h_{i, b}

is the minimum survivability of

V_{i}

with normal capability and is selected according to the concrete scenario. The event-triggering process can be expressed as

E_{2} : if h_{i, b} - h_{i} \geq 0, then T a s k_{j, k} \leftarrow T a s k I n f o_{i, j, k} (k = 1, 2, \dots, N_{i, j})

where

T a s k_{j, k} \leftarrow T a s k I n f o_{i, j, k}

is the reassignment for the subtasks that

V_{i}

is to perform.

4.2. The Basic Process of Dynamic Task Assignment

Before describing the basic process of cooperative dynamic task assignment, the following assumptions are given for the scenario:

1. Based on the early situation awareness, several targets and pieces of threat information have been obtained, including target location, number of UAVs required to perform each subtask, threat location, and impact range;

2. Due to incomplete situational awareness, there are unknown targets and threats. After the UAV detects an unknown target or threat, it broadcasts the threat information, transmits the target information to the group leader, and triggers the assignment process.

Based on the scenario and assumptions above, the basic process of cooperative dynamic task assignment is shown in Figure 7. The concrete steps are as follows:

Assign the known targets. The top leader assumes the role of the tenderee, dominates the initial assignment, and releases the target information to each group leader based on the inter-group contract network.
Each group leader that has direct communication with the top leader receives the target information, releases the subtask information to their followers, and determines the group’s bidding scheme, which is obtained by subtasks pre-assignment based on the group’s internal contract network.
The top leader selects the group with the largest bidding value as the bid winner according to the bidding schemes of the group submitted by the group leaders and assigns the target to the group.
According to the assigned target, each subset performs tasks according to the corresponding subtask assignment scheme.
If a UAV fails to perform tasks due to sudden failure, the tasks shall be assigned to other UAVs in this subset first; if no UAV in the subset can continue to perform, the group leader will release the tasks to other subsets, and then others will determine the pre-assignment scheme based on the contract network within the group. The tendering group shall determine the assistance according to each bidding scheme.
If a UAV (including the group leader) detects a sudden target, it shall be reported to the group leader. After receiving the target information, the leader of this subset will release the target to other subsets. Each subset determines the pre-assignment scheme based on the contract network within the group, carries out the bidding based on the inter-group contract network, and, finally, completes the target and subtask assignment.
Repeat steps 4–6 until there is no dynamic change.

5. Numerical Simulation

It is assumed that the swarm consists of 1 top leader and 2 subsets, with a total of 11 UAVs, including 2 reconnaissance-attack UAVs, 4 reconnaissance UAVs, and 4 attack UAVs. Among them, reconnaissance-attack UAVs are the group leaders. Consider the UAV swarm communication topology G₀, as shown in Figure 8.

Each UAV can perform no more than two subtasks, and each subtask is performed jointly by, at most, two UAVs. For flight path preplanning, the UAV kinematics model in reference [35] and the modified artificial potential field method [36,37,38] are used to realize the air-range estimation considering threat avoidance. The capability parameters of each UAV under the scenario are shown in Table 1, and the parameters of reconnaissance payload and attack payload are shown in Table 2 and Table 3, respectively.

The feasible area of UAV reconnaissance and attack for a target on the ground, based on the above parameter configuration, is shown in Figure 9.

Consider a 5 × 5 km mission scenario. The information of known targets and the time window constraints of subtasks randomly generated is shown in Table 4.

5.1. The Initial Task Assignment for Known Targets

Assume that the UAV swarm performs the reconnaissance-and-attack tasks at six enemy targets. The location of each UAV, threat, and target is shown in Figure 10.

Purple threats represent known threats, and blue threats represent unknown ones, all of which the coordinates are randomly generated. The targets are marked with “T”; Each UAV is marked with “V”, where “S” represents the reconnaissance UAV, “A” represents the attack UAV, and “SA” represents the reconnaissance–attack UAV.

In the simulation, the maximum value of the reconnaissance subtask is set at 100, the maximum value of the attack subtask is set at 150, and the attenuation factor

λ

is set at 0.9.

Adopting the designed UAV swarm cooperative dynamic task assignment approach, the sequential task assignment results for known targets are shown in Table 5. The task execution plan of each UAV is expressed in the format of (Target, Subtask type, Start time (s), Winning bidding value).

According to the known threat information and the unknown threat information detected by reconnaissance UAV during mission execution, the modified artificial potential field method is adopted to avoid local optimization, and the preplanning flight path considering threat avoidance during mission execution can be realized. Combined with the UAV swarm task assignment results, the preplanned path, and the task sequence, the swarm task execution diagram can be obtained, as shown in Figure 11.

As can be seen in Figure 11, the cooperative task assignment approaches can realize the “reconnaissance–attack” coverage of known targets under the consideration of flight path planning coupling and task time window constraints.

5.2. Dynamic TASK Assignment for Sudden Targets

Considering two unknown targets, T7 and T8, the swarm discovers targets through cooperative detection and then assigns tasks. The information of T7 and T8 is shown in Table 6, and the initial operational situation is shown in Figure 12.

The time when unknown targets are discovered by the UAVs and the corresponding subsets that discover the targets are shown in Table 7. The generated subtask time window constraints are shown in Table 7 as well. The sequential task assignment results for the unknown targets are shown in Table 8.

The swarm realizes the coverage of sudden targets through the distributed cooperative dynamic task allocation mechanism after the reconnaissance UAV V8, belonging to Subset 1, and the reconnaissance UAV V5, belonging to Subset 2, detect sudden targets T7 and T8, respectively, during task execution.

The concrete analysis is as follows: The reconnaissance UAV V4 and attack UAV V6, belonging to Subset 1, is assigned to perform the subtasks of sudden target T8 successively in the corresponding time window. The reconnaissance UAV V8 and reconnaissance-attack UAV V3, belonging to Subset 2, successively perform the subtasks of sudden target T7 in the corresponding time window so as to realize the “reconnaissance–strike” coverage of each sudden target.

We combine the assignment results with the preplanning flight path and task execution sequence based on the artificial potential field method. The diagrams of UAV swarm task execution and the execution sequence under sudden targets are shown in Figure 13.

Figure 13 indicates that through the dynamic task assignment approach based on the extended CNP, the swarm can mobilize the UAVs with task capability and the best execution effect obtained after bidding negotiation to complete tasks for each sudden target according to the time window.

5.3. Reassignment for Subtasks of Failed UAV

During the simulation, the reconnaissance UAV V5, belonging to Subset 1, fails and loses the capability to perform tasks. The failure time is 97.5 s.

According to the assignment results in Section 5.1, the reconnaissance UAV V5 prepares to perform the reconnaissance subtask. The fault of V5 triggers dynamic task assignment, and the tasks (T2, s, 117.5, 62.14)→(T6, s, 167.0, 59.20) of V5 become dynamic tasks. The bidding values of each group of potential bidders (including V2, V4, V3, and V8) are shown in Table 9. The reassignment results of failed UAV subtasks obtained by the designed assignment approach are shown in Table 10.

The task assignment results in Table 9 show that UAV V4 of Subset 1 and UAV V8 of Subset 2 replace UAV V5 and continue to perform the tasks, respectively.

Combined with the assignment results, the diagrams of UAV swarm task execution and the execution sequence under UAV V5 failure are shown in Figure 14.

The explanation of the assignment result is as follows: after the reconnaissance, UAV V5 loses its mission capability, and its subtasks shall be preferentially executed by other reconnaissance UAVs or reconnaissance UAVs (i.e., V2 and V4) in Subset 1. It can be seen from Figure 14a that after the initial assignment in Section 4.1, the number of tasks that can be performed by UAV V4 is 1. Therefore, through the contract network within the group, it can undertake one reconnaissance subtask of V5. At this time, V2 is executing the attack subtask T1(a) according to the initial allocation results. During this execution, it crosses several threat areas, resulting in low survivability estimation, and the bidding value is negative, which is not suitable to continue to perform the subtask. Therefore, the leader UAV V2 of Subset 1 releases the information to Subset 2 for assistance through the contract network among the groups. Subset 2 conducts bidding negotiation through the contract network within the group and finally assigns UAV V8 to assist Subset 1 to take over reconnaissance subtask T6 (s).

5.4. The Analysis of the Real-Time Performance of the Assignment Approach

All the simulation experiments have been implemented on a personal PC; the parameters are Intel Core i5-5350U CPU @ 1.80 GHz 8 GB RAM, and the programming environment is MATLAB 2018b.

5.4.1. Real-Time Performance with Different Problem Scales

The real-time performance of the assignment approach is mainly influenced by three factors: ① the number of subsets of the UAV swarm; ② the number of UAVs in each subset; ③ the number of targets.

To illustrate the impact of the factors above, the variable-controlling method is adopted. The details of simulation cases to be compared are in Table 11. Correspondingly, the comparison of assignment times in different cases is shown in Figure 15.

According to the comparison among the simulation, on one hand, due to the slowly increasing tendency of the assignment time in Figure 15a,b, it can be concluded that the real-time performance of the proposed approach has low sensitivity with the size of the UAV swarm. On the other hand, comparing (a) with (b), the task assignment time is more sensitive with the number of subsets than with the number of UAVs of each subset.

Moreover, according to Figure 15c, the task assignment time is sensitive to the number of targets, which indicates that the performance of the approach may become worse with the number of targets increasing.

5.4.2. Algorithm Comparison

In order to verify the effectiveness of the distributed approach based on contract network protocol (CNP) in solving the UAV swarm task assignment problem, it is compared with the centralized task assignment approach used in reference [10]. Scenarios are designed with different sizes of UAV swarm performing multiple task assignments. From the simulation results, both the centralized approach (ACO) and the distributed approach (CNP) are able to obtain the solution to the UAV swarm task assignment problem. However, the distributed approach (CNP) has an obvious advantage in terms of solution efficiency. The solution time for different sized assignment problems is shown in Table 12.

Remark:

Generally, a global optimal solution to such problems can rarely be obtained. Therefore, feasible solutions to the task assignment problem obtained in the solving process are considered in general.

As a heuristic algorithm, ACO searches in the solution space, learns from the results of each iteration, and finally converges to a feasible solution, which means that ACO requires a lot of computational resources and a much longer time. Simulation results show that this problem becomes more pronounced when the problem size (e.g., size of the UAV swarm, the number of targets) increases.

In contrast to heuristic algorithms such as ACO, CNP merely performs the optimization by bidding on each subset of UAVs as well as UAV individuals without the iterative process of searching for feasible solutions in a large solution space.

The results illustrate that, as a distributed assignment approach, the proposed method in this paper has advantages in real-time performance compared with the ACO proposed in reference [10], which proves the effectiveness of the method to some extent.

6. Conclusions

Aiming at the problem of the cooperative reconnaissance–attack task assignment of UAV swarms in complex environments, this paper proposes a distributed grouping cooperative dynamic task assignment approach by considering multiple constraints, realizes the effective assignment of cooperative reconnaissance–attack tasks to multiple targets, and optimizes the combat effectiveness of the swarm. The main conclusions include:

(1) The proposed extended CNP algorithm, which is based on the determination mechanism of cooperators and the selection mechanism of sequential tasks, with the bidding function considering the constraints of sequence, flight path, and threat; it can realize the reasonable assignment of reconnaissance and attack tasks on multiple targets under multiple constraints and the optimization of UAV swarm task execution efficiency.

(2) The proposed dynamic task assignment strategy based on the event-trigger mechanism constructs the overall architecture of cooperative dynamic task assignment for the distributed grouping of the UAV swarm and improves the adaptability of the swarm to the dynamic environment and sudden targets during task execution.

(3) Three typical simulation scenarios are designed. The simulation results show that the task assignment approach in this paper can solve the problem of cooperative sequential dynamic task assignment when the UAV swarm with subsets performs reconnaissance–attack tasks in a complex environment with incomplete situational awareness and sudden targets and realize reconnaissance and attack coverage on each target. Moreover, the real-time performance of the assignment approach has been analyzed, which indicates that the proposed approach has low sensitivity to the size of the UAV swarm.

Author Contributions

Conceptualization, B.Q. and D.Z.; methodology, B.Q.; software, B.Q.; validation, B.Q.; formal analysis, B.Q.; investigation, B.Q. and D.Z.; resources, D.Z. and S.T.; data curation, B.Q. and M.W.; writing—original draft preparation, B.Q.; writing—review and editing, B.Q.; visualization, B.Q.; supervision, D.Z. and S.T.; project administration, D.Z. and S.T; funding acquisition, D.Z. All authors have read and agreed to the published version of the manuscript.

Funding

The research was funded by the National Natural Science Foundation of China, grant number (61933010) and (61903301).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, Y.F.; Lin, D.F.; Zheng, D.; Cheng, Z.H.; Tang, P. Task Allocation and Trajectory Optimization of UAV for Multi-target Time-space Synchronization Cooperative Attack. Acta Armamentarii 2021, 42, 1482–1495. (In Chinese) [Google Scholar]
Jiang, X.; Zeng, X.L.; Sun, J.; Chen, J. Research status and prospect of distributed optimization for multiple aircraft. Acta Aeronaut. Astronaut. Sin. 2021, 42, 90–105. (In Chinese) [Google Scholar]
Zhang, J.; Xing, J.H. Cooperative task assignment of multi-UAV system. Chin. J. Aeronaut. 2020, 33, 2825–2827. [Google Scholar] [CrossRef]
Duan, H.B.; Zhao, J.X.; Deng, Y.M.; Shi, Y.H. Dynamic Discrete Pigeon-Inspired Optimization for Multi-UAV Cooperative Search-Attack Mission Planning. IEEE Trans. Aerosp. Electron. Syst. 2020, 57, 706–720. [Google Scholar] [CrossRef]
Ye, F.; Chen, J.; Tian, Y.; Jiang, T. Cooperative Task Assignment of a Heterogeneous Multi-UAV System Using an Adaptive Genetic Algorithm. Electronics 2020, 9, 687. [Google Scholar] [CrossRef]
Jia, Z.Y.; Yu, J.Q.; Ai, X.L.; Xu, X.; Yang, D. Cooperative multiple task assignment problem with stochastic velocities and time windows for heterogeneous unmanned aerial vehicles using a genetic algorithm. Aerosp. Sci. Technol. 2018, 76, 112–125. [Google Scholar] [CrossRef]
Wu, W.N.; Wang, X.G.; Cui, N.G. Fast and coupled solution for cooperative mission planning of multiple heterogeneous unmanned aerial vehicles. Aerosp. Sci. Technol. 2018, 79, 131–144. [Google Scholar] [CrossRef]
Wang, Z.; Liu, L.; Long, T.; Wen, Y.L. Multi-UAV recon-naissance task allocation for heterogeneous targets using an opposition-based genetic algorithm with double chromosome encoding. Chin. J. Aeronaut. 2018, 31, 339–350. [Google Scholar] [CrossRef]
Wei, C.; Ji, Z.; Cai, B. Particle Swarm Optimization for Cooperative Multi-Robot Task Allocation: A Multi-Objective Approach. IEEE Robot. Autom. Lett. 2020, 5, 2530–2537. [Google Scholar] [CrossRef]
Su, F.; Chen, Y.; Shen, L.C. UAV Cooperative Multi-task Assignment Based on Ant Colony Algorithm. Acta Aeronaut. Astronaut. Sin. 2008, 29, 184–191. (In Chinese) [Google Scholar]
Zhen, Z.; Xing, D.; Gao, C. Cooperative search-attack mission planning for multi-UAV based on intelligent self-organized algorithm. Aerosp. Sci. Technol. 2018, 76, 402–411. [Google Scholar] [CrossRef]
Chen, Y.B.; Yang, D.; Yu, J.Q. Multi-UAV Task Assignment with Parameter and Time-Sensitive Uncertainties Using Modified Two-Part Wolf Pack Search Algorithm. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 2853–2872. [Google Scholar] [CrossRef]
Gerkey, B.P.; Mataric, M.J. Sold!: Auction methods for multirobot coordination. IEEE Trans. Robot. Autom. 2002, 18, 758–768. [Google Scholar] [CrossRef]
Choi, H.L.; Brunet, L.; How, J.P. Consensus-Based De-8kcentralized Auctions for Robust Task Allocation. IEEE Trans. Robot. 2009, 25, 912–926. [Google Scholar] [CrossRef]
Luo, L.Z.; Chakraborty, N.; Sycara, K. Distributed Algorithms for Multirobot Task Assignment with Task Deadline Constraints. IEEE Trans. Autom. Sci. Eng. 2015, 12, 876–888. [Google Scholar] [CrossRef]
Farinelli, A.; Iocchi, L.; Nardi, D. Distributed on-line dynamic task assignment for multi-robot patrolling. Auton. Robot. 2016, 41, 1–25. [Google Scholar] [CrossRef]
Oh, G.; Kim, Y.; Ahn, J.; Choi, H.L. Market-Based Task Assignment for Cooperative Timing Missions in Dynamic Environments. J. Intell. Robot. Syst. 2017, 87, 1–27. [Google Scholar] [CrossRef]
Moore, B.J.; Passino, K.M. Distributed Task Assignment for Mobile Agents. IEEE Trans. Autom. Control. 2007, 52, 749–753. [Google Scholar] [CrossRef][Green Version]
Yang, H.Z.; Wang, Q. A multi-AUV dynamic task allocation method based on ant colony labor division model. Control. Decis. 2021, 36, 1911–1919. (In Chinese) [Google Scholar]
Li, X.M.; TANG, J.Y.; DAI, J.J.; Bo, N. Dynamic Coalition Task Allocation of Heterogeneous Multiple Agents. J. Northwestern Polytech. Univ. 2020, 38, 1094–1104. (In Chinese) [Google Scholar] [CrossRef]
Ni, J.J.; Tang, M.; Chen, Y.N.; Cao, W.D. An Improved Cooperative Control Method for Hybrid Unmanned Aerial-Ground System in Multitasks. Int. J. Aerosp. Eng. 2020, 2020, 1–14. [Google Scholar] [CrossRef]
Yao, W.R.; Qi, N.M.; Wan, N.; Liu, Y.B. An iterative strategy for task assignment and path planning of distributed multiple unmanned aerial vehicles. Aerosp. Sci. Technol. 2019, 86, 455–464. [Google Scholar] [CrossRef]
Zhan, C.; Zeng, Y. Aerial–Ground Cost Tradeoff for Multi-UAV-Enabled Data Collection in Wireless Sensor Networks. IEEE Trans. Commun. 2020, 68, 1937–1950. [Google Scholar] [CrossRef]
Wang, R.R.; Wei, W.L.; Yang, M.C.; Liu, W. Task allocation of multiple UAVs considering cooperative route planning. Acta Aeronaut. Et Astronaut. Sin. 2020, 41, 24–35. (In Chinese) [Google Scholar]
Ismail, A.; Bagula, B.A.; Tuyishimire, E. Internet-Of-Things in Motion: A UAV Coalition Model for Remote Sensing in Smart Cities. Sensors 2018, 18, 2184. [Google Scholar] [CrossRef]
Fu, X.W.; Feng, P.; Gao, X. Swarm UAVs Task and Resource Dynamic Assignment Algorithm Based on Task Sequence Mechanism. IEEE Access 2019. [Google Scholar] [CrossRef]
Yan, F.; Zhu, X.P.; Zhou, Z.; Tang, Y. Real-time task allocation for a heterogeneous multi-UAV simultaneous attack. SCIENTIA SINICA Inf. 2019, 49, 555–569. (In Chinese) [Google Scholar] [CrossRef]
Chen, P.; Yan, F.; Liu, Z.; Cheng, G.D. Heterogeneous unmanned aerial vehicles task allocation with communication constraints. Acta Aeronaut. Et Astronaut. Sin. 2021, 42, 525844. (In Chinese) [Google Scholar]
Jia, G.W.; Wang, J.F. Research review of UAV swarm mission planning method. Syst. Eng. Electron. 2021, 43, 99–111. (In Chinese) [Google Scholar]
Wu, Q.B.; Zhou, S.L.; Liu, W.; Yin, G.Y. Multi-unmanned aerial vehicles cooperative search based on central-distributed model predictive control. Control. Theory Appl. 2015, 32, 1414–1421. (In Chinese) [Google Scholar]
Tian, L.; Wang, M.Y.; Zhao, Q.L.; Wang, X.D.; Song, X.; Ren, Z. Distributed time-varying group formation tracking for cluster systems under switching topologies. Scientia Sinica Inf. 2020, 50, 408–423. (In Chinese) [Google Scholar]
Dong, X.W.; Li, Q.D.; Zhao, Q.L.; Ren, Z. Time-varying group formation analysis and design for second-order multi-agent systems with directed topologies. Neurocomputing 2016, 205, 367–374. [Google Scholar] [CrossRef]
Su, F. Research on Distributed Online Cooperative Mission Planning for Multiple Unmanned Combat Aerial Vehicles in Dynamic Environment. Ph.D. Thesis, National University of Defense Technology, Changsha, China, 2013. (In Chinese). [Google Scholar]
Ma, D.L.; Chen, Y. Assessments of air-to-surface operational effectiveness for aircraft during combat sortie. J. Beijing Univ. Aeronaut. Astronaut. 2000, 26, 1413–1417. (In Chinese) [Google Scholar]
Wang, Y.X.; Zhang, T.; Cai, Z.H.; Zhao, J.; Wu, K. Multi-UAV coordination control by chaotic grey wolf optimization based distributed MPC with event-triggered strategy. Chin. J. Aeronaut. 2020, 33, 2877–2897. [Google Scholar] [CrossRef]
Zhu, Y.; Zhang, T.; Song, J.Y. Study on the Local Minima Problem of Path Planning Using Potential Field Method in Unknown Environments. Acta Autom. Sin. 2010, 36, 1122–1130. [Google Scholar] [CrossRef]
Fan, S.P.; Qi, Q.; Lu, K.F.; Wu, G.; Li, L. Autonomous Collision Avoidance Technique of Cruise Missiles Based on Modified Artificial Potential Method. Trans. Beijing Inst. Technol. 2018, 38, 828–834. [Google Scholar]
Ge, S.S.; Cui, Y.J. New potential functions for mobile robot path planning. IEEE Trans. Robot. Autom. 2000, 16, 615–620. [Google Scholar] [CrossRef]

Figure 1. The hierarchical communication topology of UAV swarms with a distributed group.

Figure 2. (a) The detection area of reconnaissance payload; (b) the available area of attack payload.

Figure 3. The process of task assignments based on extended CNP.

Figure 4. The determination mechanism on cooperators.

Figure 5. The process of task assignment based on extended CNP.

Figure 6. The coupling of the UAV mission assignment and path planning.

Figure 7. The procedure for cooperative dynamic task assignment.

Figure 8. The communication topology G₀ of the UAV swarm.

Figure 9. The feasible area of payload: (a) the available detection area; (b) the available attack zone.

Figure 10. The initial setting of the situation between both sides.

Figure 11. (a) The execution sequence of the UAV swarm; (b) the diagram of the UAV swarm performing tasks.

Figure 12. The initial setting of the situation between both sides. (Considering unknowned targets).

Figure 13. (a) The execution sequence of the UAV swarm; (b) the diagram of the UAV swarm performing tasks. (Considering unknowned targets).

Figure 14. (a) The execution sequence of the UAV swarm; (b) the diagram of the swarm performing tasks under V5 failure.

Figure 15. Assignment time varies with (a) the number of subsets, (b) the number of UAVs of each subset, and (c) the number of targets.

Table 1. The table of capability parameters of UAVs.

UAV	Initial Position (m)	The Maximum Air-Range (m)	Fuel Consumption Rate (m⁻¹)	Velocity (m/s)	The Maximum Number of Tasks	Mission Capability
UAV	Initial Position (m)	The Maximum Air-Range (m)	Fuel Consumption Rate (m⁻¹)	Velocity (m/s)	The Maximum Number of Tasks	Reconnaissance	Attack
V1	(2500,20,200)	10,000	0.01	120	2	√
V2	(0,20,200)	10,000	0.03	80	2	√	√
V3	(5000,20,200)	10,000	0.03	80	2	√	√
V4	(500,20,200)	10,000	0.01	120	2	√
V5	(1000,20,200)	10,000	0.01	120	2	√
V6	(1500,20,200)	10,000	0.02	100	2		√
V7	(2000,20,200)	10,000	0.02	100	2		√
V8	(4500,20,200)	10,000	0.01	120	2	√
V9	(4000,20,200)	10,000	0.01	120	2	√
V10	(3500,20,200)	10,000	0.02	100	2		√
V11	(3000,20,200)	10,000	0.02	100	2		√

Table 2. The parameter table of UAV reconnaissance payload.

Flight Height H/m	Operating Distance R_s,max/m	Azimuth Search Angle φ_max/°	Pitch Search Angle ϕ_max/°	Mounting Angle α/°
200	500	45	30	30

Table 3. The parameter table of UAV attack load.

Flight Velocity V/m	The Minimum Launch Distance R_a,min/m	The Maximum off-Boresight Angle φ_a,max/°	The Maximum Operating Range of Guidance Equipment d_max/m	Maximum Horizontal Detection Angle of Guidance Equipment ± ϕ_a,max/°	Aiming Time of Guidance Equipmentt_a/s
100	80	60	30	±60	0.2

Table 4. The table of information on known targets.

Target	Coordinate (m)	Task Type		Time Window of S		Time Window of A
Target	Coordinate (m)	Reconnaissance S	Attack A	Latest (s)	Earliest (s)	Latest (s)	Earliest (s)
T1	(538.4, 3516.3, 0)	√	√	73.7	93.7	98.7	123.7
T2	(1497.9, 3788.2, 0)	√	√	117.5	137.5	142.5	167.5
T3	(2396.8, 4602.3, 0)	√	√	150.3	170.3	175.3	200.3
T4	(4005.0, 4287.9, 0)	√	√	193.8	213.8	218.8	243.8
T5	(3359.0, 4171.4, 0)	√	√	249.6	269.6	274.6	299.6
T6	(1949.3, 3865.2, 0)	√	√	167.0	187.0	192.0	217.0

Table 5. The task assignment of the UAV swarm.

UAV	Air-Range Estimation (m)	The Number of Tasks	The Execution Planning
V1	0	0	None
V2	3524.00	1	(T1, A, 98.7, 79.52)
V3	0	0	None
V4	3486.00	1	(T1, S, 73.7, 65.14)
V5	4230.00	2	(T2, S, 117.5, 62.14)→(T6, S, 167.0, 59.20)
V6	3756.52	1	(T2, A, 142.5, 74.87)
V7	3837.18	1	(T6, A, 192.0, 73.23)
V8	4298.74	1	(T5, S, 249.6, 57.04)
V9	6475.10	2	(T3, S, 150.3, 51.47)→(T4, S, 193.8, 57.34)
V10	4477.55	1	(T4, A, 218.8, 60.51)
V11	5702.98	2	(T3, A, 175.3, 56.75)→(T5, A, 274.6, 66.90)

Table 6. The table of information on unknown targets.

Target	Coordinates (m)	Task Type
Target	Coordinates (m)	Reconnaissance S	Attack A
T7	(3467.7, 2897.7, 0)	√	√
T8	(1307.7, 3057.7, 0)	√	√

Table 7. Information of unknown targets and corresponding subtasks.

Target	The Time It Is Discovered	Discoverer	Subset to Which the Discoverer Belongs	Time Window of S		Time Window of A
Target	The Time It Is Discovered	Discoverer	Subset to Which the Discoverer Belongs	Latest (s)	Earliest (s)	Latest (s)	Earliest (s)
T7	237.9	V8 (S)	2	271.0	291.0	291.0	316.0
T8	109.1	V5 (S)	1	155.8	175.8	175.8	200.8

Table 8. The task assignment of the UAV swarm.

UAV	Air-range Estimation (m)	The Number of Tasks	The Execution Planning
V1	0	0	None
V2	3524.00	1	(T₁, A, 98.7, 79.52)
V3	3262.79	1	(T₇, A, 291.0, 84.77)
V4	4366.55	2	(T₁, S, 73.7, 65.14)→(T₈, S, 155.8, 32.90)
V5	4230.00	2	(T₂, S, 117.5, 62.14)→(T₆, S, 167.0, 59.20)
V6	4502.80	2	(T₂, A, 142.5, 74.87)→(T₈, A, 175.8, 135.13)
V7	3837.18	1	(T₆, A, 192.0, 73.23)
V8	5747.08	2	(T₅, S, 249.6, 57.04)→(T₇, S, 271.0, 27.42)
V9	6475.10	2	(T₃, S, 150.3, 51.47)→(T₄, S, 193.8, 57.34)
V10	4477.55	1	(T₄, A, 218.8, 60.51)
V11	5702.98	2	(T₃, A, 175.3, 56.75)→(T₅, A, 274.6, 66.90)

Table 9. The bidding values of potential bidders for dynamic subtasks.

Bidding Value of Subtask	Subset 1		Subset 2
Bidding Value of Subtask	V2	V4	V3	V8
T2 (S)	10.72	30.88	−106.49	−48.62
T6 (S)	−6.74	21.83	−2.77	52.89

Table 10. Task assignment under the circumstance of UAV breakdown.

Subtask	Bidding Winner	Winning Bidding Value	Start Time (s)
T2 (S)	V4 (Subset 1)	30.88	117.5 s
T6 (S)	V8 (Subset 2)	52.89	167.0 s

Table 11. The simulation cases to be compared.

Case	Number of Subsets	Number of UAVs of Each Subset	Number of Targets	The Size of the UAV Swarm
(a)	5, 10, 15, 20, 25	10	10	50, 100, 150, 200, 250
(b)	5	20, 40, 60, 80, 100	10	100, 200, 300, 400, 500
(c)	5	10	5, 10, 15, 20, 25	50

Table 12. The real-time performance between the proposed approach and ACO in reference [10].

The Size of the UAV Swarm	The Number of Targets	The Total Solving Time of the CNP-Based Approach/(Seconds)	The Total Solving Time of ACO (50 Iterations)/(Seconds)
50	10	1.66	41.5
100	10	1.70	59.8
150	10	1.85	74.9
200	10	1.93	96.2
250	10	1.98	112.5
150	15	2.03	88.2
150	20	3.50	124.1
150	25	5.30	179.0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qin, B.; Zhang, D.; Tang, S.; Wang, M. Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm. Appl. Sci. 2022, 12, 2865. https://doi.org/10.3390/app12062865

AMA Style

Qin B, Zhang D, Tang S, Wang M. Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm. Applied Sciences. 2022; 12(6):2865. https://doi.org/10.3390/app12062865

Chicago/Turabian Style

Qin, Boyu, Dong Zhang, Shuo Tang, and Mengyang Wang. 2022. "Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm" Applied Sciences 12, no. 6: 2865. https://doi.org/10.3390/app12062865

APA Style

Qin, B., Zhang, D., Tang, S., & Wang, M. (2022). Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm. Applied Sciences, 12(6), 2865. https://doi.org/10.3390/app12062865

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Grouping Cooperative Dynamic Task Assignment Method of UAV Swarm

Abstract

1. Introduction

2. Problem Description

2.1. Mission Scenario Analysis

2.2. Hierarchical Communication Topology

2.3. UAV and Payload Model

2.4. Threat Model

2.5. Dynamic Task Allocation Problem

3. Task Assignment Algorithm Based on Extended CNP

3.1. Distributed Multi-Constraint Dynamic Task Assignment Algorithm

3.2. Bidding Function Design

3.2.1. Individual Bidding Function

3.2.2. Bidding Function of Each Subset

4. Dynamic Assignment Strategy Based on Event Triggering

4.1. Event Trigger Conditions for Dynamic Tasks

4.2. The Basic Process of Dynamic Task Assignment

5. Numerical Simulation

5.1. The Initial Task Assignment for Known Targets

5.2. Dynamic TASK Assignment for Sudden Targets

5.3. Reassignment for Subtasks of Failed UAV

5.4. The Analysis of the Real-Time Performance of the Assignment Approach

5.4.1. Real-Time Performance with Different Problem Scales

5.4.2. Algorithm Comparison

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI