Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation

Jiang, Bing; Qin, Kaiyu; Wu, Yu

doi:10.3390/sym18050720

Open AccessArticle

Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation

by

Bing Jiang

^1,2,

Kaiyu Qin

¹ and

Yu Wu

^1,*

¹

School of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Chengdu 611731, China

²

AVIC Chengdu Aircraft Design and Research Institute, Chengdu 610041, China

^*

Author to whom correspondence should be addressed.

Symmetry 2026, 18(5), 720; https://doi.org/10.3390/sym18050720

Submission received: 22 March 2026 / Revised: 16 April 2026 / Accepted: 22 April 2026 / Published: 24 April 2026

(This article belongs to the Special Issue Cooperative Control of Multi-Agent Systems over Signed Networks: Symmetry and Asymmetry Perspectives)

Download

Browse Figures

Versions Notes

Abstract

The cooperative airdrop of UAV swarms by multiple transport aircraft creates a large-scale multi-agent planning problem. The mission involves heterogeneous aircraft, multi-visit airdrop areas, strict time windows, and threat-aware flight paths. To address these challenges, this work develops an integrated framework for both global task allocation and real-time replanning in complex three-dimensional operational environments. First, for the combinatorial optimization of task execution sequences across multiple aircraft, a static task assignment method is proposed. This method employs a Hybrid-encoding Constrained Black-winged Kite Algorithm (HCBKA), which incorporates optimization metrics such as mission execution time, completion rate, and load-balancing symmetry among aircraft. The HCBKA aims to find a task assignment scheme that achieves a comprehensive optimum across multiple objectives through efficient model solving. Second, to handle potential real-time dynamic changes during mission execution, a rapid-response and generalizable replanning mechanism is developed. This mechanism utilizes an event-triggered strategy based on a Time-window aware Dynamic Auction Algorithm (TDAA). It ensures that the system can promptly initiate and execute online task reallocation in response to contingencies such as changing mission requirements or losses within its own drone swarm, thus maintaining the adaptability and robustness of the overall plan. Simulation results show that the proposed framework produces high-quality global solutions and maintains strong robustness under dynamic changes. The approach provides an effective and scalable solution for coordinated multi-aircraft swarm airdrop missions.

Keywords:

swarm airdrop; task assignment; multi-transport aircraft cooperation; hybrid meta-heuristic algorithm; auction algorithm

1. Introduction

Unmanned aerial vehicles (UAVs) have rapidly evolved over the past decade, driven by their low cost, ease of deployment, and high operational flexibility. These advantages have enabled UAVs to play an increasingly important role in civilian domains such as logistics [1], environmental monitoring [2], emergency response [3], and infrastructure inspection [4]. As UAV technologies mature, the focus of research has gradually shifted from single-platform autonomy to the coordinated operation of UAV swarms. Compared with individual UAVs, swarm systems offer superior spatial coverage, enhanced robustness, and emergent collective intelligence, making them suitable for demanding applications such as wide-area surveillance [5], cooperative search and rescue [6], and distributed sensing [7]. In military contexts, UAV swarms have demonstrated transformative potential by enabling rapid situational awareness, resilient distributed strike capabilities, and high-density penetration of contested airspace [8].

Parallel to the development of UAV swarms, transport aircraft have long served as strategic assets for rapid delivery of personnel, equipment, and humanitarian supplies [9]. Airdrop operations have been widely adopted due to their ability to bypass damaged or inaccessible ground infrastructure [10]. Inspired by these advantages, this research began to explore the concept of deploying UAV swarms directly from transport aircraft. This operational paradigm integrates the long-range projection capability of transport aircraft with the distributed intelligence of UAV swarms, enabling rapid force insertion beyond enemy defensive perimeters, improved survivability through stand-off deployment, and flexible mission execution across large-scale, threat-intensive environments. In such missions, multiple transport aircraft must cooperatively visit dispersed airdrop areas while avoiding air-defense threats, satisfying strict time windows, and maximizing the overall mission effectiveness. These characteristics naturally lead to a complex multi-agent planning problem that couples heterogeneous aircraft performance, spatial–temporal constraints, and dynamic battlefield uncertainties.

Motivated by these challenges, this work investigates the coordinated airdrop of UAV swarms using multiple transport aircraft and develops a unified framework for both static global planning and dynamic online replanning. The main contributions of this paper are summarized as follows:

1.: A comprehensive mission modeling framework is established for multi-transport-aircraft swarm airdrop operations. The model captures heterogeneous aircraft capabilities, multi-visit mission areas, time-window constraints, and threat-aware flight paths, forming a realistic and high-fidelity representation of contested operational environments.
2.: A Hybrid-encoding Constrained Black-winged Kite Algorithm is proposed to solve the allocation problem of multi-objective airdrop transportation tasks. By jointly optimizing task allocation and execution sequences through mixed discrete-continuous encoding, constraint-driven repair, and hybrid attack–migration search dynamics, HCBKA achieves high-quality global planning under complex operational constraints.
3.: A Time-window aware Dynamic Auction Algorithm is developed to address real-time mission disturbances such as aircraft attrition, emergent tasks, and newly appearing mission areas. Through two-level aircraft screening and a composite bid function, TDAA enables rapid, minimally disruptive task reallocation while preserving time-window feasibility and mission continuity.

Beyond these specific contributions, the proposed framework is intrinsically connected to symmetry principles that permeate both the problem structure and the solution methodology. First, load-balancing symmetry is explicitly encoded in the objective function, where we seek to achieve a balanced distribution of tasks among heterogeneous transport aircraft—a form of equitable resource allocation that prevents any single asset from becoming a bottleneck or a point of failure. Second, structural symmetry appears in the hybrid encoding scheme: the discrete assignment variables and continuous sequencing variables form a complementary pair that together define a complete solution space, reflecting a duality between task-to-agent mapping and temporal ordering. Third, behavioral symmetry is embedded in the HCBKA algorithm through its attack–migration dual-mode search strategy, where exploitation and exploration are balanced in a symmetric manner to navigate complex constrained landscapes. Fourth, the TDAA algorithm exhibits operational symmetry through its two-level screening mechanism, which treats spatial proximity and temporal feasibility as symmetric filters for candidate aircraft selection. Together, these symmetry considerations not only enhance the interpretability of our framework but also contribute to its efficiency and robustness, making the work naturally aligned with the thematic scope of symmetry.

2. Related Works

The coordinated airdrop of swarm UAVs by multiple transport aircraft represents a complex multi-agent mission planning problem that integrates heterogeneous platform capabilities, spatial–temporal constraints, and dynamic operational environments. From a modeling perspective, this problem can be naturally abstracted as a variant of the Multiple Traveling Salesman Problem (MTSP), where multiple agents depart from a common base, visit a set of mission points under various constraints, and jointly optimize global mission performance. Consequently, existing research on MTSP and its variants provides an important theoretical foundation for multi-aircraft cooperative airdrop task allocation.

Early studies primarily focused on analytical and exact optimization methods. Mixed-integer programming (MIP/MILP) formulations have been widely adopted to capture the structural properties of MTSP with strict mathematical rigor. Murray et al. developed a MILP framework for multi-agent coordination and demonstrated its effectiveness on small-scale instances using commercial solvers [11]. Kitjacharoenchai et al. introduced the MTSPD model, a variant incorporating UAV operations, and solved it using IBM-CPLEX [12]. Bérczi et al. further advanced the theoretical frontier by proposing a unified approximation framework for multi-visit MTSP variants, achieving constant-factor guarantees through constrained forest construction and transportation-based lower bounds [13]. Park et al. extended MILP modeling to heterogeneous UAV systems, integrating platform performance parameters and multi-resource constraints into a unified optimization structure [14]. Although analytical methods provide globally optimal or theoretically bounded solutions, their computational cost grows exponentially with problem size, limiting their applicability in large-scale or real-time airdrop missions.

To address scalability limitations, research attention has shifted toward metaheuristic algorithms, which offer strong global search capabilities and flexibility in handling complex constraints. Murray proposed the Adaptive Insertion (ADI) heuristic, which iteratively improves initial solutions through removal-insertion operators combined with evolutionary updates [11]. Lim et al. introduced a hybrid search strategy with neighborhood reduction, significantly improving efficiency on large benchmark instances [15]. Zheng et al. developed an iterative two-stage heuristic combining fuzzy clustering, greedy initialization, and variable neighborhood search, achieving state-of-the-art performance on both minsum and minmax MTSP variants [16]. He and Hao proposed a memetic search algorithm capable of solving both single-depot and multi-depot minmax MTSP with high-quality solutions [17]. Ergüven et al. presented the Relative Distances Approach (RDA), which constructs solutions based on relative insertion costs and improves them through task exchange and path reversal [18]. Additional metaheuristic frameworks, such as two-stage iterative local search [19], heuristic MEC-aware routing [20], multi-start tabu search [21], and clustering-based multi-strategy search [22], further demonstrate the versatility of metaheuristics in solving large-scale MTSP variants with heterogeneous agents, multi-visit requirements, and time windows. Despite their strong performance, metaheuristics often rely on manually designed operators and require parameter tuning, which limits their adaptability to rapidly changing mission environments.

With the rise of data-driven optimization, learning-based MTSP solvers have emerged as a promising direction. Reinforcement learning (RL) and graph neural networks (GNNs) have been applied to learn task allocation and route construction strategies directly from data. Nayak and Rathinam proposed an RL-based solver for Dubins MinMax MTSP using distributed GNN policies [23]. Hu et al. modeled MTSP route construction as a sequential decision process and applied Q-learning to learn node selection strategies [24]. Liang et al. introduced a Learn-to-Branch framework using bipartite GNNs to accelerate MILP solving for MTSP and its variants [25]. Gao et al. proposed an attention-based multi-agent RL method with Transformer-based feature extraction and coordinated action selection, achieving strong generalization across different problem sizes [26]. Ma et al. developed an end-to-end DRL framework that jointly learns task assignment and route generation through modular agents and pointer-network-based solvers [27]. While learning-based methods offer high inference speed and potential generalization, they require extensive training data and still face challenges when constraints or mission scales deviate significantly from the training distribution.

In real-world airdrop operations, mission environments are highly dynamic, with evolving threat zones, changing task requirements, and potential platform failures. This has motivated research on dynamic MTSP and online replanning. Garn proposed an online balanced replanning method using incremental insertion and local exchange to maintain load balance under dynamic task updates [28]. Groba et al. integrated trajectory prediction into evolutionary search, enabling GA-based solvers to anticipate future target motion and reduce replanning oscillations [29]. Huang et al. introduced a hierarchical online planning framework combining block-level TSP decomposition with distributed multi-agent RL for large-scale dynamic coverage tasks [30]. Sariel Talay et al. developed a distributed dynamic task allocation framework using incremental task selection, heuristic evaluation, and contract-net-based coordination, enabling robust operation under uncertainty and agent failures [31]. These dynamic approaches highlight the importance of fast, incremental, and robust replanning mechanisms for time-critical missions such as coordinated airdrop.

Existing methods for MTSP and its variants fall into four categories: analytical optimization, metaheuristics, learning-based approaches, and dynamic replanning. Each category has notable limitations. Analytical methods are not scalable to large problems. Metaheuristics rely on manually designed operators and lack adaptability. Learning-based methods require extensive training and fail under unseen constraints. Dynamic replanning methods often sacrifice global optimality and ignore heterogeneous platform capabilities or strict time windows. None of these categories simultaneously address heterogeneous aircraft, multi-visit task areas, strict time windows, threat avoidance, and real-time contingency handling.

The proposed framework directly fills this gap. Its novelty and importance are fourfold. First, the problem formulation models a heterogeneous fleet of transport aircraft with different range, payload, altitude, and speed limits. It also captures multi-visit requirements for each mission area, differentiated task priorities, and hard time windows that become soft during static planning but hard during dynamic replanning. Second, the Hybrid-encoding Constrained Black-winged Kite Algorithm (HCBKA) introduces a mixed discrete-continuous encoding that decouples task assignment from task sequencing. A constraint-driven hierarchical repair mechanism ensures feasibility, while an attack–migration dual-mode search balances exploration and exploitation. This design outperforms classical metaheuristics in both solution quality and convergence, as shown in our simulations. Third, the Time-window aware Dynamic Auction Algorithm (TDAA) provides rapid online replanning under three types of disturbances: aircraft loss, new tasks in existing areas, and entirely new mission areas. Unlike most dynamic methods, TDAA enforces time windows as hard constraints during replanning, reflecting the tight time margins after a disturbance. Its two-level screening and composite bid function jointly optimize task priority, schedule disruption, and route cost. Fourth, the framework integrates offline global planning with online reactive replanning in a unified architecture, offering both high-quality initial solutions and robust adaptability to unforeseen events.

The importance of this research lies in its potential to enable real-world multi-transport-aircraft swarm airdrop missions. By providing a scalable, constraint-aware, and dynamically responsive solution, the proposed framework can help military operators plan and execute complex airdrop operations more effectively. Thus, this work not only advances the state of the art in MTSP research but also offers a practical tool for coordinated swarm deployment.

3. Modeling of the Airdrop Task Assignment Problem

3.1. Modeling of the Airdrop Mission Scenario

The multi-transport-aircraft-coordinated swarm airdrop mission investigated in this section occurs within a three-dimensional battlespace containing air defense threats. For the purpose of this study, the mission is confined to a three-dimensional airspace measuring X km

\times Y

km

\times Z

km. This airspace contains a base located at

(x_{b a s e}, y_{b a s e}, 0)

, multiple dispersed mission areas, and several enemy air defense threat zones, as illustrated in Figure 1. The dashed lines with different colors in the figure represent the routes of different aircraft. The transport aircraft, all departing from the base, aim to visit the various mission areas to deploy drone swarms while avoiding threat zones and minimizing cost.

Mathematically, the mission scenario is formally defined as a quintuple

E = (B, U, A, O, T)

. Here,

B

denotes the base location vector,

U

is the set of transport aircraft,

A

is the set of mission areas,

O

represents the set of enemy air defense threat areas, and

T

is the set of airdrop task instances. The base

B

acts as the hub for all transport aircraft operations. Its spatial coordinates are fixed at

(x_{b a s e}, y_{b a s e}, 0)

, serving as both the origin and terminus for all mission execution. The transport aircraft set is defined as

U = U_{1}, U_{2}, \dots, U_{N_{u}}

, consisting of

N_{u}

heterogeneous aircraft. Each aircraft

U_{i}

(for

i = 1, 2, \dots, N_{u}

) is characterized by specific performance constraints, including its maximum range

R_{i}^{m a x}

, maximum payload capacity

Q_{i}^{m a x}

, cruise speed

v_{i}

, and permissible flight altitude range. These parameters collectively define the operational capabilities of each transport aircraft. The set of mission areas is

A = A_{1}, A_{2}, \dots, A_{N_{a}}

, containing

N_{a}

discretely located points. Each mission point

A_{i}

(for

i = 1, 2, \dots, N_{a}

) is associated with a unique coordinate

(x_{i}, y_{i}, z_{i})

within the three-dimensional airspace. The enemy air defense threats are modeled as a set of regular cuboids, denoted by

O = O_{1}, O_{2}, \dots, O_{N_{m}}

. Each threat zone

O_{j}

is defined by the coordinates of its geometric center

(x_{j}, y_{j}, z_{j})

and its spatial dimensions

(l_{j}, w_{j}, h_{j})

, which represent the effective coverage volume of the air defense system. To account for the necessity of maintaining a safe stand-off distance in actual operations, each threat cuboid is inflated by a safety buffer distance

d_{s a f e}

around its original boundaries, forming an expanded threat set

O^{'}

. Mathematically, this inflation corresponds to increasing each dimension of the cuboid, resulting in new dimensions of

(l_{j} + 2 d_{s a f e}, w_{j} + 2 d_{s a f e}, h_{j} + 2 d_{s a f e})

. This ensures that a minimum safe separation is maintained between the transport aircraft’s flight path and the actual threat boundary.

Furthermore, to simulate complex operational requirements, each mission area

A_{i}

(for

i = 1, 2, \dots, N_{a}

) may require multiple airdrops. This is formalized through the task instance set

T = T_{1}, T_{2}, \dots, T_{N_{t}}

. Each instance

T_{k}

(for

k = 1, 2, \dots, N_{t}

) is associated with a specific mission point

A_{i}

and inherits its spatial coordinates. However, each

T_{k}

has its own required access time window

[t_{k}^{s t a r t}, t_{k}^{e n d}]

and is assigned a distinct priority weight

p_{k}

based on operational demands. This modeling approach accurately captures the practical combat paradigm where a single geographical area may necessitate multiple, sequential deployments of drone swarms, thereby enhancing the model’s fidelity to real-world military scenarios.

3.2. Design of Decision Variables for Airdrop Tasks

The design of decision variables for the coordinated swarm airdrop task assignment optimization model aims to fully capture both the assignment of task instances to transport aircraft and the resulting task execution schedule. To this end, two core types of decision variables are introduced: task assignment variables and task sequencing variables. Together, they define the solution space. The underlying rationale is to decompose the complex combinatorial optimization problem into two interconnected yet distinct sub-problems, which facilitates the design of efficient optimization algorithms.

The task assignment variables employ a discrete integer encoding scheme to establish the mapping between task instances and transport aircraft. Specifically, for each task instance

T_{k}

(where

k = 1, 2, \dots, N_{t}

) in the set

T

, we define an assignment variable

a_{k} \in 0, 1, 2, \dots, N_{u}

. Here,

a_{k} = i

denotes that instance

T_{k}

is assigned to transport aircraft

U_{i}

, while

a_{k} = 0

indicates that

T_{k}

is unassigned. This representation provides an intuitive mapping between tasks and aircraft. Crucially, it accommodates scenarios where not all tasks can be assigned due to resource limitations, reflecting real-world operational constraints where demand may exceed available capacity.

The task sequencing variables utilize a continuous real-number encoding to determine the execution order of tasks allocated to the same aircraft. For each task instance

T_{k}

, a sequencing variable

s_{k} \in [0, 1)

is defined. This variable does not represent an absolute execution time or position, but rather encodes the relative order. During decoding, for all tasks assigned to a specific aircraft

U_{i}

, their execution sequence is determined by sorting their corresponding

s_{k}

values in ascending order. This continuous encoding strategy circumvents the combinatorial explosion inherent in discrete sequencing representations, thereby significantly reducing the search complexity for the optimizer.

The two variable sets are strongly coupled: the assignment variable

a_{k}

determines which aircraft executes a task, while the sequencing variable

s_{k}

specifies its order within that aircraft’s schedule. The design ensures that the values of

s_{k}

are constrained to the half-open interval

[0, 1)

and must be distinct for all tasks assigned to the same aircraft. This ensures a unique, unambiguous execution sequence for each aircraft’s assigned tasks. Based on this design, the solution space of the optimization problem is formally defined as

S = (a, s) | a \in Z^{N_{t}}, s \in R^{N_{t}}

, where

a = (a_{1}, a_{2}, \dots, a_{N_{t}})

is the vector of assignment variables and

s = (s_{1}, s_{2}, \dots, s_{N_{t}})

is the vector of sequencing variables.

3.3. Modeling of Transport Aircraft Performance Constraints

In coordinated airdrop mission planning, transport aircraft act as the deployment platforms for drone swarms. Their performance characteristics directly influence the feasibility and efficiency of task assignment. The fleet of transport aircraft, denoted by the set

U

, comprises

N_{u}

heterogeneous units. This heterogeneity is reflected in the distinct configurations of key performance parameters across aircraft, including maximum range, payload capacity, flight speed, and operational altitude limits.

The performance constraints for each transport aircraft

U_{i}

(where

i = 1, 2, \dots, N_{u}

) are characterized by the tuple

(R_{i}^{m a x}, Q_{i}^{m a x}, H_{i}^{m i n}, H_{i}^{m a x}, v_{i})

. The parameter

R_{i}^{m a x}

defines the maximum range, limiting the total spatial coverage for aircraft

U_{i}

in a single mission. Therefore, the total distance traveled by aircraft

U_{i}

, denoted

R_{i}

, must satisfy the range constraint:

R_{i} \leq R_{i}^{\max}, i = 1, 2, \dots, N_{u} .

(1)

As each aircraft can carry a limited number of swarm drones, the number of task instances it can execute is bounded. This leads to a payload capacity constraint for aircraft

U_{i}

:

Q_{i} \leq Q_{i}^{\max}, i = 1, 2, \dots, N_{u},

(2)

where

Q_{i}

is the count of task instances assigned to aircraft

U_{i}

, and

Q_{i}^{m a x}

represents its maximum payload capacity.

To satisfy operational requirements at various drop zones, transport aircraft must release drones from specified altitudes. Each aircraft has a feasible flight altitude interval, defined by a minimum safe altitude

H_{i}^{m i n}

and a maximum service ceiling

H_{i}^{m a x}

. Since some aircraft may be incapable of reaching the altitude required for a specific task, the model must incorporate altitude constraints. The instantaneous flight altitude

h_{i} (t)

of aircraft

U_{i}

must remain within its permissible interval for the entire mission duration:

H_{i}^{\min} \leq h_{i} (t) \leq H_{i}^{\max}, \forall t \in [0, t_{end}],

(3)

where

t_{end}

denotes the total mission completion time.

For motion modeling, we adopt a simplified constant-velocity assumption, where each aircraft is assumed to travel at a constant cruise speed

v_{i}

between waypoints. This simplification is justified for the cruise phase within the mission airspace, where aircraft typically maintain a stable flight state with minimal speed variation. Furthermore, to accommodate potential time window constraints at task points, the model allows aircraft to enter a holding pattern upon arrival in the vicinity of a target. During holding, an aircraft is still assumed to fly at a constant speed, defined as a fraction

η

(where

η \in (0, 1)

) of its cruise speed

v_{i}

.

Through this modeling, the transport aircraft set

U

constitutes a heterogeneous fleet subject to multiple physical constraints. The variation in performance parameters reflects the diverse and complementary operational capabilities present in real-world scenarios, thereby expanding the solution space for complex task assignment.

3.4. Modeling of Task Constraints

In multi-transport aircraft airdrop missions, accurately modeling mission areas and task instances is crucial for capturing operational demands. The set of task instances,

T = T_{1}, T_{2}, \dots, T_{N_{t}}

, is linked to the set of mission areas,

A

, through a mapping that creates a hierarchical task requirement structure.

A single mission area

A_{i} \in A

(for

i = 1, 2, \dots, N_{a}

) may correspond to multiple airdrop task instances, reflecting the common operational need for multiple deployments of drone swarms to the same location. The required number of visits

V_{i}

to area

A_{i}

is modeled as a discrete variable with a range

1, 2, \dots, N_{m a x}^{A}

, where

N_{m a x}^{A}

is the maximum allowable visits per area. This captures tactical requirements such as sustained operations or multi-wave deployments.

Each task instance

T_{k}

(for

k = 1, 2, \dots, N_{t}

) inherits the spatial coordinates

(x_{i}, y_{i}, z_{i})

of its associated mission area

A_{i}

but is defined by its own independent parameters.

A key parameter is the task priority weight

p_{k} \in 1, 2, \dots, P_{m a x}

, an integer where a higher value denotes greater importance. This discrete scale aligns with military command practice and establishes a clear hierarchy for the optimizer.

The time window constraint

[t_{k}^{s t a r t}, t_{k}^{e n d}]

defines the preferred execution interval for

T_{k}

. It imposes a constraint on its actual start time

τ_{k}

:

t_{k}^{s t a r t} \leq τ_{k} \leq t_{k}^{e n d} .

(4)

In the static global planning phase, this constraint is treated as a soft constraint. A task can be scheduled outside its time window, but any deviation reduces its contribution to the overall objective through a penalty mechanism. For tasks that do not have a specified time window, no temporal preference is applied. In the dynamic replanning phase, however, time windows are enforced as hard constraints because the remaining mission time is limited and schedule flexibility is severely reduced. Tasks without a specified time window are treated as having a soft temporal constraint, allowing schedule flexibility as long as they are completed.

To ensure efficient resource allocation, we impose a single-visit constraint: no transport aircraft

U_{i} \in U

may be assigned more than one task instance associated with the same mission area

A_{j} \in A

. This prevents inefficient revisits, which would increase total travel distance and time, lower asset utilization, and potentially create route conflicts. If a mission area requires subsequent visits, they must be performed by different aircraft. Mathematically, this is expressed as:

\sum_{T_{k} \in T A_{j}} I (U_{i} \to T_{k}) \leq 1, \forall U_{i} \in U, \forall A_{j} \in A,

(5)

where

T A_{j}

is the set of all task instances for area

A_{j}

, and

I (U_{i} \to T_{k})

is an indicator function equal to 1 if aircraft

U_{i}

is assigned to

T_{k}

, and 0 otherwise. This constraint prevents resource wastage and promotes cooperative load balancing within the fleet.

Through this modeling, the task instance set

T

represents a complex demand network defined by spatial associations, temporal (hard and soft) constraints, and differentiated priorities. This provides a precise and rich basis for the subsequent cooperative task assignment algorithm, ensuring generated plans are both tactically sound and operationally feasible.

3.5. Modeling of Airdrop Task Assignment Optimization Problem

This work formulates the coordinated swarm airdrop task assignment as a multi-objective optimization problem. The objective function is designed to balance key, often conflicting, operational goals while adhering to physical and tactical constraints. Based on operational analysis, mission performance is evaluated along three primary dimensions: (1) minimizing the total flight distance to enhance resource efficiency and reduce risk; (2) maximizing the degree of task completion to ensure high-priority tasks are fulfilled, thereby boosting overall effectiveness; and (3) maximizing the satisfaction of time window requirements to guarantee precise timing and tactical coordination. These dimensions represent the core criteria of the optimization problem.

3.5.1. Minimization of Total Flight Distance

Minimizing the total flight distance directly improves resource efficiency. In contested airspace, shorter distances reduce exposure to enemy air defenses, increasing survivability while lowering fuel consumption and cost. The total distance

R_{t o t a l}

is the sum of distances flown by all aircraft, calculated using a model that incorporates both cruise and loitering segments:

R_{t o t a l} = \sum_{i = 1}^{N_{u}} R_{i},

(6)

where

R_{i}

is the total distance for aircraft

U_{i}

. It is computed as the sum of several components derived from A* path planning and loitering:

R_{i} = R_{i}^{base} + R_{i}^{task} + R_{i}^{return} + R_{i}^{loiter},

(7)

Here,

R_{i}^{base}

is the distance from the base to the first task point,

R_{i}^{task}

is the total distance between consecutive task points, and

R_{i}^{return}

is the distance from the last task point back to the base. These three components are collision-free paths calculated by the A* algorithm applied to a discretized map with grid size

G_{size}

. The term

R_{i}^{loiter}

represents additional distance flown while holding to meet a task’s time window, calculated as:

R_{i}^{loiter} = η \cdot v_{i} \cdot \sum_{j = 1}^{Q_{i}} max (0, t_{j}^{s t a r t} - τ_{j}^{arrival})

(8)

where

τ_{j}^{arrival}

is the arrival time at task point

T_{j}

,

t_{j}^{s t a r t}

is the task’s window start time, and

η

is the loitering speed factor.

3.5.2. Maximization of Task Completion

Maximizing task completion focuses on accomplishing high-priority tasks. This is quantified by a weighted sum of assigned tasks. The task completion metric

D_{C}

is defined as:

D_{C} = \sum_{k = 1}^{N_{t}} p_{k} \cdot I (a_{k} \neq 0),

(9)

where

I (a_{k} \neq 0)

equals 1 if task

T_{k}

is assigned. For tasks with time windows, a penalty reduces the priority weight

p_{k}

if the task starts late:

p_{k}^{'} = p_{k} \cdot max (0, 1 - \frac{τ_{k} - t_{k}^{e n d}}{2}), for τ_{k} > t_{k}^{e n d},

(10)

where

p_{k}^{'}

is the discounted weight,

τ_{k}

is the actual start time, and

t_{k}^{e n d}

is the window’s end time. Together, the late penalty in the task completion metric and the reward in the time-window satisfaction metric implement a soft constraint formulation for time windows. A task that starts after its window closes is not discarded; instead, its effective priority is reduced. Similarly, a task that starts early receives a bonus. This design allows the optimizer to explore solutions where minor time-window violations may be traded for significant gains in other objectives, such as higher overall task completion or lower flight distance. Such flexibility is essential in static planning, where the goal is to find a globally efficient schedule even when demand exceeds capacity.

3.5.3. Maximization of Time Window Satisfaction

Maximizing time window satisfaction ensures tasks are executed within their required time frames, which is critical for coordination. The satisfaction degree

D_{T W}

is defined as the fraction of time-constrained tasks that start within their window, with a bonus for early completion:

\begin{matrix} D_{T W} = \frac{1}{N_{T W}} \sum_{k = 1}^{N_{t}} [I (t_{k}^{s t a r t} \leq τ_{k} \leq t_{k}^{e n d}) + 0.1 \cdot I (τ_{k} \leq \frac{t_{k}^{s t a r t} + t_{k}^{e n d}}{2})] \cdot I (a_{k} \neq 0), \end{matrix}

(11)

where

N_{T W}

is the number of tasks with time windows. The second indicator function grants a 10% bonus for tasks completed in the first half of their window, incentivizing early execution.

3.5.4. Composite Objective Function

The three objectives are combined into a single weighted aggregate function for optimization:

\begin{matrix} max F = w_{1} (1 - \frac{R_{t o t a l}}{R_{u p p e r}}) + w_{2} \frac{D_{C}}{D_{C_{u p p e r}}} + w_{3} D_{T W} \\ s . t . \{\begin{matrix} a_{k} \in {0, 1, \dots, N_{u}}, k = 1, 2, \dots, N_{t}, \\ s_{k} \in [0, 1), k = 1, 2, \dots, N_{t}, \\ \sum_{k = 1}^{N_{t}} I (a_{k} = i) \leq Q_{i}^{m a x}, i = 1, 2, \dots, N_{u}, \\ \sum_{T_{k} \in T_{A_{j}}} I (a_{k} = i) \leq 1, \forall i = 1, \dots, N_{u}, \forall j = 1, \dots, N_{a}, \end{matrix} \end{matrix}

(12)

The weight coefficients

w_{1}, w_{2},

and

w_{3}

sum to 1. The terms

R_{u p p e r}

and

D_{C_{u p p e r}}

are normalization constants defined as:

\{\begin{matrix} \begin{matrix} R_{u p p e r} = \sum_{i = 1}^{N_{u}} R_{i}^{\max}, \\ D_{C_{u p p e r}} = \sum_{k = 1}^{N_{t}} p_{k} . \end{matrix} \end{matrix}

(13)

The normalization constants are chosen as the sum of the maximum ranges across all aircraft and the sum of all task priorities, respectively. This design ensures that each objective term is dimensionless and scaled to a value between 0 and 1. The weight coefficients are set according to mission priorities: task completion is the primary objective; flight distance and time-window satisfaction are secondary but still important. Because all terms are normalized to the same scale, the influence of each weight is consistent and transparent. Mission planners may adjust these weights for different operational scenarios without affecting the validity of the normalization.

The constraints in Equation (12) can be interpreted as follows. The first and second constraint define the domains of decision variables

a_{k}

and

s_{k}

. The third constraint enforces the payload-capacity limit of each aircraft, ensuring that the number of tasks assigned to aircraft i satisfies

\sum_{k = 1}^{N_{t}} I (a_{k} = i) = Q_{i} \leq Q_{i}^{max}

. The fourth constraint guarantees that each area is visited only once, where

T_{A_{j}}

denotes the set of tasks associated with area

A_{j}

. The composite objective function is designed to embody a clear symmetry principle. It combines three objectives that include minimizing total flight distance, maximizing task completion, and maximizing the degree of time-window satisfaction. These objectives are treated as complementary and symmetric components of the overall mission design. None of them is assigned inherent priority, and their relative importance can be adjusted through the weight coefficients

w_{1}

,

w_{2}

and

w_{3}

. This structure ensures that the formulation remains balanced and that efficiency, effectiveness, and punctuality are jointly emphasized in a coherent manner. The model also incorporates a distributive symmetry across all aircraft. This feature prevents any single platform from carrying a disproportionate workload or from repeatedly visiting the same area. Such balanced task allocation is crucial in practical operations because it enhances robustness against unexpected losses and reduces the risk of creating a single point of failure that may arise from uneven distribution of responsibilities.

4. Multi-Transport-Aircraft-Coordinated Airdrop Task Assignment Method

4.1. Population Initialization Mechanism

In the HCBKA algorithm, the population initialization mechanism critically influences overall performance. A well-designed, problem-specific initialization strategy can significantly enhance the quality and diversity of the initial population compared to purely random initialization, providing a better starting point for the optimizer. For the multi-transport-aircraft-coordinated airdrop task assignment problem, which involves numerous complex constraints, purely random initial solutions often exhibit very low feasibility, leading to slow convergence and a high risk of becoming trapped in local optima.

The algorithm initializes a population of

N_{p o p}

individuals. Each individual represents a complete candidate solution, encoded as a pair of vectors

(a, s)

, where

a = (a_{1}, a_{2}, \dots, a_{N_{t}})

is the task assignment vector and

s = (s_{1}, s_{2}, \dots, s_{N_{t}})

is the task sequencing vector. HCBKA employs a heuristic initialization strategy that incorporates prior knowledge about the problem, considering factors such as task spatial distribution, priority weights, and heterogeneous aircraft capabilities.

4.1.1. Initialization of Assignment Variables ( $a_{k}$ )

A probabilistic model guides the assignment. The probability

P_{a s s i g n} (T_{k})

of assigning task

T_{k}

is determined by its normalized distance from the base

d_{f}

, its normalized altitude

h_{f}

, and its normalized priority

p_{f}

:

P_{a s s i g n} (T_{k}) = α - β \cdot d_{f} - γ \cdot h_{f} + δ \cdot p_{f},

(14)

where

α, β, γ, δ

are tuning coefficients,

d_{f} = \frac{| T_{k} - B |}{{max}_{i} | T_{i} - B |}

,

h_{f} = \frac{h_{k}}{{max}_{i} h_{i}}

, and

p_{f} = \frac{p_{k}}{{max}_{i} p_{i}}

. This model increases the likelihood of assigning tasks that are closer to the base, at lower altitudes, and of higher priority, reflecting practical operational logic. Once a task

T_{k}

is selected for assignment, a specific aircraft

U_{i}

is chosen based on a matching score

M_{i, k}

:

M_{i, k} = I (h_{k} \leq H_{i}^{m a x}) \cdot \frac{Q_{i}^{m a x} - Q_{i}^{c u r r e n t}}{\sum_{j = 1}^{N_{u}} (Q_{j}^{m a x} - Q_{j}^{c u r r e n t})},

(15)

where

I (h_{k} \leq H_{i}^{m a x})

is an indicator ensuring the aircraft can reach the required altitude, and

Q_{i}^{c u r r e n t}

is its current assigned task load. This score favors aircraft that satisfy the altitude constraint and have greater remaining capacity, promoting load balance.

4.1.2. Initialization of Sequencing Variables ( $s_{k}$ )

The sequencing variables are initialized uniformly at random:

s_{k} \sim U [0, 1)

. This provides a broad, unbiased search space for the sequence optimization. To ensure a valid, unambiguous execution order for the tasks assigned to each aircraft, the

s_{k}

values for tasks belonging to the same aircraft are normalized post-initialization to be distinct and uniformly spread within

[0, 1)

.

Despite this heuristic approach, the complexity of the problem constraints means the initial population will still contain many infeasible individuals. Common violations include exceeding aircraft range (

R_{i}^{m a x}

) or payload capacity (

Q_{i}^{m a x}

), failing altitude constraints, and having the same aircraft assigned multiple tasks in the same geographic area. Therefore, a constraint-driven repair mechanism is applied after initialization to correct these violations, transforming infeasible solutions into feasible ones and ensuring the subsequent search operates within the feasible region.

4.2. Constraint-Driven Population Repair Mechanism

The constraint-driven repair mechanism is essential for ensuring the feasibility of solutions. Given the complex constraints in the multi-transport-aircraft-coordinated airdrop problem, even a heuristic initialization yields many infeasible individuals. This mechanism systematically corrects such individuals, transforming them into feasible solutions while preserving their beneficial traits as much as possible, thereby providing a high-quality initial population for optimization.

A key innovation is the hierarchical repair framework, which processes constraints sequentially according to their importance and repair complexity: the single-visit constraint, the payload capacity constraint, the range constraint, and the altitude constraint. Finally, a global lightweight backfilling mechanism reassigns high-value tasks removed during earlier steps. This layered approach ensures systematic and efficient repair, avoiding conflicts between different repair actions. The framework consists of five key steps, detailed below.

4.2.1. Single-Visit Constraint Repair

This first step enforces that no aircraft visits the same mission area more than once. For each aircraft

U_{i}

(for

i = 1, \dots, N_{u}

), if multiple assigned tasks belong to the same area

A_{j}

(for

j = 1, \dots, N_{a}

), only the task with the smallest sequencing variable

s_{k}

is kept. This preserves the intended order from the encoding.

To choose which tasks to remove among duplicates, we evaluate the geometric path impact. The distance increment

Δ D_{k}

for removing task

T_{k}

is:

Δ D_{k} = | P_{prev} - P_{k} | + | P_{k} - P_{next} | - | P_{prev} - P_{next} |,

(16)

where

P_{prev}

,

P_{k}

, and

P_{next}

are the positions of the predecessor, the task itself, and the successor. Tasks with smaller

Δ D_{k}

are removed first to minimize disruption to the path geometry.

4.2.2. Payload Capacity Constraint Repair

This step handles cases where an aircraft’s assigned task count

Q_{i}

exceeds its capacity

Q_{i}^{m a x}

. Tasks are removed based on their range contribution per unit priority. For task

T_{k}

on aircraft

U_{i}

, the contribution

C_{i, k}

is:

C_{i, k} = \frac{Δ R_{i, k}}{p_{k}},

(17)

where

Δ R_{i, k}

is the reduction in total distance if

T_{k}

is removed. A lower

C_{i, k}

indicates that removing the task saves little distance relative to its priority. Therefore, tasks with the smallest

C_{i, k}

are removed iteratively until the capacity constraint (Equation (2)) is met, thus preserving high-benefit, high-priority tasks.

4.2.3. Range Constraint Repair

This step corrects violations where an aircraft’s total range

R_{i}

exceeds

R_{i}^{m a x}

. An iterative removal strategy is used, where each iteration computes a removal benefit

B_{i, k}

for each task

T_{k}

on the overloaded aircraft:

B_{i, k} = \frac{R_{i} - R_{i}^{∖ k}}{p_{k}},

(18)

Here,

R_{i}^{∖ k}

is the range after removing

T_{k}

(calculated via Equation (1)). A higher

B_{i, k}

means greater range savings per priority unit. The task with the highest

B_{i, k}

is removed each iteration until the range constraint is satisfied, thereby efficiently reducing range while prioritizing high-priority tasks.

4.2.4. Altitude Constraint Repair

This step addresses mismatches between a task’s required altitude

h_{k}

and an aircraft’s maximum altitude

H_{i}^{m a x}

. If

h_{k} > H_{i}^{m a x}

for a task assigned to

U_{i}

, the task is reassigned. Reassignment uses the matching score

M_{i, k}

from Equation (15), selecting the feasible aircraft with the highest score.

4.2.5. Global Lightweight Backfilling Mechanism

The final step attempts to reassign high-value tasks removed during the above repairs. A backfilling value

V_{k}

for a removed task

T_{k}

is calculated:

V_{k} = λ_{1} \cdot p_{k} - λ_{2} \cdot \frac{| T_{k} - B |}{max | T_{i} - B |} - λ_{3} \cdot \frac{h_{k}}{max h_{i}},

(19)

where

λ_{1}, λ_{2}, λ_{3}

are weight priority, distance, and altitude factors. Tasks with higher

V_{k}

are considered first.

To maintain efficiency, a restricted candidate strategy is used. For each task, we consider at most

K_{\max}

candidate aircraft, chosen based on the highest matching scores

M_{d, k}

. For each candidate aircraft, we consider at most

P_{\max}

insertion positions: the start, the end, and the position that minimizes the resulting path length increase

Δ D (pos)

. Formally:

C drones = arg max d \in U M_{d, k}, | C drones | \leq K \max,

(20)

C pos = 0, N_{d}, arg min pos Δ D (pos), | C pos | \leq P \max,

(21)

where

N_{d}

is the current number of tasks assigned to the candidate aircraft. This strategy balances reassignment quality with computational cost.

The proposed two-stage approach—initialization followed by repair—ensures both population diversity and solution feasibility, furnishing a high-quality initial population for the subsequent hybrid iterative update process.

4.3. Hybrid Black-Winged Kite Population Update Mechanism

The HCBKA algorithm incorporates a hybrid population update mechanism. It retains the core dual-mode search strategy (attack and migration) of the original Black-winged Kite Algorithm (BKA) while introducing key enhancements to handle mixed-variable types and multiple constraints effectively. This mechanism decouples the update processes for continuous and discrete variables, employs constraint-aware perturbations, and utilizes elite-guided search to navigate the complex constrained solution space efficiently.

Specifically, for the continuous sequencing variables, HCBKA adopts the continuous update strategy of BKA but with adaptive parameter tuning tailored to the task sequencing context. For the discrete assignment variables, a novel probabilistic perturbation model is designed, driven by aircraft performance matching and task priorities, ensuring all updates respect the problem constraints. This hybrid approach preserves the original algorithm’s exploratory strengths while leveraging problem-specific structure, offering an effective method for multi-objective optimization under complex constraints.

4.3.1. Attack Behavior

For the sequential variable

s_{k}

, HCBKA adopts the same attack behavior update pattern as the original BKA. The update formula is:

s_{k}^{i + 1} = \{\begin{matrix} ω \cdot s_{k}^{i} + n \cdot (1 + sin (r)) \cdot s_{k}^{i} & if p < r, \\ s_{k}^{i} \cdot [n \cdot (2 r - 1) + ω] & otherwise, \end{matrix}

(22)

where

s_{k}^{i}

and

s_{k}^{i + 1}

are the sequencing values for task

T_{k}

in generations i and

i + 1

,

r \in [0, 1]

is a uniform random number, and p is the attack selection threshold. The parameter

ω

is a nonlinear decay weight:

ω = sin (\frac{π i}{2 I t e r} + π) + 1

, with i being the current iteration and

I t e r

the maximum iterations. This weight promotes global exploration early on and shifts focus to local exploitation later. The convergence control factor n is:

n = 0.05 \cdot exp (- 2 \cdot {(i / I t e r)}^{2})

. It creates strong initial perturbations for exploration, which diminish smoothly for refined local search.

A probabilistic perturbation model governs the attack updates for assignment variables

a_{k}

, involving two main operations: changing a task’s assignment state and reassigning it. The base probability for perturbing a task’s assignment is:

P_{disturb} = δ_{base} + δ_{rate} \cdot ρ,

(23)

where

δ_{base}

and

δ_{rate}

are constants, and

ρ

is the population’s exploration rate, adjusted adaptively.

If task

T_{k}

is unassigned and selected for perturbation, it is assigned to a candidate aircraft. The candidate set

C assign

includes aircraft that meet the altitude requirement, have available capacity, and have not visited

T_{k}

’s mission area:

\begin{matrix} C_{a s s i g n} = {U_{i} \in U | h_{k} \leq H_{i}^{m a x} \land Q_{i}^{c u r r e n t} < Q_{i}^{m a x} \land ∄ T_{j} \in T_{A_{j}} s . t . a_{j} = i}, \end{matrix}

(24)

An aircraft

U_{i}

is chosen with probability proportional to its remaining capacity:

P (U_{i}) = \frac{Q_{i}^{m a x} - Q_{i}^{current}}{\sum U_{j} \in C_{assign} (Q_{j}^{m a x} - Q_{j}^{current})} .

(25)

If task

T_{k}

is already assigned and selected for perturbation, it is reassigned. With probability

P_{leader}

, it adopts the assignment from the current elite solution; otherwise (with probability

1 - P_{leader}

), it is randomly reassigned to another feasible aircraft. This elite guidance accelerates the spread of beneficial assignment patterns.

Parameter adaptation in attack behavior represents a key innovation of HCBKA. The exploration rate

ρ

is dynamically adjusted throughout the iterative process:

ρ = ρ_{m a x} - (ρ_{m a x} - ρ_{m i n}) \cdot {(\frac{i}{I t e r})}^{γ},

(26)

where

ρ_{\max}

,

ρ_{\min}

, and

γ

are constants. This ensures a shift from broad exploration to intensive exploitation, balancing global and local search.

4.3.2. Migration Behavior

The migration behavior in HCBKA is inspired by the long-distance seasonal migration of black-winged kites, serving as the primary mechanism for global exploration and escaping local optima. To enhance its effectiveness for the airdrop task assignment problem, this behavior incorporates an elite-guided reference system and an adaptive mutation strategy, significantly improving search efficiency within the complex solution space.

The update of the continuous sequencing variables

s_{k}

employs an elite-guided hybrid mutation strategy. For each individual, a reference solution is selected: with probability

P_{e l i t e}

, the global best individual is chosen; otherwise, a random individual from an elite pool serves as the reference. The update formula is bifurcated based on the relationship between an individual’s fitness

f_{i}

and the population’s average fitness

\bar{f}

.

When

f_{i} < \bar{f}

, a Cauchy mutation-driven update promotes exploration:

s_{k}^{i + 1} = s_{k}^{i} + κ \cdot C (0, 1) \cdot (s_{k}^{r e f} - s_{k}^{i}),

(27)

where

s_{k}^{r e f}

is the sequencing variable from the reference individual,

κ

is the mutation intensity coefficient, and

C (0, 1)

is a random number from the standard Cauchy distribution. The heavy-tailed property of the Cauchy distribution provides a strong capability to escape local optima, particularly beneficial for global exploration when solution quality is low.

When

f_{i} \geq \bar{f}

, a Lévy flight-guided update facilitates refined local search with global potential:

s_{k}^{i + 1} = s_{k}^{i} + λ \cdot L (ψ) \cdot (s_{k}^{i} - s_{k}^{r e f}),

(28)

where

λ

is the step size scaling factor,

L (ψ)

is the random step length from a Lévy flight, and

ψ

is the Lévy exponent. The long-jump characteristic of Lévy flight allows high-quality solutions to be finely tuned while maintaining exploratory capacity. The factor

λ

decays exponentially to sharpen convergence:

λ = λ_{0} \cdot e^{- \frac{i}{I t e r}},

(29)

where

λ_{0}

is the initial step size. This decay ensures increasing convergence precision in later iterations.

For the discrete assignment variables

a_{k}

, an intelligent, constraint-aware perturbation strategy is designed. Updates are based on a reference individual but are constrained by aircraft performance and task matching. For each task instance

T_{k}

, its assignment is updated to the reference value

a_{k}^{r e f}

with probability

P_{m i g r a t e}

, conditional on the assigned aircraft’s altitude capability:

a_{k}^{i + 1} = \{\begin{matrix} a_{k}^{r e f} & if h_{k} \leq H_{a_{k}^{r e f}}^{m a x} and r a n d () < P_{m i g r a t e}, \\ a_{k}^{i} & otherwise, \end{matrix}

(30)

where

H_{a_{k}^{r e f}}^{m a x}

is the maximum flight altitude of the aircraft corresponding to

a_{k}^{r e f}

. If the altitude constraint is violated, the algorithm defaults to a probabilistic reassignment strategy based on aircraft matching degree, guaranteeing all operations satisfy problem constraints.

The elite reference mechanism is pivotal for convergence. Reference individuals are selected from an elite pool based on fitness ranking. The size of this pool,

N_{e l i t e}

, adapts during the iterative process:

N_{e l i t e} = max (1, ⌊N_{p o p} (η_{m i n} + (η_{m a x} - η_{m i n}) \frac{i}{I t e r})⌋),

(31)

where

N_{p o p}

is the population size, and

η_{m i n}

and

η_{m a x}

are the lower and upper bounds of the elite proportion. This adaptive mechanism maintains a larger pool early on to foster diversity and progressively shrinks it to focus selection pressure and accelerate convergence.

4.4. HCBKA Algorithm Flow

Integrating the population initialization mechanism, the constraint-driven repair mechanism, and the hybrid update mechanism, the HCBKA algorithm establishes a complete framework for solving the multi-transport-aircraft-coordinated airdrop task assignment problem. By simulating the foraging and migratory behaviors of black-winged kites, the framework efficiently navigates the complex, constrained solution space. The overall workflow is illustrated in the flowchart of Figure 2. A detailed step-by-step description is also provided in the pseudocode of Algorithm 1.

The HCBKA algorithm proceeds according to the following steps.

Step 1: Input and parameter initialization. The algorithm receives the task set

T

, aircraft set

U

, base location

B

, threat zones

O

, maximum iterations

I t e r

, population size

N_{p o p}

, and weight coefficients

w_{1}, w_{2}, w_{3}

.

Step 2: Population initialization. A population of

N_{p o p}

individuals is generated using the heuristic strategy defined in Equations (14) and (15). This strategy considers task spatial distribution, priority, and aircraft capabilities to produce diverse and high-quality candidate solutions.

Step 3: Constraint repair. Each individual is processed by the constraint-driven repair mechanism described in Section 4.2. This ensures that all hard constraints are satisfied before further optimization.

Step 4: Fitness evaluation and elite selection. The fitness of each individual is computed using the scalarized objective function in Equation (12), which combines the three normalized objectives. The individual with the highest fitness is stored as the global best solution

(a^{*}, s^{*})

with fitness

F^{*}

.

Step 5: Main optimization loop. For each generation

t = 1

to

I t e r

, the following substeps are performed for every individual i in the population.

Step 5.1: Update adaptive parameters. The exploration rate

ρ

(Equation (26)) and the elite pool size

N_{e l i t e}

(Equation (31)) are recalculated based on the current generation number.

Step 5.2: Attack behavior. The sequencing variables s are updated using Equation (22). The assignment variables a are perturbed probabilistically according to Equations (23)–(25), with elite guidance.

Step 5.3: Migration behavior. A reference solution is selected from the elite pool. If the individual’s fitness is below the population average, a Cauchy mutation (Equation (27)) is applied to the sequencing variables. Otherwise, a Lévy flight update (Equation (28)) is used. The assignment variables are updated by copying from the reference solution if altitude constraints permit (Equation (30)).

Step 5.4: Constraint repair after update. The newly generated candidate solution is passed through the same repair mechanism as in Step 3 to guarantee feasibility.

Step 5.5: Fitness evaluation and greedy selection. The fitness of the candidate solution is computed. If it exceeds the current individual’s fitness, the individual is replaced. After processing all individuals, the global best solution is updated if a better fitness is found.

Step 6: Output. After completing all generations, the algorithm returns the optimal task assignment

a^{*}

, optimal task sequence

s^{*}

, and the corresponding fitness

F^{*}

.

Algorithm 1 HCBKA Algorithm Pseudocode

Require:: Task instance set $T$ , transport aircraft set $U$ , base location $B$ , threat zones $O$ , max iterations $I t e r$ , population size $N_{p o p}$ , weight coefficients $w_{1}, w_{2}, w_{3}$
Ensure:: Optimal task assignment $a^{*}$ , optimal task sequence $s^{*}$ , optimal fitness $F^{*}$
1:: Initialize population: $(a^{(i)}, s^{(i)}) {i = 1}^{N p o p}$ via heuristic rules (Equations (14) and (15))
2:: for $i = 1$ to $N_{p o p}$ do
3:: $(a^{(i)}, s^{(i)}) \leftarrow$ Repair $(a^{(i)}, s^{(i)})$ (Section 4.2)
4:: $F^{(i)} \leftarrow$ Fitness $(a^{(i)}, s^{(i)})$ (Equation (12))
5:: end for
6:: Identify elite $(a^{*}, s^{*})$ and best fitness $F^{*}$
7:: for $t = 1$ to $I t e r$ do
8:: $ρ \leftarrow$ Update exploration rate (Equation (26))
9:: $N_{e l i t e} \leftarrow$ Update elite pool size (Equation (31))
10:: for $i = 1$ to $N_{p o p}$ do
11:: {Attack Behavior}
12:: $s^{n e w} \leftarrow$ Update s (Equation (22))
13:: $a^{n e w} \leftarrow$ Update a (Equation (23)–(25))
14:: {Migration Behavior}
15:: if $r a n d () < P_{e l i t e}$ then
16:: $(a^{r e f}, s^{r e f}) \leftarrow$ Elite individual
17:: else
18:: $(a^{r e f}, s^{r e f}) \leftarrow$ Random elite
19:: end if
20:: if $F^{(i)} < \bar{F}$ then
21:: $s^{n e w} \leftarrow$ Cauchy mutation (Equation (27))
22:: else
23:: $s^{n e w} \leftarrow$ Lévy flight (Equation (28))
24:: end if
25:: $a^{n e w} \leftarrow$ Perturb a (Equation (30))
26:: $(a^{n e w}, s^{n e w}) \leftarrow$ Repair $(a^{n e w}, s^{n e w})$
27:: $F^{n e w} \leftarrow$ Fitness $(a^{n e w}, s^{n e w})$
28:: {Selection}
29:: if $F^{n e w} > F^{(i)}$ then
30:: $(a^{(i)}, s^{(i)}, F^{(i)}) \leftarrow (a^{n e w}, s^{n e w}, F^{n e w})$
31:: end if
32:: end for
33:: Update global best $(a^{*}, s^{*}, F^{*})$
34:: Output iteration statistics
35:: end for
36:: return $(a^{*}, s^{*}), F^{*}$

5. Dynamic Reallocation Method for Coordinated Airdrop Missions

The static task allocation method based on HCBKA provides a high-quality initial plan for multi-transport-aircraft-coordinated airdrop operations. However, real-world combat environments are highly uncertain and dynamic. During execution, unforeseen events—such as emergent threats, equipment failures, or changing mission requirements—can degrade or even invalidate the pre-computed static plan. Such disruptions may not only interrupt specific tasks but also compromise the entire scheduled mission sequence, leading to cascading negative effects on overall effectiveness.

To address these challenges, this section proposes a Time-window aware Dynamic Auction Algorithm (TDAA). Building upon the efficient sequential auction framework, TDAA accurately models and solves the coordinated airdrop reallocation problem through a coordinated structure that integrates a two-level screening mechanism and a two-phase auction process. The algorithm’s core innovations are systematic: (1) a two-level transport aircraft screening mechanism combining spatial proximity and temporal feasibility to ensure solution quality while boosting computational efficiency; (2) an intelligent task ordering strategy that prioritizes tasks based on both their priority and the urgency of their time windows, ensuring critical tasks are allocated first; and (3) a composite bid function balancing task priority, time window satisfaction, and route resource cost, enabling multi-objective optimization. Through intelligent task ordering, precise aircraft filtering, and utility-optimal assignment, TDAA establishes a responsive and practical reallocation solution for dynamic settings.

The following discussion first systematically analyzes typical reallocation trigger scenarios and their impact on the mission system. It then details the core mechanics and complete workflow of the TDAA algorithm, providing both theoretical foundation and a practical methodology for sustaining effective coordinated airdrop operations in dynamic environments.

5.1. Analysis of Reallocation Scenarios for Coordinated Swarm Airdrop Missions

Dynamic reallocation is triggered by unpredictable changes that undermine the optimality or feasibility of the initial HCBKA-based plan. To preserve mission effectiveness, these trigger scenarios must be precisely identified and formalized. Based on operational analysis, we categorize dynamic triggers into three typical scenarios: sudden loss of a transport aircraft, emergence of new task instances within existing mission areas, and appearance of new mission areas. Each event is defined formally below.

5.1.1. Sudden Attrition of a Transport Aircraft

An aircraft

U_{f}

(with

f \in {1, \dots, N_{u}}

) becomes unable to continue its mission due to factors such as enemy fire, mechanical failure, or severe weather. The current time is denoted

t_{current}

. The set of tasks that are affected and require reassignment is

T_{f}^{pending} = {T_{k} ∣ a_{k} = f \land τ_{k} > t_{current}}

, where

τ_{k}

is the planned start time of task

T_{k}

. Figure 3 illustrates this scenario.

5.1.2. New Task Instances in an Existing Area

For an existing mission area

A_{j}

that originally required

V_{j}

visits, a command decision may add an extra airdrop, creating a new instance

T_{new}

. This task shares the spatial coordinates of

A_{j}

but has its own priority

p_{new}

and time window

[t_{new}^{start}, t_{new}^{end}]

. Figure 4 shows this scenario.

5.1.3. Emergence of a New Mission Area

A completely new area

A_{N_{a} + 1}

appears at coordinates

(x_{new}, y_{new}, z_{new})

with a set of task instances

T_{new}

. Each task in

T_{new}

has its own priority and may have a time window. This event typically corresponds to an emergent threat that demands rapid response. Figure 5 illustrates this case.

5.1.4. Formal Definition of Event-Trigger Conditions

The TDAA algorithm is invoked immediately upon the occurrence of any of the above events. The trigger conditions are event-based, not periodic or time-based, ensuring prompt response without unnecessary computations during steady operation.

1.: Event type 1 (aircraft attrition). Let $U_{f}$ be the failed aircraft and $t_{current}$ the current time. The event is triggered when $U_{f}$ becomes unable to continue. The affected tasks are $T_{f}^{pending} = {T_{k} ∣ a_{k} = f and τ_{k} > t_{current}}$ .
2.: Event type 2 (new task instance). For an existing area $A_{j}$ , a new instance $T_{new}$ is added by command decision. The event is triggered at the moment the decision is made. The new task inherits the location of $A_{j}$ and carries its own priority and time window.
3.: Event type 3 (new mission area). A new area $A_{N_{a} + 1}$ appears with a set of tasks $T_{new}$ . The event is triggered when the area is detected and deemed operationally urgent.

In all three cases, the reallocation process starts immediately after the event is recognized. The composition of the new task set

T_{new}

varies by scenario: for aircraft attrition,

T_{new} = T_{f}^{pending}

; for a new instance in an existing area,

T_{new} = {T_{new}}

; for a new mission area,

T_{new}

contains all tasks of that area. This unified formulation provides a consistent basis for the TDAA algorithm across different dynamic disturbances.

5.2. Coordinated Airdrop Task Reallocation Method Based on the TDAA Algorithm

Addressing the specific demands of the dynamic reallocation scenarios outlined above, this section presents a Time-window aware Dynamic Auction Algorithm (TDAA). TDAA employs a cooperative optimization framework featuring two-level screening and a two-phase auction. Its sequential approach—spatial proximity pre-screening followed by temporal feasibility assessment—ensures solution feasibility while minimizing disruption to existing task schedules. A key strength of TDAA is its efficient resource matching, which rapidly identifies the most suitable aircraft and optimal insertion points for new tasks, enabling swift reallocation in dynamic settings. Furthermore, through intelligent task prioritization and a composite bid function, TDAA prioritizes high-value tasks for the best available resources while maintaining overall mission efficiency.

Underlying this design is a critical distinction between the static planning and dynamic replanning phases in how time-window constraints are enforced. In static planning, time windows are treated as soft constraints, allowing the algorithm to trade off small schedule deviations for better overall mission performance. In dynamic replanning, however, the situation is different. When an unexpected event occurs, the remaining time margin is often tight, and any further delay may cause critical tasks to miss their windows. Therefore, in the TDAA algorithm, time windows are enforced as hard constraints. Only aircraft that can reach a new task before the window closes are considered feasible candidates. This two-phase design reflects practical operational priorities: global efficiency in planning and strict temporal compliance in reactive adjustments.

5.2.1. Transport Aircraft Screening Mechanism

Efficient reallocation in the TDAA algorithm relies critically on its transport aircraft screening mechanism. Designed to balance solution quality with computational efficiency, this mechanism employs a two-tiered hierarchical filter. The first tier pre-screens candidates based on spatial proximity, leveraging the operational efficiency gained from geographic continuity in task execution. The second tier then assesses temporal feasibility to ensure all time constraints can be met. This structured approach drastically narrows the search space. Furthermore, to maximize overall assignment success, tasks that remain unassigned after this primary process enter a secondary, more relaxed screening round involving the entire available fleet.

The spatial proximity screening identifies aircraft that are naturally positioned to incorporate a new task with minimal additional travel. The underlying principle is that assigning geographically proximate tasks to the same aircraft reduces wasteful deadhead distance. For a given new task

T_{k} \in T_{n e w}

, the algorithm computes its distance to every scheduled task point across all aircraft. The distance from

T_{k}

to the j-th task

T d, j

in the sequence of aircraft

U_{d}

is defined as:

d_{k, d, j} = | p o s_{k} - p o s_{d, j} | + Δ d_{tolerance},

(32)

where

Δ d_{tolerance}

accounts for potential detours required to avoid known threats. The spatial proximity index

S_{d, k}

for aircraft

U_{d}

is then the minimum of these distances:

S_{d, k} = {min}_{j = 1}^{N_{d}} d_{k, d, j},

(33)

with

N_{d}

being the number of tasks currently assigned to

U_{d}

. The M aircraft with the smallest

S_{d, k}

values form the initial candidate set

C_{spatial} (T_{k})

. This pre-filtering step significantly reduces the pool of contenders for subsequent, more expensive computations.

The subsequent temporal feasibility screening evaluates whether candidates in

C_{spatial} (T_{k})

can physically reach the new task within its required timeframe. For tasks possessing a time window

[t_{k}^{start}, t_{k}^{end}]

, the algorithm calculates for each candidate aircraft

U_{d}

the direct flight time

t_{direct}

from its current position

P_{current} [d]

to the task location

p o s_{k}

:

t_{direct} = \frac{| P_{current} [d] - p o s_{k} |}{v_{d}} .

(34)

Aircraft are retained only if they can arrive before the window closes, allowing for a small planning tolerance

Δ t_{tolerance}

:

t_{current} + t_{direct} \leq t_{k}^{end} + Δ t_{tolerance} .

(35)

The aircraft satisfying this inequality constitute the refined, feasible candidate set

C_{feasible} (T_{k})

. For tasks without a specified time window, all spatially proximate candidates are considered feasible, i.e.,

C_{feasible} (T_{k}) = C_{spatial} (T_{k})

.

This two-stage screening produces a shortlist of aircraft that are both geographically convenient and temporally capable of executing the new task. However, if

C_{feasible} (T_{k})

is empty—meaning no candidate from the initial spatial filter can meet the time constraint—the task is flagged for secondary allocation. In this fallback phase, the spatial screening criteria are relaxed, and the algorithm attempts to assign the task from the broader set of aircraft not initially considered, thereby striving to maximize the final allocation rate.

5.2.2. Task Auction Priority Sorting Mechanism

Before initiating the auction process, the TDAA algorithm intelligently orders the set of tasks awaiting assignment (

T_{n e w}

). This pre-auction sorting ensures that critical and time-sensitive tasks receive bidding priority. The ordering is determined by an auction priority index

I_{a u c t i o n} (T_{k})

, which balances a task’s inherent priority against the urgency of its time window:

I_{a u c t i o n} (T_{k}) = α \cdot p_{k} + β \cdot I t w (T_{k}) \cdot (1 - \frac{t_{c u r r e n t}}{t_{k}^{e n d}}) .

(36)

Here,

p_{k}

is the task’s priority weight,

I t w (T_{k})

is an indicator function (equal to 1 if

T_{k}

has a time window, or 0 otherwise), and

α

and

β

are configurable weighting coefficients. Tasks are then ranked in descending order of

I_{a u c t i o n} (T_{k})

to produce a sorted sequence

T_{s o r t e d}

for the auction. This prioritization scheme mirrors real-world operational decision-making by ensuring that the most important and most urgent tasks are addressed first in the resource allocation process.

5.2.3. Bid Function Design

A core innovation of the TDAA algorithm is its bid function, designed to accurately quantify the marginal cost for a transport aircraft to execute a newly assigned task. It provides an objective metric for auction decisions by synthesizing key factors: the additional route distance required, the disruption caused to the existing schedule, and the inherent value of the task itself.

For a candidate aircraft

U_{d}

and task

T_{k}

, the bid

B i d (d, k)

is defined as:

B i d (d, k) = \frac{p_{k}}{Δ R_{e x t r a} (d, k) + λ \cdot Δ T_{d i s r u p t i o n} (d, k)},

(37)

where

p_{k}

is the task’s priority weight;

Δ R_{e x t r a} (d, k)

is the additional flight distance required to insert

T_{k}

into the existing route of aircraft

U_{d}

;

Δ T_{d i s r u p t i o n} (d, k)

is the total delay imposed on other time-constrained tasks due to the insertion; and

λ

is a user-defined weight that balances the importance of flight distance against schedule disruption.

The bid function yields a higher value when the task has high priority and when the combined cost is low. The combined cost consists of the flight distance plus the weighted disruption. Therefore, the aircraft with the maximum bid is the most efficient choice.

Let the current task sequence of aircraft

U_{d}

be

S_{d} = (T_{d 1}, T_{d 2}, \dots, T_{d N})

. Here N is the number of tasks already assigned. The total route distance of this sequence is denoted by

R (S_{d})

. Consider each possible insertion position i with

i = 0, 1, \dots, N

. The position

i = 0

means inserting before the first task, and

i = N

means inserting after the last task. For a given position i, define a new sequence

S_{d}^{(i)}

that includes

T_{k}

at position i. The distance increase for this position is

Δ R_{i} (d, k) = R (S d^{n e w}) - R (S d) .

(38)

The incremental distance is the minimal increase over all positions. The position that achieves this minimum is selected for insertion. This is written as

Δ R e x t r a (d, k) = min_{i \in [0, N]} Δ R_{i} (d, k) .

(39)

The schedule disruption term measures how much the insertion delays other tasks that have time window constraints. Let

S_{d}^{tw}

be the subset of tasks in

S_{d}

that possess a time window. For each such task

T_{j}

, the time window is denoted by

[t_{j}^{start}, t_{j}^{end}]

. Let

τ_{j}

be its originally planned start time and

τ_{j}^{new}

be its new start time after inserting

T_{k}

. The disruption term is then defined as

Δ T_{d i s r u p t i o n} (d, k) = \sum_{T_{j} \in S_{d}^{t w}} max (0, τ_{j}^{n e w} - t_{j}^{e n d}) .

(40)

This sum penalizes any task that would miss its deadline due to the insertion. Tasks without time windows are not affected by this term.

5.2.4. TDAA Algorithm Flow

The TDAA algorithm employs a two-phase auction process to achieve rapid task reallocation. The first phase focuses on efficiency, performing preprocessing (task prioritization and aircraft screening) followed by a restricted auction that considers only spatially proximate and temporally feasible aircraft. The second phase ensures robustness by re-attempting the allocation of any tasks left unassigned in the first phase, this time considering a broader set of aircraft from across the entire fleet. This structured approach is detailed in Algorithm 2.

Algorithm 2 TDAA Algorithm Pseudocode

Require:: Current transport aircraft state $S_{d r o n e}$ , new task set $T_{n e w}$ , base location $B$ , threat areas $O$ , current time $t_{c u r r e n t}$
Ensure:: Updated task assignment scheme $a^{n e w}$ , updated task sequencing scheme $s^{n e w}$
1:: {Phase I: Preprocessing and Constrained Auction}
2:: $T_{s o r t e d} \leftarrow$ Priority sort $T_{n e w}$ based on Equation (36)
3:: Initialize task sequence state $a^{n e w}, s^{n e w}$
4:: Initialize set of unassigned tasks $T_{f a i l e d} \leftarrow \emptyset$
5:: {First-Round Auction}
6:: for $k = 1$ to $| T_{s o r t e d} |$ do
7:: $T_{k} \leftarrow T_{s o r t e d} [k]$
8:: $C_{s p a t i a l} \leftarrow$ Spatial proximity pre-screening (Equation (33))
9:: $C_{f e a s i b l e} \leftarrow$ Temporal feasibility screening (Equation (35))
10:: if $C_{f e a s i b l e} = \emptyset$ then
11:: $T_{f a i l e d} \leftarrow T_{f a i l e d} \cup {T_{k}}$
12:: continue
13:: end if
14:: $d^{*} \leftarrow arg {max}_{d \in C_{f e a s i b l e}} B i d (d, k)$ {Using Equation (37)}
15:: Insert $T_{k}$ at optimal position in sequence of $U_{d^{*}}$
16:: Update state $a^{n e w}, s^{n e w}$ for $U_{d^{*}}$
17:: end for
18:: {Phase II: Secondary Auction for Unassigned Tasks}
19:: for $T_{k} \in T_{f a i l e d}$ do
20:: $C_{g l o b a l} \leftarrow$ Screen feasible aircraft from entire fleet (Equation (35))
21:: $C_{g l o b a l} \leftarrow C_{g l o b a l} ∖ C_{s p a t i a l}$ {Exclude first-round candidates}
22:: if $C_{g l o b a l} = \emptyset$ then
23:: Mark $T_{k}$ as permanently unassigned
24:: continue
25:: end if
26:: $d^{*} \leftarrow arg {max}_{d \in C_{g l o b a l}} B i d (d, k)$ {Using Equation (37)}
27:: Insert $T_{k}$ at optimal position in sequence of $U_{d^{*}}$
28:: Update state $a^{n e w}, s^{n e w}$ for $U_{d^{*}}$
29:: end for
30:: return $a^{n e w}, s^{n e w}$

The TDAA algorithm can be summarized in the following steps.

Step 1: Task prioritization. All new tasks in

T_{n e w}

are sorted in descending order of the auction priority index

I_{a u c t i o n} (T_{k})

defined in Equation (36).

Step 2: First-round auction. For each task in the sorted list, the algorithm performs spatial proximity pre-screening using Equation (33) to obtain a candidate set

C_{s p a t i a l}

. This set is then filtered by temporal feasibility using Equation (35) to obtain

C_{f e a s i b l e}

. If

C_{f e a s i b l e}

is empty, the task is added to a failed set

T_{f a i l e d}

; otherwise, the aircraft

d^{*}

that maximizes the bid function

B i d (d, k)

(Equation (37)) is selected, and the task is inserted at the position that minimizes additional flight distance.

Step 3: Second-round auction. For each task in

T_{f a i l e d}

, the algorithm searches the entire fleet for aircraft that can meet the time window constraint, excluding those already considered in the first round. The aircraft with the highest bid is chosen, and the task is inserted similarly.

The updated task assignment

a^{n e w}

and sequencing

s^{n e w}

are returned.

6. Simulation Result

6.1. Comparative Evaluation of the HCBKA Algorithm via Simulation

To thoroughly assess the performance of the HCBKA algorithm in solving the task assignment problem for cooperative airdrop missions involving multiple transport aircraft, this subsection presents a series of systematic comparative simulations. The experiments are conducted using a fixed random seed to ensure reproducibility. By comparing the HCBKA algorithm with several classical optimization algorithms under identical task scenarios, we quantitatively evaluate their overall performance in terms of solution quality, convergence speed, and robustness.

A three-dimensional operational airspace measuring 100 km × 100 km × 50 km is first constructed. Within this space, 35 task areas are defined, encompassing a total of 80 task instances to be executed collaboratively by five heterogeneous transport aircraft. Fifteen enemy air defense threat zones are present, each expanded according to the method described in Section 3.5 to incorporate a safety buffer of 2.0 km. After discretization, each grid cell measures 2.0 km. The base is located at the center of the airspace (50, 50, 0) to enable omnidirectional deployment of the aircraft.

The heterogeneous configuration of the transport aircraft fleet is detailed in Table 1. The aircraft differ in key performance parameters such as maximum range, task capacity, operational altitude range, and cruising speed. This configuration emulates realistic cooperative operations involving diverse aircraft types in combat scenarios.

The parameter settings of the HCBKA algorithm are summarized in Table 2. The detailed attributes of the task instances, including their spatiotemporal distribution and priority levels, are listed in Table 3.

Figure 6 compares the fitness convergence performance of the proposed HCBKA algorithm with three representative metaheuristic algorithms: Particle Swarm Optimization (PSO), Grey Wolf Optimizer (GWO), and the Whale Optimization Algorithm (WOA). The convergence trajectories and final fitness values reveal distinct optimization behaviors among the four methods. PSO exhibits the slowest convergence, stabilizing only after approximately 400 iterations and yielding the lowest final fitness, which remains within the range of 0.72 to 0.73. Although GWO demonstrates a relatively fast initial convergence, its fitness value plateaus around 0.74 midway through the iterations, with minimal improvement thereafter. This indicates that the algorithm has limited ability to escape local optima. WOA performs comparably to HCBKA in the early stages but lacks sufficient convergence depth in later iterations, resulting in a final fitness inferior to that of HCBKA. In contrast, the proposed HCBKA algorithm, despite a slower convergence rate during the initial tens of generations compared to GWO and WOA, surpasses all competitors by achieving the best solution after approximately 60 generations. It enters a stable convergence phase around generation 200, ultimately reaching a fitness value consistently above 0.80. These results highlight HCBKA’s robust global search capability and stable convergence behavior. The comparative results provide strong evidence that the HCBKA algorithm outperforms the benchmark metaheuristics in both solution quality and convergence efficiency, validating its effectiveness and superiority for solving this class of optimization problems.

A detailed examination of the task assignment results generated by the HCBKA algorithm is presented in the Gantt chart in Figure 7, which clearly depicts the temporal execution sequences of tasks for each transport aircraft. In the chart, red bars denote tasks with the highest priority, while gray bars indicate those with the lowest. As shown in Figure 7, all top-priority tasks are completed within the first two hours. Although most medium- and low-priority tasks are scheduled in the early stages of the mission timeline, a few are deferred to later periods. Notably, a significant number of low-priority tasks are deliberately scheduled toward the end of the execution sequence. Importantly, all task schedules strictly adhere to their respective time window constraints. These results demonstrate that the algorithm not only ensures temporal feasibility but also effectively prioritizes tasks based on their urgency and time sensitivity.

Figure 8 illustrates the performance load distribution across the transport aircraft fleet. The task loads are relatively balanced among the aircraft. Aircraft 1, possessing the highest endurance and task capacity, undertakes the largest number of tasks and covers the longest flight distance. Interestingly, despite its limited range, Aircraft 5 assumes a substantial portion of medium and low priority tasks located near the base, leveraging its superior task capacity. This highlights the effectiveness of capability complementarity within the heterogeneous fleet.

The two-dimensional scenario projection in Figure 9 visually reveals the spatial characteristics of the final route planning. The circular markers sharing the same color as the corresponding flight path in the Figure denote the airdrop points of each aircraft. The task visitation sequences exhibit clear spatial clustering, with neighboring task areas assigned to the same aircraft in contiguous sequences, thereby minimizing redundant flight distances. Tasks located near the base are primarily assigned to Aircraft 4 and 5, which have shorter operational ranges, while long-range tasks are allocated to Aircraft 1, which offers superior endurance. Such a capability-aware task allocation strategy significantly enhances overall mission efficiency.

A comprehensive analysis of the offline comparative simulation results confirms that the HCBKA algorithm exhibits notable advantages in solution quality, convergence behavior, and constraint-handling capability, thereby validating the effectiveness of its overall design framework. These strengths primarily arise from the algorithm’s problem-specific initialization strategy and its exploration-enhanced update mechanism—features absent in the baseline algorithms. The hybrid update strategy of HCBKA achieves a balanced trade-off between global exploration and local exploitation, while its constraint-driven repair mechanism ensures that all solutions remain within the feasible region. Together, these mechanisms provide a reliable and efficient solution for multi-transport aircraft cooperative airdrop missions in complex operational environments.

6.2. Ablation Study of the HCBKA Algorithm

To gain deeper insights into the individual contributions and collaborative mechanisms of the core components within the HCBKA algorithm, this subsection presents a systematic ablation study. By conducting controlled experiments, we decouple and independently analyze the heuristic initialization strategy and the constraint repair mechanism employed during population initialization. This quantitative evaluation of each module’s impact on overall performance provides empirical support for algorithmic design decisions.

Figure 9. Scenario visualization generated by the HCBKA algorithm: (a) 3D view. (b) Top-down view.

To assess the standalone effects and joint benefits of the heuristic initialization strategy and the constraint repair mechanism, four algorithm variants are constructed for comparative analysis: Random Initialization without Repair (RI-NR), Random Initialization with Repair (RI-R), Heuristic Initialization without Repair (HI-NR), and the full version combining both components (HCBKA). Specifically, random initialization generates the initial population using a uniform distribution, resulting in entirely random initial solutions—a standard practice in many heuristic algorithms. In contrast, the heuristic initialization leverages the probabilistic models defined in Equations (14) and (15), incorporating prior knowledge such as task spatial distribution, priority weights, and transport aircraft performance compatibility. The repair mechanism refers to the hierarchical constraint repair process described in Section 4.2, whereas in the absence of this mechanism, infeasible task sequences are simply discarded.

Figure 10 presents the fitness convergence curves of the four algorithm variants. The results show that HCBKA achieves superior performance in both convergence speed and final solution quality. It demonstrates rapid early convergence, stabilizing around generation 300, and ultimately reaches a fitness value of 0.85. In comparison, although HI-NR benefits from a high-quality initial population—achieving an initial best fitness of approximately 0.64, higher than the 0.60 of RI-NR and 0.62 of RI-R—its lack of a repair mechanism hampers search efficiency in the early stages, resulting in a final fitness of 0.81. RI-R, despite starting with a lower-quality population, benefits from the repair mechanism during iterations and outperforms both variants without repair. However, due to its inferior initial population, its final fitness remains slightly below that of HCBKA, reaching just above 0.80.

In summary, the ablation results confirm that both the heuristic initialization and the constraint repair mechanisms are critical components contributing to the superior performance of the HCBKA algorithm. The heuristic initialization offers a high-quality starting point for the search, while the constraint repair mechanism ensures sustained feasibility and solution quality throughout the optimization process. Their combined effect enables the algorithm to efficiently solve complex, multi-constraint optimization problems.

6.3. Simulation of the TDAA Algorithm in Dynamic Environments

To comprehensively evaluate the replanning capability and robustness of the Time-window aware Dynamic Auction Algorithm (TDAA) in dynamic environments, this subsection presents simulation experiments across three representative dynamic replanning scenarios: (i) sudden loss of a transport aircraft, (ii) emergence of new task instances within existing task areas, and (iii) addition of new task areas. The experiments are conducted within the simulation environment established in Section 3.5, using the same parameter settings as in Section 6.1. By introducing different types of dynamic disturbances during task execution, the TDAA algorithm is assessed in terms of its ability to maintain task continuity, respond promptly to unexpected events, and optimize resource reallocation.

6.3.1. Sudden Aircraft Loss Scenario

In this scenario, a sudden failure forces transport aircraft

U_{5}

to withdraw from the mission at time

t = 1.5

h. At the moment of failure,

U_{5}

has five pending tasks that are interrupted: task T10 in area A4, T15 in A5, T39 in A15, T47 in A20, and T52 in A21. Details of these tasks are provided in Table 4.

Figure 11 compares the task allocation before and after replanning. The red cross indicates the location where

U_{5}

withdrew from the mission. Dashed lines represent the original flight paths of the transport aircraft, while solid lines depict the updated task sequences following reallocation. Leveraging the decision-making capabilities of the TDAA algorithm, the five interrupted tasks are reassigned to the remaining four aircraft:

U_{1}

,

U_{3}

,

U_{6}

, and

U_{8}

take over 2, 1, 1, and 1 tasks, respectively. The replanning process accounts for each aircraft’s remaining capacity, spatial proximity to the affected tasks, and task priority levels. High-priority tasks are preferentially assigned to nearby aircraft with lower current loads. Spatially, most of the newly inserted tasks are located adjacent to the original flight paths, effectively minimizing additional travel distance induced by the reallocation.

The Gantt chart in Figure 12 depicts the temporal impact of task reallocation. The red cross marks the moment when replanning is triggered, while tasks annotated with vertical lines indicate newly assigned tasks relative to the original schedule, and the graphical markers denote the number of times each aircraft arrives at a task area, with their colors representing the priority level of the corresponding tasks. The results show that the task sequences of all aircraft preserve temporal continuity, with only minor adjustments made to the execution times of certain tasks. Notably, all tasks with strict time window constraints are completed within their designated intervals, highlighting TDAA’s effectiveness in maintaining temporal feasibility. Although

U_{1}

assumes a larger share of the reassigned tasks, its total workload remains within acceptable limits, without introducing notable load imbalance.

6.3.2. Emerging Tasks Within Existing Areas

In this scenario, at

t = 1.5

h, the command center, responding to evolving battlefield conditions, issues an additional swarm UAV airdrop in task areas A6, A7, A13, A16, and A18. The details of the newly introduced task instances are provided in Table 5.

The Gantt chart in Figure 13 illustrates the impact of task reallocation on the timeline of transport aircraft

U_{3}

. The red dashed line marks the replanning trigger time, while yellow stars denote the newly added tasks. The results show that all five tasks are effectively integrated into the existing schedule, with their time window constraints fully satisfied. This low-disruption reallocation strategy highlights the TDAA algorithm’s strength in maintaining system stability, ensuring that localized task updates do not propagate into widespread sequence disruptions.

Figure 14 presents the spatial layout after reallocation. Utilizing a two-stage selection mechanism, the TDAA algorithm identifies transport aircraft

U_{1}

as the optimal platform to execute the new task in area A16. According to its original schedule,

U_{1}

was en route from area A20 to A22. As shown in the figure, this trajectory passes near A16, and a feasible time window exists in its schedule, rendering it a suitable candidate for task insertion.

6.3.3. Scenario Involving New Task Areas

In this scenario, at

t = 1.5

h, two new task areas emerge:

A_{26}

near the northwest coordinate

(21, 91)

and

A_{27}

near the southeast coordinate

(71, 11)

. Each area contains two task instances, all with the highest priority (

p = 5

), and one task in each area is constrained by a strict time window of [1.5 h, 2.0 h]. Detailed parameters are listed in Table 6. This scenario simulates the sudden appearance of tactical opportunities or threats, posing a demand for rapid integration of new operational zones.

The Gantt chart in Figure 15 illustrates the scheduling impact of the newly introduced tasks. The red dashed line marks the replanning trigger time, and yellow stars indicate the new tasks. The task sequences of

U_{1}

and

U_{4}

are adjusted to accommodate the high-priority insertions. Tasks with time window constraints are precisely scheduled within their valid intervals, demonstrating the TDAA algorithm’s strict compliance with temporal requirements. Although the insertion of new tasks causes slight delays in some subsequent executions, all adjustments remain within acceptable bounds, with no violations of deadlines or time windows.

Figure 16 presents the full scenario view incorporating the new task areas. Yellow dashed boxes highlight the spatial extent of the newly introduced areas, yellow stars mark the additional airdrop points, and solid lines indicate the new trajectory segments added to accommodate the tasks.

The TDAA algorithm, through global optimization, assigns the two task instances in

A_{26}

to transport aircraft

U_{1}

and

U_{4}

. This decision is based on multiple factors:

U_{1}

, being closer to the new area and lightly loaded, is selected for the time-constrained urgent task;

U_{4}

, with greater flexibility in route adjustment, is assigned the task without a time window constraint. From a path planning perspective, both newly added trajectory segments are seamlessly integrated into the original flight paths, avoiding unnecessary detours and minimizing resource consumption. Notably, the algorithm successfully avoids nearby enemy air defense threats during route planning, demonstrating its capability to ensure operational safety in complex environments.

A comprehensive analysis of the simulation results across the three dynamic scenarios confirms the TDAA algorithm’s strong responsiveness and robustness. In the sudden aircraft loss scenario, the algorithm effectively mitigates mission disruption through adaptive task reallocation. In the emerging task scenario, it achieves seamless integration of new tasks into the existing system. In the new task area scenario, it demonstrates high adaptability to novel operational demands.

6.4. Multi-Run Statistical Analysis of Dynamic Replanning

To provide a rigorous quantitative evaluation of the TDAA algorithm under different initial conditions, we conducted multi-run experiments for each of the three dynamic replanning scenarios. For every scenario we performed twenty independent simulation runs with different random seeds. This setup allows us to assess the consistency and the robustness of the TDAA algorithm across varying initial conditions.

Table 7 summarizes the statistical results for all three scenarios.

S_{1}

,

S_{2}

, and

S_{3}

respectively refer to three scenarios: Aircraft loss, New task instances, and New mission areas. Each entry reports the mean value and the maximum absolute deviation from that mean across the twenty runs. The key performance metrics include the task completion rate, the time window satisfaction rate, the total flight distance, the total mission time, and the average replanning time per task. The latter metric measures the computational efficiency of the TDAA algorithm during online reallocation.

The statistical results lead to three main conclusions. First, the TDAA algorithm achieves high task completion rates in all scenarios, with average values above 94.6 percent and a maximum absolute deviation below 3.4 percentage points. The time window satisfaction rate remains 100 percent across every run, which demonstrates that the algorithm strictly respects temporal constraints during dynamic replanning. Second, the total flight distance and the total mission time exhibit moderate variations across seeds. These variations are expected because different initial schedules affect the spatial and temporal opportunities for inserting new tasks. However, the algorithm never leaves any task unassigned, and all three scenarios show zero unassigned tasks in every run. Third, the average replanning time is consistently below 0.03 s and often below 0.02 s. This low computational overhead confirms that the TDAA algorithm is suitable for real time contingency response.

7. Conclusions

This paper presents a systematic investigation into the dynamic task allocation problem for coordinated swarm airdrop missions involving multiple transport aircraft in contested airspace with air defense threats. A formal quintuple scenario model is first constructed, encompassing the base, aircraft, mission areas, threat zones, and task instances. This underpins a multi-objective optimization model that accurately captures the collaborative decision-making challenge for a heterogeneous fleet, with objectives including total flight distance, task completion rate and time window satisfaction. To solve this complex problem, an offline task allocation method based on the HCBKA is proposed. It employs a hybrid encoding scheme to separately manage discrete assignment and continuous sequencing variables. Solution feasibility is enforced through a constraint-driven, hierarchical repair strategy, while the algorithm’s search capability is enhanced by an attack–migration dual-mode mechanism that balances global exploration and local exploitation. Furthermore, to maintain operational effectiveness amidst dynamic disruptions during execution, an online reallocation mechanism utilizing the Time-window aware Dynamic Auction Algorithm is developed. TDAA achieves rapid response to contingencies through a two-stage aircraft screening process and a composite bid function that intelligently evaluates allocation costs. Comprehensive simulation experiments demonstrate the superiority of the proposed integrated approach in terms of solution quality, convergence behavior, and dynamic adaptability. The results confirm that the method effectively addresses the mission planning challenges in complex combat environments, thereby providing a robust technical foundation for multi-transport aircraft coordinated swarm operations. Throughout this study, symmetry principles have played a foundational role: from the load-balancing symmetry embedded in the objective function and constraints, to the structural symmetry between discrete assignment and continuous sequencing variables, to the behavioral symmetry of attack–migration dual-mode search in HCBKA, and finally to the operational symmetry of the two-stage screening mechanism in TDAA. These symmetry-aware design choices are not merely incidental but are central to the efficiency, scalability, and robustness of the proposed framework. Thus, beyond its engineering contributions, this work also illustrates how symmetry concepts can guide the design of optimization algorithms for complex multi-agent coordination problems.

Author Contributions

Conceptualization, B.J. and K.Q.; methodology, B.J., K.Q. and Y.W.; validation, B.J.; formal analysis, K.Q.; investigation, B.J.; writing—original draft preparation, B.J.; writing—review and editing, K.Q. and Y.W.; visualization, B.J.; and project administration, K.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available upon request from the author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Lee, H.W.; Lee, C.S. Research on logistics of intelligent unmanned aerial vehicle integration system. J. Ind. Inf. Integr. 2023, 36, 100534. [Google Scholar] [CrossRef]
Liu, K.; Zheng, J. UAV Trajectory Optimization for Time-Constrained Data Collection in UAV-Enabled Environmental Monitoring Systems. IEEE Internet Things J. 2022, 9, 24300–24314. [Google Scholar] [CrossRef]
Akter, S.; Duong, D.V.A.; Yoon, S. Joint optimization of UAV trajectory, task offloading, and resource allocation in UAV-aided emergency response operations. IEEE Internet Things J. 2025, 12, 21944–21959. [Google Scholar] [CrossRef]
Pan, Y.; Li, L.; Qin, J.; Chen, J.J.; Gardoni, P. Unmanned aerial vehicle–human collaboration route planning for intelligent infrastructure inspection. Comput.-Aided Civ. Infrastruct. Eng. 2024, 39, 2074–2104. [Google Scholar] [CrossRef]
Ahmad, I. Intelligent task offloading decisions for enhanced swarm-based UAV surveillance. IEEE Trans. Aerosp. Electron. Syst. 2025. early access. [Google Scholar] [CrossRef]
Yang, L.; Zhang, X.; Li, Z.; Li, L.; Shi, Y. A LODBO algorithm for multi-UAV search and rescue path planning in disaster areas. Chin. J. Aeronaut. 2025, 38, 103301. [Google Scholar] [CrossRef]
Zhou, L.; Leng, S.; Wang, Q.; Liu, Q. Integrated sensing and communication in UAV swarms for cooperative multiple targets tracking. IEEE Trans. Mob. Comput. 2022, 22, 6526–6542. [Google Scholar] [CrossRef]
Cao, P.; Lei, L.; Cai, S.; Shen, G.; Liu, X.; Wang, X.; Zhang, L.; Zhou, L.; Guizani, M. Computational intelligence algorithms for UAV swarm networking and collaboration: A comprehensive survey and future directions. IEEE Commun. Surv. Tutor. 2024, 26, 2684–2728. [Google Scholar] [CrossRef]
Rao, J.; Xiang, C.; Xi, J.; Chen, J.; Lei, J.; Giernacki, W.; Liu, M. Path planning for dual UAVs cooperative suspension transport based on artificial potential field-A* algorithm. Knowl.-Based Syst. 2023, 277, 110797. [Google Scholar] [CrossRef]
Zhang, A.; Xu, H.; Bi, W.; Xu, S. Adaptive mutant particle swarm optimization based precise cargo airdrop of unmanned aerial vehicles. Appl. Soft Comput. 2022, 130, 109657. [Google Scholar] [CrossRef]
Murray, C.C.; Raj, R. The multiple flying sidekicks traveling salesman problem: Parcel delivery with multiple drones. Transp. Res. Part C Emerg. Technol. 2020, 110, 368–398. [Google Scholar] [CrossRef]
Kitjacharoenchai, P.; Ventresca, M.; Moshref-Javadi, M.; Lee, S.; Tanchoco, J.M.; Brunese, P.A. Multiple traveling salesman problem with drones: Mathematical model and heuristic approach. Comput. Ind. Eng. 2019, 129, 14–30. [Google Scholar] [CrossRef]
Bérczi, K.; Mnich, M.; Vincze, R. Approximations for many-visits multiple traveling salesman problems. Omega 2023, 116, 102816. [Google Scholar] [CrossRef]
Park, J.; Noh, G.; Park, C.; Kim, J.; Kim, J.; Lee, D.; Cho, D. Development of mission allocation based on MILP for multi-UAVs with limited resources. Aerosp. Sci. Technol. 2025, 166, 110598. [Google Scholar] [CrossRef]
He, P.; Hao, J.K. Hybrid search with neighborhood reduction for the multiple traveling salesman problem. Comput. Oper. Res. 2022, 142, 105726. [Google Scholar] [CrossRef]
Zheng, J.; Hong, Y.; Xu, W.; Li, W.; Chen, Y. An effective iterated two-stage heuristic algorithm for the multiple Traveling Salesmen Problem. Comput. Oper. Res. 2022, 143, 105772. [Google Scholar] [CrossRef]
He, P.; Hao, J.K. Memetic search for the minmax multiple traveling salesman problem with single and multiple depots. Eur. J. Oper. Res. 2023, 307, 1055–1070. [Google Scholar] [CrossRef]
Ergüven, E.; Polat, F. Relative distances approach for multi-traveling salesmen problem. Knowl.-Based Syst. 2024, 300, 112160. [Google Scholar] [CrossRef]
Kloster, K.; Moeini, M.; Vigo, D.; Wendt, O. The multiple traveling salesman problem in presence of drone-and robot-supported packet stations. Eur. J. Oper. Res. 2023, 305, 630–643. [Google Scholar] [CrossRef]
Chen, L.; Liu, G.; Zhu, X.; Li, X. A heuristic routing algorithm for heterogeneous UAVs in time-constrained MEC systems. Drones 2024, 8, 379. [Google Scholar] [CrossRef]
Luo, Z.; Poon, M.; Zhang, Z.; Liu, Z.; Lim, A. The multi-visit traveling salesman problem with multi-drones. Transp. Res. Part C Emerg. Technol. 2021, 128, 103172. [Google Scholar] [CrossRef]
Dutta, P.; Khan, I.; Basuli, K.; Maiti, M.K. A heuristic with clustering-routing approach for the Multi-depot Multiple Traveling Salesman Problems in different fields. Eng. Appl. Artif. Intell. 2025, 160, 112013. [Google Scholar] [CrossRef]
Nayak, A.; Rathinam, S. Heuristics and learning models for dubins minmax traveling salesman problem. Sensors 2023, 23, 6432. [Google Scholar] [CrossRef]
Hu, Y.; Yao, Y.; Lee, W.S. A reinforcement learning approach for optimizing multiple traveling salesman problems over graphs. Knowl.-Based Syst. 2020, 204, 106244. [Google Scholar] [CrossRef]
Liang, H.; Wang, S.; Li, H.; Zhou, L.; Zhang, X.; Wang, S. BiGNN: Bipartite graph neural network with attention mechanism for solving multiple traveling salesman problems in urban logistics. Int. J. Appl. Earth Obs. Geoinf. 2024, 129, 103863. [Google Scholar] [CrossRef]
Gao, H.; Zhou, X.; Xu, X.; Lan, Y.; Xiao, Y. AMARL: An attention-based multiagent reinforcement learning approach to the min-max multiple traveling salesmen problem. IEEE Trans. Neural Netw. Learn. Syst. 2023, 35, 9758–9772. [Google Scholar] [CrossRef] [PubMed]
Ma, S.; Ruan, J.; Du, Y.; Bucknall, R.; Liu, Y. An end-to-end deep reinforcement learning based modular task allocation framework for autonomous mobile systems. IEEE Trans. Autom. Sci. Eng. 2024, 22, 1519–1533. [Google Scholar] [CrossRef]
Garn, W. Balanced dynamic multiple travelling salesmen: Algorithms and continuous approximations. Comput. Oper. Res. 2021, 136, 105509. [Google Scholar] [CrossRef]
Groba, C.; Sartal, A.; Vázquez, X.H. Integrating forecasting in metaheuristic methods to solve dynamic routing problems: Evidence from the logistic processes of tuna vessels. Eng. Appl. Artif. Intell. 2018, 76, 55–66. [Google Scholar] [CrossRef]
Huang, Y.; Wang, Y.; Li, Z.; Zhang, H.; Zhang, C. A hierarchical multi robot coverage strategy for large maps with reinforcement learning and dense segmented siamese network. IEEE Robot. Autom. Lett. 2024, 10, 444–451. [Google Scholar] [CrossRef]
Sariel-Talay, S.; Balch, T.R.; Erdogan, N. Multiple traveling robot problem: A solution based on dynamic task selection and robust execution. IEEE/ASME Trans. Mechatron. 2009, 14, 198–206. [Google Scholar] [CrossRef]

Figure 1. Schematic of the multi-transport-aircraft-coordinated swarm airdrop mission scenario.

Figure 2. Flowchart of the HCBKA algorithm.

Figure 3. Reallocation scenario due to aircraft attrition.

Figure 4. Reallocation scenario due to new task instances.

Figure 5. Reallocation scenario due to a new mission area.

Figure 6. Comparison of fitness convergence curves across different algorithms.

Figure 7. Gantt chart of task scheduling results generated by the HCBKA algorithm.

Figure 8. Bar charts of transport aircraft performance constraints: (a) Range constraints. (b) Task load distribution.

Figure 10. Comparison of fitness convergence curves for different combinations of initialization strategies and repair mechanisms.

Figure 11. Task reallocation results under the sudden aircraft loss scenario: (a) 3D view. (b) Top-down view.

Figure 12. Task Gantt chart under the sudden aircraft loss scenario.

Figure 13. Task Gantt chart under the scenario of emerging tasks within existing areas.

Figure 14. Task reallocation results under the scenario of emerging tasks within existing areas: (a) 3D view. (b) Top-down view.

Figure 15. Task Gantt chart under the scenario of new task areas.

Figure 16. Task reallocation results under the scenario of new task areas: (a) 3D view. (b) Top-down view.

Table 1. Transport Aircraft Parameters in the Fixed Scenario.

Aircraft	1	2	3	4	5
Max Range (km)	500	450	480	400	350
Max Tasks	18	15	21	12	24
Min Altitude (km)	0.5	0.5	0.5	0.5	0.5
Max Altitude (km)	40	35	45	30	48
Cruising Speed (km/h)	100	100	100	100	100

Table 2. HCBKA Algorithm Parameters.

Parameter	Description	Value
$N_{p o p}$	Population size	50
$I t e r$	Maximum number of iterations	1000
p	Threshold for selecting attack behavior	0.5
$w_{1}$	Weight of total flight distance	0.2
$w_{2}$	Weight of task completion rate	0.6
$w_{3}$	Weight of time window satisfaction	0.2

Table 3. Task Attributes.

Task ID	Location (km)	Number of Visits	Priority	Time Windows (h)
1	(37.00, 95.00, 0.35)	4	4	[2.05, 3.50] [1.57, 2.74] [2.50, 3.86] [2.30, 3.74]
2	(15.00, 15.00, 0.25)	2	1	No time window
3	(61.00, 71.00, 0.25)	2	1	[0.58, 4.38] [0.00, 2.02]
4	(31.00, 53.00, 0.15)	2	1	No time window
5	(61.00, 13.00, 0.15)	2	2	[0.78, 3.88] [0.00, 3.30]
6	(45.00, 79.00, 0.25)	1	3	No time window
7	(59.00, 5.00, 0.15)	3	1	No time window
8	(7.00, 95.00, 0.45)	3	2	No time window
9	(31.00, 9.00, 0.15)	1	1	No time window
10	(13.00, 49.00, 0.25)	2	3	No time window
11	(55.00, 19.00, 0.45)	4	1	No time window
12	(93.00, 89.00, 0.25)	1	3	No time window
13	(9.00, 19.00, 0.15)	1	2	No time window
14	(39.00, 27.00, 0.25)	2	5	[0.49, 1.12] [0.64, 1.63]
15	(77.00, 7.00, 0.15)	4	1	No time window
16	(87.00, 63.00, 0.15)	1	1	No time window
17	(89.00, 47.00, 0.25)	3	1	No time window
18	(77.00, 57.00, 0.25)	4	5	[0.86, 1.61] [0.95, 1.81] [1.32, 1.83] [1.62, 2.21]
19	(53.00, 43.00, 0.15)	1	4	No time window
20	(91.00, 25.00, 0.25)	3	1	No time window
21	(23.00, 7.00, 0.15)	2	3	[2.68, 3.89] [2.49, 3.97] [1.69, 2.84]
22	(93.00, 81.00, 0.25)	3	4	No time window
23	(81.00, 89.00, 0.15)	3	2	No time window
24	(11.00, 33.00, 0.25)	3	2	No time window
25	(51.00, 71.00, 0.25)	1	1	No time window

Table 4. Details of the Sudden Aircraft Loss Event.

Task Area	Time Window (h)
T10 (A4)	No time window
T15 (A5)	[1.15, 3.19]
T39 (A15)	No time window
T47 (A20)	No time window
T52 (A21)	[1.61, 4.67]

Table 5. Parameters of Newly Introduced Task Instances.

Task Area	Coordinates (km)	Time Window (h)	Priority
A18	(77.00, 57.00, 0.25)	[2.0, 2.5]	5
A7	(59.00, 5.00, 0.15)	No time window	4
A6	(45.00, 79.00, 0.25)	No time window	4
A13	(9.00, 19.00, 0.15)	No time window	4
A16	(87.00, 63.00, 0.15)	[2.5, 3.0]	5

Table 6. Parameters of New Task Areas.

Task ID	Task Area	Coordinates (km)	Time Window (h)	Priority
T63	A26	(21, 91, 0.45)	[1.5, 2.0]	5
T64	A26	(21, 91, 0.45)	No time window	4
T65	A27	(71, 11, 0.25)	[2.0, 2.5]	5
T66	A27	(71, 11, 0.25)	No time window	4

Table 7. Multi-run statistical results for three dynamic replanning scenarios.

Scenario	Completion Rate (%)	TW Satisfaction (%)	Total Distance (km)	Total Time (h)	Replan Time (s)
$S_{1}$	95.99 ± 1.77	100 ± 0	2492.26 ± 215.16	4.03 ± 0.50	0.0178 ± 0.0250
$S_{2}$	94.62 ± 3.32	100 ± 0	2428.48 ± 281.58	3.96 ± 0.45	0.0156 ± 0.0179
$S_{3}$	95.22 ± 3.13	100 ± 0	2554.54 ± 285.44	3.87 ± 0.46	0.0173 ± 0.0159

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jiang, B.; Qin, K.; Wu, Y. Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation. Symmetry 2026, 18, 720. https://doi.org/10.3390/sym18050720

AMA Style

Jiang B, Qin K, Wu Y. Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation. Symmetry. 2026; 18(5):720. https://doi.org/10.3390/sym18050720

Chicago/Turabian Style

Jiang, Bing, Kaiyu Qin, and Yu Wu. 2026. "Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation" Symmetry 18, no. 5: 720. https://doi.org/10.3390/sym18050720

APA Style

Jiang, B., Qin, K., & Wu, Y. (2026). Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation. Symmetry, 18(5), 720. https://doi.org/10.3390/sym18050720

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Task Allocation of Swarm Airdrop Based on Multi-Transport Aircraft Cooperation

Abstract

1. Introduction

2. Related Works

3. Modeling of the Airdrop Task Assignment Problem

3.1. Modeling of the Airdrop Mission Scenario

3.2. Design of Decision Variables for Airdrop Tasks

3.3. Modeling of Transport Aircraft Performance Constraints

3.4. Modeling of Task Constraints

3.5. Modeling of Airdrop Task Assignment Optimization Problem

3.5.1. Minimization of Total Flight Distance

3.5.2. Maximization of Task Completion

3.5.3. Maximization of Time Window Satisfaction

3.5.4. Composite Objective Function

4. Multi-Transport-Aircraft-Coordinated Airdrop Task Assignment Method

4.1. Population Initialization Mechanism

4.1.1. Initialization of Assignment Variables ( a k )

4.1.2. Initialization of Sequencing Variables ( s k )

4.2. Constraint-Driven Population Repair Mechanism

4.2.1. Single-Visit Constraint Repair

4.2.2. Payload Capacity Constraint Repair

4.2.3. Range Constraint Repair

4.2.4. Altitude Constraint Repair

4.2.5. Global Lightweight Backfilling Mechanism

4.3. Hybrid Black-Winged Kite Population Update Mechanism

4.3.1. Attack Behavior

4.3.2. Migration Behavior

4.4. HCBKA Algorithm Flow

5. Dynamic Reallocation Method for Coordinated Airdrop Missions

5.1. Analysis of Reallocation Scenarios for Coordinated Swarm Airdrop Missions

5.1.1. Sudden Attrition of a Transport Aircraft

5.1.2. New Task Instances in an Existing Area

5.1.3. Emergence of a New Mission Area

5.1.4. Formal Definition of Event-Trigger Conditions

5.2. Coordinated Airdrop Task Reallocation Method Based on the TDAA Algorithm

5.2.1. Transport Aircraft Screening Mechanism

5.2.2. Task Auction Priority Sorting Mechanism

5.2.3. Bid Function Design

5.2.4. TDAA Algorithm Flow

6. Simulation Result

6.1. Comparative Evaluation of the HCBKA Algorithm via Simulation

6.2. Ablation Study of the HCBKA Algorithm

6.3. Simulation of the TDAA Algorithm in Dynamic Environments

6.3.1. Sudden Aircraft Loss Scenario

6.3.2. Emerging Tasks Within Existing Areas

6.3.3. Scenario Involving New Task Areas

6.4. Multi-Run Statistical Analysis of Dynamic Replanning

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1.1. Initialization of Assignment Variables ( $a_{k}$ )

4.1.2. Initialization of Sequencing Variables ( $s_{k}$ )