Next Article in Journal
Model-in-the-Loop Design and Flight Test Validation of Flight Control Laws for a Small Fixed-Wing UAV
Previous Article in Journal
An Assessment of the Effectiveness of RGB-Camera Drones to Monitor Arboreal Mammals in Tropical Forests
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Resilient Task Allocation for UAV Swarms: A Bilevel PSO-ILP Optimization Approach

School of Systems Sciences and Engineering, Sun Yat-Sen University, Guangzhou 510006, China
*
Authors to whom correspondence should be addressed.
Drones 2025, 9(9), 623; https://doi.org/10.3390/drones9090623
Submission received: 18 July 2025 / Revised: 22 August 2025 / Accepted: 2 September 2025 / Published: 4 September 2025
(This article belongs to the Section Artificial Intelligence in Drones (AID))

Abstract

To address the severe challenges of task allocation for UAV swarms in uncertain complex environments, this paper introduces the concept of equivalent load, constructs the load capability matrix of a single UAV and the task required load matrix of the task area, and designs a new task resilience capability indicator accordingly to conduct research on a resilience-based optimization framework. Aiming at this multi-objective optimization problem, the “Problem Decomposability Theorem” is proposed, which theoretically proves the feasibility of decomposing the UAV swarm problem into “lower-level Integer Linear Programming (ILP) cost optimization” and “upper-level Particle Swarm Optimization (PSO) resilience optimization”. Based on this, a Particle Swarm Optimization–Integer Linear Programming (PSO-ILP) two-layer nested optimization algorithm is designed. Simulation experiments covering three task areas, five payload types and multiple UAV types are carried out, and the results show that the proposed method has outstanding performance in multi-objective optimization, especially in terms of algorithm convergence and the comprehensive efficiency of swarm load cost and task resilience. In particular, when the interruption probability is in the range of 0.2 to 0.6, it can not only maintain high task resilience but also achieve cost minimization, with a significant improvement in resilience performance. These results not only enrich the theoretical research on UAV swarm resilience but also provide a universal solution for UAV swarm task optimization in multiple fields.

1. Introduction

UAV swarms are widely used in fields such as delivery relief [1,2], agricultural irrigation [3], cooperative hunting [4] and others. In these complex task scenarios [5,6,7,8,9], various factors affect the task performance of UAV swarms. Environmental factors such as adverse weather and complex terrain can affect the flight stability and sensor performance of UAVs to a certain extent. Sudden interruptions such as physical collisions and signal interference can cause varying degrees of damage to the UAV’s body and modules, thereby affecting the UAV’s task performance. As these systems increasingly operate in dynamic, uncertain environments, resilience has become a critical design consideration for maintaining operational effectiveness during disturbances [10,11,12,13].
The theoretical foundations of system resilience were established by Vugrin et al. as early as 2010 [14]. Over subsequent decades, resilience research has expanded from natural world to diverse domains including economics, sociology and industrial studies. Following the development of resilience frameworks for critical infrastructure and economic systems, the concept has been widely applied to various complex networks. A significant advancement in the field of engineering applications emerged in 2016, when Tran et al. [15,16] developed a general resilience-assessment framework, taking into account the typical cases of System-of-Systems (SOS) networks. Their approach quantifies system performance changes during disruptions as a core resilience metric, offering valuable methodological support for assessing the resilience of the rapidly developing UAV swarm systems.
UAV swarm resilience evaluation [17] quantifies a system’s capacity to maintain operational performance during external disruptions. As highlighted by Ordoukhanian et al. [18], resilient multi-UAV operations face three fundamental challenges: real-time disruption identification, dynamic resource allocation and performance-recovery latency. Based on Tran’s resilience-assessment framework, recent studies have introduced methodological refinements: Bai et al. [19] established a standardized performance metric, Li et al. [20] developed assessment criteria for heterogeneous networks, Zhou et al. [13] addressed the handling of partial failure scenarios and Guo et al. [21] proposed a resilience-enhancement approach-based cooperative task. These contributions are primarily focused on resilience research related to system performance in the field of UAV communication, with the research focus having gradually shifted from assessment to improvement strategies. Notable approaches include Kong et al.’s [22] resource-supplementation method for adversarial environments, Sun et al.’s [23] multi-swarm reconfiguration model, Wubben et al.’s [24] leader-loss-mitigation technique and Li et al.’s [25] continuous-attack adaptation through soft resource optimization and task reallocation method under continuous attack conditions to enhance the resilience of UAV swarms. Although these methods have shown effectiveness in post-disruption recovery, their implementation requires non-negligible time intervals. In addition, there are many other types of research related to UAV swarm resilience-improvement methods, mainly focusing on designing recovery strategies after interruptions occur. Although these methods effectively address post-disruption recovery, they mainly focus on reactive strategies rather than proactive resilience design during the task-allocation phase.
As illustrated in Figure 1, sudden disruptions can trigger a rapid decline in system performance, with the solid and dashed lines depicting the performance trajectories under different scenarios. The system operates at a “normal level” in its normal state until the initial disruption occurs at time  t 1 . After that, the performance enters an “absorption” phase, continuing to decline and reaching a nadir at  t 2 ; it then enters a “maintenance” phase until recovery actions are initiated at  t 3 . Different final states (A, B, C) correspond to varying degrees of recovery, such as full recovery or partial recovery, which constitute a typical resilience trajectory (as shown by the solid line). In severe cases (dashed line D), performance may drop below the “break level” before recovery strategies can be deployed. The system may even collapse directly, especially when encountering consecutive disruptions. This temporal constraint underscores two core points: first, the critical significance of adopting proactive resilience measures during the task-allocation phase; second, considering redundant UAVs at this stage is a necessary prerequisite for UAV swarms to implement scheduling strategies and ensure the recoverability of task performance. However, the fundamental challenge of UAV swarm task allocation lies in optimizing task performance under strict resource constraints. Phadke et al. [11] critically identified that current allocation methodologies frequently neglect essential swarm system attributes that directly influence operational resilience. Traditional task-allocation problems, categorized by distinct constraints, include the vehicle routing problem (VRP), multiple traveling salesman problem (MTSP) and multiple processor resource-allocation problem (MPRAP). In UAV swarm operations, however, the swarm must simultaneously address two competing objectives: maximizing task effectiveness and minimizing operational expenditures.
The essence of the aforementioned problem is a combinatorial optimization problem between an integer number of UAVs and an integer number of tasks. Traditionally, Integer Linear Programming (ILP) has often been employed for its solution. For instance, Mirza et al. successfully implemented multi-objective optimization for drone coordination and resource allocation using ILP techniques [26]. In logistics applications, Nishira et al. minimized optimal delivery time through ILP-based drone route planning [27], while Gao et al. effectively solved task-allocation problems in medium-scale multi-robot systems via similar methodologies [28]. However, as the scale of UAV swarms and task complexity continue to expand, conventional ILP approaches are increasingly plagued by combinatorial explosion. Recent studies have focused on hybrid techniques integrating Branch and Bound with Cutting Plane methods [29,30,31,32], which exhibit superior performance in reducing solution spaces and identifying optimal solutions. The development of open-source tools—such as COIN-OR’s PuLP modeling framework and the CBC (COIN-OR Branch and Cut) solver—has substantially lowered the barrier to research in linear and integer programming [33].
Despite these advancements, ILP-based methods show limited scalability in dynamic task-allocation scenarios for drone swarms. In contrast, swarm intelligence algorithms have emerged as particularly effective alternatives in practical applications. Guo et al. [34] proposed an optimized discrete particle swarm optimization algorithm to address the requirement of multi-UAV cooperative task execution. Xiong et al. [35] put forward an improved multi-objective grey wolf optimization algorithm, which is applied to task allocation in heterogeneous target reconnaissance by multiple UAVs. Qin et al. [36] presented a seagull optimization algorithm driven by deep reinforcement learning to solve the problem of multi-UAV task allocation in plateau ecological restoration. Other approaches focus on fully decentralized optimization frameworks, such as those based on the Alternating Direction Method of Multipliers (ADMM), which enhance system scalability and resilience against single points of failure by removing the need for a central controller [37,38,39]. While these methods demonstrate strong performance in deterministic environments, they suffer from notable limitations when addressing stochastic interruption scenarios.
A critical limitation emerges from the inherent uncertainty in payload loss during disruptive events. Existing allocation strategies that account for maximum expected loss often lead to substantially increased operational costs and excessive resource redundancy, as documented in prior studies [7,40,41,42]. This necessitates proactive resilience-integrated allocation frameworks that intrinsically embed robustness at the planning stage without compromising operational efficiency.
In view of the limitations in existing UAV swarm task-allocation methods, this paper presents a comprehensive study based on task resilience. By leveraging a bilevel nested structure, we design an innovative task-allocation-optimization algorithm. This algorithm is specifically tailored to address the multi-objective optimization challenges in UAV swarms, with a particular emphasis on task resilience. This study presents three key innovations:
(1)
By introducing the concept of equivalent load, this paper deeply integrates UAV capabilities (types, payloads) with task requirements, constructs a multi-dimensional quantitative indicator system related to UAV swarm tasks and further establishes a quantitative assessment model for task resilience. This research effectively addresses the deficiency of insufficient attention to task resilience in the existing field of task allocation.
(2)
The “problem decomposability theorem” is proposed, and a two-layer PSO-ILP optimization algorithm is designed to jointly optimize swarm load costs and task resilience. Specifically, the lower layer focuses on minimizing load costs, while the upper layer optimizes the weighted fitness value of load costs and resilience, thereby effectively solving the complex multi-objective optimization problem in UAV swarm task allocation.
(3)
The uncertain and stochastic disturbance characteristics of task resilience during task execution under different interference probabilities (0.2–0.6) are investigated, fully verifying the effectiveness of the PSO-ILP algorithm in coping with sudden interruptions and maintaining high task capacity performance in various disturbed scenarios.
This paper is organized as follows: Section 2 explores the related Work in UAV swarm optimization, task-allocation framework and resilience model. Section 3 constructs the mathematical model of the UAV swarm task scenario. Section 4 elaborates on the theoretical basis of the two-layer nested optimization method and conducts algorithm design. Section 5 carry out simulation experiments and result analysis based on typical scenarios. Section 6 summarizes the research achievements and future research directions.

2. Preliminaries

2.1. UAV Swarm Task-Allocation Framework

The task-allocation framework for the UAV swarm is illustrated in Figure 2 [10]. In this framework, we introduce the concept of equivalent load to quantify the different types of tasks in the UAV swarm’s task list as well as the task capabilities of individual UAVs. The process begins with defining task scenarios, from which a “Task Required Load” matrix is derived. Concurrently, the available UAV fleet is defined by its “UAV Loads Configuration” and corresponding “Load Capacity Matrix” for each UAV type. The central “Task-Allocation Optimization” module processes these inputs. It performs “Task Splitting” to break the overall mission into subtasks and allocates UAVs to form “Subtask Alliances”. Crucially, the optimization provides “Extra” (redundant) loads for each alliance to enhance resilience against disruptions. A “Task completion test” provides feedback to the allocation and analysis stages. For example, during task execution, surveying UAVs carry more surveying payloads than other types of UAVs, and communication UAVs are equipped with more communication payloads. Based on the changes in parameters such as the swarm’s payload capacity, task capabilities and interruption probability, we can evaluate and optimize the task coalitions obtained from UAV swarm task allocation in multi-task scenarios.
The UAV swarm task-allocation problem studied herein is characterized by several key features:
A basic task is defined as an indivisible unit, and all such tasks executable by UAV swarms require corresponding task payloads—collectively referred to as “task load.” For instance, surveying tasks demand surveying payloads, communication tasks require communication payloads and rescue tasks necessitate transport payloads.
The load capacity matrix for different UAV types is defined according to the variety of UAV models, their task-performing abilities and the quantity of “carrying” task loads.
Furthermore, the entire task can be decomposed into a sequence of subtasks, which are assigned to individual tasks within each task zone. The load-requirement matrix for each subtask is derived from the load demand matrix of each task in the corresponding task domain.
Focusing on the types, quantities and load capacity matrices of UAVs, along with considerations of the task zones environment, task goals and various constraints, this study explores an optimal task-allocation strategy. It involves the systematic assignment of subtasks and the allocation of specific numbers of different UAV types to each subtask, forming task alliances to accomplish individual subtasks and thus enhancing the overall operational efficiency of the UAVs.

2.2. Basic Concepts and Definitions

Within the UAV swarm task-allocation framework, the concept of payload features prominently across various matrices—including the load capability matrix of different UAV types, the load required matrix for operational units in each task zone and the task load required matrix for individual subtasks. A unified quantification of these payloads is foundational to research on optimizing UAV swarm task-allocation strategies. To this end, this paper introduces the concept of “equivalent payload” to theoretically resolve the issue of load-measurement metrics in UAV swarms.
Definition 1.
Equivalent load: The equivalent load serves as the fundamental unit for various types of task payloads. By setting a unified basic measurement unit for each type of task to quantify its task capability, it can form a core quantitative indicator for measuring the installed payload capacity of UAVs, matching task payload requirements and optimizing scheduling. Specifically, its calculation logic is to divide the actual effectiveness of the UAV in a specific task by the benchmark unit effectiveness set for that task type to obtain the corresponding equivalent load value. This concept has dual functions: first, it can be used to define the installed task payload capacity of a single UAV, clarify the capability boundary of each UAV in performing specified tasks, lay the foundation for constructing the capability matrix of each component in the UAV swarm and provide a comparable standard for the task capabilities of different UAVs; second, it can quantify the payload required for each task area and task execution, which not only helps formulate the corresponding task payload-requirement matrix, but also provides data support for subsequent task scheduling and task alliance reconstruction, making task arrangement more scientific and operable.
For example, if “an effective search range of 100 square kilometers per unit time” is set as 1 basic unit of search-type payload: when the installed search payload of a UAV is 2, it means that the UAV can complete a search task of 200 square kilometers per unit time; and when a task requires 10 search-type payloads, it means that a search task of 100 square kilometers can be completed within 10 unit times, or a search task of 1000 square kilometers can be completed within 1 unit time, etc.
This framework is specifically designed to optimize the configuration of task alliances composed of diverse UAVs for individual subtasks and to assess the effectiveness of task capacity. The introduction of the concept of “task capacity” within this framework provides a quantitative measure for evaluating the extent of task fulfillment, which serves as a foundational basis for the subsequent calculation of task resilience metrics.
Definition 2.
Task capacity: Task capacity ( T C ( t ) ) represents the ratio of the total payload carried by the UAV swarms to the total task payload demand at time t. The calculation formula is presented in Equation (1), which ε i ( t ) represents the equivalent loads carried by the UAV at time t, N U is the number of UAVs and  U ( t ) represents the total number of task-required loads at time t.
T C ( t ) = 1 U ( t ) · i = 1 N U ε i ( t )

2.3. Task Resilience Model

A portion of the redundant task payload is added to the target tasks assigned to the UAV swarms during the task pre-allocation stage to counter abrupt interruption during the execution of tasks. By considering redundancy of the task payload demands, the UAV swarms can maintain its task capability above the baseline level, even experiencing payload losses caused by an abrupt interruption during the task-execution period.
Existing research on UAV swarm resilience has primarily focused on the dynamic variation of UAV swarm performance. However, as an inherent attribute of a system, resilience measurement and analysis during the task pre-allocation phase ensure that the UAV swarms can continue to carry out operational tasks and implement recovery strategies under abrupt interruption. This paper presents a task resilience model for the pre-allocation phase, introducing a probabilistic interruption factor into the UAV task capability model, thereby transforming a time-varying metric into a probability-varying metric.
Task resilience can be applied to not only early-stage static task allocation but also dynamic task allocation during task execution. It represents a metric with greater potential for engineering applications than traditional system performance indicators, such as “complex network information exchange volume.” The calculation of UAV swarm task resilience is presented in Equation (2). It is influenced by both the interruption probability and task capacity, being positively correlated with task capacity and negatively correlated with interruption probability. This calculation applies to performance measurement across various levels of tasks.
R = [ a + t = t 0 t f i n a l ( T C ( t ) T C b r e a k ) ( T C 0 T C b r e a k ) · ( t f i n a l t 0 ) ] · e b · ( P 1 ) a + 1 T C ( t ) T C b r e a k 0 T C ( t ) < T C b r e a k
where T C ( t ) represents the task capability under the current interruption probability. T C 0 represents the required baseline task capacity. t 0 represents the start time of the task. t f i n a l represents the moment corresponding to the end of the task. T C b r e a k denotes the break level of task capability, which is the task capability required by the UAV swarms to maintain a base level during the task. P is the interruption probability of the UAV swarms during task execution. In the simulation experiments, the specific values for these variables are determined based on the required duration of the simulation.
To adjust for the impact of different UAVs or tasks on the measurement of task resilience, this paper introduces a task baseline deviation factor a and an interruption probability deviation factor b into the formula, ensuring consistent measurement of task resilience-calculation results.
The task resilience is illustrated as a surface plot and a contour plot with task capacity and interruption probability as independent variables, as shown in Figure 3a,b, respectively. These figures demonstrate the variation in task resilience for UAV swarms at different levels of task capability and interruption probability, given a task baseline of 0.7. In particular, the specific values of a and b can be adjusted according to the characteristics of different tasks to make them conform to the physical laws shown in Figure 3: Specifically, when the UAV swarm’s task capability falls below the task collapse threshold, the task resilience value will remain at 0, not changing with an increase in interruption probability. Conversely, when the UAV swarm’s task capability exceeds the task collapse threshold, the task resilience value is greater for a given interruption probability as the task capability strengthens. Similarly, as the interruption probability increases, the task resilience value that the UAV swarms maintained with the same level of task capability is also greater. Both plots illustrate that Task Resilience is positively correlated with Task Capacity; for a given threat level (P), having more capacity leads to higher resilience. The plots also reflect the model’s characteristic where, above the break level, the calculated resilience value increases with the interruption probability. The dashed line in (b) represents a sample task capacity baseline ( T C 0 = 0.7 ). In summary, the task resilience capability index constructs an evaluation framework with both forward-looking and practical value by integrating the quantitative relationships between task capability, interruption probability and task completion degree. Its core value lies in transforming resilience planning during the pre-allocation phase from qualitative judgment to quantitative calculation. It not only retains the rigid constraints on the task baseline but also adapts to the resilience requirements of different operational scenarios through deviation factors. This index not only provides an optimizable quantitative target for UAV swarm task allocation but also offers a systematic measurement tool for enhancing the robustness of UAV swarm applications in complex environments.

3. Problem Modeling

3.1. Scenario Configuration Definition

The task scenario consists of separate targets of the task and single UAVs of the swarm. Table 1 shows the notations for variables and parameters used throughout this paper.

3.1.1. UAV Configuration Parameters

UAV swarm types: The set of UAV swarm types is denoted as V = v i i [ 0 , N U ] , where v i represents the i t h type of UAV, and  N U represents the total number of UAV types. For example, when set V = v 1 , v 2 , v 3 , it represents that there are three types of UAVs ( v 1 , v 2 and v 3 ) in the swarms.
UAV load types: A single UAV has the capacity to carry a variety of N L loads; the set of UAV load types is denoted as L = l j j [ 0 , N L ] , and the load cost set is denoted as L C = l c j j [ 0 , N L ] , where l j represents the j t h type of load, N L represents the number of load types and l c j represents the cost (USD) of l j ; for example, when set to L = l 1 , l 2 , l 3 , it represents that there are three types of loads ( l 1 , l 2 and l 3 ).
Load capacity matrix: the equivalent load number matrix of various loads designed for each UAV, E = e i j N u × N L , where e i j represents the number of l j carried by v i .

3.1.2. Configuration Parameters of Target Units

Task area set: The area where target units are concentrated, which is denoted as B = b k k [ 0 , N B ] , where N B represents the task area number, b k represents the task area.
Target unit set: The set of target units existing in task areas, denoted as S = s m m [ 0 , N S ] , where N S represents the number of target unit types.
Interruption probability set: The set of interruption probabilities existing in task areas, denoted as P = p k k [ 0 , N B ] , where p k represents the interruption probability in b k .
Target unit number matrix: The matrix represents the number of each type of target unit that existed in task areas, which is denoted as Q = [ q k m ] N B × N S , where q k m represents the number of targets unit s m that existed in b k .
Target damage matrix: The matrix represents the required loads to complete a single target task, which is denoted as F = [ f m j ] N S × N L , where f m j represents the number of required loads in a single target task.
The task required load matrix: This is the required equivalent load for a swarm to finish all tasks, which is denoted as U = [ u k j ] N B × N L , where u k j represents the number of required loads. The calculation of the task required load matrix is presented in Equation (3):
U = Q × F

3.1.3. Configuration Parameters of Task

Task set: The whole UAV swarm’s task can be divided into N B subtasks according to the number of task areas. This set is denoted as M = m k k [ 0 , N B ] .
Task alliance: This is a collection of individual UAVs capable of meeting the task payload requirements; the simplest task alliance refers to a collection of individual UAVs that can meet the task payload requirements with the minimum number of types, which is denoted as A = a 1 , a 2 , , a N B T , where a k = [ a k i ] N B × N U , a k i represents the number of UAVs v i allocated by task m k .

3.2. Mathematical Model

This paper raises a problem based on optimization with task-allocation background, where the objective function considers both the cost of the UAV swarms and the task resilience of the UAV swarms. By adopting a reasonable task-allocation method, under given constraints, the task alliance that minimizes the cost of the UAV swarms and maximizes the task resilience is solved.
Let c i be the total cost of UAV v i , which consists of two parts: UAV cost g U ( v i ) and load cost g L ( v i ) , where g U ( v i ) is positively correlated with the total number of loads carried by UAVs, and the proportionality constant is γ i . The calculation of UAV cost and load cost are presented in Equations (4) and (5) below.
g U ( v i ) = γ i · j [ 0 , N L ] e i j , i [ 0 , N U ]
g L ( v i ) = j [ 0 , N L ] e i j · l c j , i [ 0 , N U ]
Based on Equations (4) and (5), the total cost of UAV v i is presented in Equation (6):
c i = g L ( v i ) + g U ( v i ) , i [ 0 , N U ]
According to the calculation equations above, the objective function is to minimize the total cost of task alliance and maximize the task resilience as shown in Equation (7), where R k represents the task resilience of sub-task m k , λ 1 and λ 2 are normalized parameters.
min J = λ 1 · i = 1 N U k = 1 N B a k i · c i λ 2 · k = 1 N B R k
Equation (7) represents the objective function for a fundamentally multi-objective optimization problem, which aims to satisfy two conflicting goals simultaneously:
Cost Minimization: To minimize the total operational cost of the UAV swarm, represented by the term i = 1 N U k = 1 N B a k i · c i .
Resilience Maximization: To maximize the overall task resilience of the swarm across all subtasks, represented by the term k = 1 N B R k .
These objectives are inherently in conflict, as achieving higher resilience typically requires the allocation of redundant resources, which in turn increases the total cost.
To solve this multi-objective problem, we employ the widely used weighted sum method. This technique converts the vector of objectives into a single scalar objective function by assigning a weight to each objective. The normalized parameters λ 1 and λ 2 serve as these weighting factors, controlling the relative importance of cost minimization versus resilience maximization in the final solution. By adjusting these weights, a decision-maker can explore the trade-off between the two competing objectives and select a solution that best fits the specific operational priorities. This formulation allows our PSO-ILP algorithm to efficiently search for a single, optimal solution that represents the desired balance between cost and resilience.
There are the following constraints that Equation (7) should obey:
0 i = 1 N U k = 1 N B a k i · c i C 0
i = 1 N U a k i · e i j q k m · f m j , m [ 0 , N S ] , k [ 0 , N B ] , j [ 0 , N L ]
a k i Z + , R k 0 , k [ 0 , N B ] , i [ 0 , N U ]
where Constraint (8) requires that the total cost of task alliance does not exceed the expected cost C 0 , Constraint (9) requires that the number of each type of load carried by a swarm should exceed the task required load and Constraint (10) requires that the value of task resilience is positive and the UAV number should be a positive integer.

4. Design of Bilevel Nested Optimization

4.1. Problem Analysis

In adversarial environments, the UAV swarm task-allocation problem formulated in Section 3.2 constitutes a non-convex integer nonlinear programming problem, presenting two inherent challenges: (1) The calculation of task resilience relies on a feasible task alliance, and the task capacity needs to be obtained via simulation. This renders the direct solution of the problem intractable. (2) Integer programming encounters efficiency bottlenecks when addressing large-scale problems [43,44]. The nonlinear characteristics of the task resilience model make it difficult to apply algorithms effectively.
To overcome the aforementioned difficulties, this paper proposes a novel bilevel-optimization framework that decomposes the problem into two hierarchically organized phases. In the initial phase, a feasible task alliance configuration is generated using conventional integer linear programming (ILP), deliberately excluding resilience considerations to ensure computational tractability. The subsequent phase implements Monte Carlo simulations to evaluate the alliance’s resilience performance under various disturbance scenarios, followed by gradient-free optimization iterations to enhance robustness. Within this structure, the lower-level problem serves as an integer linear solver based on the branch-and-bound algorithm, and the selection of optimization variables, model and optimization algorithm design for the upper-level optimizer is crucial.
The upper- and lower-optimization problems are linked via the task resilience capability indicator. An analysis of this indicator reveals a straightforward method to guarantee enhanced task resilience. Specifically, this is achieved by augmenting the constraints of the original payload capability matrix to include the payload quantity for each UAV type. Consequently, it is feasible to introduce a slack variable within the inner loop to equivalently compensate for the “task resilience capability penalty term” that is removed from the objective function.
The constraint in Equation (9) ensures that the N L types of load quantities supplied by the UAV alliance for N B subtasks are sufficient to meet the task requirements. However, when the UAV swarms are subjected to external probabilistic interruptions, the load values in the load capacity matrix E will decrease. Similarly, during the interruptions, the load of the UAVs will also be consumed, leading to a reduction in the task-requirement matrix U . During the task period, the load consumption by UAVs and target units is illustrated in Equations (11) and (12).
e l o s s i j ( t ) = t = t D t e i j · p k · r a n d , i [ 0 , N U ] , k [ 0 , N B ] , j [ 0 , N L ]
u l o s s m j ( t ) = e l o s s m j ( t ) · c o e f f , m [ 0 , N B ] , j [ 0 , N L ]
where e i j is the existing loads l j of UAV v i , t D is the beginning time of the interruption, t f i n a l is the end time of the task. p k is the interruption probability of area b k . e l o s s i j ( t ) represents the consumption number of loads l j of UAV v i during the interruptions at time t. u l o s s m j ( t ) represents the consumption number of loads l j of target unit s m at time t. r a n d is a random number following r a n d U ( 0 , 1 ) . c o e f f is a coefficient constant number.
We introduce a compensation load matrix to ensure the task capacity value above the baseline level during the probability interruptions, denoted in Equation (13).
EX = [ e x k j ] N B × N L
The compensation load value is inversely proportional to task resilience. It will adjust according to the changes of task resilience to ensure that the UAV swarms maintain optimal task resilience while completing the task. The performance of the UAV swarms is defined as a weighted function of its resilience capability and compensation load. Consequently, optimizing this function to its maximum value corresponds to optimal overall swarm performance.

4.2. Bilevel-Optimization Modeling: Theoretical Decomposition and Hierarchical Implementation

4.2.1. Original Problem Abstract: Difficulty Analysis from a Solving Perspective

From the perspective of algorithm design, the task-allocation problem defined in Section 3.2 (Equations (4)–(10)) is classified as a mixed-integer nonlinear programming (MINLP) problems. To clarify the computational complexity and justify the bilevel decomposition, this paper focuses on rephrasing the problem from two aspects: computational coupling and scale complexity.
Let X = x b k i Z N U × N B denote the UAV-task-allocation matrix, where x b k i is a binary decision variable that determines whether the i-th UAV is assigned to the k-th subtask. For resilience optimization, the compensation load matrix EX = e x k j R N B × N L is introduce as continuous auxiliary variables. The original objective function (Equation (7)) and constraints (Equations (8)–(10)) can be rephrased as Equation (14), where R k is the resilience of subtask m k .
min X , EX J X , EX = λ 1 i , k x b k i · a k i · c i λ 2 k R k s . t . R X , EX R 0 ( Resilence baseline , derived from Section 3.2 ) i , k x b k i · a k i · c i C 0 ( Cost contraint , rephrased from Equation ( 8 ) ) i a k i · e i j e x k j q s m · f m j ( Feaxibity with compensation , extended from Equation ( 9 ) ) a k i Z + , e x k j R + ( Variable types )
Through the analysis of the above problem, it has the following difficulties:
(1)
Computational Coupling: The resilience R ( X , EX ) (corresponding to R k in Section 3.2) needs to be evaluated through Monte Carlo simulation, lacks an analytical expression and is incompatible with traditional MINLP solvers.
(2)
Scale Complexity: When the number of UAVs N U > 20 , the combinatorial explosion of discrete variables x leads to an exponential growth in computation time [45].

4.2.2. Theoretical Basis of Bilevel Decomposition: Problem Decomposability Theorem

As previously analyzed, the original task-allocation problem constitutes an intractable mixed-integer nonlinear programming (MINLP) problem due to variable coupling and scale-related complexity, rendering direct solution infeasible. To surmount this challenge, the core approach of this research lies in achieving mathematical decomposition of the problem by introducing the compensation load as a slack variable.
The feasibility of such decomposition is predicated on a key assumption: during the optimization process, decision-making regarding the compensation load (slack variable) can be treated as independent of that concerning UAV configurations. It is crucial to clarify that this independence does not imply a physical separation; rather, leveraging the mathematical properties of slack variables, it enables the disentanglement of originally coupled decision variables into two hierarchical levels. Specifically, the upper level determines the resilience buffer by optimizing the compensation load, while the lower level solves for UAV configurations based on the fixed compensation load. This gives rise to a bilevel-optimization structure that aligns with the logical hierarchy inherent in mission planning. The rationale underlying this assumption is elaborated as follows.
Therefore, separating the optimization of the compensation load from the UAV configuration is a reasonable mathematical abstraction of a realistic planning process, enabling the transformation of a complex MINLP problem into two more tractable subproblems. This assumption may be less applicable when the task interruption probability is extremely high and the swarm configuration requires dynamic real-time adjustments, which calls for more in-depth theoretical research and exploration in the future.
By constructing a theorem, it is proven that the original problem can be decomposed into “lower-level integer linear programming (ILP) + upper-level resilience optimization”, providing a mathematical basis for the bilevel model.
Assumption 1 (Compensation-Configuration Independence).
The decision-making of the compensation load matrix EX has no direct real-time dependence on the UAV configuration X , that is, the compensation strategy can be independently optimized as a “resilience safeguard” and then fed back to adjust the configuration.
Theorem 1 (Problem Decomposability).
If Assumption 1 holds, the original problem (4.1) can be equivalently decomposed into the following two levels of optimization, and the two levels form an iterative optimization relationship through variable transmission:
Lower-Level Problem (ILP): When EX is fixed, solve for the cost-optimal UAV configuration X , in the form of Equation (15). At this point, EX serves as a known constraint parameter, determining the feasible boundary after load compensation and directly influencing the optimal solution space of X :
min x C X = i , k x b k i · a k i · c i s . t . i a k i · e i j q s m · f n j e x k j 0 k , j C X C 0 a k i Z +
Upper-Level Problem (Resilience Optimization): Based on the lower-level optimal solution X * ( EX ) , solve for the resilience-optimal compensation load EX , in the form of Equation (16). At this point, X * becomes the benchmark configuration for resilience assessment, and adjustments to EX will form new parameters that are fed back to the lower level, driving the iterative optimization of the configuration scheme:
min EX R X * EX , EX s . t . R X * EX , EX 0 e x k j EX max k , j EX R +
The correctness of the decomposability of the aforementioned problem can be established through the following derivation and proof of necessity and sufficiency.
(1)
Proof of Necessity
Proposition 1.
If ( X , EX ) is the optimal solution to the original problem, then X = x * ( EX ) and EX is the optimal solution to the upper-level problem.
Proof. 
Assumption for contradiction: Suppose X x * ( EX ) , i.e., there exists x that satisfies the constraints of the lower-level problem and C ( X ) < C ( X ) .
Derivation of contradiction: Since ( X , EX ) satisfies the constraints of the original problem, X also satisfies all constraints of the original problem (as the lower-level constraints are a subset of the original problem’s constraints). In this case, J ( X , EX ) = λ 1 C ( X ) λ 2 R k < λ 1 C ( X ) λ 2 R k = J ( X , EX ) , which contradicts the optimality of ( X , EX ) for the original problem.
Conclusion: X must be the optimal solution to the lower-level problem under EX , i.e.,  X = X * ( EX ) .
Furthermore, if  EX is not the optimal solution to the upper-level problem, there exists EX such that R ( X * ( EX ) , EX ) > R ( X , EX ) while satisfying the upper-level constraints. In this case, ( X * ( EX ) , EX ) would yield a better objective function value J for the original problem (since λ 2 > 0 ), contradicting the optimality of ( X , EX ) . Thus, EX must be the optimal solution to the upper-level problem.    □
(2)
Proof of Sufficiency
Proposition 2.
If EX is the optimal solution to the upper-level problem and X = X * ( EX ) , then ( X , EX ) is the optimal solution to the original problem.
Proof. 
Feasibility verification:
-
The constraints of the lower-level problem are a subset of the original problem’s constraints, so X satisfies all constraints on X in the original problem.
-
The constraints of the upper-level problem include the core constraints on EX in the original problem (e.g., resilience baselines, non-negativity), so EX satisfies the constraints of the original problem.
-
Therefore, ( X , EX ) is a feasible solution to the original problem.
Optimality verification:
-
Assume there exists a feasible solution ( X , EX ) to the original problem such that J ( X , EX ) < J ( X , EX ) .
-
By definition of the lower-level problem, X * ( EX ) is the cost-optimal solution under EX , so C ( X * ( EX ) ) C ( X ) .
-
Substituting into the objective function gives:
J ( X * ( EX ) , EX ) J ( X , EX ) < J ( X , EX ) .
-
However, since EX is the optimal solution to the upper-level problem,
R ( X * ( EX ) , EX ) R ( X , EX ) .
-
Combined with J definition(including λ 2 R k ), this contradicts the above inequality. Therefore, ( X , EX ) is the optimal solution to the original problem.
   □
(3)
Conclusion.
Through the proofs of necessity and sufficiency, we establish that the original problem is fully equivalent to the decomposed “lower-level ILP + upper-level resilience optimization” framework under Assumption 1, thereby completing the proof of Theorem 1.

4.2.3. Lower-Level-Optimization Model: Task-Feasible Configuration via ILP

Based on the lower-level problem of Theorem 1, a lower-level-optimization model is constructed, focusing on the economical allocation of UAVs under a given compensation strategy.
Optimization Variables: UAV sub-task-allocation matrix X = x b k i (discrete integer variables).
Objective Function: Minimize the swarm task cost, consistent with problem (4.2), as shown in Equation (17), where N U is the number of UAV types, N B is the number of task areas, c i is the cost of a single UAV v i and  a k i is the number of UAV v i allocated by task m k .
min J l o w e r = i = 1 N U k = 1 N B a k i · c i
Constraints:
  • Task Feasibility:
    Ensure task requirements are met after compensation, i.e., as Equation (18):
    i = 1 N U a k i · e i j q k m · f m j e x k j 0 , m [ 0 , N S ] , k [ 0 , N B ] , j [ 0 , N L ]
  • Cost Upper Bound:
    Consistent with the original problem, i.e., in the form of Equation (19):
    0 i = 1 N U k = 1 N B a k i · c i C 0
  • Integer Constraint:
    UAV configurations are discrete allocations, i.e., in the form of Equation (20):
    a k i Z + , k [ 0 , N B ] , i [ 0 , N U ]
Solution Method: Use an integer linear programming solver (such as the branch-and-bound method) to quickly generate a “task-feasible, cost-optimal” UAV configuration, providing a basis for upper-level resilience optimization.

4.2.4. Upper-Level-Optimization Model: Resilience-Oriented Compensation Loads

Based on the upper-level problem of Theorem 1, an upper-level-optimization model is constructed, focusing on improving resilience by adjusting compensation loads under a given configuration strategy.
Optimization Variables:
Compensation load matrix EX = e x k j (continuous variables).
Objective Function:
Balance cost and resilience, consistent with the original problem’s objective, adjusted by weights λ 1 , λ 2 , as shown in Equation (21):
min J upper = λ 1 · i = 1 N U k = 1 N B α k i · c i λ 2 · k = 1 N B R k
Constraints:
  • Resilience Baseline:
Ensure task resilience meets the threshold, as in Equation (22):
R k R 0 k [ 0 , N B ]
  • Cost Upper Bound:
This is consistent with the lower-level constraints to ensure total cost control, as shown in Equation (23):
0 i = 1 N U k = 1 N B a k i · c i C 0 k [ 0 , N B ]
  • Compensation Load Upper Bound:
Avoid excess resource use from overcompensation, per Equation (24):
e x k j E X M a x k [ 0 , N B ] , j [ 0 , N L ]
Solution Method: Evaluate the resilience R under different EX via Monte Carlo simulation, adjust compensation loads with heuristic algorithms (e.g., PSO) to achieve the “resilience-optimal” goal, then feedback to update the lower level.

4.3. Design of Bilevel Nested Optimization Based on PSO-ILP

From the model of the layer-optimization problem, the traditional ILP algorithm can be used for the lower-layer optimization, while the upper-layer-optimization algorithm needs to use a swarm intelligence-optimization algorithm. There are many swarm intelligence-optimization algorithms, including genetic algorithm [46,47], particle swarm algorithm [48,49], grey wolf algorithm [50], tuna algorithm [51], whale algorithm [52] and ant colony algorithm [53]. The particle swarm algorithm is easy to implement and has relatively low complexity, especially suitable for the rapid solution of low-dimensional, continuous variable nonlinear processing and other optimization problems; it belongs to the basic algorithm of the population intelligence algorithm, like the genetic algorithm, and is very suitable for use in combination with other algorithms. The dimension of the compensation load matrix of the upper-layer-optimization problem is the number of load species, and the dimensionality is certainly not too high, while the nonlinear processing term in the upper-layer model is mainly the resilience index calculation, which is a typical continuous variable. Therefore, we choose the particle swarm algorithm as the upper-layer-optimization algorithm.
Figure 4 illustrates the overall framework of the optimization process of the PSO-ILP (particle swarm optimization–integer linear programming) algorithm. The workflow begins with the initial target requirements for the entire mission. This total required load is then divided into subtasks based on predefined geographical task areas, where each area contains a specific set of targets. The lower layer (ILP) then allocates a cost-optimal task alliance for each subtask. These alliances are subjected to simulated random interruptions, and their performance is evaluated. The results are passed to the upper layer (PSO), which updates the compensation load matrix (EX) to improve resilience. The optimized EX matrix is then returned to the lower layer to refine the allocation in an iterative feedback loop. Figure 5 is the flowchart of the PSO-ILP bilevel nested optimization process. The process starts with initial parameters and an initial task alliance from the ILP. The core of the algorithm is the interaction loop between the two layers. The lower layer (ILP) is responsible for finding a cost-optimal, feasible task alliance (A) for a given compensation load matrix (EX). The upper layer (PSO) evaluates the fitness of this solution (considering both cost and resilience) and updates the EX matrix to guide the search towards a more robust solution. This iterative process continues until a maximum number of iterations is reached or the solution converges. Algorithm 1 shows the pseudocode of the optimization process.
Algorithm 1 PSO-ILP Bilevel Nested Optimization
  • Require: Load capacities matrix E , target units matrix Q , target unit damage matrix F , interruption probability P, task capacity baseline T C 0 , number of UAV types N U , number of load types N L , number of task areas N B , number of target unit types N S , and max iterate number N i t e r .
  • Ensure: List of fitness values F , task capacity T C , task resilience R, optimal task alliance A , and final compensation load matrix EX .
 1:
Calculate the required load for every single task according to Equation (3).
 2:
Initialize compensation load matrix EX and a solution of task alliance A according to Equation (15).
 3:
Calculate the initial fitness value Fitness 0 with PSO according to Equation (26).
 4:
bestFitness Fitness 0 .
 5:
for   i t e r 1 to N i t e r  do
 6:
       for  i 1 to N B  do
 7:
             Simulate probabilistic interruption with probability P.
 8:
             Calculate task capacity T C according to Equation (1).
 9:
             if  T C > T C 0  then
10:
                  Calculate the i-th fitness value Fitness i with PSO according to Equation (26).
11:
                  if  Fitness i < bestFitness  then
12:
                         bestFitness Fitness i .
13:
                        Update the best known solution for task alliance A .
14:
                  end if
15:
             end if
16:
       end for
17:
       Update compensation load matrix EX according to Equation (16).
18:
end for
19:
Record the final results: list of fitness values F , task capacity T C , task resilience R, task alliance A and compensation load matrix EX .

4.3.1. Basic PSO Algorithm Principle

Particle swarm optimization (PSO), introduced in 1995 by Eberhart and Kennedy, is an algorithm inspired by bird predication behavior, which seeks the optimal solution to optimization problems by simulating cooperation and information sharing among group individuals in the process of foraging.
Within the population, every particle is regarded as an ideal moving entity and expanded to an n-dimensional space. The position of the particle in the n-dimensional space is denoted as a vector z i = ( z i 1 , z i 2 , , z i N ) , and the flight speed is represented as a vector V i = ( V i 1 , V i 2 , , V i N ) . Each particle moves at a specific speed in a given space, which defines its historical optimal position p i = ( p i 1 , p i 2 p i N ) and the optimal position of the entire population p g = ( p g 1 , p g 2 p g N ) . Particles continuously adjust their moving velocities and gradually converge toward the global optimal position by comparing discrepancies between their current positions and both their historical optimal positions and the global optimal position. The performance of each particle is evaluated based on its fitness value. In each iteration, particles update themselves by tracking their individual optimal positions and the global optimal position of the swarm. The position and velocity update mode of the particle swarm is as Equation (25):
V i j k + 1 = w · V i j k + c 1 · r a n d · ( p i j k z i j k ) + c 2 · r a n d · ( p g j k z i j k ) V i j k + 1 = V max V i j k + 1 > V max V max V i j k + 1 < V max z i j k + 1 = z i j k + V i j k + 1
where k is the number of particle iterations; w is the inertia weight; c 1 is the weight coefficient for the particle to track its historical optimal value; c 2 is the weight coefficient for the optimal value of the particle-tracking population. r a n d ( ) is a random number that follows r a n d U ( 0 , 1 ) . V i j k and z i j k are the velocity and position of the j t h dimension of particle i in the k t h iterations. p i j k and p g j k are the individual best position and global best position of the j t h dimension of a particle i in k t h iterations, respectively.

4.3.2. Upper-Layer Optimization of PSO Algorithm Implementation

The interior optimization problem, a classic integer linear program (ILP), is tackled by adopting a standard ILP model. The “compensation load matrix” is established for the task alliance and functions as an input to the upper-optimization model. A notable difficulty is that its compensation values for current and future periods are stochastic in nature and lack an identifiable pattern. This work aims to apply the PSO algorithm to solve the upper-optimization problem since it can effectively handle local optimization and global optimization problems. The upper-layer-optimization problem model, which is described as follows, facilitates the easy development of the PSO implementation algorithm for this problem.
(1)
The compensation load matrix EX serves as the particle in the PSO algorithm;
(2)
Fitness function construction: The model constraints of the upper-optimization problem are integrated into the objective function of optimization in the form of penalty terms, as presented in Equation (26):
F i t n e s s = λ 1 · i = 1 N U k = 1 N B a k i · c i λ 2 · k = 1 N B R k + i = 1 N B e ( T C 0 T C i ) · β
where β is the weight factor of the penalty item, which needs to be determined based on the quantity values of the other two items, and the default value of β is 100.
(3)
Numerous studies have demonstrated that adjusting inertia weights significantly impacts the performance of PSO. Given that the primary function of the optimization variable EX is to compensate for resilience factors unassessed by the lower-level optimizer, the rationality of its value depends heavily on resilience indices, which directly influence the algorithm’s global convergence. To address this challenge, inertia weights are dynamically adjusted based on evaluations of the task resilience capability index. Let the integrated value W of the resilience capability for completing all current subtasks be defined as k = 1 N B R k . Assuming the baseline integrated value for completing all subtasks is W basic and the average integrated value of the particle swarm is W ¯ , the following rules apply: if the integrated value exceeds W basic , their average is taken as W ¯ max ; if it is less than W basic , their average is denoted as W ¯ min .
The inertia weight coefficient is computed as shown in Equation (27). When the value of W is superior to W b a s i c , it implies that the resilience is stronger, and EX is near the global optimum. At this point, the inertia weight coefficient w can be assigned a smaller value to speed up the convergence to the global optimum, with the minimum value w m i n typically being equal to 0.1. When the value of W is inferior to W b a s i c , it indicates that the resilience is rather weak, and EX has optimized the difference.
w = w min × W ¯ max W basic | W W basic | W W basic 1.5 1 1 + e 0.01 · ( W ¯ min W basic ) W < W basic

4.3.3. PSO-ILP Bilevel Nested Optimization Process

In the present study, the methodology for determining optimal parameters in UAV swarm task planning bears similarity to a nested optimization framework, where PSO is integrated with an integer programming approach.
Specifically, the first parameter to be optimized is the number of unmanned aerial vehicles (UAVs), that is, the appropriate deployment of UAVs with different loads such that the total number of loads in the UAV swarms is not less than the number of loads needed for the task. This optimization problem can be mathematically depicted and solved by using an integer programming algorithm. The number of UAVs is the optimal object, and the total load of UAVs is the constraint. The resilience capability, which matches the compensating load matrix, is the second parameter to be enhanced in this research.
As the resilience capability varies with the compensatory load matrix, the optimal value solution problem is comparable to an optimization problem involving a particle swarm technique. Where e x i j represents the optimal local solution and R represents the ideal global solution. Meanwhile, the value of the resilience capability is decided by the task capacity and interruption probability as estimated by an algorithm for integer programming. Therefore, the resilience capability, the number of UAVs and the compensatory load form a cascade framework for the particle swarm algorithm and integer programming method.
In conclusion, the resilience performance of the UAV swarms that needs to be optimized is the weighted value of the number of UAVs and the resilience capability. Thus, the UAV swarms will have a certain rate of resilience capability before it encounters interruptions.

4.3.4. Convergence Criteria

The termination of the PSO-ILP algorithm is determined by a hybrid stopping condition, ensuring both a finite runtime and termination upon solution convergence.
For the simulation experiments presented in this paper, the primary stopping criterion was a fixed maximum number of iterations ( N i t e r = 30 ). This approach ensures a consistent computational budget across all experimental runs, which is crucial for a fair comparison of the final solution quality under different scenarios.
In addition to the maximum iteration limit, the algorithm incorporates a convergence-based early stopping criterion to terminate if the search process stagnates. As depicted in the flowchart (Figure 5), the optimization process can be concluded if the absolute improvement in the global best fitness value (J) between two consecutive iterations, k 1 and k, falls below a predefined threshold, τ . This condition is expressed as:
| J ( k ) J ( k 1 ) | < τ
For our implementation, the convergence threshold was set to τ = 10 4 . This value was chosen empirically to signify that the algorithm has reached a stable solution, and further iterations are unlikely to yield significant improvements to the objective function.

5. Experimental Simulation, Results and Analysis

To verify the effectiveness of PSO-ILP bilevel nested optimization under an abrupt interruption, this study conducts two sets of simulation experiments. First, we set up the simulation scenario of the following experiments. And then in experiment 1, the process of optimization of fitness value and task resilience is analyzed to prove the effectiveness of PSO-ILP bilevel nested optimization. In experiment 2, the total number and the total cost of UAVs’ task alliances of three algorithms are selected to analyze the priority of PSO-ILP. Finally, in experiment 3, a scenario based on the Lanchester model is created, and the task complete performance of PSO-ILP with task resilience under this scenario is analyzed in comparison with two other algorithms.

5.1. Task Scenario Setting

To validate the effectiveness of the proposed PSO-ILP bilevel nested optimization algorithm and to robustly evaluate its performance under various disturbances, this section constructs a representative task scenario. The detailed configuration of this scenario and the rationale for its design are described below.

5.1.1. Basic Scenario Configuration

Figure 6 presents a diagram of the task zones for the UAV swarm mission. This figure illustrates a heterogeneous UAV swarm comprising multiple UAV types (e.g., UAVs v 1 through v 5 ), which is tasked with mitigating threats across three geographically distinct mission scenarios. Each scenario contains primary targets, serving as the objective for a dedicated task alliance. The mission scenario for the UAV swarm encompasses three task areas, within each of which there exist five distinct target types that may pose threats to the UAV swarm.
We constructed a task scenario using a network approach (as shown in Figure 7), which illustrates the target configuration in our simulation. This configuration includes the specific distribution of 33 target nodes (8 nodes for target s1, 4 nodes for target s2, 6 nodes for target s3, 7 nodes for target s4 and 8 nodes for target s5) across three task areas. This setup is designed to create heterogeneous subtasks with different load requirements, thereby providing a more challenging and realistic test case with an asymmetric node distribution for the allocation algorithm. The central purple squares (A1, A2, A3) represent the optimized task alliances, the colored circles (s1–s5) represent target types, the dashed lines indicate the allocation relationship between targets and alliances, and the solid lines represent communication links between alliances.
The specific distribution of the 33 total target nodes (e.g., 8 of s1, 4 of s2) among the three task areas was intentionally designed to be asymmetric. This was done to create heterogeneous subtasks with distinct load requirements, thereby providing a more challenging and realistic test case for the allocation algorithm. The central purple squares (A1, A2, A3) represent the optimized task alliances, colored circles (s1–s5) represent the target types, dashed lines indicate the allocation of targets to alliances and solid lines represent inter-alliance communication links.
Table 2 presentes some parameters of UAVs according to the definitions in the problem-modeling section, including the capacity configuration of v 1 v 5 . γ is the cost coefficient of a single UAV. Table 3 presents some parameters of target units, including the target requirement configuration of s 1 s 5 . l c is the cost coefficient of a single load. The algorithms are requested to give the best solution set of task alliances A to satisfy the fitness value given by Equation (26).

5.1.2. Rationale and Representativeness of Scenario Design

The parameter settings for this simulation (three task areas, five UAV types, five load types) are intended to establish a representative, medium-scale complex scenario rather than to simulate an extreme, large-scale problem. The rationale for this design is mainly reflected in the following aspects:
Non-trivial decision space: This scenario is designed to pose a substantial challenge to the algorithm’s solution-finding capabilities. In the upper-level optimization, the dimensions of the compensation load matrix EX are N B × N L = 3 × 5 = 15 , which means the PSO algorithm must search within a 15-dimensional continuous space. In the lower-level optimization, the number of variables in the task-allocation matrix A is N B × N U = 3 × 5 = 15 . For a bilevel nested optimization problem, handling 15 coupled decision variables is a computationally non-trivial task, sufficient to validate the effectiveness and convergence of the PSO-ILP algorithm when dealing with multivariable, nonlinear problems.
High heterogeneity to reflect complexity: The complexity of the scenario is manifested not only in its scale but also in its inherent heterogeneity.
UAV heterogeneity: Table 2 clearly shows that different UAVs possess distinct load specializations and costs. For instance, v 1 serves as an efficient platform for the l 1 load, whereas v 5 specializes in the l 5 load. This forces the algorithm to make trade-offs between using low-cost, specialized UAVs and employing high-cost, general-purpose UAVs to enhance resilience.
Task heterogeneity: Table 3 indicates that different targets have varying combinations of load requirements. Meanwhile, the distribution of targets across the three areas is non-uniform, as shown in Figure 7. This dual heterogeneity ensures that the three subtasks have distinct demand profiles, thereby effectively testing the algorithm’s ability to allocate resources in a customized manner for subtasks with different characteristics.
Balance between computational feasibility and statistical validity: As stated in Experiments 1 and 2, a reliable statistical evaluation of the algorithm’s performance requires hundreds of repeated simulation runs (100 and 150 times, respectively). The current medium-scale setting allows for the completion of these extensive Monte Carlo simulations within an acceptable timeframe, enabling the acquisition of statistically significant average results. If the scenario scale were excessively large, the prolonged computation time for a single run would make it difficult to conduct sufficient repeated experiments to verify the algorithm’s stability and robustness.
In summary, the design of the current experimental scenario strikes a deliberate balance among problem representativeness, decision-making complexity and computational feasibility. It is sufficient to simulate the core challenges in UAV swarm task allocation, such as resource heterogeneity, diverse demands and the trade-off between cost and resilience, thereby providing a solid and reasonable testbed for validating the effectiveness of the PSO-ILP algorithm.

5.2. Task Resilience-Simulation-Evaluation Model

During the task execution, the UAV swarms are vulnerable to sudden interferences from themselves or the environment. As a result, the task capacity will decline due to the loss of the equivalent load during the task process. To dynamically assess the task load losses caused by the interaction between the UAV swarms and the environment, the interaction between the UAV swarms and the environment can be equated to the adversarial relationship between red and blue sides. Thus, the Lanchester model [54] can be applied to conduct an equivalent analysis of its uncertain disturbances.
Based on the Lanchester model given by Equations (11) and (12), this paper examines the total load of both sides in the task process and utilizes the total load to simulate the initial scale of both sides. According to the load-consumption model, for each task area, the total effective loads L o a d U ( t ) carried by UAV swarms and the required loads L o a d S ( t ) for the targets at time t can be presented as in Equation (29):
L o a d U ( t ) = L o a d U ( 0 ) i = 1 N U j = 1 N L e l o s s i j ( t ) L o a d S ( t ) = L o a d S ( 0 ) m = 1 N S j = 1 N L e l o s s m j ( t )

5.3. Experiment and Analysis

5.3.1. Experiment 1: PSO-ILP Optimization Process and Multi-Objective Effects

Table 4 shows the relevant parameters of the PSO-ILP algorithm, where dim is set to 15 to represent the number of dimensions of PSO, the particle size is set to 20 to represent the number of PSO particles and the number of iterations is set to 30; these settings were found to provide a sufficient balance between thorough exploration of the 15-dimensional search space and computational feasibility. The particle speed is 40, which prevents particles from moving too rapidly out of the effective search area. The search range is 100, ensuring they remain within a plausible range. Both the cognitive coefficient and the social coefficient are 2. This is a standard setting recommended in the canonical PSO literature, which is widely demonstrated to provide a good balance between individual particle exploration and swarm-wide information exploitation.
Under the task settings described in the simulation experiment, the PSO-ILP algorithm was used to carry out 100 task-allocation-optimization experiments, and the average value of the experimental results was taken as the final optimization result. The specific results are shown in Figure 8. The green curve shows the fitness values decline when the PSO-ILP algorithm updates its best solutions. The best solutions of this task scenario update four times when the iterate number equals 20, 25, 45 and 58. Meanwhile, the blue curve illustrates the decline in the UAV number; when the normalized coefficient λ 1 grows from 0.4 to 0.5, the total UAV number declines from 1376 to 1350. Finally, the value of task resilience is also affected by λ 1 . When λ 1 = 0.8 , the task resilience reaches the peak. The red curve in the main graph reflects the change of the target fitness function in the PSO-ILP algorithm with the increase in the number of iterations. In the initial stage, the model was solved by the CBC solver [55] in the PuLP module, and a set of feasible task alliances for the UAV swarm problem was obtained. Based on this alliance, the initial value of the target fitness was calculated. As an upper-level algorithm, PSO optimizes the compensation load matrix through iteration, making the target fitness value show an overall downward trend, and converging to a value close to 1.102 after 80 iterations. The inset in the upper right corner of the main graph synchronously shows the changing trends of the two optimization objectives during the convergence of the fitness value. It can be seen that with the convergence of the fitness value, the number of UAV task alliances decreases, while the resilience of the UAV swarm task shows an overall upward trend with the increase in the number of iterations. Whenever the PSO-ILP algorithm obtains a new fitness value, the optimal solutions of the two optimization objectives will be updated accordingly. The curve in the upper right subgraph clearly proves the effectiveness of the PSO-ILP algorithm in dealing with multi-objective problems during the optimization process.

5.3.2. Experiment 2: Task Capacity and Algorithm Effectiveness Under Interruptions

(1)
Task Capability and Algorithm Performance Under Different Interruption Intensities
This experiment focuses on the task capability performance of UAVs in dynamic confrontation processes, and delves into the actual effectiveness of the PSO-ILP algorithm under different interruption intensities. The simulation time is measured by program-execution intervals, with a total of 150 experiments conducted. It focuses on analyzing the variation patterns of task capabilities of the three subtasks within the swarm when the interruption intensities are 0.2, 0.4 and 0.6.
Focusing on the scenario with interruption level 1 as the specific analytical target, Figure 9a–c depict the capability curves of the UAV swarm in executing three subtasks under interruption probabilities of 0.2, 0.4 and 0.6, respectively. This 3 × 3 matrix of plots illustrates the algorithm’s performance across diverse conditions, where each row corresponds to a distinct interruption level (with Level 1 being the lowest and Level 3 the highest) and each column represents a specific interruption probability (P = 0.2, 0.4, 0.6). The red dashed line in each plot denotes the task baseline ( T C 0 = 0.9 ), below which task performance is deemed degraded. Owing to the compensation loads allocated to each task by the PSO-ILP algorithm during the task-allocation phase, the execution capabilities of the three subtasks all exceed 1 at the initial stage of task execution. As the confrontation proceeds, task capabilities gradually degrade over time due to load losses; however, the optimization mechanism inherent in the PSO-ILP algorithm remains effective.
When the interruption probability is 0.2, the capabilities of the three subtasks oscillate around their respective equilibrium points (close to 0.9) (occurring at t = 30, t = 60 and t = 100, respectively), reflecting the algorithm’s initial effect in maintaining task stability. When the simulation reaches t = 120, although the performance of subtask 1 has experienced three significant declines, it reaches a new balance around 0.4 under the regulation of the algorithm. This is because the remaining effective loads and the dynamic adjustment of the algorithm work together to prevent the task from completely collapsing. When the interruption probability rises to 0.4, the performance of subtask 1 declines four times at t = 65, t = 85, t = 117 and t = 142, the performance of subtask 2 only declines for the first time at t = 140, while the performance of subtask 3 remains above the task baseline throughout the process—demonstrating the PSO-ILP algorithm’s ability to provide differentiated protection for different subtasks. When the interruption probability reaches 0.6, although subtasks 1 and 2 both experience more than two declines (the performance of subtask 1 drops to zero when the simulation is close to 100), subtask 3 still remains stable above the baseline. This fully proves that the PSO-ILP algorithm can ensure the continuous execution of core tasks in a high-interruption environment and maintain the relative stability of task performance in a short period.
(2)
Task Resilience Comparison and PSO-ILP Effectiveness Validation
To further verify the effectiveness of the PSO-ILP algorithm under different interruption probabilities, we solved the problem using the PSO-ILP algorithm and the CBC solver provided by the PuLP module, respectively, in the same experimental scenario described above, and conducted a comparative analysis of the task resilience values of the UAV task alliances obtained by the two algorithms. Figure 10 compares the task resilience values of UAV task alliances solved by the CBC-ILP algorithm and the PSO-ILP algorithm for different interruption probabilities while also illustrating the task resilience values for both task alliances when the interruption probabilities are 0.2, 0.3, 0.4, 0.5 and 0.6. The bar chart compares the task resilience of the three subtasks (T1, T2, T3) for the baseline CBC-ILP algorithm (Algo 1) and the proposed PSO-ILP algorithm (Algo 2). The comparison is performed across five different interruption probabilities. The results show that the resilience values achieved by PSO-ILP are consistently higher than those of the baseline method, with the performance gap widening as the interruption probability increases. When the interruption probability is 0.2, the task capabilities of both task alliances remain at a level close to task requirements due to the lower UAV payload loss caused by interruptions. At this point, the task resilience of both task alliances for all three subtasks exceeds 0.95. Meanwhile, the resilience values of subtask 2 and subtask 3 exceed 0.98, slightly higher than subtask 1’s value. This is because subtask 2 and subtask 3 have higher task objectives and require more payloads than subtask 1. To ensure the UAV swarm’s task capability after an interruption, subtasks 2 and 3 are allocated a larger number of redundant UAVs and redundant payloads than subtask 1. When the interruption probability is 0.3, the average task resilience of each subtask under the CBC-ILP algorithm falls below 0.9, while the average task resilience of the PSO-ILP algorithm can still exceed 0.9. This is because the compensation payloads calculated by the CBC-ILP algorithm are a fixed value. When the interruption probability increases, the quantity of redundant payloads in subtasks with fewer task objectives is significantly lower than the amount of payload lost. In contrast, the compensation payloads calculated by the PSO-ILP algorithm are an adaptive value that adjusts itself according to the actual payload loss, allowing the task alliance to maintain higher task capabilities. From an interruption probability of 0.4 to 0.6, both algorithms show a sharp decline in task resilience values. This is due to the constraint on the quantity of compensation payloads limiting the UAVs’ ability to maintain their task payloads. When payload losses caused by interruptions exceed the maximum value of compensation payload, the UAVs’ task resilience drops sharply. In these three interruption probability scenarios, the task resilience values for all three subtasks of PSO-ILP consistently outperform those of CBC-ILP, demonstrating the superiority of the PSO-ILP algorithm in extreme interference scenarios.
In the context of actual task operations, the parameters such as the task baseline and the task collapse threshold for different tasks involving UAV swarms are adjusted according to the specific circumstances. Additionally, the interruption probability of the UAV swarms may be influenced by other uncertain factors during the task. To mitigate the impact of these uncertainties on the measurement of the UAV swarm’s task resilience, this paper introduces the task baseline offset factor a and the interruption probability offset factor b to ensure the non-negativity of the task resilience-calculation results. Figure 11. illustrates the trend in task resilience as the task baseline offset factor a and the interruption probability offset factor b take on different values. The plot shows the change in the calculated task resilience value as the interruption probability offset factor (b, on the x-axis) varies. The parameter b is a unitless adjustment factor used to scale the impact of the raw interruption probability in the resilience formula (Equation (2)). The range of [1.0, 5.0] was selected for this sensitivity analysis to demonstrate the full dynamic effect of this parameter on the resilience score. Each of the three curves corresponds to a different value of the task baseline offset factor ( a = 0.5 , 1.5 , 4.5 ). The graph illustrates that resilience decreases exponentially with an increasing b, while a higher value of a consistently yields a higher resilience score. It can be observed that as the task baseline offset factor increases, the task resilience values show an overall upward trend. This is because an increase in the offset factor a counteracts the impact of the decrease in task capability on the task resilience values. Simultaneously, when the interruption probability offset factor b increases, the task resilience exhibits an exponential decline. Moreover, as the value of b increases, the influence of the task baseline offset factor a on the task resilience values gradually diminishes. When the value of b rises to 5, the impact of the task baseline offset factor a on task resilience becomes negligible. In summary, by introducing the offset factors, a more realistic measurement of task resilience can be achieved to accommodate different operational requirements.
It is evident that the UAV swarm task-allocation scheme based on the PSO-ILP bilevel nested optimization algorithm that takes task resilience into account possesses a superior ability to sustain task capacity performance when faced with sudden interruptions from operational areas.

5.3.3. Experiment 3: Task-Allocation-Optimization Results by Different Algorithms

Based on the cost parameters of UAVs, game units and task scenarios in Table 5, we applied these parameters to three different algorithms, respectively, to obtain optimized solutions.
Considering the existence of interruption probability, we conducted the experiment under the same scenario 100 times and took the average values as the final results; Table 5 presents a comparison of three different task alliances among three different methods along with the results of these optimizations in detail, with the average optimal task alliances obtained from 100 simulation runs for each algorithm. The table itemizes the number of UAVs of each type ( v 1 v 5 ) allocated to the three task areas ( b 1 , b 2 , b 3 ) and lists the resulting total cost for each configuration. These detailed allocation results in Table 5 provide the underlying data for the aggregated comparison of total UAV numbers and costs presented subsequently, quantitatively demonstrating the superiority of the proposed PSO-ILP method in balancing cost and resilience.
Figure 12 presents a comparison of the ILP algorithm without considering task resilience (method 1), the CBC-ILP algorithm with task resilience consideration (method 2) and the PSO-ILP algorithm with task resilience consideration (method 3) in terms of the number of drones, the cost of drones and the resilience of the UAV swarms for the optimal task-allocation scheme. The ILP algorithm without considering task resilience (method 1) requires a total of 1531 UAVs with a total cost of USD 2,299,481. The ILP algorithm with task resilience consideration, specifically the CBC-ILP(method 2), requires a total of 1794 UAVs with a total cost of USD 2,891,355. The PSO-ILP algorithm with task resilience consideration (method 3) requires a total of 1360 UAVs with a total cost of USD 2,669,066. The results show that the task-allocation scheme using the PSO-ILP algorithm needs fewer drones than the other two systems. When compared to the CBC-ILP algorithm, the overall cost of the PSO-ILP-based scheme is lower. It is feasible to achieve the lowest overall cost while maintaining task resilience.
To sum up, the optimized task alliances simulated with the PSO-ILP algorithm have the lowest total UAV numbers among the three algorithms. Also, when considering task resilience, the total cost of the PSO-ILP algorithm is much better than the ILP algorithm. Compared with the other two algorithms, the results of the PSO-ILP algorithm are superior.

5.4. Discussions

Experimental results demonstrate that the proposed PSO-ILP bilevel nested optimization algorithm exhibits superior performance in multi-objective optimization of cost-effectiveness and task resilience, particularly in interference scenarios involving sudden interruptions. On the basis of appropriately increasing task costs, it achieves a significant improvement in task resilience capacity. Results from Experiment 1 reveal that the PSO-ILP algorithm possesses effective multi-objective optimization capabilities: as iterations proceed, the target fitness function converges stably, accompanied by a reduction in the number of task alliances and an enhancement in task resilience, reflecting its ability to balance multiple optimization objectives. Experiment 2 highlights the advantages of incorporating task resilience capacity into multi-objective optimization. Under varying interruption intensities, UAV swarms employing the PSO-ILP algorithm can maintain stable task-execution capabilities. Notably, when the interruption probability reaches 0.5 or higher, their task resilience is significantly superior to that of the CBC-ILP algorithm, demonstrating strong adaptability to sudden interruptions. Experiment 3 further validates the practical application advantages of the algorithm. Compared with the ILP and CBC-ILP algorithms, while maintaining task resilience, it requires fewer UAVs and achieves lower total costs, underscoring its cost-effectiveness edge. In contrast to previous studies that mostly focused on single objectives or static scenarios, this research comprehensively considers multiple factors such as cost, resilience and dynamic interruptions, thereby addressing the limitation of incomplete considerations in existing studies.

6. Conclusions

The proposed PSO-ILP bilevel-optimization framework provides an innovative solution for resilient task allocation in UAV swarms operating in disturbed environments. Unlike traditional passive recovery strategies, this framework directly embeds resilience considerations into the task-allocation phase through a functionally layered architecture. It adopts a bilevel nested design where the lower ILP layer generates feasible task alliances and the upper PSO layer optimizes the compensation load matrix, achieving a collaborative optimization of cost and task resilience. Simulation results demonstrate that the framework exhibits excellent performance in multiple dimensions. Compared with the other algorithms, it has advantages in terms of convergence, the number of drones required to complete the same tasks and total cost. In particular, it consistently maintains higher task resilience in scenarios with interruption probabilities ranging from 0.2 to 0.6, with the advantages becoming more prominent in high-interruption environments. This study breaks through the limitations of single-objective optimization and provides a technical path that balances robustness and economy for task allocation in UAV swarms under complex environments. Future research will first advance the scalability of the PSO-ILP algorithm through theoretical analysis and experimental verification regarding its performance in handling large-scale tasks and UAVs. It will then extend to dynamic scenarios, exploring task reconfiguration under real-time disturbances like sudden target changes or UAV damage, and integrate intelligent algorithms to boost instant environmental adaptability—thus further enhancing practical value.

Author Contributions

Conceptualization, Y.Z. and L.W.; methodology, Y.Z.; software, Y.Z. and J.L.; validation, Y.Z. and L.W.; formal analysis, Y.Z.; investigation, Y.Z.; resources, Y.Z.; data curation, Y.Z.; writing—original draft preparation, Y.Z.; writing—review and editing, Y.Z. and X.Z.; visualization, Y.Z. and C.W.; supervision, X.Z.; project administration, C.W.; funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Science and Technology on Information System Engineering Laboratory (No. 05202007).

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to sincerely appreciate the careful guidance and selfless assistance provided by Yingchao Zhang in the early stages of this work. Additionally, The authors would like to thank the valuable comments and suggestions offered by the reviewers for this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PSO-ILPParticle Swarm Optimization–Integer Linear Programming
CBC-ILPCOIN-OR Branch and Cut–Integer Linear Programming

References

  1. Wen, J.; Wang, F.; Su, Y. A bi-layer collaborative planning framework for multi-UAV delivery tasks in multi-depot urban logistics. Drones 2025, 9, 512. [Google Scholar] [CrossRef]
  2. Yang, Y.; Fu, Y.; Xin, R.; Feng, W.; Xu, K. Multi-UAV Trajectory Planning Based on a Two-Layer Algorithm Under Four-Dimensional Constraints. Drones, 2025; prints. [Google Scholar] [CrossRef]
  3. Fan, Y.; Xu, H.; Li, T.; Zhao, W.; Bai, T.; Zhuo, Q.; Zhang, T. Optimized Task Allocation for Unmanned Aerial Vehicle Swarms in Smart Agriculture. In Proceedings of the International Conference on Guidance, Navigation and Control, Changsha, China, 9–11 August 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 349–359. [Google Scholar]
  4. Li, B.; Wang, J.; Song, C.; Yang, Z.; Wan, K.; Zhang, Q. Multi-UAV roundup strategy method based on deep reinforcement learning CEL-MADDPG algorithm. Expert Syst. Appl. 2024, 245, 123018. [Google Scholar] [CrossRef]
  5. Xia, C.; Yongtai, L.; Liyuan, Y.; Lijie, Q. Cooperative task assignment and track planning for multi-UAV attack mobile targets. J. Intell. Robot. Syst. 2020, 100, 1383–1400. [Google Scholar] [CrossRef]
  6. Neville, G.; Chernova, S.; Ravichandar, H. D-ITAGS: A dynamic interleaved approach to resilient task allocation, scheduling, and motion planning. IEEE Robot. Autom. Lett. 2023, 8, 1037–1044. [Google Scholar] [CrossRef]
  7. Duan, H.; Zhao, J.; Deng, Y.; Shi, Y.; Ding, X. Dynamic discrete pigeon-inspired optimization for multi-UAV cooperative search-attack mission planning. IEEE Trans. Aerosp. Electron. Syst. 2020, 57, 706–720. [Google Scholar] [CrossRef]
  8. Henchey, M.; Rosen, S. Emerging approaches to support dynamic mission planning: Survey and recommendations for future research. J. Def. Model. Simul. 2021, 18, 453–468. [Google Scholar] [CrossRef]
  9. Wang, W.; Chen, Y.; Zhang, Y.; Chen, Y.; Du, Y. Collaborative Search Algorithm for Multi-UAVs Under Interference Conditions: A Multi-Agent Deep Reinforcement Learning Approach. Drones 2025, 9, 445. [Google Scholar] [CrossRef]
  10. Zeng, Y.; Ren, K.; Sun, Q.; Zhang, Y. A Resilient Task Allocation Method for UAV Swarm Based on PSO-ILP Bilevel Nested Optimization. Preprint 2023. [Google Scholar] [CrossRef]
  11. Phadke, A.; Medrano, F.A. Towards resilient UAV swarms—A breakdown of resiliency requirements in UAV swarms. Drones 2022, 6, 340. [Google Scholar] [CrossRef]
  12. Alqefari, S.; Menai, M.E.B. Multi-UAV Task Assignment in Dynamic Environments: Current Trends and Future Directions. Drones 2025, 9, 75. [Google Scholar] [CrossRef]
  13. Zhou, X.; Huang, Y.; Bai, G.; Xu, B.; Tao, J. The resilience evaluation of unmanned autonomous swarm with informed agents under partial failure. Reliab. Eng. Syst. Saf. 2024, 244, 109920. [Google Scholar] [CrossRef]
  14. Vugrin, E.D.; Warren, D.E.; Ehlen, M.A.; Camphouse, R.C. A framework for assessing the resilience of infrastructure and economic systems. In Sustainable and Resilient Critical Infrastructure Systems: Simulation, Modeling, and Intelligent Engineering; Springer: Berlin/Heidelberg, Germany, 2010; pp. 77–116. [Google Scholar]
  15. Tran, H.T.; Domerçant, J.C.; Mavris, D.N. A network-based cost comparison of resilient and robust system-of-systems. Procedia Comput. Sci. 2016, 95, 126–133. [Google Scholar] [CrossRef]
  16. Tran, H.T.; Balchanos, M.; Domerçant, J.C.; Mavris, D.N. A framework for the quantitative assessment of performance-based system resilience. Reliab. Eng. Syst. Saf. 2017, 158, 73–84. [Google Scholar] [CrossRef]
  17. Bai, C.; Yan, P.; Yu, X.; Guo, J. Learning-based resilience guarantee for multi-UAV collaborative QoS management. Pattern Recognit. 2022, 122, 108166. [Google Scholar] [CrossRef]
  18. Ordoukhanian, E.; Madni, A.M. Resilient multi-UAV operation: Key concepts and challenges. In Proceedings of the 54th AIAA Aerospace Sciences Meeting, San Diego, CA, USA, 4–8 January 2016; p. 0475. [Google Scholar]
  19. Bai, G.; Li, Y.; Fang, Y.; Zhang, Y.A.; Tao, J. Network approach for resilience evaluation of a UAV swarm by considering communication limits. Reliab. Eng. Syst. Saf. 2020, 193, 106602. [Google Scholar] [CrossRef]
  20. Li, H.; Sun, Q.; Wang, M.; Liu, C.; Xie, Y.; Zhang, Y. A baseline-resilience assessment method for UAV swarms under heterogeneous communication networks. IEEE Syst. J. 2022, 16, 6107–6118. [Google Scholar] [CrossRef]
  21. Guo, Z.; Wang, Y.; Sun, Y.; Li, J.; Fu, C.; Zhong, J. Resilience analysis of cooperative mission based on spatio–temporal network dynamics for flying ad hoc network. IEEE Trans. Reliab. 2024, 73, 1034–1043. [Google Scholar] [CrossRef]
  22. Kong, L.; Wang, L.; Cao, Z.; Wang, X. Resilience evaluation of UAV swarm considering resource supplementation. Reliab. Eng. Syst. Saf. 2024, 241, 109673. [Google Scholar] [CrossRef]
  23. Sun, Q.; Li, H.; Wang, Y.; Zhang, Y. Multi-swarm-based cooperative reconfiguration model for resilient unmanned weapon system-of-systems. Reliab. Eng. Syst. Saf. 2022, 222, 108426. [Google Scholar] [CrossRef]
  24. Clerigues, D.; Wubben, J.; Calafate, C.T.; Cano, J.C.; Manzoni, P. Enabling resilient UAV swarms through multi-hop wireless communications. EURASIP J. Wirel. Commun. Netw. 2024, 2024, 39. [Google Scholar] [CrossRef]
  25. Li, H.; Zhong, Y.; Zhuang, X. A soft resource optimization method based on autonomous coordination of unmanned swarms system driven by resilience. Reliab. Eng. Syst. Saf. 2024, 249, 110227. [Google Scholar] [CrossRef]
  26. Mirza, I.S.; Shah, S.; Siddiqi, M.Z.; Wuttisittikukij, L.; Sasithong, P. Task assignment and path planning of multiple unmanned aerial vehicles using integer linear programming. In Proceedings of the TENCON 2023—2023 IEEE Region 10 Conference (TENCON), Chiang Mai, Thailand, 31 October–3 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 547–551. [Google Scholar] [CrossRef]
  27. Nishira, M.; Ito, S.; Nishikawa, H.; Kong, X.; Tomiyama, H. An ILP-based approach to delivery drone routing under load-dependent flight speed. In Proceedings of the 2022 International Conference on Electronics, Information, and Communication (ICEIC), Jeju, Republic of Korea, 6–9 February 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–4. [Google Scholar] [CrossRef]
  28. Gao, J.; Li, Y.; Xu, Y.; Lv, S. A two-objective ILP model of OP-MATSP for the multi-robot task assignment in an intelligent warehouse. Appl. Sci. 2022, 12, 4843. [Google Scholar] [CrossRef]
  29. Vo, T.Q.T.; Baiou, M.; Nguyen, V.H. A branch-and-cut algorithm for the balanced traveling salesman problem. J. Comb. Optim. 2024, 47, 4. [Google Scholar] [CrossRef]
  30. Maachou, N.; Moulaï, M. Branch-and-cut method for solving the integer linear multiplicative bilevel problem. Pesqui. Oper. 2024, 44, e278191. [Google Scholar] [CrossRef]
  31. Sawaluddin, u.; Beruh, M. Analysis of branch and cut and branch and bound methods to solve mixed integer linear programming problems. J. Math. Technol. Educ. 2025, 3, 401–411. [Google Scholar] [CrossRef]
  32. Kazachkov, A.M.; Le Bodic, P.; Sankaranarayanan, S. An abstract model for branch and cut. Math. Program. 2024, 206, 175–202. [Google Scholar] [CrossRef]
  33. COIN-OR. Optimization with PuLP—PuLP 3.0.2 Documentation. 2023. Available online: https://www.coin-or.org/PuLP/ (accessed on 17 August 2025).
  34. Guo, H.; Zhou, Y.; Wu, J.; Weng, F. Automated Task Allocation for Multi-Drone Reconnaissance Missions Based on Optimized Discrete Particle Swarm Optimization. In Proceedings of the 2024 4th International Conference on Artificial Intelligence, Robotics, and Communication (ICAIRC), Xiamen, China, 27–29 December 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 555–559. [Google Scholar]
  35. Xiong, H.; Yao, C.; Liu, J.; Shi, X. Improved multi-objective gray wolf optimization for task allocation in multi-UAV heterogeneous targets reconnaissance. Clust. Comput. 2025, 28, 397. [Google Scholar] [CrossRef]
  36. Qin, L.; Zhou, Z.; Liu, H.; Yan, Z.; Dai, Y. A Deep Reinforcement Learning-Driven Seagull Optimization Algorithm for Solving Multi-UAV Task Allocation Problem in Plateau Ecological Restoration. Drones 2025, 9, 436. [Google Scholar] [CrossRef]
  37. Song, Y.; Yuan, X.; Yue, H. The ADMM-PINNs algorithmic framework for nonsmooth PDE-constrained optimization: A deep learning approach. SIAM J. Sci. Comput. 2024, 46, C659–C687. [Google Scholar] [CrossRef]
  38. Bai, W.; Wong, Y.; Qiao, X.; Ho, C.P. Distributionally Robust Federated Learning: An ADMM Algorithm. arXiv 2025, arXiv:2503.18436. [Google Scholar] [CrossRef]
  39. He, J.; Xiao, M.; Skoglund, M.; Poor, H.V. Straggler-Resilient Asynchronous ADMM for Distributed Consensus Optimization. IEEE Trans. Signal Process. 2025, 73, 2496–2510. [Google Scholar] [CrossRef]
  40. Kurdi, H.; AlDaood, M.F.; Al-Megren, S.; Aloboud, E.; Aldawood, A.S.; Youcef-Toumi, K. Adaptive task allocation for multi-UAV systems based on bacteria foraging behaviour. Appl. Soft Comput. 2019, 83, 105643. [Google Scholar] [CrossRef]
  41. Wang, Z.; Zhang, J. A task allocation algorithm for a swarm of unmanned aerial vehicles based on bionic wolf pack method. Knowl.-Based Syst. 2022, 250, 109072. [Google Scholar] [CrossRef]
  42. Wan, Y.; Zhong, Y.; Ma, A.; Zhang, L. An accurate UAV 3-D path planning method for disaster emergency response based on an improved multiobjective swarm intelligence algorithm. IEEE Trans. Cybern. 2022, 53, 2658–2671. [Google Scholar] [CrossRef]
  43. Zemkohoo, A.; Dempe, S. Bilevel Optimization Advances and next Challenges; Springer: Geneva, Switzerland, 2020. [Google Scholar]
  44. Dyro, R.; Schmerling, E.; Arechiga, N.; Pavone, M. Second-order sensitivity analysis for bilevel optimization. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Valencia, Spain, 28–30 March 2022; PMLR: Cambridge, MA, USA, 2022; pp. 9166–9181. [Google Scholar]
  45. Mohsan, S.A.H.; Othman, N.Q.H.; Li, Y.; Alsharif, M.H.; Khan, M.A. Unmanned aerial vehicles (UAVs): Practical aspects, applications, open challenges, security issues, and future trends. Intell. Serv. Robot. 2023, 16, 109–137. [Google Scholar] [CrossRef]
  46. Bello-Orgaz, G.; Ramirez-Atencia, C.; Fradera-Gil, J.; Camacho, D. GAMPP: Genetic algorithm for UAV mission planning problems. In Proceedings of the Intelligent Distributed Computing IX: Proceedings of the 9th International Symposium on Intelligent Distributed Computing–IDC’2015, Guimarães, Portugal, 7–9 October 2015; Springer: Berlin/Heidelberg, Germany, 2015; pp. 167–176. [Google Scholar]
  47. Miloradović, B.; Cürüklü, B.; Ekström, M.; Papadopoulos, A.V. A genetic algorithm approach to multi-agent mission planning problems. In Proceedings of the International Conference on Operations Research and Enterprise Systems, Prague, Czech Republic, 19–21 February 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 109–134. [Google Scholar]
  48. Jamshidi, V.; Nekoukar, V.; Refan, M.H. Analysis of parallel genetic algorithm and parallel particle swarm optimization algorithm UAV path planning on controller area network. J. Control. Autom. Electr. Syst. 2020, 31, 129–140. [Google Scholar] [CrossRef]
  49. Ghamry, K.A.; Kamel, M.A.; Zhang, Y. Multiple UAVs in forest fire fighting mission using particle swarm optimization. In Proceedings of the 2017 International conference on unmanned aircraft systems (ICUAS), Miami, FL, USA, 13–16 June 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1404–1409. [Google Scholar]
  50. Nadimi-Shahraki, M.H.; Taghian, S.; Mirjalili, S. An improved grey wolf optimizer for solving engineering problems. Expert Syst. Appl. 2021, 166, 113917. [Google Scholar] [CrossRef]
  51. Yan, Z.; Yan, J.; Wu, Y.; Cai, S.; Wang, H. A novel reinforcement learning based tuna swarm optimization algorithm for autonomous underwater vehicle path planning. Math. Comput. Simul. 2023, 209, 55–86. [Google Scholar] [CrossRef]
  52. Pham, Q.V.; Mirjalili, S.; Kumar, N.; Alazab, M.; Hwang, W.J. Whale optimization algorithm with applications to resource allocation in wireless networks. IEEE Trans. Veh. Technol. 2020, 69, 4285–4297. [Google Scholar] [CrossRef]
  53. Zaza, T.; Richards, A. Ant colony optimization for routing and tasking problems for teams of UAVs. In Proceedings of the 2014 UKACC International Conference on Control (CONTROL), Loughborough, UK, 9–11 July 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 652–655. [Google Scholar]
  54. MacKay, N.J. Lanchester combat models. arXiv 2006, arXiv:math/0606300. [Google Scholar]
  55. Zulian, É.F.; Haugou, G.; Weis, C.; Jung, M.; Wehn, N. System simulation with PULP virtual platform and SystemC. In Proceedings of the Conference on Rapid Simulation and Performance Evaluation: Methods and Tools, Bologna, Italy, 21 January 2020; pp. 1–7. [Google Scholar]
Figure 1. Conceptual model of swarm performance and resilience over time under disruptions.
Figure 1. Conceptual model of swarm performance and resilience over time under disruptions.
Drones 09 00623 g001
Figure 2. The proposed UAV swarm task-allocation framework.
Figure 2. The proposed UAV swarm task-allocation framework.
Drones 09 00623 g002
Figure 3. Visualization of the task resilience (R) model. (a) A 3D surface plot showing R as a function of task capacity ( T C ) and interruption probability (P). (b) A corresponding 2D contour map.
Figure 3. Visualization of the task resilience (R) model. (a) A 3D surface plot showing R as a function of task capacity ( T C ) and interruption probability (P). (b) A corresponding 2D contour map.
Drones 09 00623 g003
Figure 4. The overall framework of the PSO-ILP optimization process.
Figure 4. The overall framework of the PSO-ILP optimization process.
Drones 09 00623 g004
Figure 5. Flowchart of the PSO-ILP bilevel nested optimization process.
Figure 5. Flowchart of the PSO-ILP bilevel nested optimization process.
Drones 09 00623 g005
Figure 6. Schematic diagram of the multi-area task environment for the simulation.
Figure 6. Schematic diagram of the multi-area task environment for the simulation.
Drones 09 00623 g006
Figure 7. Network representation of the target configuration and task allocation in the simulation scenario.
Figure 7. Network representation of the target configuration and task allocation in the simulation scenario.
Drones 09 00623 g007
Figure 8. The overall decline in total UAV number in the task alliance and task resilience gain during the PSO-ILP optimization process. The dashed line shows the task resilience value under different λ 1 value.
Figure 8. The overall decline in total UAV number in the task alliance and task resilience gain during the PSO-ILP optimization process. The dashed line shows the task resilience value under different λ 1 value.
Drones 09 00623 g008
Figure 9. Change of task capacity over time for the three task alliances under various interruption scenarios. (ai) show the task capacity over time with interruption probability from 0.2 to 0.6 under interruption level 1,2 and 3.
Figure 9. Change of task capacity over time for the three task alliances under various interruption scenarios. (ai) show the task capacity over time with interruption probability from 0.2 to 0.6 under interruption level 1,2 and 3.
Drones 09 00623 g009
Figure 10. Comparative analysis of sub-task resilience under increasing interruption probability.
Figure 10. Comparative analysis of sub-task resilience under increasing interruption probability.
Drones 09 00623 g010
Figure 11. Sensitivity analysis of the task resilience (R) model with respect to the bias parameters a and b.
Figure 11. Sensitivity analysis of the task resilience (R) model with respect to the bias parameters a and b.
Drones 09 00623 g011
Figure 12. Comparison of UAV number and cost among three different methods.
Figure 12. Comparison of UAV number and cost among three different methods.
Drones 09 00623 g012
Table 1. Major notations for variables and parameters.
Table 1. Major notations for variables and parameters.
NotationExplanation
VThe set of UAVs in swarm
LThe set of loads in UAV
BThe set of task areas
MThe set of UAV task
PThe set of interruption probability
L C The set of cost by loads
N U Number of UAV types
N L Number of load types
N B Number of operational task areas
A Task alliance configuration matrix
E Load capacity matrix
Q Target unit number matrix
F Target damage matrix
U Task required load matrix
EX Resilience enhancing (extra) load matrix
Table 2. Some parameters of UAV configuration.
Table 2. Some parameters of UAV configuration.
UAV l 1 l 2 l 3 l 4 l 5 γ i
v 1 12211080
v 2 21010170
v 3 22111180
v 4 43232100
v 5 0111660
Table 3. Some parameters of target unit configuration.
Table 3. Some parameters of target unit configuration.
Target Unit l 1 l 2 l 3 l 4 l 5
s 1 2001601208080
s 2 2101501007580
s 3 1901651058085
s 4 2051551108575
s 5 1951701309090
l c 1013172550
Table 4. Experiment 1: PSO-ILP algorithm parameters.
Table 4. Experiment 1: PSO-ILP algorithm parameters.
Parameter NamedimParticle SizeIterate NumberParticle SpeedSearch RangeCognitive CoeffSocial Coeff
Parameter value1520304010022
Table 5. The optimal task alliances of three subtasks with three algorithms.
Table 5. The optimal task alliances of three subtasks with three algorithms.
AlgorithmsField No. v 1 v 2 v 3 v 4 v 5 Total Cost (Dollars)
ILP (without resilience) b 1 665844210402,299,481
b 2 90806026958
b 3 86805827755
CBC-ILP (with resilience) b 1 776951247462,891,355
b 2 104947031767
b 3 101946832465
PSO-ILP (with resilience) b 1 875643193412,669,066
b 2 89805926156
b 3 86815726853
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zeng, Y.; Wu, L.; Li, J.; Zhuang, X.; Wu, C. Resilient Task Allocation for UAV Swarms: A Bilevel PSO-ILP Optimization Approach. Drones 2025, 9, 623. https://doi.org/10.3390/drones9090623

AMA Style

Zeng Y, Wu L, Li J, Zhuang X, Wu C. Resilient Task Allocation for UAV Swarms: A Bilevel PSO-ILP Optimization Approach. Drones. 2025; 9(9):623. https://doi.org/10.3390/drones9090623

Chicago/Turabian Style

Zeng, Yifan, Linghua Wu, Jinning Li, Xuebin Zhuang, and Cailun Wu. 2025. "Resilient Task Allocation for UAV Swarms: A Bilevel PSO-ILP Optimization Approach" Drones 9, no. 9: 623. https://doi.org/10.3390/drones9090623

APA Style

Zeng, Y., Wu, L., Li, J., Zhuang, X., & Wu, C. (2025). Resilient Task Allocation for UAV Swarms: A Bilevel PSO-ILP Optimization Approach. Drones, 9(9), 623. https://doi.org/10.3390/drones9090623

Article Metrics

Back to TopTop