2.3.1. Mathematical Model
(1) Decision variables.
In the upper-layer task assignment model, let the set of all delivery tasks be denoted as
I, where
. The set of distribution centers involved in scheduling is denoted as
J, where
. The available UAV types form the UAV model set
K, where
. To describe the assignment and scheduling status in the multi-depot multi-UAV system, the following key decision variables are defined to capture the matching relationship among tasks, distribution centers, and UAVs:
(2) Objective functions.
This section develops a multi-objective optimization model incorporating economic costs, delay times, and UAV resource utilization, aiming to optimize overall operational costs and resource consumption while ensuring timely task completion. Specifically, the objective function comprises the following three sub-objectives:
(1) Economic cost:
The economic cost function consists of three components: the scheduling cost induced via the task waiting time, the fixed cost incurred through UAV deployment, and the variable cost associated with the flight path length, formulated as follows:
where
denotes the unit waiting cost of task
i (CNY/min),
and
represent the delivery start time and request time of task
i (min), respectively,
and
denote the fixed scheduling cost (CNY) and unit distance cost (CNY/km) of UAV type
k, respectively, and
(km) is the actual flight distance between depot
j and task point
i optimized via the lower-layer path planning model, accounting for obstacle avoidance and route optimization.
(2) Total delay time:
To ensure service timeliness, delivery tasks must be completed within the time windows specified by customers. Each task point,
i, is assigned a service time window,
, where
denotes the earliest allowable service time, and
denotes the latest permissible service time. If the actual arrival time of the UAV at task point
i exceeds
, a delay is incurred. The delay time is defined as the difference between the actual arrival time and the latest allowable service time:
where
denotes the actual arrival time at task point
i.
(3) Number of UAVs:
By minimizing the number of deployed UAVs, the system can effectively allocate platform resources while still meeting task requirements:
Combining the above three objectives yields a vector-based, multi-objective optimization problem:
(3) Constraints.
To ensure the feasibility of the model solution and the rationality of the scheduling results, the following constraints are imposed:
(1) Task uniqueness constraint:
This constraint guarantees that each task is performed by exactly one UAV from a specific distribution center, thus maintaining task integrity:
(2) Delay time constraint:
This constraint ensures that a delay occurs only if the actual arrival time at task
i exceeds the latest allowable service time in its time window:
where
denotes the delay time for task
i.
(3) Distribution center time window constraint:
To align the schedule with the operating hours of each distribution center, it is required that all UAVs return to their corresponding centers before the center’s latest closing time after completing their assigned tasks:
where
denotes the time at which the UAV returns to its distribution center after completing task
i, and
represents the closing time of distribution center
j.
(4) UAV load constraint:
This constraint ensures that the total weight of goods assigned to a single UAV does not exceed its maximum carrying capacity:
where
denotes the weight of goods required to be transported for task
i, and
represents the maximum payload capacity of a UAV of type
k.
(5) UAV quantity constraint:
The maximum number of UAVs dispatched is limited by the available fleet size:
where
denotes the total number of UAVs currently available.
(6) Single-task path length constraint:
To ensure flight safety and power sustainability, the flight path for each task must not exceed the maximum allowable distance:
where
(km) denotes the optimized flight distance from the lower-layer path planning, which may include detours for obstacle avoidance, and
(km) represents the maximum flight distance of UAV type k, determined by its endurance.
As mentioned above, the multi-depot multi-UAV task allocation problem in the upper-layer model is described as follows:
where
(CNY) denotes total economic cost,
(min) represents cumulative delay time beyond latest service windows,
(unitless) is the number of deployed UAVs,
(min) is the total time for UAV
k from depot
j to complete task
i (including flight time from previous location and on-site service time),
(min) indicates the operating time window limit of depot
j (maximum allowed return time), and the constraint
ensures the cumulative mission time for all tasks assigned to UAVs departing from depot
j does not exceed its operating window.
This multi-objective model represents a typical multi-objective mixed-integer nonlinear programming (MINLP) problem, characterized by strong coupling between objective functions, a non-convex variable space, and a large number of discrete variables. To address these complexities, a Pareto-optimal multi-objective solution strategy is required to obtain a set of balanced scheduling schemes that effectively trade off between cost efficiency and service quality.
2.3.2. Nsga-Ii Algorithm
The NSGA-II algorithm [
33] is a classical multi-objective evolutionary optimization method. Its core lies in constructing a solution set that approximates the true Pareto front through fast non-dominated sorting and a crowding distance control mechanism, thereby achieving a well-balanced trade-off between solution diversity and convergence. The standard procedure of NSGA-II is as follows:
(1) Initialization.
Randomly generate an initial population, , of size N, and compute the multi-objective fitness value, , for each individual.
(2) Non-dominated sorting.
Sort all individuals in the population based on dominance relations. For any two individuals, and , if and , then is said to dominate . The first layer of non-dominated individuals forms the approximate Pareto front .
(3) Crowding distance calculation.
Within each non-dominated front, calculate the crowding distance,
, of each individual,
, based on the normalized distance between its neighboring solutions in each objective:
where
denotes the position of individual
in the sorted list for the
k-th objective. The crowding distance reflects the density of surrounding solutions in the objective space.
(4) Selection.
Use a binary tournament selection strategy based on rank (non-dominated layer) and crowding distance to generate a parent population, .
(5) Crossover and mutation.
Apply simulated binary crossover and polynomial mutation operators to the selected parents to generate the offspring population .
(6) Population update.
Combine the current generation’s parents, and offspring , to form a temporary population, . Perform non-dominated sorting, and select the top N individuals to form the next generation, .
(7) Iteration.
Repeat the process until a predefined termination condition is satisfied.
While NSGA-II demonstrates advantages in non-dominated sorting and diversity preservation, it still faces limitations when handling high-dimensional objective spaces or mixed problems with strong nonlinear coupling and integer constraints. These challenges include a weak local search capability, a tendency to converge to suboptimal solutions, the diminished resolution of the Pareto front in high-dimensional objectives, leading to poor coverage, and a lack of problem-specific operator design, reducing generalizability [
31]. To address these issues, this study develops an enhanced NSGA-II algorithm to improve overall solving efficiency and solution quality, thereby enabling efficient resolution and decision support for complex multi-objective scheduling problems.
2.3.3. Ensga-Ii Algorithm
The proposed ENSGA-II introduces a series of targeted improvements over the classical NSGA-II framework, including heuristic insertion-based initial population construction, goal-oriented local search operators, and stage-wise evolutionary control mechanisms.
(1) Improved strategy I: heuristic insertion-based initial population construction.
To enhance the initialization efficiency and solution feasibility of NSGA-II, this study proposes a heuristic insertion-based method for initial population construction. This approach addresses the limitations of traditional random initialization in terms of constraint satisfaction and distribution coverage within the solution space. The core idea is to incrementally construct high-quality initial solutions under the constraint of path feasibility. The improved initialization process is described as follows:
(1) A task,
, is randomly selected to form the initial path,
. Each path,
R, represents a scheduling sequence executed by a specific UAV departing from a designated distribution center. An appropriate combination of distribution center
and UAV model
is assigned to the path to ensure the following constraint is satisfied:
where
denotes the maximum flight range of UAV model
k deployed at distribution center
j, and
is the Euclidean distance between nodes
i and
j.
(2) For the remaining tasks
, each task is tentatively inserted into any legal position,
l, in the existing paths
. After insertion, the feasibility of the updated path must be verified, particularly with respect to the task deadline
. The updated path must satisfy the following:
where
is the estimated completion time of task
under the current path configuration, and
denotes its maximum allowable completion time. If the condition is met, the task
is inserted at position
l, and the path is updated as follows:
(3) If task
cannot be inserted into any existing path without violating constraints, a new distribution center,
, and UAV model,
, are selected, under the condition that the distance between the center and the task does not exceed the maximum flight range:
A new path is then constructed as follows:
(4) The above process is iteratively executed until all tasks are successfully assigned, resulting in a set of feasible solutions,
. To ensure structural diversity within the initial population, a uniqueness check is performed after each new solution is generated. A solution is added to the initial population,
, only if it satisfies the following:
The resulting initial population thus consists of p structurally diverse and constraint-compliant scheduling solutions. By incorporating constraint-aware evaluation and iterative path insertion strategies, this initialization method significantly reduces the proportion of infeasible solutions and enhances structural diversity within the feasible solution space, thereby providing a high-quality basis for the subsequent multi-objective evolutionary process.
(2) Improved strategy II: goal-oriented top-down and bottom-up search operators.
To enhance both the global exploration ability and local convergence performance in the multi-objective optimization process, a goal-oriented, staged top-down and bottom-up search mechanism is proposed to improve the performance of genetic operators. This mechanism comprises two customized operators: a goal-oriented crossover operator and a goal-oriented mutation operator, which collectively enhance solution quality and diversity through a two-phase construction strategy.
(1) Goal-oriented crossover operator.
This operator constructs high-quality offspring solutions through a two-stage path recombination mechanism. The first stage focuses on identifying and extracting promising path segments from the parent individuals to build a partial, high-quality offspring; the second stage is responsible for embedding remaining unassigned tasks into the existing path structure to ensure solution completeness and feasibility.
First stage: The construction of partial offspring (path selection and information coordination). In this stage, a partial solution is constructed based on information from two parent solutions, with priority given to extracting potentially valuable delivery paths while coordinating task information across paths to form an initial offspring solution
. Initially, the offspring solution is set to an empty set, i.e.,
. Let the two parent solutions be
and
, with corresponding numbers of delivery paths
and
. Define the minimum and maximum number of paths as follows:
Then, set the number of path extraction iterations,
k, based on the current optimization objective. If the objective is to minimize UAV usage (
), set
to encourage path merging and reduce total UAV count. If the objective is minimizing economic cost, (
), or the delay time, (
), set
. In each iteration, a parent solution,
, is randomly selected. For the selected parent, identify the most promising path,
, according to the objective function
. The selection strategy is defined as follows:
where
denotes the currently selected parent solution, and
represents the set of delivery paths within that solution,
indicates the number of task points contained in path
. A lower average delay per task point in a path implies a higher preference under objective function
. Once the optimal path
is identified, it is copied and added to the offspring solution
and simultaneously removed from the current parent solution.
In the other parent solution, all task points, , included in this selected path, , undergo a coordinated operation, as follows:
The task point i is removed from its original path, after which its predecessor and successor nodes are directly reconnected to form a continuous path structure. Relevant attributes, such as path length, payload, and time windows, are then updated to ensure the feasibility and consistency of the modified path. This procedure is iteratively executed for k rounds. If, at any point during the iterations, either parent solution no longer contains feasible paths, the phase is terminated prematurely. Ultimately, the number of paths in the partial offspring solution satisfies . However, some task points may remain unassigned and will be handled in the second phase.
Second stage: The insertion of remaining customer tasks. In the partial offspring solution constructed during the first stage, only a subset of delivery routes inherited from the parent solutions is retained, leaving some task points uncovered. Let the set of these unassigned task points be denoted as U. The aim of this phase is to insert all task points in U into the existing routes in a feasible and efficient manner, or to create new delivery routes when necessary, thereby forming a complete and feasible offspring solution.
Specifically, this stage operator processes the task points in
U one by one. In each iteration, a randomly selected unassigned task point,
, is chosen, and its optimal insertion position within the current solution,
, is sought. The evaluation criterion for determining the best insertion position is based on the current optimization objective function,
. If the current objective is to minimize the number of UAVs used (
), the primary principle is to avoid adding new routes. Among all existing routes, the one with the largest number of task points is given priority:
Then, within the selected route , the operator sequentially searches for the first insertion position that satisfies all constraint conditions. Once a feasible position is found, the task point is inserted accordingly.
If the current objective is to minimize economic cost (
), the algorithm evaluates all routes and feasible insertion positions to calculate the incremental cost:
The task is inserted at the position with the smallest cost increase:
To minimize delay time (
), the delay increment is calculated as:
Insertion occurs at the position minimizing the delay increment:
If no feasible insertion exists, a new route,
, is created. First, assign the closest distribution center:
Select a UAV type,
, satisfying range constraints:
This new route is then added to the current solution, . The above steps are repeated iteratively until all unassigned customer task points, , have been successfully inserted, ultimately forming a complete and feasible offspring solution.
(2) Goal-guided mutation operator.
To enhance the population’s adaptability and responsiveness during the multi-objective evolutionary process, this study designs two types of mutation operators with objective-awareness mechanisms. These operators apply perturbations and structured repairs tailored to different path characteristics and optimization objectives, such as the number of drones used, economic cost, and service delay time, thereby strengthening the algorithm’s capability to balance multiple objectives.
Mutation operator I: The reconstruction and reinsertion of delayed paths (trigger probability ).
This mutation operator aims to identify and adjust paths containing delayed tasks in the current solution to reduce path-layer delay risks, thereby improving the timeliness and feasibility of the scheduling scheme. The specific procedure is as follows:
First, traverse all delivery paths,
R, in the current solution and extract those containing at least one delayed task point to form a set of delayed paths:
Secondly, for each path,
, in the set
, remove all task points,
, where delays have occurred (i.e., tasks satisfying the corresponding delay condition), and denote these removed task points as the set
. It is noteworthy that the original distribution center and drone model of the path remain unchanged. For each removed task point,
, an attempt is made to reinsert it into the remaining routes of the current solution. The insertion position must satisfy the following conditions:
Equation (
36) (now without numbering) ensures that, after insertion, the service completion time of all tasks on the path does not exceed the upper bound of their respective time windows. If no feasible insertion position can be found in the existing routes for the task point, a new path,
, is created for task
i, starting from its original distribution center. The algorithm then searches for the optimal insertion position within the new path or its neighboring routes. The criterion for selecting the insertion position is to minimize the total service delay of the path, ideally achieving zero while satisfying all constraints. If the insertion conditions remain unmet, additional backup paths are constructed, and insertion attempts continue iteratively until all delayed task points are successfully reassigned. This strategy effectively reduces the overall delay layer of the solution, providing a target-oriented proactive intervention mechanism that facilitates precise regulation of delay metrics during the multi-objective evolutionary process.
Mutation operator II: Destruction–reconstruction mechanism (trigger probability p = 0.75).
This operator integrates the dynamic preferences of multiple objective functions to achieve targeted perturbation and reconstruction of path structures, thereby enhancing the local feasibility repair capability and global exploration diversity within the solution space. The mechanism primarily consists of two stages:
In the destruction stage, first, the set U of unassigned task points is constructed. According to the dynamic preference of the current optimization objective function , different destruction strategies are selected:
When the optimization objective is to minimize the number of drones used (
), priority is given to selecting the path with the fewest task points, and all tasks on this path are moved into the set
U. If, after tasks are removed from the path, its structure no longer satisfies feasibility constraints, it must be divided and reconstructed into several feasible sub-paths:
When the optimization objective is to minimize economic costs (), two mutually exclusive strategies are employed and executed with probabilities of 70% and 30% respectively: The first is “random path partial removal”, in which a path is randomly selected from the set of paths, and n task points are randomly removed from it (). The second is “iterative random removal”, through which a maximum iteration number, , is set, and in each iteration, a feasible path is randomly selected, and one task point is removed.
When the optimization objective is to minimize time delays (
), two mutually exclusive strategies are also used: The first is “high-delay path partial removal”, which selects the path with the largest total delay:
Then, several task points are randomly removed from it. The second is “iterative random removal”, through which removal operations are randomly performed only on customer points in paths that contain service delays.
After the destruction phase is completed, the reconstruction phase begins, where task points in the set U are inserted one by one. In each iteration, a task point, , is randomly selected from U, and its optimal insertion position in the existing or new routes is determined based on the current optimization objective function.
If the objective function is to minimize the number of drones used (
), priority is given to the earliest feasible position where the insertion does not cause service delays for existing task points. If no such position exists, the last feasible position is chosen to maximize the insertion flexibility of the route. If the objective function is to minimize economic cost (
), the position causing the smallest increase in cost after insertion is selected. If the objective function is to minimize delay time (
), the position resulting in the minimum increase in total route delay is selected. If the current task point cannot be inserted into any existing route, a new route is created according to the following rule: the nearest distribution center in terms of Euclidean distance to task point
is selected as the starting point:
Next, assign a UAV model,
, that meets the current task’s service range requirements, which must satisfy the following range constraint:
Finally, during the population update process, this study adopts a “parent plus offspring” merging strategy to construct an intermediate population of size , and it introduces an elitism-based nondominated sorting mechanism to ensure the continuous preservation of high-quality solutions. The proposed goal-oriented bi-directional search improvement operator strategy integrates structured crossover and mutation operations, enabling directional guidance and local fine-tuning reconstruction for different optimization objectives while maintaining solution feasibility. This strategy, combined with a staged evolutionary mechanism, achieves a balance between global exploration and local exploitation, significantly enhancing solution diversity and the capability to obtain high-quality solutions. It provides efficient and stable optimization support and an algorithmic foundation for multi-UAV task allocation problems. The pseudocode of the proposed ENSGA-II algorithm is presented in Algorithm 1.