Next Article in Journal
A General Method for Pre-Flight Preparation in Data Collection for Unmanned Aerial Vehicle-Based Bridge Inspection
Previous Article in Journal
Multi-Type Task Assignment Algorithm for Heterogeneous UAV Cluster Based on Improved NSGA-Ⅱ
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Mission Planning Method for Long-Endurance Unmanned Aerial Vehicles: Integrating Heterogeneous Ground Control Resource Allocation

National Key Laboratory of Information Systems Engineering, National University of Defense Technology, Changsha 410003, China
*
Author to whom correspondence should be addressed.
Drones 2024, 8(8), 385; https://doi.org/10.3390/drones8080385
Submission received: 14 July 2024 / Revised: 6 August 2024 / Accepted: 7 August 2024 / Published: 8 August 2024

Abstract

:
Long-endurance unmanned aerial vehicles (LE-UAVs) are extensively used due to their vast coverage and significant payload capacities. However, their limited autonomous intelligence necessitates the intervention of ground control resources (GCRs), which include one or more operators, during mission execution. The performance of these missions is notably affected by the varying effectiveness of different GCRs and their fatigue levels. Current research on multi-UAV mission planning inadequately addresses these critical factors. To tackle this practical issue, we present an integrated optimization problem for multi-LE-UAV mission planning combined with heterogeneous GCR allocation. This problem extends traditional multi-UAV cooperative mission planning by incorporating GCR allocation decisions. The coupling of mission planning decisions with GCR allocation decisions increases the dimensionality of the decision space, rendering the problem more complex. By analyzing the problem’s characteristics, we develop a mixed-integer linear programming model. To effectively solve this problem, we propose a bilevel programming algorithm based on a hybrid genetic algorithm framework. Numerical experiments demonstrate that our proposed algorithm effectively solves the problem, outperforming the advanced optimization toolkit CPLEX. Remarkably, for larger-scale instances, our algorithm achieves superior solutions within 10 s compared with CPLEX’s 2 h runtime.

1. Introduction

In recent years, unmanned aerial vehicles (UAVs) have emerged as advanced aircraft that are widely utilized across various fields, including reconnaissance and surveillance [1,2], communication relay [3,4], disaster management [5,6], logistics delivery [7,8], and remote sensing [9,10]. Based on their endurance capabilities, UAVs are typically categorized into short-endurance and long-endurance types. Due to limitations in payload and energy configuration, short-endurance UAVs (SE-UAVs) exhibit slower flight speeds and shorter ranges [11]. In contrast, long-endurance UAVs (LE-UAVs), with their extended flight durations and greater payload capacities, are capable of executing a more diverse array of extensive missions. Consequently, various countries have developed multiple models of LE-UAVs, such as the United States’s “MQ-1 Predator” and “RQ-4 Global Hawk” and China’s “Twin-Tailed Scorpion” and “Wing Loong” UAVs.
Currently, LE-UAVs possess a certain degree of autonomous intelligence, enabling them to autonomously perform operations such as cruising along a predetermined route and obstacle avoidance [12]. However, LE-UAVs are still incapable of completely relying on onboard computers to autonomously execute complex tasks, such as strikes, reconnaissance, surveillance, and payload delivery. These tasks still require operators to send commands from ground control stations, a mode referred to as “human-in-the-loop” command and control [13]. For instance, the RQ-4 Global Hawk UAV, produced by Northrop Grumman, is operated by both a pilot and an image sensor operator at the ground control station to complete its mission after takeoff [14]. In this study, UAV operators are defined as ground control resources (GCRs).
The multi-UAV mission planning problem has been extensively studied. To enhance the efficiency and intelligent coordination of multiple UAVs in complex environments, researchers have proposed various mission planning algorithms, including mathematical programming algorithms, heuristic algorithms, swarm intelligence algorithms, and contract network algorithms [15,16,17]. However, to the best of our knowledge, most current research is based on the assumption that UAVs possess complete autonomous intelligence and do not require GCR for control. In practical applications, LE-UAVs lack full autonomy and still require GCR control during the operational phase. Although a few studies [18,19,20] have considered the need for GCR control, they have overlooked the heterogeneity of GCRs (differences in task execution capabilities) and the significant impact of fatigue on the effectiveness of UAV mission execution. In fact, the existing literature has demonstrated the crucial influence of GCR capabilities and fatigue levels on UAV operations [21]. Furthermore, to the best of our knowledge, current research typically assumes that the communication link between UAVs and GCRs is a fixed match, meaning one or more UAVs are permanently connected to a specific GCR. In reality, there exists a flexible matching mechanism between LE-UAVs and GCRs. This allows for the selection of the most suitable GCR to control the UAV based on the current task information.
Assume that GCR1 controls tasks N1–N6 with the following profits: {10, 20, 10, 20, 10, 20}, and GCR2 controls tasks N1–N6 with the following profits: {20, 10, 20, 10, 20, 10}. If the influence of the GCRs on UAV task execution is not considered, the mission planning scheme is likely to be suboptimal, as shown in Figure 1a, with an actual profit of only 80. In reality, the capabilities and fatigue levels of GCR significantly impact the profits of UAV task execution. Taking these factors into account, a fixed matching mechanism can yield a slightly better scheme, as shown in Figure 1b, with a profit value of 100. By employing a flexible matching mechanism on this basis, an even better scheme can be obtained, as illustrated in Figure 1c, with a profit value of 120, which is the highest among the three strategies.
Therefore, it is crucial to investigate how to optimize UAV mission planning in conjunction with GCR allocation. Based on customer information and UAV capabilities, each UAV should be assigned a rational sequence of tasks. At the same time, considering the varying capabilities and fatigue levels of each GCR, the most suitable GCR should be allocated for each task assigned to each UAV. This integrated approach aims to maximize the overall profit.
When the heterogeneity of GCR is not considered, the UAV mission planning problem can be simplified to the classic vehicle routing problem (VRP). As indicated in the reference, the VRP is classified as an NP-hard problem [22]. However, when the heterogeneity of GCR is taken into account, the mission planning problem becomes significantly more complex, thus it also remains an NP-hard problem. In this scenario, the decision making for mission planning is coupled with the allocation of heterogeneous GCRs, resulting in an increased dimensionality of the decision space and a more complex problem structure. This introduces substantial challenges in modeling the problem and developing solution algorithms.
The main contributions of this paper can be summarized as follows: (1) Based on the relationship between LE-UAVs and GCR, as well as the practical applications of LE-UAVs, we propose the integrated optimization problem of multi-long-endurance UAV mission planning and multi-heterogeneous GCR assignment, termed H-G&U IOP. This problem is formulated as a mixed-integer linear programming (MILP) model, with the objective of maximizing overall profit. (2) To efficiently solve the MILP, we present a bi-level programming algorithm (BPA) based on a hybrid genetic algorithm (HGA) framework. The upper level of the BPA addresses mission planning, where we determine the task sequence for each UAV. We design three initial solution generation methods according to the problem characteristics and employ a local search–variable neighborhood descent (LS-VND) to improve solution quality. The lower level of the BPA focuses on heterogeneous GCR assignment, deciding the GCR for each task of each UAV. We apply a greedy algorithm combined with local search to calculate the optimal GCR assignment strategy for each chromosome (mission planning solution) in the HGA. Finally, the fitness function of the HGA aggregates the profit values from both levels, iterating until convergence is achieved. (3) Numerical experiments demonstrate that the BPA effectively solves the H-G&U IOP and outperforms the advanced optimization package CPLEX in terms of solution quality. Notably, for larger-scale instances, our algorithm obtains better solutions in 10 s than CPLEX does in 2 h.
The remainder of this paper is structured as follows. Section 2 describes related work on UAV mission planning. Section 3 introduces the problem description and mathematical model of the H-G&U IOP. Section 4 presents the bilevel programming algorithm (BPA) developed to solve this problem. Section 5 details the datasets used, the settings adopted in the BPA, and a comprehensive experimental evaluation. Section 6 discusses the results and provides insights into future research directions. Finally, Section 7 concludes this study.

2. Related Work

The multi-UAV mission planning problem can be formulated as a complex combinatorial optimization problem, incorporating factors such as time constraints, task decomposition, and dynamic reassignment, making it an NP-hard problem [23]. UAV mission planning algorithms can generally be categorized into mathematical programming algorithms, swarm intelligence algorithms, and heuristic algorithms [15].
Mathematical programming algorithms aim to obtain the optimal solution for an objective function under given constraints. Commonly used mathematical programming algorithms include integer programming algorithms [24,25] and dynamic programming algorithms [26,27]. Integer programming algorithms are a class of algorithms designed to solve integer programming problems, encompassing methods such as branch and bound, branch and cut, column generation, and row generation. Dynamic programming algorithms employ a bottom-up approach, solving a problem by breaking it down into a series of subproblems and incrementally solving these subproblems to address the overall problem. Due to their effective management of subproblems and utilization of overlapping subproblems, dynamic programming algorithms excel in handling complex issues.
Swarm intelligence algorithms are a class of optimization algorithms inspired by the collective behavior of natural systems. These algorithms solve complex problems by simulating the collaboration and information-sharing mechanisms observed in biological communities. Common swarm intelligence algorithms include ant colony optimization [28], particle swarm optimization [29], artificial bee colony [30], and the multi-swarm fruit fly optimization algorithm [31].
Heuristic algorithms are intuition- or experience-based methods designed to find feasible solutions to complex problems within a limited timeframe. These methods primarily focus on the objective function of the model, requiring less complexity. This allows for the attainment of good feasible solutions in a short period, making them practically viable for large-scale complex problems. Heuristic algorithms include genetic algorithm (GA) [32], tabu search [33], simulated annealing [34], and fireworks algorithm [16], among others. Notably, genetic algorithms, due to their superior computational robustness and ability to handle complex constraints, have seen various improved versions widely applied to address UAV mission planning problems.
Tal Shima et al. [32] were the pioneers in applying genetic algorithms to solve UAV mission planning problems. They viewed the allocation of multiple UAVs executing multiple missions with multiple objectives as a novel combinatorial optimization problem and proposed a corresponding genetic algorithm to address it.
Wu et al. [35] investigated the problem of finding optimal flyable trajectories for heterogeneous fixed-wing UAVs in multi-objective missions. They proposed a coupled distributed planning approach that integrates task allocation with trajectory generation. To search for the optimal solution, they introduced a distributed genetic algorithm and modified the chromosomal genes to adapt to the heterogeneous characteristics of the UAVs. Ye et al. [36] addressed the cooperative multiple task assignment problem for heterogeneous fixed-wing UAVs performing the Suppression of Enemy Air Defense mission against multiple stationary ground targets. They proposed an improved GA utilizing a multitype gene chromosome encoding strategy to tackle this issue. Yu et al. [37] investigated the cooperative mission planning problem for multiple heterogeneous UAVs in cross-regional joint operations. To address this challenge, they developed an improved GA and introduced a novel chromosome encoding format. Gao et al. [38] addressed the issue of cooperative mission assignment for heterogeneous UAVs by developing a multi-objective optimization model that balances mission profit and loss, taking into account the probabilities of mission success and UAV loss. To tackle this problem, they proposed an improved multi-objective genetic algorithm (MOGA) that employs a natural chromosome encoding format and specially designed genetic operators. This approach effectively handles constraints such as munitions loading capacity, time, and priority.
The existing research literature on UAV mission planning generally assumes that UAVs possess a high degree of autonomous intelligence, thereby overlooking the factor of GCRs. However, in practical military applications, LE-UAVs have limited autonomous intelligence and still require control from GCRs. To the best of our knowledge, the following literature has considered the relationship between GCRs and multiple UAVs in mission planning.
Ramirez et al. [18] conducted an in-depth study on the cooperative multiple-task assignment problem and proposed a new MOGA to address complex mission planning problems involving multiple UAVs and GCRs. This mission planning system takes into account the communication distance constraints between UAVs and GCRs, as well as the resource constraints of GCRs. It not only allocates each task to a specific UAV but also ensures that each UAV is assigned to a specific GCR for control. In reference [19], they model the multi-UAV mission planning problem as a constraint satisfaction problem (CSP) and propose a hybrid multi-objective evolutionary algorithm combined with a constraint satisfaction problem model (MOEA-CSP) to solve it. They extend non-dominated sorting genetic algorithm-II to handle constraints in the fitness function and certain settings, as well as genetic operators, aiming to reduce the search space of the problem. This study also involves a set of heterogeneous UAVs and GCRs and considers communication constraints between UAVs and GCRs, as well as the limited control resources of the GCR. In reference [20], they also addressed the problem of multi-UAV mission planning, with a focus on improving the efficiency and effectiveness of MOEA. They proposed a weighted random generator to enhance MOEA. This generator employs three strategies—arithmetic, harmonic, and geometric—to create new individuals, thereby improving the diversity and convergence performance of the algorithm. The study also considered the constraints between GCRs and UAVs. Moreover, multiple UAVs are coordinated and commanded by various GCRs to execute missions collaboratively.
However, in planning multi-UAV missions, the aforementioned literature overlooks the heterogeneity of GCRs—differences in their capabilities to execute tasks—and the significant impact that fatigue can have on the effectiveness of UAV mission execution. In reality, the existing literature has demonstrated that the capabilities and fatigue levels of GCRs play a crucial role in operating UAV tasks [21]. More skilled GCRs achieve better task outcomes within the same time frame. Moreover, as the workload increases, GCRs experience growing fatigue. With increasing fatigue, the accuracy of GCRs in completing tasks declines rapidly, and consequently, the profit decreases.
A summary of papers on UAV mission planning is provided in Table 1. To the best of our knowledge, current studies assume a fixed 1-to-1 or n-to-1 matching mechanism between UAVs and GCRs, where one or multiple UAVs are continuously controlled by a single GCR. However, in reality, UAVs possess a certain level of autonomous intelligence, allowing them to fly autonomously without being continuously controlled. Ground control intervention is only required when performing tasks at customer locations. Any GCR can connect to any UAV via satellite. There exists a flexible many-to-many (n-to-n) matching mechanism between UAVs and ground control stations, as depicted in Figure 2. Therefore, it is entirely feasible to select the most suitable GCRs to control the UAV based on the current task information.

3. Problem Description and Formulation

In this section, we formally define the H-G&U IOP and then formulate a MILP for the problem.

3.1. Problem Description

Let  G = ( N , A )  be a complete directed graph, where the node set is  N = { 0 , 1 , 2 , , n }  and the arc set is  A = { ( i , j ) | i , j N , i j } . Node 0 represents the airport where the UAV takes off and lands and is the location from which the UAV departs and returns. Let  N ^ = { 1 , 2 , , n }  represent the set of profitable nodes, which we call customers.
Let  T = { 0 , 1 , 2 , , t max }  represent the planning horizon, where  t max = max t max u u U . We consider a set of identical UAVs  u  departing from the airport at the time  t 0 u  to serve a specific customer. The UAV  u  flies for a duration not exceeding  t max u    and returns to the terminal before time  t max u   . Once the UAV starts servicing a customer, the process cannot be interrupted. The service time at customer  i , i.e., the execution time of the task, is  t i . The flight time required to travel from node  i  to node  j  is  t i j . The number of resources required for customer  i  is  R i . The resources that a UAV can carry are  Q u . The capability of UAV  u  for customer  i  can be denoted as  C u i .
Let  S  represent the set of GCRs. During the flight to node  i , the UAV can fly autonomously without GCR control. However, when servicing the customer  i , a GCR is required for the UAV to perform the task. Each GCR has different capabilities for each customer. For the customer, the initial capability value of the GCR can be denoted as  B s i . The capability value of the GCR gradually decreases over time as it continues to operate, following a variation function  φ ( t ) . The definition of  φ ( t )  is provided in Equation (1).
φ ( t ) = 1     β ( t t max ) 2
where  β  is a user-defined parameter that represents the descent rate of the GCR’s capability. When its value is increased, the capability value of GCRs rapidly decreases over time.
The objective of mission planning is to maximize the total profit from all tasks. The profit derived from completing a task depends on the customer’s initial value, the time it takes for the UAV to reach the customer, the UAV’s capability, and the capability of the GCRs.
profit j = v j ( w 1 ψ ( t ) + w 2 C u j + w 3 φ ( t ) B s j )
where  w 1 w 2 , and  w 3  are weights of the arrival time, the capability of the UAV, and the capability of the GCRs, respectively.  v j  represents the initial value of customer  j ψ ( t )  is a monotonically decreasing function of time, which can be defined as in Equation (3).
ψ ( t ) = 1 1 + γ t
where  γ  is a user-defined parameter that represents the descent rate of the task’s profit. When its value is increased, the profit rapidly decreases over time.
Before formulating the mode of the H-G&U IOP, some assumptions are listed as follows:
  • There is only one airport. And all UAVs depart from this airport and return to the same airport after completing their tasks;
  • The service time and resource requirements of customers are certain;
  • GCRs are connected to UAVs via satellites. The GCRs, UAVs, and satellites are in real-time visibility, and their signals are not subject to interference;
  • Within a given time, once a UAV starts serving a customer, it cannot be interrupted.
  • Any GCR can only control one UAV at a time, and each UAV can only be controlled by one GCR at a time.
The fundamental assumptions underlying the H-G&U IOP are designed to simplify its complexity while preserving its core elements. This allows us to focus on investigating how to enhance the overall effectiveness of multiple UAVs through the implementation of rational control strategies.

3.2. Model Formulation

Traditional UAV mission planning primarily focuses on determining the sequence of target point visits and path planning. Building on this foundation, we also incorporate the allocation of GCRs into the decision-making process. Additionally, we take into account constraints such as the UAV’s flight duration and payload capacity. A mathematical model is constructed with the objective of maximizing the total profit from all tasks.

Mathematical Model

max j = 1 n v j ( w 1 ψ ( t j ) + +   w 2 u = 1 | U | i = 0 n C u j x i j u + w 3 s = 1 | S | φ ( t j ) B s j ) w 4 u = 1 | U | a 0 u
Subject to the following:
t j = s = 1 S u = 1 | U | τ = 0 t max τ y s u j τ    j N ^
t j = u = 1 | U | τ = 0 t max τ y s u j τ    j N ^
j = 1 n x 0 , j , u = 1    u U
i = 1 n x i , 0 , u = 1    u U
i = 0 , i k n x i , k , u = j = 0 , j k n x k , j , u    k N ^ , u U
u = 1 | U | j = 0 , j k n x k , j , u = 1    k N ^
t 0 u + t 0 j a j u M ( 1 x 0 j u )    j N ^ , u U
a j u t i t i j a i u M ( x i , j , u 1 )    j N , i N ^ , u U
a j u M i N , i j x i , j , u    j N , u U
a 0 u t max u    i N , u U
k = 1 n | j = 0 j k n R k x k , j , u Q u    u U
a u i s = 1 | S | t = 0 t max t y s u i t = 0    i N ^ , u U
s = 1 | S | u = 1 | U | t = 0 t max y s u i t = 1    i N ^
u = 1 | U | i = 1 n τ = t t i t 1 y s u i τ 1    s S , t T
x i j u { 0 , 1 }    i , j N , u U
y s u i t { 0 , 1 }    s S , u U , i N ^ , t T
a i u 0    i N , u U
The objective function (4) maximizes the overall profit of system, where  w 4 u = 1 | U | a 0 u  is the cost of the UAVs’ flight time from takeoff to landing. Constraint (5) defines the calculation method of parameter  t j  in  ψ ( t j ) . Constraint (6) defines the calculation method of parameter  t j  in  φ ( t j ) . Constraints (7)–(8) ensure that each aircraft departs from the airport and eventually returns to the airport. Constraint (9) is the conservation of flow for each customer node. Constraint (10) ensures that one and only one UAV traverses each customer. Constraints (11)–(12) couple the routing decision with temporal scheduling. Constraint (11) states that if  x 0 , j , u = 1 , then the arrival time of the first customer  j  traversed by UAV  u  from airport 0 must be equal to or greater than the take-off moment  t 0 u  plus the flight time from airport 0 to customer  j . Constraint (12) ensures that if  x 0 , j , u = 1 , i.e., the UAV  u  visits customer  j  directly after departing from customer  i , then the arrival time of UAV  u  at  j  must be equal to or greater than the time of arrival at  i  plus the service time of customer  i  and the flight time between customer  i  and customer  j . Constraint (13) ensures that the variable  a j u  of a virtual node  j  that has not been visited by a UAV  u  is zero. Constraint (14) is the flight duration limit, which ensures that each UAV is able to return to the airport. Constraint (15) ensures that each UAV has the sufficient resources required to satisfy the customer nodes it traverses. Constraint (16) denotes the coupling of the decision of GCR allocation with the time scheduling, where the GCR starts serving the customer  i  on the UAV  u  at a time equal to the time the UAV  u  reaches the customer  i . Constraint (17) ensures that there is one and only one GCR controlling the UAV to serve customer  i . Constraint (18) ensures that the GCR can only control one UAV at the same moment. Constraints (19)–(21) define the domains of decision variables.
It should be noted that the objective function (4) is nonlinear because it contains fractional and exponential terms. For this type of nonlinear objective function, CPLEX is unable to solve it. Therefore, we linearize it by using the precomputation and lookup table method to enable successful solving by CPLEX. Since we adopt a time-indexed modeling approach, time is discrete, i.e.,  T = { 1 , 2 , , t max } . Consequently, we can precompute the values of the fractional and exponential terms at each integer time point and store them. In the model, whenever we need to use a specific time  t , we can directly call the precomputed values.
f a [ t ] = 1 1 + γ t    t T
f b [ t ] = 1 β ( t t max ) 2    t T
ψ ( t ) = s = 1 | S | u = 1 | U | t = 0 t max f a [ t ] y s u j t    j N ^
φ ( t ) = u = 1 | U | t = 0 t max f b [ t ] y s u j t    j N ^
max j = 1 n v j ( w 1 ψ ( t ) + w 2 s = 1 | S | φ ( t ) B s j + w 3 u = 1 | U | i = 0 n C u j x i j u ) w 4 u = 1 | U | a 0 u
The construction of the lookup tables for the exponential and fractional terms is shown in Equations (22) and (23), respectively. Then, when building the model, we directly use the precomputed values from the lookup tables to compute  ψ ( t )  and  φ ( t ) , as shown in Equations (24) and (25). Finally, the objective function can be linearized as shown in Equation (26).

4. Proposed Method

The model developed in Section 3 can be solved for small-scale instances using readily available solvers such as CPLEX. To address large-scale instances, we developed a bi-level programming algorithm (BPA) based on the hybrid genetic algorithm (HGA) framework.
As illustrated in Figure 3, the framework of the entire algorithm is depicted. The upper level of the BPA is dedicated to mission planning. At this level, we determine the task sequence for each UAV. Specifically, we designed three methods for generating initial solutions based on the problem characteristics. Then, we use local search–variable neighborhood descent (LS-VND) to improve the solution quality. Following the upper-level algorithm, we obtain the mission planning schemes for the UAVs. Here, each chromosome represents a mission planning scheme. On this basis, we proceed with the lower level of the BPA—the allocation of heterogeneous GCRs. We employ a greedy algorithm combined with local search to compute the optimal GCR allocation strategy for each chromosome in the HGA. Through the lower-level algorithm, we determine the GCR for each task of each UAV. Finally, we use the fitness function of the HGA to perform a weighted summation of the profit values from both levels. The HGA iterates through operations such as crossover, recombination optimization, mutation, and repair until convergence conditions are met.
In Section 4.1, we introduce the encoding design of the BPA solution. In Section 4.2, we discuss the upper level—mission planning. In Section 4.3, we present the lower level—GCR allocation. Finally, in Section 4.4, we describe the evolutionary process of the HGA.

4.1. Solution Encoding Design

The construction of the solution for the H-G&U IOP problem is illustrated in Figure 4. The solution is represented as  I n d = { T o t a l _ p r o f i t , P r o f i t 1 , P r o f i t 2 , P e n a l t y , C o s t , S _ w o r k ,   C h r o m o s o m e } , where the total profit is calculated as  T o t a l _ p r o f i t = P r o f i t 1 + P r o f i t 2   C o s t . This indicates that the objective value of the solution is the sum of the upper-level profit  P r o f i t 1  and the lower-level profit  P r o f i t 2 , minus the weighted trajectory cost  C o s t . The calculation formula for  P r o f i t 1  is derived from Equation (27) in Section 4.2, while the calculation for  P r o f i t 2  is obtained from Equation (28) in Section 4.3. The variable  S _ w o r k  represents the task sequence that each GCR needs to manage. A chromosome is a collection composed of multiple subchromosomes, denoted as  C h r o m o s o m e =   ( s u b c h r o m o s o m e 1 , s u b c h r o m o s o m e 2 , , s u b c h r o m o s o m e | U | ) . Each subchromosome corresponds to a mission planning scheme for a single UAV.
Within each subchromosome, the task sequence (i.e., the list of customer visits) is encoded using integers, where positive integers represent customers and “0” represents the UAV airport. The variable  s u b _ d e m a n d  indicates the demand of all customers in the planned route of each UAV,  f l i g h t _ d u r a t i o n  represents the flight duration of each UAV from takeoff to landing, and  t i m e _ s e r i e s = { t 0 u , a i u , , a 0 u }  denotes the sequence of times at which each UAV takes off and reaches each node.

4.2. Upper Level—Mission Planning

In the upper level of the BPA, we primarily focus on mission planning. In this level, the objective function  P r o f i t 1  is defined as
P r o f i t 1 = j = 1 n v j ( w 1 ψ ( t j ) + +   w 2 u = 1 | U | i = 0 n C u j x i j u )
where  t j  is derived from  t i m e _ s e r i e s  and other parameter information aligns with the 3.2 MILP model. The variable  x i j u  is decoded from the task sequence in  s u b c h r o m o s o m e u .
In Section 4.2.1, we introduce three initial solution generation strategies. In Section 4.2.2, we employ LS-VND to enhance solution quality and accelerate algorithm convergence. In Section 4.2.3, we discuss the design of crossover, recombination, and mutation operators for the chromosomes.

4.2.1. Initial Solution Generation Strategies

The quality of the initial solutions is crucial for the BPA’s performance and convergence speed. To improve the quality and diversity of the initial population, we designed the following three strategies based on problem characteristics:
A.
Randomized nearest neighbor
The nearest neighbor method is commonly employed in VRP problem. We implemented several improvements based on this approach. Initially, we select a node A with the highest  P r o f i t 1  value from the airport. Subsequently, with a 90% probability, we apply a greedy strategy to choose the next node with the highest value starting from node A, and with a 10% probability, we randomly select a node. This process is repeated until the UAV can no longer support reaching the next node due to range or resource constraints, at which node it returns to the airport.
B.
Randomized average generation
The profit from UAV mission completion is time-dependent; the quicker all tasks are completed, the higher the profit. Therefore, we aim to evenly distribute customers among the available UAVs to complete all tasks as rapidly as possible. The process is as follows:
  • Shuffle the task sequence;
  • Sequentially assign customers to the UAVs in set U until each UAV has been assigned a customer. Remove the selected customer from the sequence;
  • Distribute the remaining customers starting from the first UAV in U, repeating Step 2 until all customers are assigned;
  • Output the task sequence for each UAV.
C.
Distributed nearest neighbor
To ensure an even distribution of tasks among the UAVs and enhance the overall task execution efficiency, we sequentially assign the point with the highest  P r o f i t 1  value from the current node for each UAV. The specific steps are as follows:
  • Following the order in set U, each UAV sequentially selects customers using the nearest neighbor method from the task sequence. Remove the selected customer from the sequence;
  • Repeat Step 1 until the task sequence is empty;
  • Output the task sequence for each UAV.

4.2.2. LS-VND Acceleration Strategy

The objective of the LS-VND is to systematically alter the neighborhood structure set of the current solution during the search process, thereby expanding the search scope and identifying better solutions. For the task sequence of each UAV, we implement the LS-VND algorithm to explore the neighborhood. Local search is a sequence search algorithm used to explore three fundamental neighborhood structures [39], described as follows:
  • Reinsertion: This involves repositioning a node from one location in the task sequence to another, as depicted in Figure 5a. The neighborhood set in this structure is defined by  N h 1 ;
  • Exchange: This involves swapping two nodes between two positions in the task sequence, as shown in Figure 5b. The neighborhood set in this structure is defined by  N h 2 ;
  • Reversal: This involves reversing the nodes within a portion of the task sequence, as illustrated in Figure 5c. The neighborhood set in this structure is defined by  N h 3 .
The specific algorithm is detailed in Algorithm 1:
Algorithm 1 LS-VND framework
1: S 0   initial   sequence ; k 1
2 while   k 3  do
3        S   find   the   best   neighborhood   of   S 0   in N h k
4        if   P r o f i t 1 ( S ) > P r o f i t 1 ( S 0 )  then
5              S 0 S   , k 1
6      else
7              k k + 1
8end while
For each new task sequence generated by LS-VND, we use  P r o f i t 1  to evaluate the quality of these sequences. Additionally, we update the task sequences based on  P r o f i t 1 . From an algorithmic perspective, the adaptability of LS-VND aims to enhance the intensification and diversification of the HGA. The core concept of the LS-VND algorithm is to systematically alter the neighborhood structure of the current solution, expand the search scope, and then identify the current optimal solution through the local search algorithm.

4.2.3. Design of Crossover, Recombination, and Mutation Operators

We designed a series of crossover, recombination, and mutation operators based on the problem characteristics to enhance the solution quality and algorithm efficiency.
  • Subroute Exchange Crossover (SEC): A subroute is randomly selected from a parent chromosome  P 2 , and a fragment  f r a g 2  from this subroute is chosen. The elements in  f r a g 2  are then removed from parent chromosome  P 1 . The fragment  f r a g 2  is randomly inserted into  P 1 , resulting in the offspring chromosome  C 1 . The same method is applied to generate the offspring chromosome  C 2 .
  • Subroute Single-Point Crossover (SSPC): A subroute is randomly selected from parent chromosome  P 2 , and a fragment  f r a g 2  from this subroute is chosen. The elements in  f r a g 2  are then removed from parent chromosome  P 1 . Subsequently, the elements in  f r a g 2  are individually and randomly inserted into  P 1 , resulting in the offspring chromosome  C 1 . The same method is applied to generate the offspring chromosome  C 2 .
After crossover, the newly formed offspring chromosomes undergo further crossover operations, referred to as the recombination optimization operator (ROO). These operations are repeated to find better offspring.
The ROO for the offspring chromosome  C  is detailed in Algorithm 2:
Algorithm 2 ROO framework
1: C   = SEC   ( P 1 *   , P 2 )
2if random() < 0.9 then
3      for _ in range(3) do
4                C = SSPC   ( C , P 2 )
5                if   P r o f i t 1 ( C ) > P r o f i t 1 ( C )  then
6                       C C
7              end if
8      end for
9else
10      for _ in range(3) do
11                C = SSPC   ( C , P 2 )
12                if   P r o f i t 1 ( C ) > P r o f i t 1 ( C )  then
13                       C C
14              end if
15      end for
16end if
To enhance the exploration capability of the genetic algorithm, we designed a simple random mutation: A subroute is randomly selected from the chromosome, a point within this subroute is randomly chosen, and another subroute  S u b 2  is randomly selected. The chosen point is then inserted into a random position within  S u b 2 .

4.3. Lower Level—GCR Allocation

Following the decisions made in the upper-level planning, we obtain the mission planning scheme for each UAV. However, in practical applications, we must also consider the rational allocation of GCR for these missions that require control. Therefore, in the lower level of the BPA, we perform heterogeneous GCR allocation. The objective function  P r o f i t 2  at this level is defined as
P r o f i t 2 = j = 1 n v j ( w 3 s = 1 | S | φ ( t j ) B s j )
where  t j  is derived from  t i m e _ s e r i e s  and other parameter information aligns with the 3.2 MILP model.
We propose a greedy algorithm combined with local search. In Section 4.3.1, we introduce the initial allocation based on the greedy algorithm. Then, in Section 4.3.2, we use local search to improve the solution quality.

4.3.1. Initial Allocation

Based on the upper-level mission planning scheme, we initially adopt a greedy algorithm to obtain the initial allocation scheme for heterogeneous GCRs. The main idea of this method is to allocate GCRs to all tasks in chronological order, prioritizing the GCR that maximizes the profit for tasks that start earlier. The specific steps are as follows:
  • Sort all tasks according to  t i m e _ s e r i e s  in ascending order of start time, creating the task set  G . The GCR set is  S , and  S _ w o r k  is initialized. For the task  g  at the top of  G , the following allocation procedure is applied:
    (a)
    Calculate the profit value of all GCRs for the current task  g . Sort the GCRs in descending order of profit value to obtain the sorted list  S L ;
    (b)
    Allocate the GCR  s  with the highest profit value to the task  g . If this GCR  s  is free, add the allocation result to  S _ w o r k [ s ] . If the GCR is occupied, i.e., the previous task  g p r e  allocated to  s  has not yet finished at the start time of  g , choose the next GCR in  S L  until the allocation is successful;
    (c)
    Proceed to allocate the next task.
  • Iterate the allocation process until all tasks are assigned a GCR, resulting in the final Gantt chart work schedule of GCRs.
The algorithm is as detailed in Algorithm 3:
Algorithm 3 Heterogeneous GCR initial allocation framework
1: S _ w o r k   , G    initial sequence
2 for   g   in   G  do
3      Calculating   the   profit   of   all   s   with   g
4      S L S  sort in descending order according to profit
5      for   s   in   S L  do
6           if   s   is   free   at   the   beginning   of   g  then
7                assign   task   g   to   S _ w o r k [ s ]
8         else
9              continue
10         end if
11    end for
12end for

4.3.2. Local Search Optimization Strategy

After obtaining the initial allocation scheme for heterogeneous GCRs using the greedy algorithm, we employ a local search method based on exchange neighborhood structures  N h 2  to improve the solution quality. The specific steps are as follows:
  • Starting from  t = 0 , traverse all tasks and define the set of overlapping tasks in time as a conflict set  p . As shown in Figure 6, tasks {3, 5, 7} form one conflict set, and tasks {2, 4, 6, 8} form another. Additionally, within a conflict set, a task for the same UAV can only appear once. Ultimately, we obtain a list of all conflict sets  s p ;
  • For each subset  p  in  s p , use a greedy iterative method to search the neighborhood, continuously exchanging their GCRs until no better solution can be found;
  • Based on the local search results, update  S _ w o r k .
The algorithm is as detailed in Algorithm 4:
Algorithm 4 Local search framework
1: S P     Detecting   overlapping   tasks   in   G
2 for   p   in   S P  do
3        r P r o f i t 2 ( p )   , h   1
4        while   h > 0  do
5              h 0
6              p   find   the   best   neighborhood   of   p   in   N h 2
7              r 1 P r o f i t 2 ( p )
8              h r 1 r
9              if   h > 0  then
10                    p p , r r 1
11            end if
12      end while
13end for

4.4. Evolutionary Process of the HGA

We integrated a series of advanced management strategies to enhance the performance of the GA. Based on the problem characteristics, we designed customized crossover, recombination, and mutation operations, as well as repair operators. Additionally, we introduced an elitism strategy and a simulated annealing mechanism, where the simulated annealing mechanism determines the acceptance probability of inferior solutions in the offspring. These mechanisms not only accelerate the convergence of the algorithm but also enhance its exploration capabilities.
This section describes the evolutionary process of the HGA. The crossover, recombination, and mutation operations, which are primarily aimed at the upper level of the BPA, were already discussed in Section 4.2 and are not reiterated here. Instead, we focus on the elitism strategy, fitness function, simulated annealing mechanism, and repair operators.

4.4.1. Elitism Strategy

In the HGA framework, the strategy for generating offspring is diversified and mainly divided into two categories: elitism and general propagation. These strategies work together to balance the exploration and exploitation capabilities of the algorithm.
Elitism: This refers to directly retaining a portion of the chromosomes with the highest fitness in each generation, ensuring that the algorithm does not lose the excellent solutions it has already found. This strategy is a common technique in genetic algorithms and helps in the rapid convergence of the algorithm.
General Propagation: The remaining chromosomes in the population undergo general propagation. A binary tournament selection method is used to select two parent chromosomes, which then undergo crossover, recombination, mutation, and repair operations to generate two offspring chromosomes. The general propagation process is influenced by the crossover rate. This process aims to introduce new gene combinations into the population, helping the algorithm avoid premature convergence to local optima.

4.4.2. Fitness Function

In this section, we introduce the fitness function used in the HGA. This function is utilized to evaluate the performance of solutions or chromosomes, playing a pivotal role in the search process of the genetic algorithm and the quality of the final results. The primary task of the fitness function is to provide a quantifiable score that indicates the quality of the chromosomes, thereby guiding the algorithm towards more optimal solutions.
In the HGA, the fitness mainly consists of two components: one part is the profit, represented as  P r o f i t 1  and  P r o f i t 2  in the solution; the other part is the penalty value for UAV constraint violations ( P e n a l t y ), and the trajectory cost ( C o s t ).
The fitness function can be expressed as
f ( c ) = P r o f i t 1 + P r o f i t 2 C o s t P e n a l t y
The calculation of  P e n a l t y ( c )  is as follows:
P e n a l t y ( c ) = α u = 1 U c max ( 0 , d u Q u ) 2 + u = 1 U c max ( 0 , t u t max u ) 2 i I
where  U c  is the number of UAVs in this chromosome,  d u  represents the resource demand of the UAV  u  task sequence in this chromosome,  Q u  is the rated capacity of the UAV,  t u  is the flying duration of UAV  u t max u  is the rated flight duration of the UAV  u i  is the current iteration count, and  I  is the total number of iterations.
The coefficient  α  is calculated as follows:
α = b U c ( 1 i + 1 ( m 2 ) 2 + ε )
where  b  is the total profit value  T o t a l _ p r o f i t  of the best solution in the current population,  m  is three times the sum of the initial values of all customers, and  ε  is a very small positive number to ensure that the denominator is not zero.
As the iterations progress, the weight of the penalty term  α  gradually increases. This means that the algorithm will progressively increase the penalty for violations while searching for solutions that meet all constraints. This mechanism aids the algorithm in quickly finding solutions that satisfy all constraint conditions.

4.4.3. Simulated Annealing Mechanism

We drew inspiration from the “temperature” parameter concept in the simulated annealing algorithm: the higher the temperature, the greater the probability of accepting poorer solutions; as the temperature gradually decreases, the probability of accepting poorer solutions diminishes, and the algorithm stabilizes, focusing more on finding the optimal or near-optimal solutions. This strategy allows some individuals with lower fitness to remain in the population, maintaining population diversity and avoiding premature convergence.
The probability of accepting poorer solutions,  P accept , is set as follows:
P accept = max ( 0.1 , 0.5 i I )
By skillfully balancing convergence and diversity, our designed hybrid genetic algorithm can efficiently solve the mission planning problem.

4.4.4. Repair Operator

The repair operator plays a crucial role in the HGA, ensuring the diversity and quality of the population. Its task is to adjust solutions that do not meet the constraint conditions (i.e., infeasible chromosomes) so that they satisfy the problem’s actual constraints. In the HGA, we designed a route repair operator.
For the H-G&U IOP problem, the primary goal of the route repair operator is to ensure that each UAV’s route conforms to the constraints regarding demand and flight duration. For those infeasible solutions, the repair operator restores chromosome feasibility by selecting the routes with the highest demand and flight duration and appropriately redistributing nodes between them. This redistribution is accomplished through a series of insertion and deletion operations, as illustrated in Figure 7. For routes that violate the flight duration constraints, the repair operator removes the last customer from the route and reallocates it to the UAV with the shortest flight time; for routes that violate capacity constraints, the operator reallocates the customer to the UAV with the smallest demand.

5. Numerical Experiments and Results

In this section, we conduct numerical experiments using a PC (AMD Ryzen 7-5800H, 2.80 GHz, 16 GB RAM) to validate the effectiveness of our proposed model and the efficiency of our developed solution method. The model is implemented with CPLEX (Version 22.1.1.0) using the concert technology of Python (Version 3.9). The parameter settings are reported in Section 5.1. Section 5.2 presents a comparison between CPLEX and BPA. Section 5.3 reveals the performance of BPA on large-scale instances and its own ablation experiments. Section 5.4 conducts a sensitivity analysis.

5.1. Parameters Setting

Since the mission planning model is similar to the VRP, the parameters for airports, customer locations, and demands in our instances are based on Solomon’s benchmark data [Solomon, 1987]. The customer’s initial value  v j  is set to 10. The task execution time  t j  for customers is randomly generated, referencing actual application data, and ranges between 11 and 15 min. We categorize all tasks into four types, with the GCR capability values for different customer types being random numbers from {0.7, 0.8, 0.9, 1.0}. The UAVs are classified into two types, carrying resource quantities of 150 and 200, respectively, and their capabilities for the four customer types are {1.0, 1.0, 0.7, 0.7} and {0.7, 0.7, 1.0, 1.0}, respectively, with an endurance of 300 min each. The parameter  γ  for the monotonically decreasing function  ψ ( t )  of customer profit is set to 0.004, and the parameter  β  for the monotonically decreasing function  φ ( t )  of GCR capability is set to 0.4, as illustrated in Figure 8.
The population size for the HGA is set to 100, with a crossover rate of 0.9, a mutation rate of 0.1, and an elite individual proportion of 3%, totaling three individuals. We observed that for small-scale problems (where the number of customers is fewer than or equal to five), the HGA converges to the optimal value after just one iteration. Therefore, for small-scale problems, the number of iterations for the HGA is set to 1. For problems with more than 10 customers, we set the maximum number of iterations to 300. If the best solution does not improve over 30 consecutive iterations, the algorithm stops and outputs the current best solution. The solving time for BPA is denoted as  T b , and the resulting solution is denoted as  Z b .

5.2. Comparison between CPELX and BPA

The comparison results between BPA and CPLEX are shown in Table 2, which provides the objective values ” Z ”, computation time “ T ”, Gap =  ( Z b Z c ) / Z c , and Time ratio =  ( T b / T c ) . Here,  Z c  represents the solutions obtained by CPLEX, while  Z b  represents the solutions provided by BPA;  T c  is the computation time of CPLEX in seconds, and  T b  is the computation time of BPA in seconds. We tested four types of instances containing 5, 10, 15, and 20 customers, totaling 16 instances, as shown in Table 2. Each instance ID consists of four parts: the number of customers, the number of GCRs, the number of UAVs, and the instance index. “A” denotes small-scale instances, referring specifically to instances with a number of customers less than 20, while “B” denotes large-scale instances, which are characterized by having 20 or more customers. When the number of customers in instances is “5”, both CPLEX and BPA can obtain the optimal solution, but the average time ratio is 6.564%, demonstrating that BPA solves the problem faster. When the number of customers reaches “10”, for instances 1 and 5, both CPLEX and BPA can achieve the optimal solution, but the average time ratio is 0.183%, indicating that BPA’s solving speed far exceeds that of CPLEX. For instances 2 to 4, CPLEX fails to obtain the optimal solution within two hours, whereas BPA can still quickly find a solution, with results equal to CPLEX. When the number of customers reaches “15”, CPLEX fails to obtain an optimal solution for any instance. In contrast, BPA can achieve solutions that surpass those provided by CPLEX in a remarkably shorter time frame. For all small-scale instances, the average GAP is 0.033%, and the average time ratio is 2.248%. Compared with CPLEX, BPA can obtain better solutions in a shorter time.
Moreover, it can be observed that when facing the three instances with 20 customers, CPLEX is unable to find a feasible solution within two hours for two of these instances. This indicates that CPLEX struggles to solve the proposed MILP model for large-scale instances. However, BPA can still obtain a solution in a relatively short time.

5.3. Performance of the BPA

In scenarios where CPLEX cannot obtain a solution for large-scale instances, our proposed method is capable of providing feasible solutions within a reasonable time frame. As shown in Table 2, we list a total of 36 instances ranging from 20 to 100 customers to test large-scale instances, where the meaning of each instance ID is the same as that in Table 1 for small-scale instances. As depicted in Figure 9, the planning results for each instance include mission planning schemes and a Gantt chart work schedule of GCRs. Additionally, we conducted a comparative experiment to analyze our proposed BPA algorithm, specifically between the version that includes the LS-VND module (Section 4.2.2) and a simplified version without the LS-VND module. Here,  Z w  is the solution obtained by the BPA without LS-VND. Time ratio =  T b / T w , where  T w  is the computing time of the BPA (without LS-VND) in seconds. From the data in Table 3, we can observe that the BPA solves the problem in a relatively short time. Moreover, in all instances, the LS-VND module significantly accelerates the algorithm’s convergence speed and improves the solution quality. The average time ratio between BPA (without LS-VND) and BPA is 83.82%, and the average gap is 0.36%. As shown in Figure 10a, the average gap for all instances of each scale is greater than 0, with the maximum gap occurring when the instance scale is 20. And the average solving time of BPA is less than that of BPA (without LS-VND) for instances of all scales. For instance, for “B100-14-14-1”, the convergence curves of the two algorithms are shown in Figure 11, where we can observe that LS-VND significantly improves the quality of the initial solution, accelerates the convergence speed, enhances convergence accuracy, and shortens the convergence time.
We also compared the solution quality of the two algorithms under the same time constraints, with the results for nine instances ranging from 80 to 100 customers shown in Table 4. The experimental results indicate that the LS-VND module significantly increases the algorithm’s convergence speed and improves the quality of the solutions. The enhancement effect of the LS-VND module is visually evident in Figure 11.

5.4. Sensitivity Analysis

In this section, we primarily analyze parameter  β  of the GCR capability variation function. As shown in Table 5, we set up a total of 10 instances with a customer scale of 90, where  β  ranges from 0 to 0.9. The meaning of each instance ID is the same as that in Table 1 for small-scale instances. Here  V s  represents the initial total value of customers served within the first 100 s in the planning scheme.  Z s  represents the profit within the first 100 s. Aside from the different  β  parameters, the other parameter information for the 10 instances is entirely consistent. For a straightforward comparison, we set the initial value of customers  v j  as random integers between 10 and 15, with other parameter information configured according to the method described in Section 5.1.
According to the data presented in Table 5, we observe that as parameter  β  increases, the total value of customers served within the first 100 s ( V s ) also increases, along with the profit within the first 100 s ( Z s ), as illustrated in Figure 12a. This indicates that the faster the decline in GCR’s capability, the more the system tends to prioritize serving high-value customers earlier. Additionally, we note that the overall profit ( Z b ) for all tasks decreases, while the proportion of profit within the first 100 s ( Z s / Z b ) increases, as shown in Figure 12b. This implies that if the degradation effect of GCR’s capability is significant, higher-profit tasks should be allocated to GCR in the initial service phase. It is noteworthy that when  β  = 0.9, the increase in  V s  encounters a bottleneck. Our analysis suggests that this is because the advantage of prioritizing high-value customers can no longer offset the losses induced by the decline in GCR’s capability.

6. Discussion

The experimental results demonstrate that for small-scale problems, the quality of solutions obtained by our proposed BPA surpasses those generated by CPLEX after 2 h of computation. Additionally, the average computation time required by our algorithm is only 2.248% of that needed by CPLEX. Moreover, BPA is capable of solving some large-scale problems (100 customers, 14 GCRs, and 14 UAVs) within reasonable computation times. This clearly illustrates the efficacy of BPA in addressing the H-G&U IOP.
Additionally, our method comprehensively considers the capability characteristics of heterogeneous GCRs and the degradation effects of GCR operational abilities during the multi-UAV mission planning phase. This allows for a tighter integration of human and UAV systems, thereby fully leveraging the overall efficiency of the human-machine system. The proposed planning method can yield greater practical profits and holds significant real-world implications.
Future research directions may include the following:
  • The model and method proposed in this paper are not only applicable to long-endurance UAVs but also potentially extendable to other multirobot systems, such as multiple unmanned surface vehicles or multiple unmanned ground vehicles;
  • The modeling approach for GCR capabilities could be further refined. However, using our model, any form of function can be linearized for a solution;
  • In practical applications, unforeseen situations such as dynamic changes in customer demand or UAV malfunctions may arise, necessitating mission replanning. This requires a faster response time, and the speed of existing algorithms for large-scale scenarios may be insufficient. Exploring solutions using deep reinforcement learning methods is worth considering.

7. Conclusions

The primary contributions of this study are as follows: First, we conduct an exploratory investigation into the integrated optimization problem of multi-long-endurance unmanned aerial vehicle (UAV) mission planning and multi-heterogeneous ground control resource assignment (H-G&U IOP). The H-G&U IOP comprehensively considers elements not previously examined in the literature, including the heterogeneity of ground control resources (GCR) and the impact of degradation effects on multi-UAV mission planning, as well as the flexible matching mechanism between GCRs and UAVs. Furthermore, the H-G&U IOP is formulated as a mixed-integer linear programming model aimed at maximizing the profit from completing all tasks. To efficiently solve the proposed model, we present a bi-level programming algorithm (BPA) based on a hybrid genetic algorithm framework. The BPA incorporates heuristic methods such as local search–variable neighborhood descent and a greedy algorithm combined with local search. Subsequently, the effectiveness of the proposed solution method is validated through numerical experiments. Finally, we provide a detailed analysis of the experimental results and discuss potential directions for future research.

8. Patents

A patent application has been filed for the work reported in this manuscript: “Method, Device, and Computer Equipment for Long-Endurance UAV Mission Planning”.

Author Contributions

Writing—original draft, K.L. (Kai Li); supervision and resources, C.Z.; project administration, X.P.; data curation, L.X.; validation, K.L. (Kai Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Defense Basic Scientific Research Program (Grant No. WDZC20235250411).

Data Availability Statement

All data included in this study are available upon request by contact with the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

Indices and Sets:
i , j , k index of a node.
u index of a UAV.
s index of a GCR.
t index of time.
N set of nodes.
N ^ set of customers.
S set of GCRs.
T planning horizon.
Parameters:
t i service   time   of   customer   i .
v i value   of   customer   i  at the initial time.
R i demand   of   customer   i .
t i j travel   time   between   node   i   and   j .
n number of customers.
Q u capacity   of   UAV   u .
t 0 u time   point   that   UAV   u  left the airport.
t max u flight duration of each UAV.
t max longest flight duration of any UAV.
B s i GCR   s s   capability   for   customer   i .
C u i UAV   u s   capability   for   customer   i .
M a sufficiently large positive number.
Decision variables:
a i u time   point   that   UAV   u   arrives   at   node   i .
x i j u binary   equals   one   if   UAV   u   goes   from   node   i   to   node   j ; otherwise, equals zero.
y s u i t binary   equals   one   if   GCR   s   starts   the   task   control   of   UAV   u   at   the   task   i   at   time   t ; otherwise, equals zero.

References

  1. Lu, Q.W.; Qiu, Y.F.; Guan, C.T.; Wang, H.Y.; Zhu, M.Q.; Xu, B.; Li, W.J.; Fan, Z. Coordinated Multi-UAV Reconnaissance Scheme for Multiple Targets. Appl. Sci. 2023, 13, 10920. [Google Scholar] [CrossRef]
  2. Zhang, X.; Zhao, W.; Liu, C.; Li, J. Distributed Multi-Target Search and Surveillance Mission Planning for Unmanned Aerial Vehicles in Uncertain Environments. Drones 2023, 7, 355. [Google Scholar] [CrossRef]
  3. Fan, B.; Jiang, L.; Chen, Y.Y.; Zhang, Y.; Wu, Y. UAV Assisted Traffic Offloading in Air Ground Integrated Networks With Mixed User Traffic. IEEE Trans. Intell. Transp. Syst. 2022, 23, 12601–12611. [Google Scholar] [CrossRef]
  4. Ng, J.S.; Lim, W.Y.B.; Dai, H.N.; Xiong, Z.H.; Huang, J.Q.; Niyato, D.; Hua, X.S.; Leung, C.; Miao, C.Y. Joint Auction-Coalition Formation Framework for Communication-Efficient Federated Learning in UAV-Enabled Internet of Vehicles. IEEE Trans. Intell. Transp. Syst. 2021, 22, 2326–2344. [Google Scholar] [CrossRef]
  5. Saif, A.; Dimyati, K.; Noordin, K.A.; Mosali, N.A.; Deepak, G.C.; Alsamhi, S.H. Skyward bound: Empowering disaster resilience with multi-UAV-assisted B5G networks for enhanced connectivity and energy efficiency. Internet Things 2023, 23, 21. [Google Scholar] [CrossRef]
  6. Zhang, J.; Zhu, Y.F.; Wang, T.; Wang, W.P.; Wang, R.; Li, X.B. An Improved Intelligent Auction Mechanism for Emergency Material Delivery. Mathematics 2022, 10, 2184. [Google Scholar] [CrossRef]
  7. Liu, Y.; Liu, Z.; Shi, J.M.; Wu, G.H.; Pedrycz, W. Two-Echelon Routing Problem for Parcel Delivery by Cooperated Truck and Drone. IEEE Trans. Syst. Man Cybern.-Syst. 2021, 51, 7450–7465. [Google Scholar] [CrossRef]
  8. Song, B.D.; Park, K.; Kim, J. Persistent UAV delivery logistics: MILP formulation and efficient heuristic. Comput. Ind. Eng. 2018, 120, 418–428. [Google Scholar] [CrossRef]
  9. Cheng, J.; Deng, C.J.; Su, Y.Z.; An, Z.Y.; Wang, Q. Methods and datasets on semantic segmentation for Unmanned Aerial Vehicle remote sensing images: A review. ISPRS-J. Photogramm. Remote Sens. 2024, 211, 1–34. [Google Scholar] [CrossRef]
  10. Osco, L.P.; Marcato, J., Jr.; Ramos, A.P.M.; Jorge, L.A.D.; Fatholahi, S.N.; Silva, J.D.; Matsubara, E.T.; Pistori, H.; Gonçalves, W.N.; Li, J. A review on deep learning in UAV remote sensing. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 21. [Google Scholar] [CrossRef]
  11. Alghamdi, Y.; Munir, A.; La, H.M. Architecture, Classification, and Applications of Contemporary Unmanned Aerial Vehicles. IEEE Consum. Electron. Mag. 2021, 10, 9–20. [Google Scholar] [CrossRef]
  12. Hu, H.H.; Zhou, H.F.; Li, J.H.; Li, K.R.; Pan, B. Automatic and Intelligent Line Inspection using UAV based on Beidou Navigation System. In Proceedings of the 6th International Conference on Information Science and Control Engineering (ICISCE), Shanghai, China, 20–22 December 2019; pp. 1004–1008. [Google Scholar]
  13. Niu, Y.; Shen, L.; Li, J.; Wang, X. Key scientific problems in cooperation control of unmanned-manned aircraft systems. Sci. Sin. Informationis 2019, 49, 538–554. [Google Scholar] [CrossRef]
  14. Gilmore, M. RQ-4B Global Hawk Block 30 Operational Test and Evaluation Report. Department of Defense. Available online: http://pogoarchives.org/m/ns/pentagonot-and-e-eval-rq-4b-global-hawk-20110526.pdf (accessed on 1 September 2012).
  15. Liu, B.; Wang, S.; Li, Q.; Zhao, X.; Pan, Y.; Wang, C. Task Assignment of UAV Swarms Based on Deep Reinforcement Learning. Drones 2023, 7, 297. [Google Scholar] [CrossRef]
  16. Liu, X.; Jing, T.; Hou, L. An FW–GA Hybrid Algorithm Combined with Clustering for UAV Forest Fire Reconnaissance Task Assignment. Mathematics 2023, 11, 2400. [Google Scholar] [CrossRef]
  17. Poudel, S.; Moh, S. Task assignment algorithms for unmanned aerial vehicle networks: A comprehensive survey. Veh. Commun. 2022, 35, 29. [Google Scholar] [CrossRef]
  18. Ramirez-Atencia, C.; Bello-Orgaz, G.; R-Moreno, M.D.; Camacho, D. Solving complex multi-UAV mission planning problems using multi-objective genetic algorithms. Soft Comput. 2017, 21, 4883–4900. [Google Scholar] [CrossRef]
  19. Ramirez-Atencia, C.; Camacho, D. Constrained multi-objective optimization for multi-UAV planning. J. Ambient. Intell. Humaniz. Comput. 2019, 10, 2479–2496. [Google Scholar] [CrossRef]
  20. Ramirez-Atencia, C.; Del Ser, J.; Camacho, D. Weighted strategies to guide a multi-objective evolutionary algorithm for multi-UAV mission planning. Swarm Evol. Comput. 2019, 44, 480–495. [Google Scholar] [CrossRef]
  21. Feng, L.; Wiltsche, C.; Humphrey, L.; Topcu, U. Synthesis of Human-in-the-Loop Control Protocols for Autonomous Systems. IEEE Trans. Autom. Sci. Eng. 2016, 13, 450–462. [Google Scholar] [CrossRef]
  22. Lenstra, J.K.; Kan, A.H.G.R. Complexity of vehicle routing and scheduling problems. Networks 1981, 11, 221–227. [Google Scholar] [CrossRef]
  23. Cui, W.; Li, R.; Feng, Y.; Yang, Y. Distributed Task Allocation for a Multi-UAV System with Time Window Constraints. Drones 2022, 6, 226. [Google Scholar] [CrossRef]
  24. Casbeer, D.W.; Holsapple, R.W. Column generation for a UAV assignment problem with precedence constraints. Int. J. Robust Nonlinear Control 2011, 21, 1421–1433. [Google Scholar] [CrossRef]
  25. Kim, J.; Morrison, J.R. On the Concerted Design and Scheduling of Multiple Resources for Persistent UAV Operations. J. Intell. Robot. Syst. 2014, 74, 479–498. [Google Scholar] [CrossRef]
  26. Escribano, J.; Chang, H.; Angeloudis, P. Integrated Path Planning and Task Assignment Model for On-Demand Last-Mile UAV-Based Delivery. In Proceedings of the 13th International Conference on Computational Logistics (ICCL), Barcelona, Spain, 21–23 September 2022; pp. 198–213. [Google Scholar]
  27. Jin, Z.P.; Shima, T.; Schumacher, C.J. Optimal scheduling for refueling multiple autonomous aerial vehicles. IEEE Trans. Robot. 2006, 22, 682–693. [Google Scholar] [CrossRef]
  28. Yan, F.L. Gauss interference ant colony algorithm-based optimization of UAV mission planning. J. Supercomput. 2020, 76, 1170–1179. [Google Scholar] [CrossRef]
  29. Kim, J.; Oh, H.; Yu, B.; Kim, S. Optimal Task Assignment for UAV Swarm Operations in Hostile Environments. Int. J. Aeronaut. Space Sci. 2021, 22, 456–467. [Google Scholar] [CrossRef]
  30. Sharma, A.; Shoval, S.; Sharma, A.; Pandey, J.K. Path Planning for Multiple Targets Interception by the Swarm of UAVs based on Swarm Intelligence Algorithms: A Review. IETE Tech. Rev. 2022, 39, 675–697. [Google Scholar] [CrossRef]
  31. Luo, R.; Zheng, H.; Guo, J. Solving the Multi-Functional Heterogeneous UAV Cooperative Mission Planning Problem Using Multi-Swarm Fruit Fly Optimization Algorithm. Sensors 2020, 20, 5026. [Google Scholar] [CrossRef] [PubMed]
  32. Shima, T.; Rasmussen, S.J.; Sparks, A.G.; Passino, K.M. Multiple task assignments for cooperating uninhabited aerial vehicles using genetic algorithms. Comput. Oper. Res. 2006, 33, 3252–3269. [Google Scholar] [CrossRef]
  33. Silva, A.; Coelho, L.C.; Darvish, M. Quadratic assignment problem variants: A survey and an effective parallel memetic iterated tabu search. Eur. J. Oper. Res. 2021, 292, 1066–1084. [Google Scholar] [CrossRef]
  34. Wu, Y.; Liang, T.J.; Gou, J.Z.; Tao, C.G.; Wang, H.F. Heterogeneous Mission Planning for Multiple UAV Formations via Metaheuristic Algorithms. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 3924–3940. [Google Scholar] [CrossRef]
  35. Wu, W.N.; Wang, X.G.; Cui, N.G. Fast and coupled solution for cooperative mission planning of multiple heterogeneous unmanned aerial vehicles. Aerosp. Sci. Technol. 2018, 79, 131–144. [Google Scholar] [CrossRef]
  36. Ye, F.; Chen, J.; Tian, Y.; Jiang, T. Cooperative Task Assignment of a Heterogeneous Multi-UAV System Using an Adaptive Genetic Algorithm. Electronics 2020, 9, 687. [Google Scholar] [CrossRef]
  37. Yu, X.Y.; Gao, X.H.; Wang, L.; Wang, X.W.; Ding, Y.; Lu, C.; Zhang, S. Cooperative Multi-UAV Task Assignment in Cross-Regional Joint Operations Considering Ammunition Inventory. Drones 2022, 6, 77. [Google Scholar] [CrossRef]
  38. Gao, X.H.; Wang, L.; Yu, X.Y.; Su, X.C.; Ding, Y.; Lu, C.; Peng, H.J.; Wang, X.W. Conditional probability based multi-objective cooperative task assignment for heterogeneous UAVs. Eng. Appl. Artif. Intell. 2023, 123, 22. [Google Scholar] [CrossRef]
  39. Zhen, L.; Ma, C.L.; Wang, K.; Xiao, L.Y.; Zhang, W. Multi-depot multi-trip vehicle routing problem with time windows and release dates. Transp. Res. Part E Logist. Transp. Rev. 2020, 135, 21. [Google Scholar] [CrossRef]
Figure 1. Comparison of three strategies: (a) Ignoring the influence of GCRs. (b) Considering the influence of GCRs with a fixed matching mechanism. (c) Considering the influence of GCRs with a flexible matching mechanism.
Figure 1. Comparison of three strategies: (a) Ignoring the influence of GCRs. (b) Considering the influence of GCRs with a fixed matching mechanism. (c) Considering the influence of GCRs with a flexible matching mechanism.
Drones 08 00385 g001
Figure 2. n-to-n matching mechanism.
Figure 2. n-to-n matching mechanism.
Drones 08 00385 g002
Figure 3. BPA algorithm framework.
Figure 3. BPA algorithm framework.
Drones 08 00385 g003
Figure 4. Construction of solution to H-G&U IOP.
Figure 4. Construction of solution to H-G&U IOP.
Drones 08 00385 g004
Figure 5. Three methods of local search: (a) reinsertion; (b) exchange; (c) reversal.
Figure 5. Three methods of local search: (a) reinsertion; (b) exchange; (c) reversal.
Drones 08 00385 g005
Figure 6. Conflict set partitioning method.
Figure 6. Conflict set partitioning method.
Drones 08 00385 g006
Figure 7. The flow of route repair operators.
Figure 7. The flow of route repair operators.
Drones 08 00385 g007
Figure 8. Function graph of  ψ ( t )  and  φ ( t ) .
Figure 8. Function graph of  ψ ( t )  and  φ ( t ) .
Drones 08 00385 g008
Figure 9. “B20-3-3-1” planning scheme: (a) UAV routes; (b) Gantt chart work schedule of GCRs.
Figure 9. “B20-3-3-1” planning scheme: (a) UAV routes; (b) Gantt chart work schedule of GCRs.
Drones 08 00385 g009
Figure 10. Comparison between BPA(without LS-VND) and BPA: (a) Performance of the two algorithms in large-scale instances. (b) Performance of the two algorithms with the one-minute time limit.
Figure 10. Comparison between BPA(without LS-VND) and BPA: (a) Performance of the two algorithms in large-scale instances. (b) Performance of the two algorithms with the one-minute time limit.
Drones 08 00385 g010
Figure 11. Convergence curves of the two algorithms in instance ”B100-14-14-3”.
Figure 11. Convergence curves of the two algorithms in instance ”B100-14-14-3”.
Drones 08 00385 g011
Figure 12. Sensitivity analysis: (a) Variation curves of  V s  and  Z s  across 10 instances. (b) Variation curves of  Z b  and the ratio  Z s / Z b  across 10 instances.
Figure 12. Sensitivity analysis: (a) Variation curves of  V s  and  Z s  across 10 instances. (b) Variation curves of  Z b  and the ratio  Z s / Z b  across 10 instances.
Drones 08 00385 g012
Table 1. Summary of UAV mission planning.
Table 1. Summary of UAV mission planning.
AuthorsGCR
Constraint
Heterogeneous
GCR
Matching
Mechanism
Solution
Tal et al. (2006) [32] GA
Wu et al. (2018) [35] Distributed GA
Ye et al. (2020) [36] Improved GA
Yu et al. (2022) [37] Improved GA
Gao et al. (2023) [38] MOGA
Ramirez et al.
(2017) [18]
  n-to-1 fixed matchNew MOGA
Ramirez et al.
(2019) [19]
  n-to-1 fixed matchMOEA-CSP
Ramirez et al.
(2019) [20]
  n-to-1 fixed matchImproved MOEA
This research     n-to-n flexible matchBilevel programming algorithm
In this table, a vacancy indicates that the literature does not consider this condition. ‘√’ indicates that the literature considers this condition. GA = genetic algorithm; MOGA = multi-objective genetic algorithm; MOEA = multi-objective evolutionary algorithm; MOEA-CSP = multi-objective evolutionary algorithm combined with a constraint satisfaction problem model.
Table 2. Performance of the Cplex and the BPA.
Table 2. Performance of the Cplex and the BPA.
Instance IDCplexBPAGAP (%)Time Ratio (%)
  T c   Z c gap   T b   Z b   ( Z b Z c ) / Z c   T b / T c
A5-2-2-11.042127.68200.066127.68206.334
A5-2-2-21.011127.33800.044127.33804.352
A5-2-2-30.954140.00300.100140.003010.482
A5-2-2-40.855139.33400.055139.33406.433
A5-2-2-50.805140.75400.042140.75405.217
A10-2-2-1739.709260.66602.093260.66600.283
A10-2-2-27200259.9860.0302.744259.98600.038
A10-2-2-37200253.4170.0344.590253.41700.064
A10-2-2-47200253.3840.0325.267253.38400.073
A10-2-2-54036.055262.25403.336262.25400.083
A15-2-2-17200362.2050.0598.793362.6630.1260.122
A15-2-2-27200371.2210.0523.951371.4670.0660.055
A15-2-2-37200380.9750.0654.042381.1580.0480.056
A15-2-2-47200381.9280.0523.052382.5680.1680.042
A15-2-2-57200359.4710.0585.854359.7960.0900.081
B20-3-3-17200none 7.930525.222 0.110
B20-3-3-27200509.5100.08212.680526.0233.2410.176
B20-3-3-37200none 13.987494.905 0.194
Table 3. Performance of BPA (without LS-VND) and BPA for large-scale instances.
Table 3. Performance of BPA (without LS-VND) and BPA for large-scale instances.
Instance IDBPA (without LS-VND)BPAGAP (%)Time Ratio (%)
  T w   Z w   T b   Z b   ( Z b Z w ) / Z w   T b / T w
B20-3-3-17.194520.5527.930525.2220.897110.231
B20-3-3-215.798525.00212.680526.0230.19480.263
B20-3-3-313.588490.84813.987494.9050.827102.936
B30-4-4-118.965761.74014.642768.1870.84677.205
B30-4-4-222.934739.48418.059741.1610.22778.743
B30-4-4-320.845765.43016.272769.0200.46978.062
B40-6-6-133.4781043.77824.9011046.5590.26674.380
B40-6-6-232.132996.52831.2341002.3320.58297.205
B40-6-6-337.1381045.92725.5311050.4990.43768.746
B50-7-7-144.9551309.58138.6371315.0820.42085.946
B50-7-7-233.3991303.35033.0101307.2960.30398.835
B50-7-7-348.5291297.64739.5921298.8160.09081.584
B60-8-8-166.1821547.25549.3291553.3380.39374.535
B60-8-8-264.2891544.07446.4351553.8910.63672.229
B60-8-8-354.1451564.05953.2681568.1410.26198.380
B70-10-10-182.9431752.44154.5341757.8550.30965.749
B70-10-10-299.4101827.90173.3571832.8170.26973.792
B70-10-10-373.0531848.59967.3731855.7390.38692.225
B80-11-11-1107.4902103.97487.1892106.5670.12381.114
B80-11-11-2112.6422078.99588.6082083.9170.23778.663
B80-11-11-3113.5062053.08193.4212058.5090.26482.305
B90-12-12-1153.4592328.142112.5642337.0020.38173.351
B90-12-12-2133.9982384.167115.8962388.7740.19386.491
B90-12-12-3135.7752376.958108.7932378.0130.04480.127
B100-14-14-1197.5812546.527180.9392556.1220.37791.577
B100-14-14-2241.9092634.321207.0482637.7450.13085.589
B100-14-14-3231.0462612.914214.6102617.3160.16892.886
Table 4. The performance of BPA (without LS-VND) and BPA in a limited time of 60 s.
Table 4. The performance of BPA (without LS-VND) and BPA in a limited time of 60 s.
Instance IDBPA (without LS-VND)BPAGAP(%)
  Z w   Z b   ( Z b Z w ) / Z w   T ( s )
B80-11-11-12101.5192106.5670.24060 s
B80-11-11-22076.3992083.9170.36260 s
B80-11-11-32049.2752058.5090.45160 s
B90-12-12-12317.7012332.4570.63760 s
B90-12-12-22378.8472385.4150.27660 s
B90-12-12-32373.7102377.9480.17960 s
B100-14-14-12541.5712550.2470.34160 s
B100-14-14-22631.0492632.1770.04360 s
B100-14-14-32609.9352613.2710.12860 s
Table 5. Comparison of different  β .
Table 5. Comparison of different  β .
Instance ID   β   V s   Z s   Z b   Z s / Z b ( % )
C90-12-12-005501532.5982959.34151.788
C90-12-12-10.15541544.5182945.62952.434
C90-12-12-20.25581549.8252918.71253.100
C90-12-12-30.35611557.1382905.73353.588
C90-12-12-40.45681570.5662889.07654.362
C90-12-12-50.55731588.1302865.33255.426
C90-12-12-60.65771590.8652854.47055.732
C90-12-12-70.75831607.1652848.27856.426
C90-12-12-80.85951635.9932827.68857.856
C90-12-12-90.95951634.2872791.47458.546
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, K.; Zhu, C.; Pan, X.; Xu, L.; Liu, K. A Mission Planning Method for Long-Endurance Unmanned Aerial Vehicles: Integrating Heterogeneous Ground Control Resource Allocation. Drones 2024, 8, 385. https://doi.org/10.3390/drones8080385

AMA Style

Li K, Zhu C, Pan X, Xu L, Liu K. A Mission Planning Method for Long-Endurance Unmanned Aerial Vehicles: Integrating Heterogeneous Ground Control Resource Allocation. Drones. 2024; 8(8):385. https://doi.org/10.3390/drones8080385

Chicago/Turabian Style

Li, Kai, Cheng Zhu, Xiaogang Pan, Long Xu, and Kai Liu. 2024. "A Mission Planning Method for Long-Endurance Unmanned Aerial Vehicles: Integrating Heterogeneous Ground Control Resource Allocation" Drones 8, no. 8: 385. https://doi.org/10.3390/drones8080385

APA Style

Li, K., Zhu, C., Pan, X., Xu, L., & Liu, K. (2024). A Mission Planning Method for Long-Endurance Unmanned Aerial Vehicles: Integrating Heterogeneous Ground Control Resource Allocation. Drones, 8(8), 385. https://doi.org/10.3390/drones8080385

Article Metrics

Back to TopTop