1. Introduction
Heterogeneous multi-UAV systems, by leveraging their diverse capabilities and coordinating complementary functions, offer unmistakable advantages in a wide range of applications. From disaster relief and surveillance to agricultural and forest monitoring, and beyond, they extend their operational scope beyond the limitations inherent in a single UAV, such as voyage duration and resource constraints [
1,
2]. However, to fully harness these benefits, meticulous task planning is essential as the highest-level strategy for deployments. This planning entails optimizing executable solutions that align with specific task requirements and the capabilities of each unit [
3,
4,
5].
Despite the advantages of heterogeneous multi-UAV systems, effective deployment hinges on managing inevitable uncertainties during task execution. The uncertainties can be mainly categorized into environmental (e.g., meteorological condition changes [
6], sudden ground threats [
7]), task-related (e.g., task location ambiguities [
8,
9], resource needs [
10], and time window variations [
11]), and UAV-specific (e.g., inaccuracies in dynamic models [
3,
12,
13,
14], payload performance inconsistencies [
3,
15], and unexpected UAV failures [
16,
17]). It is imperative for task planning to comprehensively consider the potential consequences of these uncertainties and accordingly adapt the task plans, thereby minimizing their disruptive effects on the overall task execution.
The flight time of a UAV is calculated based on the estimated task path [
18,
19] and cruise speed [
3,
18,
19]. It serves as the cornerstone for evaluating the time cost of a task plan. However, due to environmental disturbances and errors in dynamic models [
14], resulting in deviations in actual flight time from the predicted values. Such deviations not only affect the current task but also have ripple effects on multiple tasks through task correlation. Increasing flight speeds can mitigate the influence of flight time uncertainties, but this approach has certain limitations. The range of speed increase is constrained, high-speed flight has a limited duration, and it can affect the loiter time [
20]. Therefore, it is necessary to consider flight time uncertainties during task planning. Current research mainly employs stochastic programming, Markovian methods, and robust optimization techniques to overcome these challenges.
Stochastic programming employs probability distribution functions to depict uncertain factors. In the literature [
3], a two-stage stochastic programming model was adopted to address task planning issues under speed uncertainties. Initially, it filtered plans based on their expected values and then calculated the cost of violations via Monte Carlo simulations. In the literature [
13], a fuzzy chance-constrained programming model was used to time uncertainties and formulate multiple plans according to risk appetites. In the literature [
21], the expected value and worst-case scenario approaches were investigated in the context of agent speed uncertainties, and it was revealed that the latter usually incur excessively high costs due to their overly cautious nature.
Markovian methods employ a model of an agent’s interaction with its environment to solve sequential decision-making processes under uncertainty. Such methods represent uncertainty in the form of probabilities, usually through transition matrices, and they aim to adjust decisions to maximize cumulative rewards. The literature [
22] has focused on the task of moving-target tracking and considered payload confidence and target movement. Transition probabilities were constructed for task paths and observation outcomes, which were then used to solve the optimal observation strategy. By incorporating Markovian principles, the challenges posed by uncertainty in a dynamic and changing environment were addressed.
Robust optimization utilizes uncertainty sets to depict uncertain factors, without relying on probability distribution information. It guarantees the highest feasibility of solutions under the constraints imposed by these sets [
10,
23]. The literature [
24] considered execution time and speed uncertainty by utilizing interval sets to represent task durations. A dominance relationship was designed based on intervals, leading to highly robust solutions, though it was overly conservative. In the literature [
25], uncertainty in travel time and resource requirements was addressed using budget sets. This approach integrated a dynamic programming recursive strategy into the task planning model and accurately evaluated the robustness of the obtained solutions through Monte Carlo simulations.
Sequential coupling constraints cause task correlation to evolve from typical linear chains in traditional vehicle routing to more complex networked structures. Due to uncertainty in flight time, both waiting periods and violations resulting from time window constraints can affect task completion time [
15]. These effects propagate through the network, thereby impacting subsequent tasks and increasing the overall complexity of the problem. When considering the scale of the solution space, the optimization of task completion counts, and the execution time involves a complex intersection of two well-known NP-Hard problems: the maximum coverage problem and the vehicle routing problem [
3,
5,
26]. Despite having the same task assignment, the uncertainty in flight time can lead to variability in task plans. This obscures the previously well-defined sequential coupling constraints, resulting in a nonlinear expansion of the solution space and, consequently, a significant increase in the computational demands placed on the task planning solving algorithm.
This study addresses the uncertainty in flight time by developing a rapid estimation method for task completion time that considers sequential coupling constraint and time window constraints. This approach enables the rapid evaluation of task plan violation probabilities without the need for Monte Carlo simulations. Meanwhile, a dynamic time window method is introduced to guide neighborhood search algorithms in the adjustment of task assignments by evaluating the impact of task insertion on subsequent tasks. This allows the reinsertion of tasks with high probabilities of violating time window constraints into positions that adhere to the time window, thereby mitigating the blindness of the neighborhood search process.
2. Heterogeneous UAV Collaborative Task Planning Model
2.1. Problem Description
The scenario target is a radar station with its approximate location known. The target damage requirements and task time window have been clearly defined. To simplify calculations, the following assumptions are made:
The task area is a two-dimensional region, with no consideration given to interfering factors such as terrain obstacles, no-fly zones, or sudden threats.
Each target involves three types of tasks: reconnaissance, attack, and verification. The number of tasks can be adjusted based on requirements, where each task is executed using a single UAV.
The task execution duration cannot be ignored, and the width of the task time window is larger than the execution duration.
UAVs operate at different altitude levels, subject to resource and range limitations. Upon completion of their tasks, they are required to return to their takeoff points.
2.2. Basic Model
Targets: Let be the set of targets, and the target number is . Assuming that the target has tasks, and is the task set of . (reconnaissance (R), attack (A), and verification (V)) can be employed to establish the target-related sequential coupling constraints.
Tasks: The set of tasks is arranged according to the target serial number to form the total set of tasks , and the total number of tasks is . Let be the task in . Given that is the set of execution requirements for task , including the resource requirements , execution duration , capability requirements , and a specific time window . The tasks without specific requirements are subject to an overall completion time .
UAV: Let be the set of heterogeneous UAVs. Let be a set of UAV index parameters including the flight speed , turning radius , included range , number of loads , and capability index . indicates that UAVs can perform the corresponding tasks. Additionally, denotes the task execution sequence of UAV , which is employed to establish the UAV-related sequential coupling constraints.
2.3. Task Correlation Analysis
The target-related and UAV-related sequential coupling constraints are intricately interconnected, forming a complex network of correlations. This intricate relationship extends from the interaction between a single UAV and a single task to all schedules, encompassing task dependencies and interval demands. It is important to note that task correlations are not static but undergo dynamic changes with adjustments in task plans. This section introduces a matrix-based approach for calculating task correlations, specifically involving the coupling correlation matrix, interval time matrix, and sequential coupling correlation matrix.
Target-related sequential coupling correlation matrix : represents the element in the i-th row and j-th column of the matrix. When and , it suggests that can be executed only after the completion of , i.e., is the prior task of , and is the successor task of .
Target-related interval time matrix : The matrix is related to the target-related correlation matrix. When and , it indicates must execute after the completion of with an interval of .
UAV-related sequential coupling correlation matrix : is the element in the i-th row and j-th column of the matrix. indicates that UAV needs to execute task after the completion of , i.e., is the preceding execution task of , and is the subsequent execution task of .
Synthesized
sequential coupling correlation matrix :
indicates that
can affect the execution of
. The task
meeting
is the susceptible task of task
, and all tasks meeting
constitute the affected susceptible task set
of
.
represents the disjunction operator.
2.4. Task Planning Model
Let the graph model be
. Specifically,
represents the set of take-off points and tasks, and
represents the set of transfer orders between UAV and tasks. Let
be the task assignment variable, and
indicates that the UAV
will travel from
to
and perform task
. The following constraints need to be met:
Equation (3) constrains the allocation relationship; Equation (4) constrains the UAV so that it starts from the takeoff point, returns after completing the task, and the number of UAVs used does not exceed the limit; Equation (5) constrains the resource requirement so that it does not exceed its on-board resource limit; Equation (6) constrains that the UAV, so it can perform the assigned task.
The time-related variables, namely the start time and the completion time of task , are constrained using multiple temporal constraints in the network of correlations. These constraints encompass the target-related sequential coupling constraint, UAV-related sequential coupling constraint, time window constraint, and UAV range constraint.
The complexity of the correlation is reflected in the calculation of the start time and completion time. Taking the start time as an example, as shown in Equations (7)–(9) and
Figure 1, the start time
is calculated as the maximum of the following relevant values: the completion time of the prior task (denoted as
), the earliest arrival time of the assigned UAV (denoted as
), and the open time of the time window (denoted as
). Additionally, the start time must satisfy the time window constraint presented in Equation (11) and the UAV range constraint given in Equation (12).
In Equation (8),
represents the ideal flight time of UAV
between
and
(obtained from the Dubins path [
18] and cruise speed
). In Equation (10),
denotes the waiting time caused by an unopened time window or incomplete prior tasks. In Equation (12),
denotes the departure time of UAV
.
When considering the uncertainty of flight times,
is defined as the flight time for UAV
between
and
.
denotes the start time, and
denotes the completion time. The completion time of prior tasks is represented as
, and the waiting time is denoted as
. However, a significant change occurs when certain variables in Equations (7)–(9) transform into continuous stochastic variables, rendering the original solution method obsolete. A new approach, detailed in
Section 3, has been devised to address this challenge.
After obtaining the probability distribution of the task start time and completion time, this probability information will be used to assess the likelihood of task breaches. To facilitate this assessment, this paper establishes a task benefit function that incorporates a reward for task completion and a penalty for task breaches. Consequently, as the risk of breaching the time window constraints rises, the value of the task benefit function decreases proportionately.
where
represents the probability of
breaching the time window constraint,
denotes the penalty, and
represents the reward. It follows that a higher probability of breaching the time window constraint will result in a lower value of the benefit function. Meanwhile, this paper utilizes the expected task completion time and variance to filter and selects optimal task plans while ensuring equivalent benefits.
3. Rapid Estimation Method for Task Completion Time under Uncertain Flight Times
Under the assumption that flight time follows a normal distribution [
3,
21], this paper proposes a rapid estimation method for task completion time. By fully considering the sequential coupling constraints among tasks, the proposed method can effectively obtain task completion time and evaluate the robustness of the plan.
3.1. Impact of Uncertain Flight Time on Task Completion Time
The analysis presented in
Section 2.4 indicates that both the task’s start time and completion time are influenced by several factors: the earliest arrival time of the assigned UAV, the completion time of prior tasks, and the task time window. The uncertainty inherent in flight time transforms the first two factors into continuous interval random variables. Due to the constraints imposed by the task time window, tasks can be divided into two categories: those completed within the designated timeframe and those that have missed it. Consequently, this section focuses specifically on the calculation of task completion time.
The determination of task completion time involves two key steps. Firstly, it is necessary to find the joint distribution of the two continuous interval random variables representing the completion time of prior tasks and the earliest arrival time of the UAV. Secondly, an analytical expression for the task completion time is derived based on an analysis of the relationship between this joint distribution and the opening and closing times of the task time window. The task completion time is defined as the time when the UAV is ready to depart for the next task, and the sum of the task completion time and flight time yields the earliest possible arrival time at the subsequent destination.
3.2. Maximum Time: Prior Task Completion vs. Earliest UAV Arrival
Denote the completion time of the prior task as
and the earliest arrival time of the UAV as
. Denote the maximum time between the completion time of the prior task and the earliest arrival time of the assigned UAV as
. Denote the probability density function (PDF) for
as
and its cumulative distribution function (CDF) as
.
where
is the time variable,
is the Gaussian error function, and
is the complementary error function.
represents the cumulative probability at
. The expectation and variance of
are given using
Given the distances between tasks and environmental factors, the variance in the earliest arrival time tends to be large. However, since time window constraints could lead to UAV waiting periods, there is smaller variance in the completion time of the prior task. To improve evaluation precision, comparable expected values are assigned to both the arrival time of UAV and completion time of the prior task. More specifically, in
Figure 2a, the earliest arrival time of the UAV is illustrated with the purple curve, while the completion time of prior tasks is depicted using the green curve. The blue line traces the PDF of the maximum time
. Using Equations (17) and (18), the expected value is 31.63 with a variance of 2.0313. The normal distribution curve is shown as a red dashed line in
Figure 2b.
Figure 2c compares the CDFs of the maximum time and the normal distribution. Additionally,
Figure 2d shows a maximum CDF deviation of 0.0167, confirming a close match between empirical and theoretical data.
3.3. Task Completion Time under Time Windows
The maximum time , a continuous random variable within an interval, has four relative relationships with the task time window: being influenced only by the opening time of the time window (), being influenced only by the closing time of the time window (), being influenced by both the opening and closing time of the time window (), and falling within the time window.
3.3.1. Impact of Time Window Opening Time on Task Completion Time
Considering the open time
,
must await its commencement before this designated opening time. Conversely, tasks arriving thereafter can commence execution promptly. Given the execution duration
, the completion time
can be ascertained. The associated PDF
and CDF
are given using
The task start time must not precede the designated opening time of the time window, thus making the completion time conform to a truncated normal distribution with a lower limit (
). When the condition
is met, it indicates that the UAV waits before the time window’s commencement, and the corresponding completion time for different points within all coalesce to
, resulting in a discontinuity at
. The expected value and variance of completion time are given using Equations (22) and (23).
In the context of this study, the maximum time
is derived from the simulation results
presented in
Section 3.2. The analysis focuses on the impact of the time window’s opening time
on the completion time
. To perform the analysis, the time window is deliberately set to [30, 40]. This allows for the introduction of a probability distribution of
to precede the time window’s opening while mitigating any potential influence from the time window’s closing time. Furthermore, to maintain consistency and facilitate comparison, the task execution duration has been fixed at two units throughout the analysis.
In
Figure 3a, the black curve represents the PDF of
, while the black vertical dashed line indicates the time window’s opening time. The yellow area signifies the scenario in which the UAV arrives early and waits for the time window’s opening, with a cumulative probability of 0.1264. In
Figure 3b, the blue curve depicts the PDF of the completion time, which consists of two components: a probabilistic mass point and a curve. Specifically, the former corresponds to the UAV’s arrival before the time window’s opening, with a starting time of 30 and a completion time of 32, yielding a PDF value of 0.1264. The latter is obtained by shifting the distribution after the time window’s opening to the right by two units (task execution duration).
Based on Equations (22) and (23), the expected completion time is calculated to be 33.7196 with a variance of 1.6204. In
Figure 3b, the red dashed line depicts the fitting curve corresponding to the normal distribution. Considering the variance of the completion time, the present variance is lower than the variance of the maximum time
. Additionally, when the opening time of the time window is delayed to 31, the variance of the completion time further decreases to 1.0733. This indicates that the waiting period of the UAV before the time window commencement diminishes the uncertainty surrounding the task plan.
Figure 3c illustrates the CDF before and after fitting, revealing significant fluctuations that are influenced by the time window’s opening time. Specifically,
Figure 3d highlights a maximum error of −0.0871 in the fitted CDF, emphasizing the significance of this particular time point.
3.3.2. Impact of Time Window Closing Time on Task Completion Time
Considering the closing time
, tasks adhere to constraints and are executed directly if
occurs before
. Conversely, if
occurs after
, the task violates the time window constraints, it cannot be executed, and the UAV proceeds to the next task. The PDF
and the CDF
are provided below.
Due to the influence of task execution duration, there is an overlap between the two sub-functions within the domain
in Equation (24). The expected value and variance of completion time are given using Equations (27) and (28), respectively.
is set to the simulation results
presented in
Section 3.2, and the time window is deliberately set to [20, 33]. This allows for a certain probability distribution of
to exist beyond the time window’s closing time while eliminating the influence of the time window’s opening time.
In
Figure 4a, the black curve represents the PDF of
, while the black vertical dashed line indicates the time window’s closing time. The green area depicts the portion that violates the time window constraints. In
Figure 4b, the blue curve illustrates the PDF of the completion time. The increase in probability within the interval (33,35) is attributed to the overlap between two cases: satisfying the time window and violating the time window. For instance, when
has a value of 32, the task satisfies the time window and has a completion time of 34. However, when
has a value of 34, the task violates the time window, but the completion time remains to be 34. The situation is the same for other time points within the range of (33,35).
Using Equations (27) and (28), the expected completion time is 33.2936 with a variance of 1.1581. In
Figure 4b, the red dashed line represents the fitted normal distribution curve corresponding to these values. In this case, the variance of the completion time has decreased compared to the variance of
. However, when the task execution duration is changed to 6, the variance of completion time increases to 2.7697, surpassing that of
. This indicates that the distribution of the task completion time has an intricate nature, which is influenced by both the time window’s closing time and the task execution duration.
Figure 4c illustrates the CDF both before and after fitting. Notably, the main fluctuations are observed at the two endpoints of the overlapping regions. Specifically,
Figure 4d highlights a maximum deviation of −0.0632 in the fitted CDF.
3.3.3. Impact of Time Window Opening and Closing Times on Task Completion Time
When the variance of
is relatively large and the time window is narrow, the task completion time is affected by the time window’s opening time
and closing time
, provided that
and
are satisfied. The PDF
and the CDF
are expressed as follows:
The expected value and variance of the completion time, jointly influenced by the time window’s opening and closing time, are given using Equations (32) and (33), respectively.
The maximum time
corresponds to the simulation result
presented in
Section 3.2. To analyze the impact of both the time window’s opening and closing time on the task completion time, a time window of [30, 33] is established.
In
Figure 5a, the black curve depicts the PDF of
, while the black vertical dashed line indicates the time window. The yellow area depicts the period during which the UAV arrives early and waits for the time window to open. The green area represents the portion that violates the time window constraints. In
Figure 5b, the blue curve illustrates the PDF of the completion time, which comprises probability-weighted points and three discontinuous curve segments and is consistent with the preceding analysis.
The expected value of the task completion time, derived from Equations (32) and (33), is 33.3832 with a variance of 0.8075. The red dashed line in
Figure 5b corresponds to the fitted normal distribution curve. Due to the combined effects of the time window’s opening and closing times, the variance in the completion time has further decreased.
Figure 5c illustrates the CDF both before and after fitting. Notably, significant fluctuations are observed at specific time points, particularly at the time window’s opening time (plus the task execution duration) and the endpoints of the overlapping regions. Specifically,
Figure 5d highlights a maximum deviation of 0.0649 in the fitted CDF, emphasizing the significance of these fluctuations in the data distribution.
3.4. Earliest Arrival Time for the Next Task
The task completion time is defined as the moment when the UAV commences its subsequent assignment. In deterministic situations, determining the earliest possible arrival time for the following task is straightforward, involving the summation of the task completion time and the flight duration. However, when dealing with uncertain scenarios where both these durations are stochastic and characterized by their own probability distributions, the computation becomes more complex. Specifically, it necessitates the convolution of the PDF of the task completion time with the PDF of the flight time to derive the PDF of the arrival time for the subsequent task (denoted as ).
Due to the influence of time windows, as discussed in
Section 3.3, the PDF of the task completion time exhibits irregularity and discontinuity. This irregularity poses a challenge for direct convolution with the flight time PDF due to potential computational expense and inaccuracy. Consequently, this study employs an approximate convolution method that utilizes a normal distribution fitting function for the task completion time. In this section, a comparison is made between this approximate convolution method and direct convolution using the actual distributions of the task completion time. The PDF resulting from the approximate convolution is denoted as
, whereas the PDF resulting from direct convolution is denoted as
.
Let
be the flight time, which is assumed to be independent among different tasks. Following the convolution calculation rules for the normal distribution,
is derived by convolving
with the task completion time
, as shown in Equation (34).
3.4.1. Convolution Calculation of Task Completion Time and Flight Time under the Influence of the Time Window’s Opening Time
In Equation (20), the PDF of the completion time under the influence of the time window’s opening time is mainly divided into two segments. Due to the commutativity property of convolution, the convolution calculation can be independently performed on two distinct segments of the PDF, resulting in two separate functions denoted as
and
. Subsequently,
is obtained by summing the two convolutions.
Based on the simulations presented in
Section 3.3.1, let the UAV flight time
be
.
Figure 6a represents the PDF curve for the earliest arrival time of the subsequent task. The blue curve is obtained through piecewise convolution calculations using Equations (35) and (36).
Figure 6b represents the CDF curve for the earliest arrival time. Despite the discontinuous nature of the task completion time, the convolution results demonstrate a remarkably smooth profile. The red curve represents the convolution calculation results using the normal distribution fitting function. Notably, both calculation strategies demonstrate excellent approximation performances.
By further analyzing the CDF errors associated with the two calculation strategies, it is observed that the maximum error after the fitting completion time in
Section 3.3.1 is −0.0871.
Figure 6c demonstrates that the maximum error after the convolution calculations is reduced to −0.0112. Meanwhile, when the flight time variance is expanded to 3, this error is further diminished to −0.0075.
3.4.2. Convolution Calculation of Task Completion Time and Flight Time under the Influence of the Time Window’s Closing Time
The PDF of the completion time under the influence of the time window’s closing time, as represented in Equation (25), is mainly divided into three segments. According to the properties of convolution calculation, it can be transformed into two function segments. Then, convolution calculations are performed separately on these two segments, denoted as
and
.
is obtained by summing the two convolutions.
Based on the simulations in
Section 3.3.2, considering the UAV flight time
as
,
Figure 7a depicts the PDF curve for the earliest arrival time of the subsequent task.
Figure 7b depicts the CDF curve for the earliest arrival time. The blue curve is derived through piecewise convolution calculations using Equations (37) and (38). The red curve represents the convolution results obtained using the normal distribution fitting function. Both calculation approaches exhibit excellent approximation performance.
Further analysis of the CDF errors associated with these two calculation strategies indicates that the maximum error after the fitting completion time in
Section 3.3.2 is −0.0632. However,
Figure 7c reveals that the maximum error after convolution calculations is reduced to 0.0105. Moreover, as the flight time variance increases to 3, this error is further reduced to 0.0068.
Overall, this indicates that using the normal distribution fitting function for convolution calculations not only simplifies the computational process but also maintains a relatively high degree of solution accuracy. Moreover, the error decreases as the variance of the flight time increases. In the context of this study, the tasks are geographically dispersed, and the flight time between them exhibits significant variance due to environmental factors. In the context of this study, the tasks are geographically dispersed, and the flight time between them exhibits significant variance due to environmental factors. In such cases, directly employing the normal distribution fitting function for convolution calculations is feasible.
3.5. Calculation Process for Task Planning Plan
Based on the aforementioned analysis, the computational process and pseudocode are given below, with specific details provided in Algorithm 1.
Calculate the completion time of the prior task and the earliest arrival time of the UAV .
Compute the maximum time according to Equations (17) and (18).
Determine the relative situation of the maximum time with the time window.
If and , calculate the task completion time using Equations (32) and (33).
If , calculate the task completion time using Equations (22) and (23).
If , calculate the task completion time using Equations (27) and (28).
In all other cases, the task completion time is , .
Algorithm 1: Task completion time calculation |
Input: Task assignment |
Output: Task completion time |
1 | For each task (from 1 to ), perform the following steps: |
2 | Calculate the completion time of the prior task |
3 | Calculate the earliest arrival time of assigned UAV |
4 | Compute the maximum time based on and according to Equations (17) and (18) |
5 | Analyze and time window , and execute corresponding steps based on conditions: |
6 | If and , calculate completion time according to Equations (32) and (33) |
7 | If , calculate completion time according to Equations (22) and (23) |
8 | If , calculate completion time according to Equations (27) and (28) |
9 | Otherwise, set completion time , |
10 | Return the completion times of all tasks |
4. Neighborhood Search Hybrid Evolutionary Algorithm for Uncertain Flight Time
In situations where the flight time is uncertain, task completion time transforms from a specific point to a continuous probability distribution within an interval. Consequently, the actual task completion time may fluctuate within a certain range, and this can lead to variations in the execution outcomes even for the same task assignment, significantly expanding the solution space, and increasing the coordination complexity for temporally coupled tasks. To accurately evaluate the performance of task plans under flight time fluctuations, it is necessary to introduce additional optimization objective functions. However, the increase in optimization objectives will also lead to a higher pressure on solution screening [
27]. The task planning problem with uncertain flight time puts forward higher demands on the performance of solution algorithms.
To overcome this challenge, this paper designs a hybrid evolutionary algorithm based on neighborhood search. This algorithm adopts an evolutionary algorithm as the fundamental update framework [
28] and incorporates neighborhood search to reassign tasks with higher breach probabilities or those affecting task execution in relatively good task plans, thereby enhancing the performance of the task plans. Since the adjustment of a single task in the neighborhood search process may affect multiple tasks, this paper introduces a dynamic time window based on task correlation and temporal constraints. This design allows for precise evaluation of the effects of task adjustments on subsequent tasks, thereby achieving higher computational efficiency in neighborhood search.
4.1. Algorithm Architecture
With an evolutionary algorithm as its fundamental update framework [
28], the hybrid evolutionary algorithm incorporates neighborhood search to conduct the main refined search. During the search process, it selects tasks from promising plans using preset operators and optimizes them through insertion operations. By introducing a dynamic time window, the algorithm can respond to task environment changes in real-time, guiding the reinsertion of tasks to optimize the plan. Task selection, sorting, and insertion are adjusted based on time fluctuations, while the selection probability is dynamically updated according to its performance during the calculation. Additionally, a population reset mechanism is periodically triggered to maintain diversity and prevent stagnation in local optima. The detailed pseudocode of the Algorithm 2 is provided below.
Algorithm 2: Hybrid evolutionary algorithm |
Input: Information of Targets, Tasks and UAVs |
Output: Task plan |
1 | Population initialization |
2 | While (It<ItMax) do |
3 | Crossover and mutation: Perform crossover and mutation operations for task assignment |
4 | Update repository: Update repository based on the ordered optimization objective |
5 | Dynamic time window: Calculate dynamic time window for each task based on task correlation |
6 | Neighborhood search: Accurately adjust individuals in the repository utilizing dynamic time windows as guidance |
7 | Update selection probability: Update selection probability for task selection, task sorting, and task insertion operators |
8 | Population reset: Perform neighborhood search operations on the entire population |
9 | EndWhile |
10 | Return Best feasible solution |
4.2. Dynamic Time Window
According to
Section 2.4, the start time is constrained by multiple temporal constraints. The introduction of a new task can delay the original start time at the insertion point, which subsequently affects other tasks through task correlation, potentially leading to constraint violations. To rapidly determine whether task insertion will cause violations, this study introduces the concept of the dynamic time window.
The dynamic time window, when applied to a specific task plan, is based on the temporal constraints inherent to each individual task. By incorporating task correlations and utilizing backward reasoning, it translates the temporal constraints of associated, affected tasks onto the current task under consideration. This methodology facilitates the determination of the maximum allowable delay for the current task, thereby guaranteeing adherence to the temporal constraints of all tasks without any violations. The dynamic time window for task
is denoted as
, where its length is represented by
and calculated using Equation (39).
where
represents the difference between the task start time and time window’s closing time;
represents the difference between the task start time and the maximum range of UAV, with
denoting the remaining distance of the UAV. The third term
represents the duration of the dynamic time window and waiting time of the susceptible task
in the UAV-related sequential coupling correlation matrix; the fourth term
represents the difference between the dynamic time window and the start time of the susceptible task
in the target-related sequential coupling correlation matrix.
Figure 8 illustrates the calculation of dynamic time windows, encompassing five tasks labeled A through E. Tasks A and B, as well as C and D, form two reconnaissance-attack task sequences with a sequential coupling constraint. A new task, E, is to be inserted before task A. The determination of the dynamic time window for task A relies on task correlation, necessitating a reverse calculation from task D. The closure of task D’s time window dictates its latest commencement. According to the sequential coupling, task C must be completed before task D. It reveals that task D commences earlier than the closure of task C’s time window, thus precluding any postponement of task C beyond the start of D. Task B also precedes D, yet it can be delayed until the closure of its time window after accounting for the requisite flight time. Task A must not only precede task B but also ensure a sufficient flight time between A and C. It indicates that the extended flight time between tasks A and C primarily restricts any delays in task A. The purple shaded area represents the dynamic time window for task A, which is narrower than its original time window, reflecting the temporal constraints imposed by its affected susceptible task. Similarly, the dynamic time windows for tasks B, C, and D can be calculated based on their respective constraints.
To analyze the impact of inserting a new task, it is sufficient to compare the increase in time after the insertion with the length of the dynamic time window of the original task at the corresponding insertion position. After the new task is inserted, if the start time of the original task at the insertion position still falls within its dynamic time window, then despite changes in the start time of some tasks, the overall task plan will continue to satisfy all temporal constraints. Conversely, if the start time of the original task at the insertion position exceeds the range of its dynamic time window, the insertion of the new task will cause some tasks in the plan to violate constraints. During the neighborhood search operation, giving priority to positions that satisfy the dynamic time window requirements as insertion points for new tasks can minimize interference with the existing task plan.
Considering flight time uncertainty, both the task start time and waiting time described in Equation (39) are transformed into continuous random variables. The obtained dynamic time window also requires precise evaluation through probability, which increases the difficulty in determining the suitability of task insertion. To address this issue, this paper introduces the concept of chance constraints and screens suitable tasks for insertion by adopting a dynamic threshold adjustment strategy.
Specifically, a default threshold is preset for task insertion. When the probability of violating the dynamic time window after a new task is inserted does not exceed the threshold, the task can be considered suitable for insertion. Selecting a higher threshold helps reduce the risk of default but may limit the insertion positions of tasks; selecting a lower threshold facilitates rapid task insertion but may increase the risk of default. Thus, it is necessary to dynamically adjust the threshold during the calculation process, i.e., using a relatively conservative threshold in the initial stage and gradually reducing it when there is no feasible insertion position, thereby balancing the benefits of the solution and the risk of default.
4.3. Neighborhood Search Operation
The neighborhood search operation mainly comprises steps such as task selection, task sorting, task insertion, and probability updating of the operation operators. The selection probabilities of task selection, task sorting, and task insertion operators will be adjusted after a predetermined number of generations [
28,
29].
- a.
Constraint-violating tasks: select tasks that definitely violate constraints and tasks with a higher probability of violation (in this paper, tasks with a probability of violating time window constraints greater than 0.1 are selected).
- b.
Predecessor tasks of constraint-violating tasks: Some tasks miss their time windows or have shorter time windows due to the influence of predecessor tasks. In such cases, the predecessor tasks need to be readjusted; otherwise, the tasks cannot be executed.
- c.
Random selection: randomly select a predetermined number of tasks for reassignment to increase the randomness of the algorithm’s search.
- d.
Execution duration: Select tasks where the expected change in drone task time after deletion is significant.
- e.
Path length: select tasks where the change in flight path length after deletion is significant for reassignment and reduction.
Considering the order of task calculation, it is necessary to calculate tasks that satisfy sequential coupling constraints as optional tasks after each task insertion. When there are multiple optional tasks, three sorting operators are used: random sorting, absolute regret sorting, and relative regret sorting [
30] to sort the tasks.
- a.
Time-greedy insertion: select the position where the expected increase in drone time after task insertion is minimal.
- b.
Path-greedy insertion: according to the C-W strategy [
28], select the position where the path increase after task insertion is minimal.
- c.
Noisy path-greedy insertion: add a certain amount of random noise to the path-greedy insertion operation, with a maximum noise amplitude of 10% of the maximum distance between targets.
Considering that the algorithm can simultaneously operate on multiple independent neighborhood actions, it has a fast convergence speed. To avoid the algorithm falling into a local optimal solution, when the optimal solution in the external storage has not been updated for a certain number of iterations, a population reconstruction operation will be triggered. This operation will perform a neighborhood search on the population in hopes of jumping out of the local optimum.
6. Conclusions
This paper designs a rapid estimation method for task completion time, considering the uncertainty of flight time. Expressions are derived for task completion time under varying time window constraints, facilitating the recursive computation of task duration based on task correlation. This approach enables swift and precise evaluation of the robustness of task plans, obviating the need for costly numerical simulations.
Leveraging task correlation and temporal constraints, a dynamic time window for tasks is established to efficiently assess how the introduction of a new task affects subsequent ones. The hybrid evolutionary algorithm precisely adjusts task assignments in response to the dynamic time window, reallocating tasks with higher probabilities of violating constraints or negatively impacting task execution, thereby bolstering the resilience of the task plan.
The proposed methodology comprehensively accounts for the stochastic nature of the problem and exhibits strong adaptability to changes in task scale, temporal fluctuations, and time window variations. It eliminates the redundant verification of numerous probability values typically required in comparative methods, attaining robust task plans with minimal increments in time costs and without significantly increasing computational complexity.
In the future, research efforts will explore alternative flight time distributions, including the Gamma distribution, to gauge changes in calculation accuracy and efficiency across diverse scenarios. Studies will also delve into task execution time and its maximum time distribution, elucidating how time window closures affect task completion variance. Furthermore, the integration of adversarial risks such as interference and hacking into UAV models is crucial for understanding their real-world ramifications on task planning and execution.