1. Introduction
The characteristics of high maneuverability, low operational cost, and the ability to perform tasks in complex environments mean that unmanned aerial vehicles (UAVs) are widely applied in both civil and military fields, such as disaster rescue, environmental monitoring, and cooperative reconnaissance [
1,
2,
3]. Among various UAV-related technologies, cooperative path planning for multiple UAVs stands out as a core challenge, as it directly determines the efficiency, safety, and success rate of multi-UAV mission execution. Simultaneously designing the flight trajectories of multiple UAVs in parallel is a key challenge for this task. In addition to considering various environmental constraints (maximum and minimum flight) and threat avoidance (radar, missile, and artillery threats), it is also necessary to consider collisions between multiple UAVs and spatiotemporal coordination [
4,
5,
6,
7,
8,
9]. Therefore, the design of the path is complex.
The A* algorithm [
10] and Dijkstra algorithm [
11], as traditional and classical path planning methods, are limited by handling high-dimensional and complex constraint spaces, especially when dealing with dynamic threats and multi-UAV collaboration [
12,
13,
14]. Due to the characteristics of strong search capabilities for the high-dimension solution space and flexibility in dealing with nonlinear constraints [
15,
16], metaheuristic algorithms have been widely adopted to emerge promising solutions for optimization of various application scenarios [
17,
18,
19]. The grey wolf optimizer (GWO), proposed by Mirjalili et al., is a representative metaheuristic inspired by the social hierarchy and cooperative hunting behavior of grey wolves [
20]. It has been widely used in UAV path planning due to its simple structure and few control parameters [
21,
22,
23]. However, the standard GWO still has inherent shortcomings: its search performance is highly dependent on the three leader wolves (
,
, and
), which easily leads to premature convergence and trapping in local optima when facing complex optimization tasks (e.g., multi-UAV cooperative path planning with multiple threats and time–space constraints) [
24,
25].
To address the limitations of GWO, numerous improved versions have been proposed [
26,
27,
28]. To mitigate the insufficient exploration and premature convergence of the original GWO, Zhu et al. [
26] adopted chaotic maps integrated into the position update process of GWO. Moreover, differential evolution (DE) combined with a fractal-based multi-scale search strategy is used to realize search with multiple levels of detail to enhance the exploitation ability of GWO. In [
27], Liu et al. integrated Gaussian mutation and dynamic weights of trigonometric functions to adjust the importance of information for three leaders through differentiated weight adjustment. In addition, a spiral function was employed to perturb the optimal individual position, which prevented GWO from getting stuck in local optimal solutions. Due to the traditional GWO relying on leader wolves’ (
,
, and
) guidance for population updates, the population tends to converge in the later stages of iteration, resulting in insufficient diversity and difficulty in breaking through local optima. Zhou [
28] directly integrated crossover and mutation operators of the genetic algorithm (GA) into the GWO population update process, breaking population homogenization through gene recombination and random mutation, thereby improving premature convergence problems. Nevertheless, these improved algorithms still struggle to balance exploration and exploitation in complex solution spaces, such as hybrid and composite benchmark functions or multi-UAV path planning scenarios with dense threats. The imbalance between exploration and exploitation can lead to different inefficient search problems. Excessive exploration makes it difficult for individuals to achieve precise search within the existing search area, thereby impeding the approach of the global optimal solution. Conversely, excessive exploitation behavior causes inefficient searching for unexplored areas, resulting in falling into local optima, especially when the solution space contains multiple local optima [
29,
30]. In addition, existing path planning algorithms for cooperative path planning often ignore the coupling relationship between individual UAV trajectory optimization and global team coordination. For example, some algorithms focus on avoiding threats but may result in excessive flight distance and energy consumption, while others prioritize time coordination but cannot ensure collision avoidance between UAVs [
31]. Therefore, there is an urgent need for a metaheuristic algorithm that can not only balance exploration and development to handle complex optimization tasks, but also effectively integrate multiple drone constraints (time coordination, spatial collision avoidance, threat avoidance) to generate high-quality cooperative trajectories.
Against this background, this study proposes a hybrid search behavior-based adaptive GWO (HSB-GWO) for multi-UAV cooperative path planning. The algorithm introduces dimension learning-based hunting (DLH) and Aquila exploration strategies to enrich the search behavior of the wolf pack, and adopts an adaptive weight adjustment mechanism for leader wolves to improve the utilization of high-quality solutions. By constructing a comprehensive cost model that integrates energy consumption, altitude constraints, threat avoidance, time coordination, and collision avoidance, HSB-GWO is expected to solve the multi-UAV cooperative path planning problem more effectively. The performance of HSB-GWO is verified through extensive experiments on IEEE CEC 2017 benchmark functions and a multi-UAV cooperative path planning scenario, providing a new efficient solution for multi-UAV path planning.
The remainder of this article is structured as follows.
Section 2 introduces the modeling of cooperative path planning for multiple UAVs.
Section 3 designs the proposed HSB-GWO.
Section 4 reports the simulation results. The conclusion and future work are summarized in
Section 5 and
Section 6, respectively.
3. Hybrid Search Behavior-Based Adaptive Grey Wolf Optimizer
GWO, introduced by Mirjalili et al. [
20], is inspired by the social hierarchy and cooperative hunting behavior of grey wolves. GWO emulates the leadership structure within a wolf pack, typically categorized into four levels, alpha (
), beta (
), delta (
), and omega (
), in which the
,
, and
wolves, representing the fittest solutions, are regarded as leaders to guide the search process of the
wolves that follow them. In [
32], it is suggested that GWO is a variant algorithm of PSO (similar to SPSO-2011). Therefore, the shortcomings of GWO in search performance can reflect the common problems of PSO-based metaheuristic algorithms, which is the focus of this work to solve.
GWO mathematically models the strategies of encircling, hunting, and attacking prey. The positions of the search agents (wolves) are updated based on the perceived locations of the , , and wolves, simulating the collaborative effort of the pack to approach the optimal solution (prey). This mechanism is developed to balance the exploration of the search space and the exploitation of promising regions. However, the search performance of the whole population is strongly dependent on the three best-positioned wolves, which easily causes the population to fall into the local optimal solution and weak search stability.
In order to overcome the weakness of traditional GWO, in this paper a novel algorithm named hybrid search behavior-based adaptive GWO (HSB-GWO) is developed. Dimension learning-based learning (DLH) [
33] and Lévy flight [
34] are introduced into the GWO for improve the balance of exploration and exploitation behaviors. In addition, the adaptive strategy is also adopted to dynamically adjust the weights of the
,
, and
wolves in offspring generation. The overall diagram of the HSB-GWO algorithm is depicted in
Figure 1 and the pseudo-code is shown in Algorithm 1.
| Algorithm 1: HSB-GWO |
![Sensors 25 07657 i001 Sensors 25 07657 i001]() |
Remark 3. It is worth noting that HSB-GWO performs fitness calculation twice per generation (see lines 13 and 32 of Algorithm 1). Therefore, G of HSB-GWO should be set half compared to the other algorithms to ensure the fitness evaluations (FEs) are equal for fair comparison.
3.1. Individual Formulation
The cooperative path planning task for multiple UAVs is achieved by generating a certain number of waypoints on a three-dimensional map for each UAV. Therefore, each individual of HSB-GWO contains the coordinates of these waypoints. In addition, the search range of HSB-GWO for paths is determined by the size of the 3D map constructed (flight altitude range
and map area
).
Figure 2 is an example of path planning in a two-dimensional map.
The data structure of each wolf for multi-UAV path planning is illustrated in
Figure 3, where a sequence of waypoints starting from the initial node
and terminating at the goal waypoint
is adopted to record spatial position information about each UAV for the calculation of
and
. In addition, between
and
, state variables
(for
) and
(for
) are integrated into the information set to record constraint states (the number of collisions generated by the path formed by waypoints
or
, and the threat area passed through) for the calculation of
and
. Each UAV has an independently set fixed speed. Therefore, the speed
will also be recorded for the calculation of
.
According to the above content, HSB-GWO will generate waypoints and velocities for each UAV based on search strategies, then update the constraint state according to the coordinates of the waypoints and velocity, and finally record the relevant information in each individual for the calculation of the cost function, thereby quantifying the performance of the UAV under the cooperative path planning task.
3.2. Wolf Pack Dividing
Before the search of each iteration, wolves are divided into two sub groups
and
randomly by
in which
is defined as the dividing probability,
i denotes the
i-th individual in the wolf pack (
),
g is defined as the
g-th iteration (
), and
is a random function to generate disturbance within
.
3.3. Adaptive Search
Firstly, the surround behavior is conducted to update the position of the wolf pack that can be modeled by
where ∘ denotes the Hadamard product,
is defined as the position of the prey,
denotes the position of the wolf in the
g-th generation,
A and
C are the random coefficients, in which
A is a control parameter to adjust the trend of the individual position movement and
C is adopted to generate various distances from the population to the
,
, and
wolves. The update laws of these two correlation coefficient vectors are designed as
where
and
are random vectors generated within
, and
a is defined as the attenuation coefficient to decrease from 2 to 0 with increasing iterations.
Different from the linear decay adopted in the conventional GWO, the update law of
a proposed in [
35] is formed as
in which
G denotes the maximum iteration for the HSB-GWO.
Figure 4 shows the convergence process of the linear decay method and Equation (
22). Nonlinear decay can enrich the behavior of wolf packs throughout the entire search, enabling them to exhibit certain exploitation behavior in the early search stages to accelerate the convergence speed of the population. Similarly, in the later stages of the search, certain exploration behavior can also be exhibited, thereby reducing the possibility of falling into local optima.
The hunting stage is conducted after the positions of the leader wolves (
,
, and
wolves) are updated. In wolf pack hunting, the leader wolves are the closest to the prey. Therefore, considering that
,
, and
wolves have a better knowledge of the location of the optimal global solution, the other wolves are guided by them. The hunting behavior can be mathematically modeled by
where
,
, and
are computed by Equation (21).
is a sign for absolute value. We substitute
,
, and
into Equation (
18), which can be formed as
in which
,
, and
are the top three solutions at the
g-th generation;
,
, and
are calculated by Equation (
20).
In the conventional GWO, the generation of a new solution
is achieved by calculating the mean of
,
, and
, which can be written as
However, by considering the differences in their fitness, their contribution to the generation of a new solution should also be different. For the current best solution
, the importance of its
should be the highest. Therefore, Equation (
29) is rewritten as follows:
where
,
, and
are the fitness of
,
, and
.
F represents the total cost generated during the flight process, in which
always holds and the minimum values of other elements (
,
,
, and
) are 0. Therefore,
always holds.
3.4. Aquila Exploration
Aquila exploration as a hybrid search method is proposed by Ma et al. [
24], in which two search strategies are introduced to prevent wolves from falling into a local optimal solution. Moreover, hybrid search modes combined with the aforementioned adaptive search method can further enrich the diversity of wolves to improve search ability and stability.
3.4.1. Expand Exploration
This search strategy is used to act as an investigator for locating the potential areas in the solution space, which can be formed as
where
is defined as a new position for the wolf,
is adopted as the linear decay factor to control search behavior and
is the average value of all the positions of the wolves participating in the expand exploration that can be calculated by
in which
denotes the number of wolves in expand exploration.
3.4.2. Narrowed Exploration
This search mode imitates the hunting behavior of Aquila after discovering prey, wolves are expected to be able to perform local search behavior while possessing the ability to escape from local optima. Therefore, Levy flight with variable step size conforming to the characteristics of heavy tailed distribution is introduced. The new position updated by this search mode can be formed as follows:
where
is calculated by
in which
c represents a fixed constant (0.01) and the mathematical formula of
is written as
which can be approximated and computed by the Mantegna method [
36] as
in which
is a fixed constant set to be 1.5 and
u is generated by a random function that conforms to a normal distribution with variance
computed by
Similarly, the variance of a random function for
v is 1.
is a gamma function, which can be computed according to Weierstrass’s definition as
where
is the Euler–Mascheroni constant, 0.577216.
3.5. Dimension Learning-Based Hunting (DLH)
The unstable convergence caused by the generation of new individuals relying on the three leader wolves cannot be unavoidable in the conventional GWO. Moreover, hunting, as another interesting social behavior of the non-leader members, is neglected, which is potential information for enhancing search performance. Therefore, DLH is adopted to overcome these issues, which is an effective strategy to achieve knowledge learning for each wolf from its neighbors.
For the new position of each wolf, its
d-th dimension
can be calculated by
in which
is the position of the
i-th wolf,
is a wolf selected randomly from the current population
, and
is defined as the
d-th dimension of a composite individual
combined by neighbor group
. The neighbors of each wolf
represented by
are constructed as follows:
in which
is the Euclidean distance between
and
. Similarly,
, as the Euclidean distance, can be computed by
where
is the current position of the
i-th wolf and
denotes the new position of a wolf updated by the aforementioned two search strategies designed in
Section 3.3 and
Section 3.4.
4. Simulations
In this section, the searching performance of our proposed HSB-GWO algorithm is evaluated through several test functions and the simulation of UAVs for cooperative path planning. All the simulations were conducted on a CPU, Intel Core (TM) i9-13900HX 2.2 GHz and 64.00 GB RAM (Intel, Santa Clara, CA, USA), and the version of Matlab is R2023b.
4.1. Control Parameter Analysis
The search performance evaluations of the proposed HSB-GWO with different control parameter settings were performed by
IEEE Congress on Evolutionary Computation 2017 (IEEE CEC 2017) benchmark suite consisting of 29 test functions [
37]. These test suite combines unimodal (F1, F3), multimodal (F4–F10), hybrid (F11–F20), and composition (F21–F30) functions. All test functions were adopted with three dimensions of
,
, and
by 10 independent runs. The number of fitness evaluations (FEs) was set based on
for fair comparison. The value of population size is
.
The Friedman test [
38] was used for ranking HSB-GWO with seven parameter settings based on their obtained fitness. As shown in
Table 1, for the F1 and F3 group (unimodal functions), the mean ranks of HSB-GWO with
are 1.00 (
), 2.00 (
), and 1.50 (
), which are relatively low in comparison to other
settings. In the F4–F10 group (multimodal functions), the mean ranks for
are 1.57 (
), 1.43 (
), and 2.29 (
), exhibiting superior performance over most other
settings. Regarding the F11–F20 group (hybrid functions), the mean ranks with
are 3.20 (
), 3.30 (
), and 2.40 (
), demonstrating competitiveness. In the F21–F30 group (composition functions), the mean ranks for
are 3.30 (
), 2.20 (
), and 3.20 (
), indicating favorable results.
From the perspective of overall mean ranks across all function groups and dimensions, the HSB-GWO with obtained the lowest mean ranks (2.69 for , 2.38 for , and 2.59 for ) in contrast to other configurations. A lower mean rank signifies better algorithm performance. Moreover, considering the complexity of solution space for the cooperative path planning of multiple UAVs, , which performs well in different types of benchmark functions, is chosen for subsequent experiments. The variation of control parameters can determine the search behavior of HSB-GWO; therefore, the performance of the algorithm in different search tasks can be improved by adjusting the control parameters.
Remark 4. In addition to finding the optimal configuration of control parameters through the above experiments, a variety of tuners, such as CRS-Tuning [39], F-Race [40], REVAC [41], and ParamILS [42], can also be adopted. 4.2. Search Performance Comparison Experiment on Benchmark Functions
To validate the search performance of HSB-GWO, seven metaheuristic algorithms are adopted: AO [
43], AOA [
44], CBOA [
45], NOA [
46], GWO [
20], IGWO [
25], and AGWO [
24]. All the control parameters of seven comparative algorithms were set following the recommended settings provided from their original works. Due to the varying number of fitness evaluation operators used by different algorithms in each iteration, the maximum number of fitness evaluations (MaxFEs) was set based on
. The value of population size is
. Two benchmark function sets (IEEE CEC 2017 and 2019) were adopted to verify the search performance of our proposed HSB-GWO.
The mean fitness errors (MFEs) representing the difference between the best fitness and the global optimum are summarized in
Table 2,
Table 3,
Table 4,
Table 5,
Table 6 and
Table 7. Moreover, standard deviations (STD) of fitness errors were also reported to measure the search performance of each algorithm. “w/t/l” denotes the number of wins (w), ties (t), and losses (l).
4.2.1. Results of IEEE CEC 2017
Exploitation and exploration ability analysis
F1 and F3 as unimodal test functions are well-suited for verifying the exploitation capability to locate the optimal solution. The results shown in
Table 2 demonstrate that the proposed HSB-GWO algorithm can yield highly competitive results on unimodal test functions. Notably, it significantly improved results on F3 across all dimensions (
) compared to the comparative algorithms. Hence, the effective exploitation ability of the HSB-GWO algorithm on the region around the optimal solution is proved.
Multimodal functions (F4–F10), which possess numerous local minima, can be utilized to test both the exploration ability and the local optimum avoidance capability of the HSB-GWO. According to the results presented in
Table 3, HSB-GWO is capable of providing superior results on multimodal functions for different dimensions (
) in F4–F7 and F9, which indicates that the proposed HSB-GWO algorithm is competitive in terms of exploration.
Search performance analysis on complex optimization tasks
Optimization for the hybrid (F11–F20) and composite (F21–F30) functions possessing complex solution space requires strong and stable search performance of algorithms. Thus, the balance between exploitation and exploration of HSB-GWO can be evaluated simultaneously by these test functions.
The results summarized in
Table 4 indicate that the HSB-GWO performs superiorly on all hybrid functions across three different dimensions (
) except F13 and F16. Although IGWO, as a comparative algorithm, achieved better results in F13 with
and F16 with
, the overall search performance of HSB-GWO is superior. Furthermore,
Table 5 presents the solutions obtained by the HSB-GWO and other algorithms for solving composition functions (F21-F30). HSB-GWO won or tied all the other seven algorithms on F22-F24, F26, F29, and F30 with
. Although HSB-GWO lost to AO in F21 with
and IGWO in F25 with
, F27 with
, and F28 with
, its MFEs in these functions are close to the best.
Table 6 summarizes the overall effectiveness of the eight algorithms. It can be seen that the HSB-GWO wins or draws with seven comparative algorithms in 82.76% of optimization tasks. Therefore, our proposed algorithm exhibited a favorable balance between exploration and exploitation, as the results reveal, which illustrates the high-performing search ability on complex optimization tasks.
4.2.2. Results of IEEE CEC 2019
To comprehensively demonstrate the performance of our proposed HSB-GWO, the benchmark functions of IEEE CEC 2019 were determined. The results are summarized in
Table 7. For F1, both HSB-GWO and CBOA can find the optimal solution for all the trials. Although AGWO and CBOA obtained the best results in F2 and F3, respectively, HSB-GWO performed competitively, being the second-best in F2 and F3. For F4–F10, HSB-GWO outperformed the other seven algorithms, demonstrating a strong search ability. The rank results based on the Wilcoxon signed-rank test further verified the superiority of HSB-GWO because it achieved seven wins, one tie, and two losses in the IEEE CEC 2019 benchmark suite.
Remark 5. In addition to verifying the balance of exploitation and exploration behaviors through the experimental results, it can also be measured indirectly [47,48] (diversity-based, entropy-based, fitness-based) or directly [49] (attraction basin-based). 4.2.3. Statistical Analysis by Non-Parametric Friedman Test
To comprehensively evaluate the performance of the HSB-GWO against other algorithms, the Friedman test proposed in [
38] is adopted, and the results are presented in
Table 8. First, observing the “Overall Rank”, HSB-GWO achieved the top rank (rank 1) in all dimensions (
). In contrast, other algorithms have lower overall rankings or show inconsistent performance in different dimensions. Analyzing the “Avg. Rank”, HSB-GWO has the lowest average rank among all algorithms for each dimension. When
, its average rank is 1.66; for
, it is 1.17; and for
, the average rank of HSB-GWO is 1.14. Lower average ranks signify better overall performance in the Friedman test, which evaluates the relative performance of algorithms in 29 test functions. Moreover, examining the performance on individual test functions (F1, F3–F30) across different dimensions, HSB-GWO consistently delivers optimal or near-optimal results. For example, HSB-GWO obtained the lowest scores compared to other algorithms on F3–F7, F9, F11–F15, F22, F26, and F29 in three different dimensions (
). Moreover, in other test functions like F10 (
), F17 (
), and F23 (
), HSB-GWO also achieved competitive rankings compared to the best. In summary, the Friedman test results clearly demonstrate that HSB-GWO outperformed other comparative algorithms in terms of both exploration and exploitation capabilities, exhibiting more excellent search performance.
The Friedman test results of the eight algorithms in IEEE CEC 2019 are shown in
Table 9. HSB-GWO achieved the best performance, as the average rank is only 1.17, indicating that HSB-GWO ranked high in most test functions. For example, HSB-GWO obtained the lowest scores in F4, F5, F6, F7, F9, and F10. In contrast, other algorithms obtained lower or inconsistent average ranks in the benchmark functions. For example, AOA received an average rank of 2.8 in F1. However, it obtained 7.05 average ranks in F2 and F6, which demonstrates the unstable search performance of AOA.
Moreover,
Figure 5 shows the results of the critical difference (CD) in the Friedman test (
Table 8 and
Table 9). For
of IEEE CEC 2017, the high performance of HSB-GWO is statistically significant compared to the other algorithms except IGWO. For
and 50 of IEEE CEC 2017, the superiority of HSB-GWO on the search performance is statistically significant.
Figure 5d suggests that HSB-GWO significantly outperforms AO, AGWO, AOA, and NOA in IEEE CEC 2019.
4.3. Cooperative Path Planning for Multiple UAVs
Eight algorithms were adopted in the two cooperative path planning tasks for multiple UAVs. For all algorithms, we conducted 10 trials, and the maximum number of fitness evaluations (MaxFEs) was set to 2400, and the value of population size
. The parameter setting and threat design regarding the cooperative path planning simulation are summarized in
Table 10,
Table 11,
Table 12 and
Table 13.
4.3.1. Task 1
Table 14 summarizes the fitness results of eight algorithms after we separately performed 10 trials in
Task 1 for the cooperative path planning of multiple UAVs. HSB-GWO achieved the lowest mean value of 30.81 among all algorithms, which indicates the better performance of HSB-GWO in path planning, as it implies a more optimal path with shorter distance or lower cost. Through the
p-value results, it can be seen that the performance improvement of HSB-GWO in cooperative path planning tasks for multiple UAVs is statistically different from other compared algorithms. Therefore, the superiority of HSB-GWO is demonstrated. The distances, flight time, velocity, and collision of each UAV in the best trial obtained by each algorithm are provided in
Table 15.
Figure 6 presents the 3D flight trajectory maps of multiple UAVs planned by each algorithm, where the trajectory corresponding to the best trial (the trial boxed in
Table 14 with the lowest fitness value) of each algorithm is selected for representation. It can be seen that multiple UAVs flying in a straight line as much as possible from the starting point along the target point is a potential optimal path because the flight distance is shorter, thereby decreasing the flight time and saving more energy. However, this path passes through multiple threat areas, and the algorithms need to guide the UAVs to avoid entering the threat area as much as possible while satisfying the constraints of time and space coordination, which increases the search difficulty of the algorithms. Therefore, AOA, CBOA, and NOA tended to select paths that bypassed threat dense regions, which reduced the extra cost incurred by traversing threat zones. Nevertheless, such paths led to an increase in flight distance and energy consumption, rendering them locally optimal solutions. For the result obtained by AO, the path is a trade-off solution, which shortens the path in the vicinity of threat dense regions by traversing some threat areas. However, as indicated by the fitness, this path also corresponds to a locally optimal solution. GWO, IGWO, AGWO, and HSB-GWO have discovered paths with similar characteristics, yet from the comparison of their fitness, it is evident that the path determined by HSB-GWO incurs the lowest cost for the multiple UAVs. Three views and a 3D graph of the best trial result for HSB-GWO are shown in
Figure 7.
Mean fitness convergences of the eight algorithms are shown in
Figure 8. The curves show that HSB-GWO can quickly converge to the local optimal solution in the early stages of iteration and gradually approach the global optimal solution in subsequent searches. For AOA and NOA, their curves showed convergence stagnation during the search process, indicating that their population was trapped in a local optimal solution. For the other comparative algorithms, it can be seen that the fitness stagnation occurred in the middle stage of iteration and converged again in the later stage. The similar convergence characteristics are caused by their hybrid search strategies of dynamical adjustment of exploitation and exploration search behavior. Although these strategies can help the population escape from local optima, the problem of convergence stagnation has not been solved.
4.3.2. Task 2
For
Task 2, the fitness results are summarized in
Table 16. It can be seen that HSB-GWO achieved the lowest mean value of 2.11 compared to other algorithms, indicating the best performance of HSB-GWO in path planning. The results of
p-values verified that the performance improvement of HSB-GWO is statistically different from other comparative algorithms. Therefore, the superiority of HSB-GWO is demonstrated. The distances, flight time, velocity, and collision of each UAV in the best trial of each algorithm are provided in
Table 17.
Figure 9 presents the best flight trajectories of multiple UAVs planned by each algorithm. In
Task 2, the UAVs need to depart from the initial point and bypass the threat zones to reach the goal positions. Due to the fact that UAVs above the queue (
) need to reach the target point at the bottom (
), and UAVs departing from below (
) need to reach the endpoint above (
), they need to avoid collision issues during queue position exchange as much as possible, which increases the difficulty of the path planning.
It can be seen that the strategies adopted by the eight algorithms were different. For example, AO reduced the collisions by implementing a strategy of exchanging positions of UAVs from the starting positions. For AOA and CBOA, similar paths were generated that increased the difference in flight distance between the upper and lower UAVs to reduce the possibility of collision due to the time difference. However, both strategies were considered as local optimal paths due to increased flight distances and energy consumption. The other five algorithms adopted the same strategy, which is to achieve position exchange while moving towards the target positions. However, the conservative routes designed by GWO and IGWO were adopted for some UAVs via a detour to decrease the possibility of collisions. Conversely, AGWO and HSB-GWO achieved coordinated flight by setting different flight velocities for UAVs. Three views and a 3D graph of the best trial result for HSB-GWO are shown in
Figure 10.
Mean fitness convergences of the eight algorithms are shown in
Figure 11, which shows that HSB-GWO exhibited remarkable convergence behavior because the HSB-GWO quickly reduced its mean fitness in the early stage of optimization and converged to the lowest fitness when the search finished. It is worth noting that during the search process, AGWO, IGWO, and GWO converged slowly for a period of time. Although the algorithm ultimately found a better solution, this indicates that there are shortcomings in the algorithm. Although AOA, CBOA, and NOA achieved a rapid decrease in average fitness in the early stages of search, they fell into local optima after
.
4.4. Architecture Effectiveness Analysis of HSB-GWO
The ablation simulation was conducted to verify the architecture effectiveness of the proposed HSB-GWO. The simulation configuration is the same as
Section 4.3 and three ablation algorithms are designed: HSB-GWO without DLH (A-1), HSB-GWO without Aquila exploration (A-2), and HSB-GWO without adaptive search (A-3). The simulation results and mean fitness convergences are shown in
Table 18 and
Figure 12, respectively. From the curve of A-1, it can be seen that the hybrid search mode composed of Aquila exploration and adaptive search can effectively address the problem of convergence stagnation, but A-1 without DLH cannot further approach the global optimal solution, whereby the mean fitness is 40.86. Conversely, the curves of A-2 and A-3 indicate that the fitness convergence stagnation is unavoidable, which affects search efficiency. In addition, the
p-values represented in
Table 18 of the three ablation algorithms demonstrate that the designed architecture has a statistically significant effect on improving search performance of the HSB-GWO.
5. Conclusions
This study proposes a hybrid search behavior-based adaptive grey wolf optimizer (HSB-GWO) to address the limitations of traditional metaheuristic algorithms in complex optimization tasks and multi-UAV cooperative path planning. Through theoretical analysis, benchmark function experiments, and multi-UAV path planning simulations, the effectiveness and superiority of HSB-GWO are systematically verified. According to the results on optimization for the benchmark functions, HSB-GWO exhibits excellent search capabilities across different types of optimization tasks. On the IEEE CEC 2017 benchmark suite (including unimodal, multimodal, hybrid, and composite functions), HSB-GWO outperformed seven metaheuristic algorithms (AO, AOA, CBOA, NOA, GWO, IGWO, and AGWO). Ablation experiments show that the three core components of HSB-GWO (DLH, Aquila exploration, and adaptive search) play crucial roles in improving search performance, in which DLH enhances the search ability of algorithm and hybrid search mode (Aquila exploration and adaptive search) effectively avoids premature convergence and enhances the ability to escape local optima. Statistical analysis confirms that the performance improvement of HSB-GWO is statistically significant. In the multi-UAV cooperative path planning scenario with 10 UAVs and 13 threats, HSB-GWO achieves the lowest mean fitness (30.81) among all algorithms. Additionally, HSB-GWO’s convergence curve shows fast early convergence and stable late-stage optimization, avoiding the convergence stagnation problems.