You are currently viewing a new version of our website. To view the old version click .
Drones
  • Article
  • Open Access

4 February 2025

Multi-Unmanned Aerial Vehicle Path Planning Based on Improved Nutcracker Optimization Algorithm

,
and
College of Artificial Intelligence and Automation, Nanjing 211106, China
*
Author to whom correspondence should be addressed.

Abstract

For the multi-UAV path planning problem, environmental modeling and an improved swarm intelligence-based optimization algorithm are discussed in this paper. Firstly, to align with reality, specific constraints of UAVs in motions, attitudes and altitudes, real-world threats such as radars and no-fly zones, and inter-UAV collisions are considered. Thus, multi-UAV path planning is transformed into a multi-objective constrained optimization problem. Accordingly, an improved nutcracker optimization algorithm is proposed to solve this problem. Through initializing with logistic chaotic mapping and the lens imaging inverse learning strategy, a more fit elite initialization population is produced to increase the efficiency of path planning. Furthermore, by adjusting adaptive parameters and integrating an improved sine-cosine search strategy, a balance between global exploration capability and local exploitation capability during path planning is achieved. Experimental results show that the improved nutcracker optimization algorithm surpasses other algorithms with respect to both convergence speed and convergence value, making it an effective method for multi-UAV path planning.

1. Introduction

Owing to their compact size and light weight, strong adaptability, high concealment, and low risk factor, UAVs have been widely applied as their technology matures. However, due to the environmental complexity, task diversity, and payload limitations of UAVs, a single UAV often struggles to meet many practical mission requirements. Therefore, multiple UAVs are growing in significance and have gradually become a research hotspot.
Path planning is a significant part of the execution of the UAV mission and holds significant research value. Existing UAV path planning algorithms mainly fall into three categories. The first are traditional algorithms, including A* algorithm [], Dijkstra’s algorithm [], artificial potential field method [], probabilistic roadmaps [], and rapid-exploration random trees [], etc. These algorithms perform well in single-UAV path planning but are difficult to apply to large-scale, multi-dimensional and path planning problems. For multiple UAVs, the second are deep learning algorithms, such as deep q-learning (DQN) [], deep deterministic policy gradient (DDPG) [], and proximal policy optimization (PPO) algorithms []. Bohao Li et al. focused on the study of UAV ground target tracking in obstacle-rich environments using deep reinforcement learning and proposed an improved DDPG algorithm []. Xueyuan Wang et al. transformed the path planning problem into a Markov decision process (MDP) with parameterized states, permissible actions, and detailed reward functions []. Then, a dueling double deep Q-network (D3QN) was proposed to learn the decision-making policy of a typical UAV, without any prior knowledge of the environment. However, as the number of variables increases, the computational complexity also grows exponentially; thus, deep learning algorithms require long training times and are difficult to converge. The last are intelligent optimization algorithms, such as the ant colony algorithm (ACO) [], particle swarm optimization (PSO) [], genetic algorithm (GA) [], grey wolf optimizer (GWO) [], and nutcracker optimization algorithm (NOA) [], etc. Compared to other algorithms, intelligent optimization algorithms can effectively solve complex combinatorial optimization problems.
Intelligent optimization algorithms are effective in handling complex path planning problems, especially those with nonlinear, multi-constraint, and multi-objective characteristics. They do not require an accurate mathematical model of the problem and can find the best or nearly the best solutions in large-scale, high-dimensional problem spaces. Tai-shan Lou proposed a hybrid strategy-based golden jackal optimizer (HGJO) algorithm [], in which a pre-decreasing slow nonlinear energy decay strategy is utilized to balance global and local search capabilities. However, this method is only applied under two-dimensional conditions and is not suitable for three-dimensional scenarios of UAVs. Jie Zhang et al. improved the sparrow search algorithm (SSA) [] by applying golden search optimization (GSO) and an adaptive iterative method to adjust local exploitation and global exploration, enhancing overall optimization performance. Xiaobing Yu combined GWO with differential evolution to solve the path planning problem of UAVs []. The position update equation of GWO was improved to enhance the search capability of grey wolves, and a rank-based mutation strategy was implemented to promote exploitation while maintaining exploration capabilities. However, these two methods are mainly applied to single-UAV path planning and have limits in multi-UAV path planning. Liang Xu et al. utilized the ideas of dynamic multi-swarm PSO (DMSPSO) and comprehensive learning PSO (CLPSO) to propose the CL-DMSPSO algorithm [], which further improved the performance of both algorithms. Kai Meng et al. proposed an evolutionary state estimation-based multi-strategy jellyfish search (ESE-MSJS) algorithm for multi-UAV cooperative path planning to search for high-quality paths []. However, CL-DMSPSO has only been compared with PSO algorithms and not with other optimization algorithms, which lacks persuasiveness. The convergence speed and robustness of ESE-MSJS still have room for improvement.
NOA is a highly competitive new intelligent optimization algorithm proposed by Mohamed Abdel-Basset in 2023 []. This algorithm simulates the seasonal behavior of nutcrackers, demonstrating superior performance with a strong global search capability and being easy to implement. NOA can converge extremely fast and possess good robustness. It is applicable not only to single-objective optimization problems, but also effectively handles multi-objective optimization ones. Therefore, it has more potential for application to multi-UAV path planning issues compared with other swarm intelligence optimization algorithms. However, it also has issues such as uneven population initialization and an imbalance between exploration and exploitation capabilities during path planning. To address these issues, an improved nutcracker optimizer algorithm (INOA) is proposed. By using logistic chaotic mapping for initialization and a lens imaging inverse learning strategy, a more fit elite initialization population is obtained. Furthermore, through the integration of parameter adjustment and an improved sine–cosine search strategy, the balance between global exploration and local exploitation skills is achieved. With these improvements, INOA effectively improves the performance of NOA and has significant advantages in addressing multi-UAV path planning problems.
The main contributions of this paper are summarized as follows:
  • A constrained optimization model for multi-UAV path planning is established, which considers realistic constraints and threats that are specific to UAVs. Taking them into account would make the resulting path points more aligned with real-world scenarios of UAVs.
  • An INOA is proposed and then applied to the multi-UAV path planning problem. Specifically, elite individuals are first obtained through chaotic mapping initialization and lens imaging-based reverse learning strategy to increase the efficiency of path planning. Then, parameters in the foraging/storage phase of NOA are adjusted to balance global search and local exploitation capabilities. By integrating with the improved sine–cosine strategy, the convergence speed and precision of searching for optimal path is further enhanced.
  • The effectiveness of the proposed INOA is first verified using the CEC2020 test suite. Then, simulations of multi-UAV path planning are conducted in various scenarios with the comparison of INOA and other related state-of-the-art algorithms, strongly demonstrating the merits and applicability of INOA.
The remainder of this paper is organized as follows: Section 2 provides the problem description for multi-UAV path planning. Section 3 outlines the technical details of the proposed INOA. Section 4 gives a brief introduction to the applied path fitting method. Then, Section 5 presents the results and analysis of the comparative experiment between INOA and other state-of-the-art swarm intelligence optimization algorithms. Section 6 concludes with a summary of the main contributions.

2. Problem Description of Multi-UAV Path Planning

In path planning for multiple UAVs, they are subjected to various threats such as mountains, radars and no-fly zones, and motion and attitude constraints, which were modeled and analyzed in this study. As shown in Figure 1, the radar detection range is described as a sphere based on its operating principle, the no-fly zones are modeled as cuboids since the forbidden areas of UAVs are usually regular-shaped buildings or grounds, and mountains are modeled as undulations. Multiple UAVs should avoid these threats to reach destinations safely and efficiently while satisfying motion and attitude restrictions.
Figure 1. The diagram of the flying environment: the sphere represents the radar detection range, the red cuboid indicates the no-fly zone, and the blue undulations represent mountains.

2.1. Path Length Cost

Suppose that there are U UAVs, each of which has corresponding start and end points. Between the start and end points, there are V trajectory points, denoted as P i , 1 , P i , 2 , P i , j , , P i , V , respectively, and each trajectory point has a corresponding coordinate ( x i , j , y i , j , z i , j ) . By connecting the start point, trajectory points, and the end point, a corresponding path can be obtained. Thus, the path length cost F L is defined as follows:
F L = i = 1 U j = 1 V + 1 L i , j
L i , j = x i , j x i , j 1 2 + y i , j y i , j 1 2 + z i , j z i , j 1 2
where L i , j refers to the distance of the j-th path segment of the i-th UAV.
By constraining the total path length, UAVs can reach the target points with the minimum distance, thereby saving the energy consumption of UAVs.

2.2. Modeling of Flight Environment and Threats

In order to tackle the aforementioned threats properly, the associated penalties are defined as follows to help UAVs avoid the threat areas, and finally synthesized into the total objective function of path planning.

2.2.1. Threat of Mountain Collision

For natural mountains in a flight environment, their mathematical model can be described as follows:
Z k ( x , y ) = k = 1 n h k exp x x k x r k 2 y y k y r k 2
where ( x k , y k ) represents the central coordinate of the i-th mountain, h k represents the height of the mountain, x r k and y r k are the decay amounts along the x-axis and y-axis of the k-th mountain, and n indicates the sum of mountains.
By referring to possible collisions on a mountain peak, effective obstacle avoidance should be carried out. Then, the penalty F C of a mountain collision is defined as follows:
F C = i = 1 U j = 1 V C i , j
C i , j = 100 z i , j Z , i f z i , j < Z 0 , o t h e r w i s e
where z i , j represents the height of the corresponding point on the mountain peak, and Z represents the height of the mountain peak.

2.2.2. Radar Threat

As aforementioned, the detection range of the radar threat that UAVs may encounter is considered as a spherical space, and the corresponding penalty F R is determined as follows based on whether UAVs fly into the detectable areas:
F R = i = 1 U j = 1 V k = 1 R n u m R i , j
R i , j = 100 , i f P i , j O R 0 , o t h e r w i s e
where O represents the center of the radar’s sphere, and R represents the radius range of the radar.

2.2.3. No-Fly Zone Threat

There always exists no-fly zones for UAVs in real applications. Therefore, this kind of threat is also considered and modeled as a cuboid-shaped space, and the penalty F N F is given as follows depending on whether UAVs enter the no-fly zone:
F N F = i = 1 U j = 1 V k = 1 N F n u m N F i , j
N F i , j = 100 , i f ( N x a x i , j N x + a ) a n d   ( N y b y i , j N y + b ) 0 , o t h e r w i s e
where N F i , j is a flag of the no-fly zone threat. ( N x , N y ) , 2 a , and 2 b are the center coordinate, length, and width of the no-fly zone.

2.3. Constraints of Motions and Attitudes of UAVs

Similarly, to address motion and attitude restrictions during path planning, other penalties are defined to ensure UAVs operate within these specified limitations and are also synthesized into the total objective function presented later.

2.3.1. Constraints of Path Segment Distance

When the path segment distance is too small, the UAV cannot complete some state adjustments; accordingly, the penalty F S L based on the constraint of the path segment distance is defined as follows:
F S L = i = 1 U j = 1 V + 1 S L i , j
S L i , j = 100 , i f L i , j l m i n 0 , o t h e r w i s e
where the path segment distance L i , j is defined in Equation (1). l m i n refers to its minimum value and can be determined upon the specifications of the UAV.

2.3.2. Attitudinal Constraints

The UAVs considered in this paper are quadrotors, and their dynamic equations are as follows []:
m p ¨ = 0 0 m g + R 0 0 F 1 + F 2 + F 3 + F 4
I β ˙ ϕ ˙ α ˙ + β ϕ α × I β ϕ α = l ( F 2 F 4 ) l ( F 3 F 1 ) M 1 M 2 + M 3 M 4
where m is the mass of the UAV, p ¨ is the acceleration vector of the UAV’s center of mass, g is the gravity acceleration, and R is a rotation matrix that converts vectors from the body frame to the world frame. F 1 , F 2 , F 3 , F 4 are the upward thrust forces generated by the four rotors, respectively. I is the inertia matrix of the UAV, which is a symmetric matrix containing the moments of inertia of the UAV around the three axes. β , ϕ , α and β ˙ , ϕ ˙ , α ˙ are the angular velocities and accelerations of the UAV, respectively. l is the distance from the rotor to the center of mass of the UAV. M 1 , M 2 , M 3 , M 4 are the counter-torques produced by the four rotors.
As shown in Equations (12) and (13), when UAVs pose large attitude angles, the required thrusts and torques may exceed the maximum values that motors can provide, resulting in insufficient lifts. Therefore, the maximum yaw and pitch angles of UAVs are limited, as well as the curvature of the path, which guarantees the safe operation of UAVs. The schematic diagram of the limitations of the yaw and pitch angles along the path points is shown in Figure 2. As a result, the penalties F Y and F A based on these angular limitations are defined as follows, respectively:
F Y = i = 1 U j = 1 V + 1 Y i , j
Y i , j = 0 , i f α i , j α m a x 100 , o t h e r w i s e
α i , j = arctan P i , j 1 P i , j × P i , j P i , j + 1 P i , j 1 P i , j · P i , j P i , j + 1
F A = i = 1 U j = 1 V + 1 A i , j
A i , j = 0 , i f β i , j β m a x 100 , o t h e r w i s e
β i , j = arctan z i , j z i , j 1 x i , j x i , j 1 2 + y i , j y i , j 1 2
where α m a x , β m a x refer to the maximum yaw and pitch angles, respectively. α i , j , β i , j represent the yaw and pitch angle at the current point, respectively. P i , j 1 P i , j is the horizontal projection of P i , j 1 P i , j .
Figure 2. The diagram of the limitations of yaw and pitch angles along the path points: since the yaw angle α and pitch angle β are within the restricted range, the curvature of the path connecting the previous path point P i , j 1 , the current one P i , j , and the next one P i , j + 1 is limited.

2.3.3. Altitudinal Constraints

In real application, UAVs are usually encouraged to fly at appropriate altitudes to mitigate the effects of winds and ensure safety. Therefore, the penalty F H depending upon the altitudinal constraint is defined as follows:
F H = i = 1 U j = 1 V H i , j
H i , j = h i , j h l b + h u b 2 , i f h l b h i , j h u b 100 , o t h e r w i s e
where h l b and h u b refer to the lowest and highest flight altitudes, respectively, and h i , j = z i , j represents the height of the UAV.

2.3.4. Collision Avoidance Between UAVs

To avoid collisions, each UAV should search for the position of other UAVs at each trajectory point. Assuming that it flies at a constant speed between trajectory points, with a speed of V i , j , the time to pass through each trajectory segment can be calculated by L i , j , and then the total flight time can be obtained accordingly.
Suppose that UAVs p and q have a total flight time of t p , j and t q , j at the j-th point ( p , q , j N + , j > 1 ), respectively. When t q , j 1 < t p , j < t q , j , the current position coordinate of UAV q is given as
P q = P q , j 1 + ( t p , j t q , j 1 ) V q , j
where V q , j = ( v q , j x , v q , j y , v q , j z ) indicates the velocity of UAV q in three directions.
Similarly, the positions of other UAVs can be determined. With these positions, the safety distance, denoted as u s d , could be set according to the application scenarios. So far, the collision penalty F U C between UAVs is defined as follows:
F U C = i = 1 U j = 1 V U C i , j
where U C i , j is a flag of collision. If the distance P i P j between UAVs i and j exceeds u s d , U C i , j = 0 ; otherwise, U C i , j = 100 .

2.4. Objective Function

Therefore, by integrating all the aforementioned penalties based on threats, physical limitations of UAVs, as well as the collision avoidance issue into the path length cost shown in Equation (1), the objective function of the multi-UAV path planning problem is written as follows:
J = w 1 F L + w 2 F S L + w 3 F A + w 4 F Y + w 5 F H + w 6 F C + w 7 F U C + w 8 F R + w 9 F N F
where J represents the total cost, i.e., the fitness, and w i refers to the weight of each cost function.
Then, a swarm-intelligence-based optimization method is proposed and applied to find the optimal value of the above objective function, with which the trajectory points along the optimal path of UAVs can be obtained.

3. Improved Nutcracker Optimization Algorithm for Multi-UAV Path Planning

This section first introduces the principle of the standard NOA, which is the basis of solving the multi-UAV path planning problem, and then its limitations are analyzed. Accordingly, INOA is proposed to further improve the planning performance.

3.1. Standard NOA

NOA is motivated by the foraging and food recovery behavior of nutcrackers. It mimics two seasonal behaviors of the nutcracker, namely, seeking and storing seeds in summer and autumn, and foraging based on memory in winter and spring. As a result, this algorithm shows a strong global search capability, and it is suitable to implement in path planning for multiple UAVs due to its applicability to constrained multi-objective optimization problems.

3.1.1. Foraging Phase and Storage Stage

(1)
Foraging stage
In this stage, a nutcracker checks the initial position first. If there exists food, it would be actuated towards the storage area; otherwise, it goes to another location for continued searching. The modeling of this behavior is given as follows:
X i t + 1 ( n e w ) = X a , j t + γ X U , j t X V , j t + μ g 2 U j L j , i f t T / 2.0 X W , j t + μ X U , j t X V , j t + μ ( g 1 < δ ) g 2 U j L j , o t h e r w i s e
X i t + 1 ( n e w ) = X i , j t , i f h 1 < h 2 E q u a t i o n ( 25 ) , o t h e r w i s e
μ = h 3 , i f g 1 < g 2 h 4 , i f g 2 < g 3 h 5 , i f g 1 < g 3
where X i , j t represents the j-th position of the i-th nutcracker in the iteration t, X i t + 1 ( n e w ) represents the new position of the nutcracker, X a , j t represents the average position of all nutcrackers, X U , j t , X V , j t and X W , j t represent three different nutcracker individuals randomly selected from the population, and U j and L j represent the maximum and minimum values, respectively. γ and h 5 represent the random numbers generated by a Levy flight; g, g 1 , g 2 , g 3 , h 1 , h 2 , and h 3 represent random numbers in the range of [0, 1]; h 4 refers to a random number that follows a normal distribution; and δ is used to improve the local exploitation capability.
(2)
Storage stage
The nutcracker transports the food obtained during the foraging phase to the storage area. This behavior can be mathematically represented as follows:
X i t + 1 ( n e w ) = X i t + μ X b e s t t X i t | λ | + g 1 X U t X V t i f h 1 < h 2 X b e s t t + μ X U t X V t i f h 1 < h 3 X b e s t t l o t h e r w i s e
where λ represents the number generated based on a Levy flight, X b e s t t represents the current optimal solution, and l represents a factor that linearly decreases from 1 to 0.
Then, the following equation is used to adjust the conversion between the foraging phase and storage stage:
X i t + 1 = E q u a t i o n ( 26 ) , i f u < P a 1 E q u a t i o n ( 28 ) , o t h e r w i s e
where u represents random numbers in the range of [0, 1], and P a 1 decreases linearly from 1 to 0.

3.1.2. Cache-Search and Recovery Strategy

(1)
Cache-search stage
As winter approaches, the nutcracker would embark on its second round of exploration, transitioning from its storage mode to a search mode by using two reference points (FPs) as markers for a single storage area. These two reference points F P i , 1 t and F P i , 2 t are defined as follows:
F P i , 1 t = X i t + φ c o s α X U t X V t + φ F P , i f α = π / 2 X i t + φ c o s α X U t X V t , o t h e r w i s e
F P i , 2 t = X i t + φ cos α U L h 3 + L + φ F P U s i f α = π / 2 X i t + φ c o s α U L h 3 + L U s , o t h e r w i s e
U s = 1 , g 2 < P r p 0 , o t h e r w i s e
φ = 1 t T 2 · t T , i f g 1 > g 2 t T 2 t , o t h e r w i s e
where t and T represent the current and maximum number of generations, respectively. α represents a random radian in the range of [ 0 , π ] . P r p represents the percentage likelihood of investigating various areas within the exploration domain.
(2)
Recovery stage
When nutcrackers search for their caches, they may encounter two possibilities. The first possibility is that the nutcracker can use the first F P to remember its cache location. The following formula characterizes this behavior:
X i , j t + 1 = X i , j t , i f h 3 < h 4 X i , j t + g 1 X b e s t , j t X i , j t + g 2 F P i , 1 t X W , j t , o t h e r w i s e
The second possibility is that the nutcracker, using the first F P , does not remember the location of the hidden food, and then it would use the second F P to search for food. This behavior can be characterized as follows:
X i , j t + 1 = X i , j t , i f h 5 < h 6 X i , j t + g 1 X b e s t , j t X i , j t + g 2 F P i , 2 t X W , j t , o t h e r w i s e
where h 6 is also a random number in the interval of [0, 1].
Thus, the recovery behavior can be summarized by the following formula:
X i t + 1 = E q u a t i o n ( 34 ) , i f   h 7 < h 8 E q u a t i o n ( 35 ) , o t h e r w i s e
where h 7 , h 8 are random numbers in the interval of [0, 1].
Update the position based on whether the nutcracker finds food, which is determined by the following formula:
X i t + 1 = X i t , i f f ( X i t ) < i f ( F P i , 1 t ) F P i , 1 t , o t h e r w i s e
X i t + 1 = X i t , i f f ( X i t ) < i f ( F P i , 2 t ) F P i , 2 t , o t h e r w i s e
To balance foraging behaviors through FP, the following formula is utilized:
X i t + 1 = E q u a t i o n ( 37 ) , i f f ( F P i , 1 ) < f ( F P i , 2 ) E q u a t i o n ( 38 ) , o t h e r w i s e
Similarly, the following equation is defined to adjust the conversion between the caching phase and recovery phase:
X i t + 1 = E q u a t i o n ( 36 ) , i f v > P a 2 E q u a t i o n ( 39 ) , o t h e r w i s e
where v represents a random number in the range of [0, 1], and P a 2 represents the probability of exchange between the cache search phase and the recovery phase.

3.2. Improvements on NOA

Although NOA is a competitive and efficient intelligent optimization algorithm for path planning, it also has regular issues, such as uneven population initialization and an imbalance between exploration and exploitation capabilities. Consequently, the following improvements are implemented to enhance its search performance, thereby obtaining more satisfactory paths for UAVs.

3.2.1. Improvement of Initialization Based on Logistic Chaos Map

The basic population initialization is randomly distributed throughout the entire space, which has high randomness and uneven distribution, leading to problems such as a lack of population diversity and low search efficiency. Utilizing the chaotic mapping mechanism to increase the variety of the population can boost the efficiency of the algorithm. Its nonlinear characteristics and periodic nature enable it to generate more complex and random sequences, which helps to enhance the variety of the population and stops the population from converging on local optimal solutions. Commonly used chaotic mapping methods include the logistic map and the tent map. Compared to the other one, the logistic map offers a more flexible parameter adjustment, higher sensitivity to chaos, and advantages in simulating complex systems. Therefore, the logistic map was used for population initialization in this study, which is a nonlinear difference equation used to describe the change in a variable across time. The formula is as follows:
x n + 1 = μ x n 1 x n
where x n represents the state variable value at time step n, which is a number between 0 and 1, x n + 1 is the state variable value at the next time step n + 1 . μ is a positive control parameter, which is typically in the range of [0, 4]. Here, we set it as 4, placing the map in a fully mapped state with ergodicity.
As can be seen in Figure 3 and Figure 4, the sequence produced by the logistic map initialization is more uniform compared to a random initialization, which can find the optimal solution more quickly and avoid being stuck in local minima.
Figure 3. Random initialization adopted in NOA: the initialization sequence is uneven.
Figure 4. The logistic map initialization employed in INOA: the initialization sequence is more uniform.

3.2.2. Lens Imaging Inverse Learning Strategy

A lens imaging inverse learning strategy is also introduced in this study, which further enhances the diversity of the initial solutions. Compared to the ordinary backward learning strategy, the lens imaging inverse learning introduces a scaling factor k, which is more conducive to obtaining the global optimal solution. The schematic diagram is shown in Figure 5. By generating the inverse solution of the current initial solution through inverse learning, and comparing it with the fitness of the original solution, the initial solutions are updated, thereby further increasing the probability of the algorithm escaping from local optima.
Figure 5. The diagram of the lens imaging inverse learning strategy: if P is the current point within the bounds of [ l b , u b ] , then P is the inverse solution of P obtained through by the lens imaging inverse learning strategy.
The lens imaging inverse learning strategy is as follows:
X i , j r = U j + L j / 2 + U j + L j / 2 k X i , j t / k
X i , j t = X i , j r ,   i f f ( X i , j r ) < f ( X i , j t ) X i , j t ,   i f f ( X i , j r ) f ( X i , j t )
where X i , j r is the generated inverse solution, X i , j t is the current solution, and k is a random value between 1 and 10. When the fitness of the inverse solution is lower than that of the current solution, the inverse solution swaps out the current solution; otherwise, it remains unchanged.

3.2.3. Parameter Adjustment

The exploration phase reflects the algorithm’s capability to explore the global search space, determining whether the algorithm can find the optimal solution, while the exploitation phase demonstrates the algorithm’s capability to mine in the local search space, affecting the efficiency of the algorithm in obtaining the optimal solution. However, a limitation of NOA is that its transition between exploration and exploitation is not balanced. In the foraging/storage phase of NOA, the parameter P a 1 is responsible for regulating the conversion between exploration and exploitation, and it decreases linearly. But the actual foraging/storage behavior of the nutcracker may exhibit nonlinear characteristics. Moreover, in the search for the caching/recovery food phase, the parameter P a 2 that controls exploration and exploitation is set to a fixed value of 0.2, which cannot fully describe the complex behavior of the nutcracker in the process of searching for the caching areas and recovery food. Obviously, these parameters ( P a 1 and P a 2 ) do not accurately reflect the changes in the actual process. Therefore, the following improvements are made, as shown in Figure 6. Specifically, the original parameters are set as follows:
P a 1 = T t / T
P a 2 = 0.2
Then, to better balance the exploration and exploitation, the parameters are adjusted as follows:
P a 1 n e w = c o s π t / T + 1 / 2
P a 2 n e w = t / T 1 2 / 10 + 0.2
Figure 6. Parameter adjustment: INOA improves parameters P a 1 n e w and P a 2 n e w in the foraging/storage and the caching/recovery phases as trigonometric and quadratic functions, respectively, to better balance the exploration and exploitation.

3.2.4. Improved Sine–Cosine Strategy

The speed at which the nutcracker finds the food source affects the convergence velocity of the algorithm. By introducing the sine–cosine strategy and utilizing the oscillating characteristics of sine and cosine to search for food sources, the global exploration speed of the algorithm can be effectively improved. The sine–cosine algorithm formula is as follows:
X i , j t + 1 = X i , j t + z 1 sin ( z 2 ) | z 3 X b e s t , j t X i , j t | , z 4 > 0.5 X i , j t + z 1 cos ( z 2 ) | z 3 X b e s t , j t X i , j t | , z 4 0.5
where z 1 = 2 ( 1 t / T ) , z 2 [ 0 , 2 π ] , z 3 [ 0 , 2 ] , and z 4 [ 0 , 1 ] .
However, this algorithm has a weak local search capability; therefore, the following improvements are made on it:
X i , j t + 1 = z 1 sin ( z 2 ) + X b e s t , j t , z 4 > 0.5 z 1 cos ( z 2 ) + X b e s t , j t , z 4 0.5
To guarantee both the global exploration in the early stage and the local exploitation in the later stage, the sine–cosine strategy is finally modified as follows:
X i , j t + 1 = E q u a t i o n ( 48 ) , i f t / T < 0.3 E q u a t i o n ( 49 ) , o t h e r w i s e
By performing the aforementioned improvements, the flowchart of INOA is displayed in Figure 7, and the main steps are as follows:
Step 1: Set the size of the nutcracker population as N and initialize it using the logistic chaotic mapping.
Step 2: Apply the lens imaging inverse learning strategy to the initialized nutcracker population and select better elite initialization solutions through fitness comparison.
Step 3: Create p and q as random values between 0 and 1. If p < q , update the solution using the foraging and storing strategy and optimize the parameter P a 1 .
Step 4: If p q , update the solution using the caching search and recovery strategy and optimize the parameter P a 2 .
Step 5: Integrate the improved sine–cosine strategy to improve the algorithm’s abilities of balancing global exploration and local exploitation.
Step 6: Find the best fitness solution through iterations.
At this point, by applying the proposed INOA, the trajectory points in an optimal path for UAVs can be obtained.
Figure 7. The flowchat of INOA.

4. Path Fitting

To improve the smoothness of the optimal path and ensure the stability of a UAV flight, it is necessary to fit the path. Cubic spline interpolation was used in this study to construct a smooth curve on a given set of data points such that the curve was a cubic polynomial on each segment. For a set of ordered data points ( x 0 , y 0 , z 0 ) , ( x 1 , y 1 , z 1 ) , , ( x n , y n , z n ) , the cubic spline curve is defined as the cubic polynomial S i j k ( x , y ) between each pair of adjacent points ( x i , y i , z i ) and ( x i + 1 , y i + 1 , z i + 1 ) . The entire curve is composed of these polynomials spliced together. Each polynomial can be written as follows:
S i j k ( x , y ) = a i j k + b i j k ( x x i ) + c i j k ( y y j ) + d i j k ( x x i ) 2 + e i j k ( x x i ) ( y y j ) + f i j k ( y y j ) 2 + g i j k ( x x i ) 3 + h i j k ( x x i ) 2 ( y y j ) + i i j k ( x x i ) ( y y j ) 2 + j i j k ( y y j ) 3
where a i j k , b i j k ,..., j i j k represent the coefficients of the polynomial.
Forming a system of linear equations through interpolation conditions and the continuity of the first and the second derivatives, we can obtain all the coefficients of the polynomials by solving this system. Then, the interpolation function is constructed in the entire three-dimensional space, thereby ensuring that the connection between trajectory points is a smooth curve.

5. Simulation Results and Analysis

In this section, we report simulations of the proposed INOA, including those based on the test set, applications in multi-UAV path planning, and comparative studies, conducted on the MATLAB R2020b environment to comprehensively demonstrate the merits of INOA.

5.1. Simulation Based on Test Set

The effectiveness of the proposed INOA was confirmed through the CEC2020 test suite. The CEC2020 test suite is a set of standard test functions used to evaluate and compare the performance of optimization algorithms, which was proposed at the 2020 congress on evolutionary computation (CEC), and it consists of a total of 10 functions.
To show the superiority of INOA, it was compared to NOA, Harris hawks optimization (HHO), and sea horse optimization (SHO). The population size was set to 50, the maximum number of iterations to 500, and the dimensionality to 20. The algorithms were run 30 times, and the average values were taken for the simulation. The simulation results are shown in Figure 8. It is seen that INOA achieved the best results on all 10 functions during the entire search process, except for functions F2 and F6, where the global search performance does not outperform but still keeps pace with NOA in the early stage.
Figure 8. Simulation based on CEC2020: INOA obtains the best results on all 10 functions.

5.2. Simulation for Multi-UAV Path Planning

To further verify the effectiveness and applicability of INOA in multi-UAV path planning, simulation scenarios are designed for the following four different flight situations.
In the first scenario, we set the number of UAVs as 6, with 20 waypoints for each. Suppose that there are 2 no-fly zones, 3 mountains, and 2 radar threats in the flying environment. The start points of the UAVs were set as (0, 0, 0), (0, 5, 0), (5, 0, 0), (5, 5, 0), (0, 10, 0), and (10, 0, 0), respectively. The end points were (90, 90, 100), (90, 100, 100), (100, 90, 100), (100, 100, 90), (85, 85, 100), and (90, 85, 100), respectively. In the second scenario, based on the first one, the number of mountains increased to 6. In the third scenario, building upon the first one, the number of radar threats increased to 4.
To enhance the persuasiveness of the algorithm, the fourth scenario considered a more complex situation, in which the number of UAVs was set as 8, with 20 waypoints for each. The start points for each UAV were set as (0, 0, 0), (0, 0, 5), (0, 5, 0), (0, 5, 5), (5, 0, 0), (5, 0, 5), (5, 5, 0), and (5, 5, 5), while the end points were (95, 95, 100), (95, 95, 95), (95, 100, 95), (95, 100, 100), (100, 95, 95), (100, 95, 100), (100, 100, 95), and (100, 100, 100). Additionally, the number of mountains and radar threats increased to 6 and 4, respectively.
The parameter setting for the problem description is presented in Table 1. In the simulation, we set the population size as 60 and allow for a maximum of 100 iterations.
Table 1. Parameter setting for the problem description.
Figure 9, Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16 show the path planning results in all scenarios. It can be seen that INOA always performs well for multi-UAV path planning tasks. Whether there exists an increase in the mountainous terrain or radar threats does not affect the performance of INOA in complex scenarios such as the fourth scenario. As the safe operating space for multiple UAVs is further compressed, the difficulty of finding feasible paths for multiple UAVs greatly increases. Nevertheless, it is evident that INOA can effectively overcome these challenges and successfully find the optimal paths.
Figure 9. The result of multi-UAV path planning in the first scenario with 6 UAVs, 3 mountains, 2 radars, and 2 no-fly zones.
Figure 10. Top–down view of multi-UAV path planning in the first scenario with 6 UAVs, 3 mountains, 2 radars, and 2 no-fly zones.
Figure 11. The result of multi-UAV path planning in the second scenario with 6 UAVs, 6 mountains, 2 radars, and 2 no-fly zones: compared with the first scenario, the number of mountains increases to 6.
Figure 12. Top–down view of multi-UAV path planning in the second scenario with 6 UAVs, 6 mountains, 2 radars, and 2 no-fly zones: compared with the first scenario, the number of mountains increases to 6.
Figure 13. The result of multi-UAV path planning in the third scenario with 6 UAVs, 3 mountains, 4 radars, and 2 no-fly zones: compared with the first scenario, the number of radar threats increases to 4.
Figure 14. Top–down view of multi-UAV path planning in the third scenario with 6 UAVs, 3 mountains, 4 radars, and 2 no-fly zones: compared with the first scenario, the number of radar threats increases to 4.
Figure 15. The result of multi-UAV path planning in the fourth scenario with 8 UAVs, 6 mountains, 4 radars, and 2 no-fly zones: compared with the first scenario, the numbers of UAVs, mountains, and radars all increased.
Figure 16. Top–down view of multi-UAV path planning in the fourth scenario with 8 UAVs, 6 mountains, 4 radars, and 2 no-fly zones: compared with the first scenario, the numbers of UAVs, mountains, and radars are all increased.
To address the advantages of the proposed INOA, a comparative analysis was performed between PSO, GWO, NOA, and INOA based on the first scenario. The parameters of PSO can be found in [], and those of other algorithms are properly selected and shown in Table 2. To avoid randomness, all algorithms were run 30 times.
Table 2. Parameters in all algorithms.
The comparison of fitness values is displayed in Figure 17, and the corresponding statics are given in Table 3. As shown in them, due to its lack of a mechanism to escape local optima, PSO is trapped in a local optimal solution and cannot escape, resulting in the least ideal performance. GWO lacks convergence velocity and does not converge to the global optimal solution within the number of iterations. NOA converges quickly and has strong multi-objective optimization capabilities such that it is able to find high-quality solutions in a relatively small number of iterations, showing obvious advantages over the other two algorithms, but there is still room for improvement. Based on it, INOA further increases the search efficiency by obtaining a higher-fitness initial solution through logistic mapping; then, it improves the quality of the initial solution through lens imaging inverse learning and subsequently enhances convergence speed and precision through parameter adjustment and the improved sine and cosine strategy. As a result, INOA can apparently improve the capability of both global search and local exploitation, strongly confirming its effectiveness in multi-UAV path planning.
Figure 17. Fitness comparison: INOA achieves the best fitness compared with PSO, GWO, and NOA, with the improved search efficiency.
Table 3. Fitness statistics.

6. Conclusions

The multi-UAV path planning problem faces many challenges. It was effectively modeled in this study with constraints such as path length, segment distance, smoothness, altitude, collision with mountains, inter-UAV collisions, radar detection, and no-fly zones, being converted into a multi-objective restricted optimization problem. To solve this problem, INOA is proposed with a more fit elite initialization population through logistic chaotic mapping initialization and a lens imaging inverse learning strategy. Furthermore, it has a balanced global search capability with a local exploitation capability through parameter adjustments and the integration of an improved sine–cosine search strategy. Simulation results on multi-UAV path planning showed that INOA significantly surpasses other algorithms with regard to convergence velocity and solution precision.
INOA addresses global path planning and can design a feasible general route for each UAV in practical applications. However, from the perspective of absolute safety, the research on the local planning of UAVs is necessary but not adequate enough. Further research will delve into local planning algorithms. The synthesis of these two methods would be applied into the real-world flight of UAVs with a more satisfactory performance.

Author Contributions

Conceptualization, H.Y. and C.X.; methodology, H.Y. and C.X.; software, C.X.; validation, C.X. and B.Z.; formal analysis, H.Y. and C.X.; investigation, C.X.; data curation, C.X.; writing—original draft preparation, C.X. and H.Y.; writing—review and editing, C.X. and H.Y.; visualization, C.X.; supervision, H.Y.; project administration, H.Y.; funding acquisition, H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Natural Science Foundations of China (62003130) and Fundamental Research Funds for the Central Universities (B230201055).

Data Availability Statement

All data are presented in the article.

Acknowledgments

The authors would like to thank everyone who provided valuable suggestions and support to improve the content, quality, and presentation of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
UAVUnmanned aerial vehicle
DQNDeep q-learning
DDPGDeep deterministic policy gradient
PPOProximal policy optimization
MDPMarkov decision process
D3QNDueling double deep Q-network
ACOAnt colony algorithm
PSOParticle swarm optimization
GAGenetic algorithm
GWOGrey wolf optimizer
NOANutcracker optimization algorithm
HGJOHybrid strategy-based golden jackal optimizer
SSASparrow search algorithm
GSOGolden search optimization
DMSPSODynamic multi-swarm PSO
CLPSOComprehensive learning PSO
INOAImproved nutcracker optimization algorithm
CECCongress on evolutionary computation
HHOHarris hawks optimization
SHOSea horse optimization

References

  1. Han, L.; Wu, X.; Sun, X. Hybrid path planning algorithm for mobile robot based on a* algorithm fused with dwa. In Proceedings of the 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA), Chongqing, China, 26–28 May 2023; Volume 3, pp. 1465–1469. [Google Scholar]
  2. Luo, M.; Hou, X.; Yang, J. Surface Optimal Path Planning Using an Extended Dijkstra Algorithm. IEEE Access 2020, 8, 147827–147838. [Google Scholar] [CrossRef]
  3. Yang, F.; Su, H. Research on Improving Artificial Potential Field for Drone Obstacle Avoidance. In Proceedings of the 2024 3rd International Conference on Artificial Intelligence and Computer Information Technology (AICIT), Yichang, China, 20–22 September 2024; pp. 1–4. [Google Scholar] [CrossRef]
  4. Jang, K.; Baek, J.; Park, S.; Park, J. Motion Planning for Closed-Chain Constraints Based on Probabilistic Roadmap with Improved Connectivity. IEEE/ASME Trans. Mechatronics 2022, 27, 2035–2043. [Google Scholar] [CrossRef]
  5. Wang, W.; Li, J.; Bai, Z.; Wei, Z.; Peng, J. Toward Optimization of AGV Path Planning: An RRT*-ACO Algorithm. IEEE Access 2024, 12, 18387–18399. [Google Scholar] [CrossRef]
  6. Jiang, W.; Bao, C.; Xu, G.; Wang, Y. Research on Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Improved Dueling DQN Algorithm. In Proceedings of the 2021 China Automation Congress (CAC), Beijing, China, 22–24 October 2021; pp. 5110–5115. [Google Scholar] [CrossRef]
  7. Li, P.; Ding, X.; Ren, W. Research on Path Planning of Cloud Robot in Dynamic Environment Based on Improved DDPG Algorithm. In Proceedings of the 2021 China Automation Congress (CAC), Beijing, China, 22–24 October 2021; pp. 3561–3566. [Google Scholar] [CrossRef]
  8. Li, L.; Li, W.; Wang, J.; Chen, X.; Peng, Q.; Huang, W. UAV Trajectory Optimization for Spectrum Cartography: A PPO Approach. IEEE Commun. Lett. 2023, 27, 1575–1579. [Google Scholar] [CrossRef]
  9. Li, B.; Wu, Y. Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning. IEEE Access 2020, 8, 29064–29074. [Google Scholar] [CrossRef]
  10. Wang, X.; Gursoy, M.C.; Erpek, T.; Sagduyu, Y.E. Learning-Based UAV Path Planning for Data Collection with Integrated Collision Avoidance. IEEE Internet Things J. 2022, 9, 16663–16676. [Google Scholar] [CrossRef]
  11. Chen, Y.; Bai, G.; Zhan, Y.; Hu, X.; Liu, J. Path Planning and Obstacle Avoiding of the USV Based on Improved ACO-APF Hybrid Algorithm with Adaptive Early-Warning. IEEE Access 2021, 9, 40728–40742. [Google Scholar] [CrossRef]
  12. Tao, B.; Kim, J.H. Mobile robot path planning based on bi-population particle swarm optimization with random perturbation strategy. J. King Saud Univ.-Comput. Inf. Sci. 2024, 36, 101974. [Google Scholar] [CrossRef]
  13. Pehlivanoglu, Y.V.; Pehlivanoğlu, P. An enhanced genetic algorithm for path planning of autonomous UAV in target coverage problems. Appl. Soft Comput. 2021, 112, 107796. [Google Scholar] [CrossRef]
  14. Zhang, W.; Zhang, S.; Wu, F.; Wang, Y. Path Planning of UAV Based on Improved Adaptive Grey Wolf Optimization Algorithm. IEEE Access 2021, 9, 89400–89411. [Google Scholar] [CrossRef]
  15. Abdel-Basset, M.; Mohamed, R.; Jameel, M.; Abouhawwash, M. Nutcracker optimizer: A novel nature-inspired metaheuristic algorithm for global optimization and engineering design problems. Knowl.-Based Syst. 2023, 262, 110248. [Google Scholar] [CrossRef]
  16. Lou, T.S.; Yue, Z.P.; Jiao, Y.Z.; He, Z.D. A hybrid strategy-based GJO algorithm for robot path planning. Expert Syst. Appl. 2024, 238, 121975. [Google Scholar] [CrossRef]
  17. Zhang, J.; Chen, D.; Han, G.; Qian, Y. Formation Path Planning for Collaborative Autonomous Underwater Vehicles Based on Consensus-Sparrow Search Algorithm. IEEE Internet Things J. 2023, 11, 13810–13823. [Google Scholar] [CrossRef]
  18. Yu, X.; Jiang, N.; Wang, X.; Li, M. A hybrid algorithm based on grey wolf optimizer and differential evolution for UAV path planning. Expert Syst. Appl. 2023, 215, 119327. [Google Scholar] [CrossRef]
  19. Xu, L.; Cao, X.; Du, W.; Li, Y. Cooperative path planning optimization for multiple UAVs with communication constraints. Knowl.-Based Syst. 2023, 260, 110164. [Google Scholar] [CrossRef]
  20. Meng, K.; Chen, C.; Wu, T.; Xin, B.; Liang, M.; Deng, F. Evolutionary State Estimation-Based Multi-Strategy Jellyfish Search Algorithm for Multi-UAV Cooperative Path Planning. IEEE Trans. Intell. Veh. 2024, 1–19. [Google Scholar] [CrossRef]
  21. Mellinger, D.; Kumar, V. Minimum snap trajectory generation and control for quadrotors. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 2520–2525. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.