1. Introduction
Since its outstanding performance in the Gulf War in 1991, drones have made good achievements in the Afghanistan War, the Iraq War, the fight against the Islamic State (ISIS) terrorist group, the “Neptune Spear” decapitation operation in 2011, and the Russia–Ukraine conflict in 2022. Their success has caused countries around the world to invest a large amount of manpower and financial resources in the research of UAV [
1], as shown in
Figure 1. After decades of development, UAVs have not only been applied in the military fields of reconnaissance, surveillance, communication relay, electronic countermeasures, combat assessment, harassment, decoy, anti-submarine, target attack, etc. At the same time, they have been widely used in agriculture [
2], energy [
3], civil [
4] and other very important fields. However, there are some problems with a single drone performing its mission. For example, when a single UAV performs a reconnaissance mission, it may be limited by the observation angle and cannot observe the target area from multiple different orientations [
5]; when faced with a large-scale search task, a single UAV cannot effectively cover the entire reconnaissance area [
6]; during the attack, the combat range, killing radius, destruction capability and attack accuracy are limited, thus affecting the success rate of the entire combat mission [
7]; if a single drone fails in the middle of a mission, it must immediately interrupt the mission and return, but in a war, it may delay the aircraft and destroy the entire operation plan. In order to improve combat effectiveness and make up for the deficiency of a single UAV, a multi-UAV cooperative formation (cluster) combat task is proposed. It refers to the formation, maintenance or reconstruction of a certain geometric formation during the execution of a task by multiple UAVs to adapt to the battlefield situation and task requirements.
Unmanned aerial vehicle formation has incomparable advantages over a single unmanned aerial vehicle. When UAV formation is not possible in extreme weather, maintaining different formations can improve the aerodynamic efficiency of UAVs to varying degrees, thus reducing the overall flight resistance and saving fuel [
8]; it can realize the all-round reconnaissance or observation of the target, such as the enemy target monitoring and reconnaissance, resource exploration and so on, and it can greatly improve the scope of target monitoring; 0069t can also realize simultaneous strikes on multiple mission targets, disrupt the enemy’s combat command system, improve the lethality and hit rate of targets, and improve combat effectiveness, as shown in
Figure 2. In the process of use, UAVs are equipped with intelligent devices, which can simulate the transport environment in real time, determine their own position, control their flight status, select effective trajectory points, and calculate safe trajectory. These are important guarantees for UAVs formation to reach the target point from the take-off point as well as important prerequisites for a UAV formation to complete tasks. Therefore, it is important to select a suitable algorithm for UAV formation trajectory planning.
The purpose of trajectory planning of UAV formation is based on the specific tasks, terrain, weather and other environmental factors of each UAV as well as its own flight performance. Under the premise of satisfying multiple constraints, the specified performance index can be optimized or better so that all UAVs in the formation can safely reach the target from the starting point. The trajectory planning of UAV formation is a complex multi-objective optimization and decision problem under multiple constraints. With the increasing number of UAVs, the analytic space of the problem will increase exponentially. In the study of UAV trajectory planning, the algorithm is the soul of UAV trajectory planning, which is directly related to the efficiency and results of trajectory planning. Compared with single UAV trajectory planning, the complexity of UAV formation trajectory planning is mainly reflected as follows:
- (1)
In many cases, the scope of planning space is large and complex: for example, there are various spatial obstacles and dynamic threats in the modern battlefield environment;
- (2)
There are many constraints. Not only should the planned flyable trajectory conform to the actual dynamics and kinematic characteristics of the UAV, but also the coordination between time and space and the concealment of the trajectory should be considered;
- (3)
Multi-UAV trajectory planning can adapt to battlefield dynamic changes and adjust trajectories online in real time.
For the trajectory planning of UAV formation, many papers have proposed solutions from different perspectives, but there are still many unsolved problems and many limitations, resulting in numerous and complex papers without a comprehensive and systematic classification, which is not conducive to research and reading.
The reference [
9] classifies and statistically analyzes the cooperative flight path planning of various UAV formations from the three elements of a UAV system (mission, UAV crew and environment) and the three elements of UAV formation cooperative flight path planning (UAV flight path, target and constraint), but it does not discuss the flight path planning algorithm of UAV formation.
Stochastic Heuristic Algorithms (SHA) are reviewed in reference [
10], and the characteristics, improvements, applications, advantages and disadvantages of some of them are discussed, but non-SHA algorithms in UAV formation flight paths are not discussed.
The reference [
11] divides the flight path planning algorithms of UAV formation into five categories, including optimal algorithm, graph theoretics-based planning method, heuristic information-based planning algorithm, swarm intelligence algorithm and neural network algorithm. Then, a simple description is given to these categories, but no specific algorithms are discussed.
Reference [
12] reviews swarm intelligence algorithms from four aspects, such as collision avoidance processing, task allocation, track planning and formation recombination, and it discusses classical algorithms among them. However, it does not discuss non-swarm intelligence algorithms, which has certain limitations.
Compared with many studies in the literature on UAV formation trajectory planning [
9,
10,
11,
12], the contributions of this paper are as follows.
In this review, the UAV formation trajectory planning algorithms used in recent decades are classified in detail, and the basic principles of these algorithms are described and compared so as to find out the shortcomings of UAV formation trajectory planning algorithms. The challenges and future research directions of the algorithm are summarized and prospected, which provides reference information for researchers and workers engaged in the formation flight of UAVs.
This paper can be divided into the following parts: Firstly, a simple classification framework of the UAV formation trajectory planning algorithm is introduced in
Section 2. Then, the global trajectory planning algorithms are summarized in time order in
Section 3 and
Section 4. Among them,
Section 3 summarizes the traditional algorithm and
Section 4 summarizes the intelligent algorithm.
Section 5 summarizes the local trajectory planning algorithm.
Section 6 summarizes the challenges the algorithm faces.
Section 7 summarizes the focus and direction of future research.
Section 8 summarizes the full text.
3. Traditional Algorithm
Traditional methods must build the map environment for the target before performing trajectory planning. Firstly, the map environment was discretized into graphs, and feasible trajectories were generated by the search algorithm to complete the global trajectory planning of UAV formation. The existing algorithms are the Dijkstra algorithm, Dubins Curve, Floyd algorithm, Voronoi graph method, Probabilistic Roadmaps (PRM), and Rapidly-Exploring Random Tree (RRT).
3.1. Dijkstra Algorithm
The Dijkstra algorithm is the classical shortest trajectory method in the geometric graph method, in which the vertex represents trajectory points, the edge represents a feasible trajectory, the line between nodes is called an edge, and each edge has a corresponding weight, which is the distance or cost of the journey; it is suitable for two-dimensional static obstacle avoidance scenes with a non-negative side weight. The key to using this algorithm is to select effective trajectory points, shorten the planning time, expand from the starting point, find the shortest trajectory for a node in each step, select the node with the smallest distance from the node that has never been visited to register, then traverse the adjacent nodes of the node after the node is included, and then update the distance. The cost diagram of a Dijkstra algorithm is shown in
Figure 5.
Aiming at the uncertain region search problem, Sujit and Ghose [
13] proposed a search algorithm based on the K-shortest trajectory algorithm for UAV to search targets in an unknown environment. It satisfies the requirements of endurance time of each UAV and the location of the base station of UAV operation, and it enables each UAV to search in the area of maximum uncertainty so as to maximize the search benefit.
In order to meet the needs of searching an unknown environment and tracking moving targets in a balanced way, Tin [
14] improved on Dijkstra’s algorithm and proposed a robust shortest algorithm (ARSP) to deal with arc uncertainty. The influence of information uncertainty and environmental change on the trajectory planning algorithm is overcome, and the trajectory is quickly replanned at the same time.
Ueno and Kwon [
15] applied the Dijkstra algorithm to the minimum time reconstruction of UAV formation in order to meet the requirements of optimality and short computing time, and the trajectory generated within the shortest time is close to the optimal trajectory.
Aggarwal et al. [
16] proposed an approximate trajectory generation method to generate an approximate trajectory length under the condition of meeting the safety constraints of a UAV. This method is based on the total cost of the Lagrange relaxation (LARAC) algorithm, and it iteratively uses the Dijkstra algorithm (iDijkstra) to modify the edge cost, which solves the safety constraints and flight energy consumption of UAV caused by extreme high temperature.
3.2. Dubins Curve
The Dubins curve is the shortest locus connecting two two-dimensional planes (i.e., the X-Y plane) under the condition that the curvature constraint is satisfied and the tangent directions of the specified beginning and end are satisfied. In 1957, Lester Eli Dubins proved that any locus can consist of a maximum curvature arc or straight segment (the locus between two points must exist). In other words, the shortest path connecting two points will be constituted by the circular arc of the maximum curvature and the straight line segment. The Dubins of any starting point to the end point are composed of not more than three original motions, and the sequence constituted by the three original motions is called a kind of trajectory. As two continuous and identical primordial motions can be combined into one primordial motion, Dubins proved that the optimal trajectory can only be one of the following six combinations: namely, RSR, LSL, RSL, LSR, RLR, LRL. The first four are collectively called a CSC trajectory, and the last two are collectively called a CCC trajectory, where the primordial motion R represents right turn, S stands for straight and L stands for left.
Figure 6 is the trajectory diagram of one Dubins curve LRL.
D’Amato, Mattei, and Notaro [
17] modeled the UAV as a Dubins vehicle, using a method based on the Reduced Visibility Graph (RVG), connecting selected nodes by arcs and segments, and adding the Rendez-Vous Waypoints (RVWs). It was based on the leader–follower Stackelberg model’s two-layer game theory method to optimize the location of the trajectory point and the trajectory of the UAV as much as possible in order to find the optimal trajectory while maintaining the shape of the formation in many places.
3.3. Floyd Algorithm
The Floyd algorithm, also known as the interpolation method, is a relatively classic algorithm for solving graph theory problems. It is an algorithm to solve the shortest trajectory between vertices in a given weighted graph, and it can correctly handle the shortest trajectory problem of directed graphs. At the same time, it is a dynamic programming algorithm, and the connection weight between nodes can be positive or negative; similar to Dijkstra’s algorithm, but different from it is that Floyd’s algorithm is used to find the distance between any two points, which is the shortest path of multiple sources, and it can be calculated with negative weights, while Dijkstra’s algorithm is used to find the shortest route from one vertex to all other vertices, is the single-source shortest path, and negative weight circuits cannot be calculated.
Faced with the problem of multi-UAV cooperative patrol trajectory planning under constraints such as time windows, mandatory patrol nodes, UAV flight time and imaging sensors, Yang et al. [
18] proposed a new cooperative patrol trajectory planning method, using the Floyd algorithm to generate the initial trajectory, and then used the improved forward insertion heuristic algorithm (PFIH) to obtain the optimal trajectory.
Zhou and Nie [
19] proposed a graph-based trajectory planning method for multi-UAV systems, using the Floyd algorithm to update the adjacent cost matrix and trajectory matrix, and solved the problem of UAV formation trajectory planning.
3.4. Fast Marching Method
The fast marching method (FM) is an efficient numerical algorithm for solving the optical path function equation (Eikonal equation), and the optical path function equation is as follows:
where (
x, y) is the coordinate of the calculation point in the pose space,
T(
x, y) is the time when the interface function arrives at the calculation point,
V(
x, y) is the propagation velocity set by the interface function, and it is a fixed value in trajectory planning. The optical path function solution model is shown in
Figure 7.
The fast marching method first establishes a rasterized space for storing time values, and then, the time cost will be converted into the distance cost during planning. Then, we set reachable points and unreachable points and complete the minimum value search operation by continuously updating the distance cost to obtain the distance matrix. We use it to construct the potential field and then use the gradient descent method to iterate continuously from the starting point along the direction of the fastest gradient descent in the generated potential field, obtaining a smooth trajectory without collision.
Aiming at the problem that the trajectory obtained by the traditional FM algorithm in the 3D environment will be too close to obstacles and the trajectory is not smooth enough, López et al. [
20,
21] proposed a fast marching square algorithm (FM2), which improves the FM algorithm by changing the propagation speed in space so that the wave will tend to follow the track travel.
3.5. Voronoi Diagram Method
The Voronoi graph method (also known as Dirichlet tessellation) is a space segmentation algorithm proposed by Russian mathematician Georgy Voronoy. It divides the space into many sub-regions through a series of seed nodes (Seed Points), each sub-region is called a Cell, and the distance between all points in each Cell and the Seed Points in the current Cell is less than Distance to all other Seed Points. According to the distribution of obstacles, the Voronoi diagram squares the free space between the edges of the obstacles, and at the same time, it draws the vertical line of adjacent obstacles to form a polygon around the obstacles so that each side is equidistant from the surrounding obstacles. Then, the origin and destination nodes can be connected into the graph by constructing trajectories from the nodes to the edges closest to each node.
Figure 8 is a Voronoi trajectory diagram.
Unavoidable accidents or environmental interference problems will inevitably occur when UAV formations perform multi-mission planning and collaborative trajectory planning. In order to cope with this situation, Meng et al. [
22] proposed an algorithm to deal with multi-UAV multi-task trajectory re-planning in an unexpected event environment. Each UAV uses a Voronoi diagram to plan its own initial, optimal or sub-optimal trajectory; then, it replans its trajectory according to the new multi-task requirements corresponding to some unexpected events.
To solve the coverage problem with average Voronoi partitions, Chen et al. [
23] proposed a distributed coverage algorithm to cover the convex area of the average Voronoi partition of the UAV formation. By exchanging local information with neighbors, the Voronoi partition is continuously iteratively updated, and the UAV direction of movement is calculated. The algorithm can theoretically make the area difference infinitely small so as to achieve the actual average area coverage.
Chen et al. [
24] proposed a method based on consistency theory, using the Voronoi diagram method to create a threat domain, and designing a cost function for trajectory planning of multiple UAVs, so that multiple UAVs can take off at the same time and reach the specified target Point, solving the problem of UAV formation attacking multiple targets in a static threat environment.
Hu et al. [
25] proposed a distributed formation control and collision avoidance method based on the Voronoi partition and traditional artificial potential field, using the Voronoi partition theory to divide the entire space into non-overlapping regions, and further proposed the target switching scheme; this method solves the problem of local optimum when an artificial potential field is used as motion control law.
3.6. Probabilistic Roadmap Algorithm
The Probabilistic Roadmap Algorithm (PRM) is a method based on graph search, which converts continuous space into discrete space. Trajectory planning is mainly divided into two stages. In the offline learning stage, a large number of robot pose points are randomly sampled; then, neighbor nodes are searched and connections are established to construct a landmark map. In the online query phase, a feasible trajectory is searched from the landmark map using a heuristic search algorithm based on the starting point, target point and landmark map information.
Figure 9 is a trajectory diagram of a roadmap algorithm.
Madridano et al. [
26] proposed a multi-trajectory PRM-based planning method by establishing a parameter to define three different modes, so that different UAVs in the UAV formation can achieve different mission goals.
3.7. Rapidly Exploring Random Trees
Rapidly exploring Random Trees (RRT) is a single query random search algorithm based on sampling. Its basic idea is to randomly sample in the state space, use the graph structure or tree structure extension to build a feasible trajectory set, and then find a complete feasible trajectory from the trajectory set. The RRT algorithm takes the starting point in the state space as the root node and then generates a random extended tree by gradually increasing the leaf nodes at random. If the newly generated node conflicts with the obstacle area during the generation process, the node is discarded and reselected. When the target point is included in the leaf node of the random tree, the expansion of the random tree stops, and an obstacle avoidance route from the starting point to the target point can be obtained. A fast random search tree locus is shown in
Figure 10.
Aiming at the trajectory planning problem of UAV formation with static, ejection and dynamic obstacles, Kothar et al. [
27] proposed a trajectory planning algorithm based on fast search random tree (RRTs) and introduced an anytime algorithm and guidance law based on tracking and line of sight into the algorithm to generate low-cost UAV formation trajectories under kinematic constraints in real time.
Zu et al. [
28] proposed an improved Rapid Exploration Random Trees (RRTs) UAV formation collaborative trajectory planning algorithm, using a trajectory pruning method to delete redundant nodes on the trajectory. The UAV uses a trajectory planner, which enables the UAV to share information within the communication range.
When UAV formation faces sudden threat trajectory replanning, the classical RRT algorithm has some problems such as low efficiency, large storage space and slow convergence. Huang and Sun [
29] proposed a bidirectional fast search random tree algorithm based on greedy strategy, improved the expansion mode of algorithm nodes, and adopted an adaptive step size rolling detection method to improve the sensitivity of UAV formation to sudden threats.
In addition, the traditional RRT algorithm also has the problem that it cannot adapt to the possible changes in the high-order dynamic characteristics of the autonomous movement of the UAV and the mission process during trajectory planning. In response to this problem, Shi et al. [
30] proposed a trajectory generation algorithm based on the integration of the RRT algorithm and the minimum capture algorithm, using the RRT algorithm to generate the initial trajectory, and then using the minimum capture algorithm to smooth the initial trajectory, and using the concept of flight corridors to limit the flight trajectory of drones.
Table 1 summarizes the contents of our survey on traditional trajectory planning algorithms.
6. Problems of UAV Formation Trajectory Planning Algorithm
With the advancement of low-airspace reforms and the innovation of artificial intelligence and information technology [
164], new theories and new achievements related to swarm intelligence continue to emerge, and the improvement of UAV formation trajectory planning algorithms is facing many challenges.
6.1. Physical Constraints of UAV Formation
In the process of trajectory planning, the UAV in the formation is usually simplified into three degrees of freedom particles, ignoring its own constraints such as minimum turning radius, rolling angle and other restrictions on the running state. As a result, the current trajectory planning algorithm is difficult to adapt to the UAV with high maneuverability, there are errors between the release route and the planned route when the actual formation UAV performs the task, and the execution effect will also be affected.
6.2. Performance Problems of UAV Formation Carrying Equipment
In the UAV formation trajectory planning algorithm, the performance of the UAV itself is not considered enough. An insufficient consideration of problems such as fuel consumption, load, and onboard sensor errors in practice makes it difficult to accurately detect complex environments, and the trajectory planning that can be achieved by simulation cannot be realized or has poor robustness in the actual environment.
6.3. Complex Environment Modeling Problem
Most of the current formation trajectory planning algorithms are hypothetical ideal obstacles, but the actual operating environment of UAVs is complex and diverse, especially the detection and description of scenes such as complex concave obstacle environments and dense dynamic obstacles, which need further exploration.
6.4. Algorithm Real-Time Problems
Realistic environmental information is usually time-varying. Regarding UAV formation trajectory planning in an unknown environment, the success rate of trajectory planning strategies used by traditional algorithms and local trajectory planning algorithms in the face of emergencies and dynamic environments is low; in addition, the amount of calculation is large, and the trajectory is not optimal. It is difficult for drones to complete real-time trajectory updates.
6.5. Adaptability of UAV Formation Route Planning Algorithm
When performing formation trajectory planning in a complex dynamic environment, there is a lot of information exchange between UAVs, which leads to an increase in the amount of calculations, and intelligent algorithms are prone to fall into problems such as local optima that exist in themselves.
6.6. UAV Formation Communication Problem
With the development of science and technology, the application scenarios of drones in the future will become more and more complex. In certain scenarios, there will be communication interference problems, which will cause the UAV to fail to work normally and even cause irreversible damage.
7. Future Research Focus and Direction
7.1. Improved Model
Constraints such as six degrees of freedom, minimum turning radius, roll angle, and the onboard sensor error of each UAV are added to the modeling to enhance the robustness of actual control. For complex environment modeling, the influence of multiple factors in the complex environment on the effect of trajectory planning must be considered; reliable and accurate data must be obtained through specific measurements or the use of accurate 3D maps, and at the same time, the data must be used to verify the model to make the simulation closer to reality. It can also be better applied to the actual platform in the future.
7.2. Real time Planning
In the face of increasingly complex environments and tasks, in order to meet the requirements of fast optimal solution, computational complexity, convergence speed and rationality, the computational memory is allocated reasonably. Executing trajectory planning algorithms to generate efficient trajectories in the case of limited computing power of UAVs is of great significance for UAV formations to complete tasks in complex environments.
7.3. Fusion Algorithm
It is an important current research trend to integrate different types of trajectory planning algorithms to make up for the defects and deficiencies of existing single methods.
For example, the local trajectory planning method can be combined with artificial intelligence technology represented by machine learning to complement each other. On the one hand, it can solve the problem of easy falling into local optimum in the local trajectory planning method, and on the other hand, it can also make up for the poor real-time performance of the machine learning-based track planning algorithm to a certain extent. It is also possible to combine the characteristics of heuristic algorithms and machine learning algorithms that are easy to integrate with each other to help analyze the performance of the algorithm and expand the application range of the algorithm. At the same time, experiments show that the hybrid algorithm has better adaptability.
7.4. New Algorithm
At present, the existing UAV formation trajectory planning algorithms have more or less defects. Therefore, developing an algorithm that reduces computational requirements, saves time, allows real-time planning, and is more efficient in terms of energy is also a direction worth exploring.
7.5. Fault Tolerance Mechanism
Since the maneuvering area of each UAV is very small, once a collision occurs, it will affect the adjacent UAVs, and a chain effect will be generated between the UAV clusters, which will cause the mission to fail. Therefore, the fault-tolerant redundancy mechanism is an important link to ensure the safe operation of the UAV system. At present, there is no fault-tolerant mechanism design for the core and weak links of the trajectory planning algorithm so as to improve the fault-tolerant ability of unmanned formation flight. In future research, we should focus on the design of the fault-tolerant mechanism when the function of the UAV fails to avoid uncontrollable events.
7.6. Hybrid Frame
It is unrealistic to use a centralized framework to solve problems in the process of carrying out missions in large-scale UAV formations. Therefore, a hybrid framework should be adopted; how to design an appropriate conflict resolution mechanism and how to effectively combine UAV formation trajectory planning with collaborative control to generate feasible flight trajectories are topics worthy of further study.
7.7. Behavior Decision
Recently, UAV swarm-to-swarm dynamic confrontation has become a hot research direction. At the same time, the autonomous decision-making behavior of UAV formations such as autonomous reconnaissance and detection, autonomous target recognition, and autonomous task coordination in complex terrain such as cities and mountainous areas can effectively reduce the loss of manpower and material resources. How to plan and generate the optimal trajectory of UAV formation from the perspective of game theory is undoubtedly a problem worthy of further exploration.
7.8. Allocation of Resources
In the future, drone formations will be widely used in battlefields and anti-terrorism operations. The environment in which UAVs perform these tasks may become very complex, resulting in increased mission difficulty, and the environment may provide extremely strong support for UAV formations, such as satellite links and energy supplies for continuous flight. How to reasonably allocate available resources to each UAV during mission execution is also a challenging problem.
7.9. Communication Networking
The confrontation between UAVs has become more and more information-based. When the UAV formation is performing tasks, it is necessary to ensure that the UAVs can communicate and share information normally and at the same time deal with external communication interference. Although some scholars have noticed related problems, the problem of how to solve communication interference in UAV formation trajectory planning is still a difficult problem.