Distributed Formation Planning for Unmanned Aerial Vehicles

Zhao, Zeming; Zhang, Xiaozhen; Fang, Hao; Yang, Qingkai

doi:10.3390/drones9040306

Open AccessArticle

Distributed Formation Planning for Unmanned Aerial Vehicles

¹

School of Automation, Beijing Institute of Technology, Beijing 100081, China

²

The National Key Laboratory of Autonomous Intelligent Unmanned Systems, Beijing 100081, China

³

Faculty of Marine Science and Technology, Beijing Institute of Technology, Zhuhai 519088, China

^*

Author to whom correspondence should be addressed.

Drones 2025, 9(4), 306; https://doi.org/10.3390/drones9040306

Submission received: 27 February 2025 / Revised: 11 April 2025 / Accepted: 11 April 2025 / Published: 14 April 2025

(This article belongs to the Section Drone Communications)

Download

Browse Figures

Versions Notes

Abstract

:

Formation flying of multiple unmanned aerial vehicles (UAVs) has attracted much attention for its versatility in cooperative tasks. In this paper, a distributed formation planning method is proposed for UAVs. First, we design a path searching algorithm, swarm-A*, which can enhance the cohesion of a swarm, i.e., preventing the disintegration of the swarm when it encounters an obstacle. Then, after waypoint reallocation, a formation trajectory optimization framework is formulated. Smooth formation trajectories for UAVs to travel safely in obstacle-laden environments can be obtained by solving the optimization problem. Next, a tracking controller based on sliding mode control is designed, ensuring that the UAVs follow the planned formation trajectories under dynamic constraints. Finally, numerical simulations and experiments are conducted to validate the effectiveness of the proposed method.

Keywords:

UAV formation; path searching; trajectory optimization

1. Introduction

Unmanned aerial vehicles (UAVs) have gained significant attention because of their numerous advantages, including high mobility, low operational cost, and the ability to access difficult or hazardous environments. These characteristics have led to their widespread application in various fields. However, when a single UAV is tasked with complex operations, its limited capacity in terms of payload, coverage area, and redundancy often restricts working effectiveness. To overcome these challenges, there is a growing need for multiple UAVs to collaborate and complete tasks more efficiently as a swarm. Due to their flexibility and efficiency, UAV swarms can be used for various kinds of cooperative tasks, such as surveillance [1], search [2], rescue [3], exploration [4], cooperative encirclement [5], and object transportation [6,7].

In the domain of UAV swarms, formation flying has emerged as a key strategy for enabling coordinated behavior. It allows UAVs to work together in predefined spatial configurations, thus optimizing resource usage, increasing coverage, and ensuring fault tolerance. The ability to manage and maintain formation not only enhances the operational capabilities of UAV swarms but also improves their overall robustness and efficiency in task execution, making it an essential approach in modern UAV systems.

Planning is an indispensable component of any autonomous system, offering safe and efficient guidance to complete specific tasks. Therefore, formation planning is a critical aspect of UAV swarms. Formation planning refers to the process of determining the optimal movement strategy for UAVs to ensure safe task completion while maintaining the desired formation shape. Collision avoidance between UAVs and formation shape maintenance are the two main challenges of the formation planning problem. Many researchers have devoted attention to this field in recent years.

Extensive research has been carried out on the collision avoidance problem (collision with obstacles and collision between UAVs) in multi-UAV planning. Alonso-Mora et al. [8] used the velocity obstacle method to establish a local obstacle avoidance planning problem for multiple UAVs. Collision avoidance, obstacle avoidance, and motion continuity problems are considered in the cost functions of the proposed optimization problems. Safe and feasible real-time local trajectories are obtained by solving the proposed problems. Zhou et al. [9] presented EGO-swarm, a decentralized approach for autonomous navigation by multiple UAVs using only onboard resources. The planning system is formulated under gradient-based local planning framework, where collision avoidance is realized by formulating the collision risk as a penalty of a nonlinear optimization problem. On that basis, ref. [10] used MINCO instead of a B-spline to parameterize trajectories, thus solving the difficulty of time adjustment when UAVs need to pass through the same area. It also produced smoother trajectories and a lower optimization time. Tordesillas et al. [11] presented MADER, a 3D decentralized and asynchronous trajectory planner for UAVs that generates collision-free trajectories. Collision with other UAVs can be realized by including their committed trajectories as constraints in the optimization and then executing a collision check–recheck scheme. Recently, Toumieh et al. [12] proposed a high-speed, decentralized, and synchronous motion planning framework (HDSM), which generated a time-aware safe corridor (TASC) to guarantee the safety of the UAV trajectories. Zhao et al. [13] introduced a new Theta*–APF method for drone swarm path planning in 3D space; the method reduces the searching time and the path length. Collision avoidance for the agents is realized utilizing repulsive force fields.

Although the above work offers outstanding performance in collision avoidance for UAV swarms, the formation maintenance problem is not taken into account. Existing formation maintenance methods are summarized below.

First, from the perspective of control, a high-precision formation shape can be maintained. Leader–follower control is a hierarchical approach used in UAV swarm operations, where one or several UAVs are designated as leaders, and the others follow these leaders based on predefined rules [14]. This approach depends on the stability and reliability of the leader(s), meaning that disruption of the leader’s communication could impact the entire formation. Therefore, when faced with intricate scenarios and constraints, such methods [15,16,17] need further research and refinement to address all the requirements. Virtual structure methods establish a geometric formation in which the UAVs move together as a cohesive unit. Each UAV preserves its relative position as the formation moves, allowing the entire formation to translate, rotate, or scale as required to meet mission goals or avoid obstacles. These approaches enable precise control over the formation shape and provides an intuitive way to modify the formation in response to changing conditions. However, as the number of UAVs increases, the stability and accuracy of the virtual structure may become extremely difficult to maintain. Consensus control is also a classic and promising formation maintenance approach, enabling the UAVs to agree on specific parameters such as position, velocity, or angle, making the swarm move cohesively. It promotes flexible adaptation to environmental changes and enhances collaborative task performance. Ref. [18,19,20] are all recent works utilizing consensus theory to design formation planning algorithms. However, the effectiveness of consensus algorithms depends heavily on the quality of communication between UAVs and the convergence speed of the algorithms. Other control methods such as fuzzy control and adaptive control [21] can also tackle the formation maintenance problem.

Second, from the perspective of searching and optimization, flexibility and robustness can be improved by treating the formation maintenance requirement as a soft constraint. Nguyen et al. [22] formulated a distributed optimization problem based on dynamic consensus and solved the problem using ADMM (Alternating Direction Method of Multipliers), achieving time-varying formation shape maintenance with inter-robot collision avoidance. They took advantage of MPC-based motion planning approaches to design reference trajectories at each replanning instant. Quan et al. [23] designed a differentiable cost function based on graph theory to evaluate UAV formations. By considering formation similarity, obstacle avoidance, and dynamic feasibility, they realized a balance between UAV formation maintenance and safety during flight. Peng et al. [24] proposed a distributed and synchronous motion planning framework for a formation of multiple UAVs equipped with an active sensing system in an obstacle-based environment using a gradient-based method. They used expanding FOVs to enhance safety in the UAV swarm motion planning task. The planning problem was solved with the distributed particle swarm optimization algorithm. Zhang et al. [25] proposed an online formation planning method for a tethered multirotor UAV cooperative transportation system. An optimization problem was constructed considering asymmetric tension-based swarm reciprocal avoidance, obstacle avoidance, and target transportation. All constraints are represented as soft constraints to realize task requirements. Mikkelsen et al. [26] introduced a distributed planner for rigid formations. They first determined the scaling, rotation, and translation of a base configuration to obtain the desired velocities at each time step for the swarm. The desired velocities of the agents were mapped to a parameter space to guarantee consensus and constraints, and they were then remapped to the velocity space of the agents, ensuring that the robots maintained the shape of their formation. Liu et al. [27] proposed a global formation planning method with obstacle avoidance, which conceptualized robot formations into distinct configurations. The feasible configurations and the transitions between two configurations were represented as vertexes and edges, respectively, to construct an undirected graph where the optimal formation path could be found using searching algorithms.

Third, formation maintenance can also be realized from the perspective of deep reinforcement learning (DRL) [28,29,30,31]. In recent years, significant attention has been focused on DRL, in which deep neural networks are employed to approximate the value function, the policy, or both within reinforcement learning algorithms, allowing agents to effectively manage high-dimensional states. By utilizing DRL, UAVs are capable of independently coordinating their movements to preserve desired formation patterns, avoid collisions, and adapt to changing environments. Through continuous interaction with the environment, DRL methods allow UAVs to learn optimal control strategies, resulting in more efficient and resilient formation control compared to conventional approaches. DRL methods offer benefits such as complex decision-making, long-term reward optimization, and adaptability. However, there exist challenges, including sample inefficiency, potential training instability, the need for hyperparameter tuning, and safety risks during the learning process. Furthermore, the selection of these approaches depends on various factors, including the specific needs of the formation task, available data, computational resources, and safety concerns [14]. However, the high computational demands and long computing time are the main challenges for DRL-based methods.

In summary, different kinds of methods have their own advantages and shortages. A detailed comparison is shown in Table 1. Control-based methods have lower computational demands and higher real-time performance, which make them easier to deploy, yet there is a lack of environmental adaptability because control-based methods rely on accurate environmental models and strict communication among agents. The local minimum problem is also a challenge during intricate obstacle avoidance. Search-optimization-based methods perform well with relatively high scalability and moderate computational complexity when the environment is known (even if it is intricate). This paper mainly considers the formation planning problem in terms of known static environments. DRL-based methods realize much higher scalability and adaptability, but the training process is time-consuming and requires enormous computing power. In summary, control-based methods are suitable for smaller swarms in easier environments; search-optimization methods are good at handling normal formations in environments that allow offline planning; and DRL-based methods are recommended for large-scale formations in unknown and complex environments under the condition of high computational power. Since this paper focuses on the formation planning problem in static environments for medium-scale swarms, the search-optimization method is utilized. Compared with the existing search-optimization-based algorithms, our proposed method simplifies the expression of formation cost using the reference relative vectors instead of the Laplacian matrix used in [23]. The path searching algorithm is also improved to avoid disintegration of the formation. The safe corridor approach is utilized instead of soft constraints to ensure flight safety in [23,24].

In this paper, we introduce the swarm-A* algorithm, which enhances the cohesion of the swarm and prevents disintegration of the formation during the path searching process, and we also present a distributed formation trajectory optimization framework that can balance collision avoidance and formation shape maintenance. In detail, we propose a distributed formation planning approach for UAVs, which can generate collision-free trajectories in environments with static obstacles. The proposed formation planning method mainly consists of swarm path searching and formation trajectory optimization. A sliding mode controller is designed to validate the dynamic feasibility of the trajectories. Simulations and experiments verify the effectiveness and adaptability of the proposed method.

The main contributions of this work are as follows:

A path searching algorithm that prevents formation disintegration is proposed. A swarm heuristic cost is designed to be applied during the search, and it observably enhances the cohesion of the swarm paths. As a result, the difficulty of solving the optimization problem can be greatly reduced compared to searching without consideration of swarm cohesion.
We propose a distributed formation trajectory optimization method that takes formation maintenance, obstacle avoidance, and kinematics into account. By solving the optimization problem, smooth rotation and translation of the UAV formation can be realized. The method enables the UAV system to balance between moving in the reference formation shape and avoiding obstacles.
A series of simulations and a real-world experiment are conducted, validating the effectiveness of our proposed method.

The remaining of this paper is organized as follows. In Section 2, we describe the studied system and formulate the problem. Some basic knowledge used in this paper is also introduced. In Section 3, the proposed formation planning method is described in detail. Section 4 introduces the formation tracking control method. In Section 5, simulations under different circumstances is introduced. Section 6 shows the experimental results and analysis. Finally, Section 7 summarizes the paper and discusses future work.

2. System Description and Problem Formulation

2.1. System Description

In the formation planning task, the UAV formation system under consideration consists of

n (n \geq 3)

identical UAVs.

In the body-fixed frame of the ith UAV, the attitude is denoted as

ρ_{i} = {[ϕ_{i} θ_{i} ψ_{i}]}^{T}

.

ϕ_{i}

,

θ_{i}

, and

ψ_{i}

represent the roll, pitch, and yaw, respectively. The dynamics of the ith UAV is given as follows [19,32,33]:

\begin{matrix} \ddot{x_{i}} & = \frac{(cos ψ_{i} sin θ_{i} cos ϕ_{i} + sin ψ_{i} sin ϕ_{i}) F_{i}}{m} \end{matrix}

(1a)

\begin{matrix} \ddot{y_{i}} & = \frac{(sin ψ_{i} sin θ_{i} cos ϕ_{i} - cos ψ_{i} sin ϕ_{i}) F_{i}}{m} \end{matrix}

(1b)

\begin{matrix} \ddot{z_{i}} & = \frac{cos ϕ_{i} cos θ_{i} F_{i}}{m} - g \end{matrix}

(1c)

\begin{matrix} \ddot{ϕ_{i}} & = \frac{τ_{x i} + \dot{θ_{i}} \dot{ψ_{i}} (I_{y} - I_{z})}{I_{x}} \end{matrix}

(1d)

\begin{matrix} \ddot{θ_{i}} & = \frac{τ_{y i} + \dot{ϕ_{i}} \dot{ψ_{i}} (I_{z} - I_{x})}{I_{y}} \end{matrix}

(1e)

\begin{matrix} \ddot{ψ_{i}} & = \frac{τ_{z i} + \dot{ϕ_{i}} \dot{θ_{i}} (I_{x} - I_{y})}{I_{z}} \end{matrix}

(1f)

where m represents the mass of a UAV;

F_{i}

represents the magnitude of thrust;

τ = {[τ_{x i} τ_{y i} τ_{z i}]}^{T}

is the control torque;

I_{x}, I_{y},

and

I_{z}

are the rotational inertia; and g is the gravitational acceleration. Typically, (1a)–(1c) are the position dynamics, describing the position motion of a UAV; (1d)–(1f) are the attitude dynamics, describing the attitude motion of a UAV.

The obstacle avoidance task in this paper is constrained to the two-dimensional horizontal plane (

x O y

). There are several static obstacles and some unfeasible regions in the environment. The 2D position of the ith UAV in the

x O y

plane is denoted as

q_{i} = {[x_{i} y_{i}]}^{T}

. The 2D linear velocity and acceleration of the ith UAV in the

x O y

plane are denoted as

v_{i} = {[v_{x} v_{y}]}^{T}

and

a_{i} = {[a_{x} a_{y}]}^{T}

, respectively. The acceleration

a_{i}

is limited by

a_{i} \in A

, where

A = {{[a_{x} a_{y}]}^{T} | a_{m i n} < a_{x} < a_{m a x}, a_{m i n} < a_{y} < a_{m a x} .}

To better describe the motion of a multirotor UAV, the second-order integrator model is utilized to describe the UAV’s motion in the

x O y

plane, which is written as

\begin{matrix} \{\begin{matrix} \dot{q_{i}} = v_{i} \\ \dot{v_{i}} = a_{i} \end{matrix} \end{matrix}

(2)

2.2. Problem Formulation

Each UAV has its own ID number i; the corresponding starting point is

s_{i}

, and the target point is

g_{i}

. The planning task for the each UAV is represented as

〈s_{i}, g_{i}〉

; the objective of each UAV is to find a safe, smooth, and feasible trajectory that can take it from the starting point to the goal point. Meanwhile, the UAVs should maintain the reference formation shape.

The reference formation shape

F^{r} = [F_{1}^{r}, F_{2}^{r}, \dots F_{n}^{r}]

is defined using relative positions between UAVs. UAV1 is set as the coordinate origin, and all other UAVs’ positions relative to UAV1 can be determined. Hence, it is possible to obtain a set of reference positions of UAVs in the formation, which can be written as

\begin{matrix} F_{i}^{r} = \{\begin{matrix} {[0, 0]}^{T}, i f i = 1 \\ {[x_{i}^{r}, y_{i}^{r}]}^{T}, i f i \neq 1 \end{matrix} \end{matrix}

(3)

where

{[x_{i}^{r}, y_{i}^{r}]}^{T}

is the relative position of UAVi (

i > 1

) to UAV1. By taking the average of the vectors in

F^{r}

, the reference position of the formation center is expressed as

F_{c}^{r} = (\sum_{i = 1}^{n} F_{i}^{r}) / n

.

2.3. Graph Theory

In this paper, we utilize a directed graph

G = {V, E, A}

to describe the communication topology among UAVs.

V = {1, 2, \dots, n}

is the set of nodes, which represents the n UAVs in the swarm.

E \subseteq V \times V

denotes the edges between any two nodes. Let

e_{i j} = (i, j) \in E

denote that there is a directed path from node i to node j.

A : E \to R^{+}

is a function allocating a weight to each edge; for example, for

e_{i j} = (i, j) \in E

, there exists

A (e_{i j}) = a_{i j}

. If

e_{i j} = (i, j) \notin E

, then

a_{i j} = 0

. The neighbor set of a UAV

i \in V

is denoted as

N_{i} = {j \in V | (i, j) \in E}

.

3. Formation Planning

The overall framework of the proposed formation planning method is shown in Figure 1. First, the swarm-A* algorithm is designed to search for collision-free discrete waypoints for the center of the formation and for the UAVs. Second, the waypoints are densified, and the numbers of waypoints are equalized. Safe corridors, a set of convex polygons covering the free motion space of the UAVs, are constructed using these processed waypoints. Finally, a nonlinear distributed trajectory optimization problem is presented and solved to obtain safe and smooth trajectories that maintain the reference formation shape. Detailed explanations of the proposed method are provided in the subsections below.

3.1. Path Searching

This subsection presents a path searching method called the swarm-A* algorithm, which aims to obtain discrete collision-free path points for UAVs in consideration of their cohesion. The search space is a two-dimensional grid map that contains free grid squared and occupied grid squares. We define eight available directions in each search iteration.

In terms of formation path searching, conventional algorithms such as A* are not applicable because if we use them to search paths for multiple UAVs sequentially, the UAVs are very likely to bypass obstacles on different sides, which is not conducive to maintaining formation and will lead to a large computational burden, sometimes even to the point of unsolvability, for the subsequent trajectory optimization. Additionally, some large obstacles may lead to communication blockage if UAVs are located on different sides. To overcome this problem, we designed the swarm-A* algorithm, whose principle is presented below.

The swarm-A* algorithm is shown as Algorithm 1.

n_{p}

and

n_{c}

represent the parent node and the child node, respectively, during the searching process. In Line 2, the algorithm searches for a reference path

p_{c}

for the center of the formation using a hybrid A* algorithm [34].

p_{c}

contains both position and orientation information, and its kth waypoint is denoted as

q_{c}^{k} = {[x_{c}^{k} y_{c}^{k} ω_{c}^{k}]}^{T}

, where

ω_{c}^{k}

represents the orientation. In Line 12,

n_{r}

, the closest path point to

n_{p}

, is found in

p_{c}

, acting as a reference point. In Line 17, the algorithm judges whether the path passing

n_{p}

towards

n_{c}

is shorter than the path passing its previous parent node. If so, the parent of

n_{c}

is replaced with

n_{p}

. In Line 21, a node extension formula for swarm-A* is proposed, where

g (n_{c})

is the cumulative path length from the starting point to the current node

n_{p}

,

h_{1} (n_{c}) = \sqrt{{(n_{c} . x - g_{i} . x)}^{2} + {(n_{c} . y - g_{i} . y)}^{2}}

is the heuristic cost between the current node and the goal point, and

h_{2} (n_{c}) = \sqrt{{(n_{c} . x - n_{r} . x)}^{2} + {(n_{c} . y - n_{r} . y)}^{2}}

is the heuristic cost between the current node and the reference point.

α_{1}

and

α_{2}

are the weights of the two heuristic costs, respectively.

f (n_{c}) = g (n_{c}) + α_{1} h_{1} (n_{c}) + α_{2} h_{2} (n_{c})

is the total cost. A comparison of different values of swarm weight is shown in Figure 2. It can be observed that, as

α_{2}

increases, the cohesion of the searched paths also improves, i.e., the UAVs’ path become closer to the reference path. When

α_{2} = 1

, the paths bypass all obstacles on the same side. It proves that the proposed swarm-A* algorithm is capable of enhancing the cohesion of swarm movements.

Algorithm 1 Swarm-A*

Input:: UAV number n, starting and goal points ${s_{1}, s_{2}, \dots, s_{n}} {g_{1}, g_{2}, \dots, g_{n}}$
Output:: UAV paths $p_{c}, p_{1}, p_{2}, \dots, p_{n}$
1:: Compute the starting point and goal point of the formation center $s_{c} = (Σ_{i = 1}^{n} s_{i}) / n$ , $g_{c} = (Σ_{i = 1}^{n} g_{i}) / n$
2:: Use Hybrid A* algorithm to search for a reference path for the center of the formation, $p_{c} = H y b r i d A s t a r (s_{c}, g_{c})$
3:: for $i = 1 \to n$ do
4:: $o p e n l i s t \leftarrow s_{i}$ ;
5:: $c l o s e d l i s t \leftarrow ⌀$ ;
6:: while $o p e n l i s t \neq ⌀$ do
7:: Choose $n_{p}$ with minimal total cost f in $o p e n l i s t$ ;
8:: $c l o s e d l i s t \leftarrow n_{p}$ ;
9:: if $n_{p} = g_{i}$ then
10:: break;
11:: end if
12:: Find the point $n_{r}$ in $p_{c}$ that is closest to $n_{p}$ ;
13:: for all $n_{c} \in n e i g h b o r s (n_{p})$ do
14:: if $n_{c} \in c l o s e d l i s t$ or $n_{c}$ is unfeasible then
15:: continue;
16:: else if $n_{c} \in o p e n l i s t$ then
17:: $J u d g e (n_{c}, n_{p})$ ;
18:: else
19:: $o p e n l i s t \leftarrow n_{c}$ ;
20:: $k (n_{c}) \leftarrow | | n_{c} - n_{r} {| |}_{2}$ ;
21:: $f (n_{c}) \leftarrow g (n_{c}) + α_{1} h_{1} (n_{c}) + α_{2} h_{2} (n_{c})$ ;
22:: end if
23:: end for
24:: end while
25:: Obtain $p_{i}$ ;
26:: end for
27:: return $p_{c}, p_{1}, p_{2}, \dots p_{n}$

3.2. Waypoint Reallocation

Due to the diverse starting and goal points of the UAVs, the number of waypoints for each UAV may be different. Since the relative position between UAVs will be calculated at each time step during the trajectory optimization process, the length of the path for each UAV should be the same. Besides, since the reference path (formation center path) will be used to offer a formation rotation angle for UAVs at each time step (Section 4), the length of the reference path should be close to that of the UAVs. To achieve this requirement, the path points are densified by inserting h points equidistantly into the line segments between two adjacent points. An equalization algorithm (Algorithm 2) is proposed to calculate the number h of points that need to be inserted and then insert them into the previously searched paths. The process of Algorithm 2 is as follows.

Algorithm 2 Waypoint reallocation method

Input:: UAV paths 1, waypoint number $l_{1}, l_{2}, \dots, l_{n}$ , waypoint number of the formation center $l_{c}$ , maximum of the inserted point number $i_{m a x}$
Output:: reallocated paths $p_{c}, p_{1}, p_{2}, \dots p_{n}$
1:: $l_{m}$ = $m a x {l_{1}, l_{2}, \dots, l_{n}}$
2:: $E x t e n d (p_{1}, p_{2}, \dots p_{n})$
3:: for $i = 0 \to i_{m a x}$ do
4:: if $l_{m} + (l_{m} - 1) * i < l_{c}$ and $l_{m} + (l_{m} - 1) * (i + 1) > l_{c}$ then
5:: if $l_{c} - (l_{m} + (l_{m} - 1) * i) < l_{m} + (l_{m} - 1) * (i + 1) - l_{c}$ then
6:: $h = i$
7:: else $h = i + 1$
8:: end if
9:: end if
10:: end for
11:: for $j = 1 \to h$ do
12:: $I n s e r t (p_{j}, h)$
13:: end for
14:: $l = l_{m} + h * (l_{m} - 1)$
15:: if $l > l_{c}$ then
16:: $E x t e n d (p_{c}, l)$
17:: else $E x t e n d ({p_{1}, p_{2}, \dots p_{n}}, l_{c})$
18:: end if
19:: return $p_{c}, p_{1}, p_{2}, \dots p_{n}$

In Line 1, the longest path among the UAVs is chosen, and its waypoint number is denoted as

l_{m}

. In Line 2, each UAV’s path (except for the longest one) is extended by copying and appending its last waypoint to the end (function

E x t e n d ()

), thus making all UAVs’ numbers of waypoints equivalent. In Lines 3–10, h is determined by comparing the numbers of waypoints on the reference path and the inserted paths. In Lines 11–18, the UAVs’ paths are densified (function

I n s e r t ()

), and the length of the reallocated path is denoted as l. After that, the lengths of the reference path and the reallocated path are equalized.

L = m a x {l, l_{c}}

represents the final number of waypoints on all discrete paths.

To ensure the clarity and conciseness of the representation, the discrete path point set of UAV i obtained through path searching and waypoint reallocation is denoted as

p_{i} = {q_{i}^{k}}_{0}^{L}

, where

q_{i}^{k} = {[x_{i}^{k} y_{i}^{k}]}^{T}

represents the position of the kth path point.

3.3. Safe Corridor Generation

Safe corridors are several convex polygons covering the feasible space in the environment that ensures collision-free trajectories. The safe corridor set of the path

p_{i}

is represented as

S (p_{i}) = {S_{i}^{k} | k = 1, 2, \dots L}

, where

S_{i}^{k}

represents the safe corridor of the kth waypoint. Note that the safe corridor needs to be sequentially connected, i.e., the safe corridors of two adjacent points must overlap, which is denoted as

S_{i}^{k} \cap S_{i}^{k + 1} \neq ⌀, \forall k \in {1, 2, \dots L - 1}

(4)

In this paper, a rectangular safe corridor is generated around each waypoint by expanding a safe region centered at that point.

S_{i}^{j}

is initialized as the starting point

s_{i}

. Expansion proceeds in the four cardinal directions

{+ x, - x, + y, - y}

until the distance between the corridor’s boundary and nearby obstacles is reduced to a specified safe distance in all directions. This expansion is repeated sequentially for each point in the path

p_{i}

until the final corridor for

g_{i}

is obtained. Inspired by [35], the connectivity problem is solved by inserting points into the line segments between two adjacent waypoints (the same process as in Section 3.2) of both the UAV paths and the reference path. The number of inserted points is chosen in consideration of the density of obstacles.

Figure 3 demonstrates a safe corridor generation result. The yellow dots represent the original waypoints, and the blue ones represent the inserted waypoints (

h = 1

). The dots surrounded by red dashed lines represent the ‘expansion points’. The green rectangles represent the generated safe corridor for the path. During the generation process, if a waypoint

q_{i}^{k}

does not lie within the boundaries of the previously generated safe corridor

S_{i}^{k - 1}

(

2 < k < L

), i.e.,

q_{i}^{k} \notin S_{i}^{k - 1}

, then it is called an ‘expansion point’. Otherwise, if

q_{i}^{k} \in S_{i}^{k - 1}

, then the corridor generation process for the expansion point is skipped, and we suppose that

S_{i}^{k} = S_{i}^{k - 1}

. As is shown in Figure 3, after the first corridor

S_{i}^{1}

is constructed, the next waypoint that is not within

S_{i}^{1}

is

p_{i}^{3}

. Therefore,

p_{i}^{3}

is defined as an expansion point, whose safe corridor, denoted as

S_{i}^{3}

, is then generated, etc. When the last waypoint

p_{i}^{19}

receives its corresponding corridor

S_{i}^{19}

, the generation process is finished.

3.4. Trajectory Optimization

The discrete waypoints are refined into smooth formation trajectories in this subsection. The trajectory of the UAV i obtained through trajectory optimization is denoted as

{q_{i}^{k}, t_{i}^{k}}_{0}^{L}

, where

t_{k} = k Δ t

represents the time from the start to the kth trajectory point,

Δ t

is the unit time, and L represents the total number of points contained in the optimized trajectory.

First, the formation rotation matrix at each time step is calculated. By combining the reference formation shape with the rotation matrix, the formation cost is determined. Subsequently, a distributed trajectory optimization problem is formulated. Safe and smooth formation trajectories are obtained by solving the optimization problem.

3.4.1. Smoothness Cost

The smoothness cost involves two parts. The first part,

Δ v_{i}^{k}

, describes the difference in linear speed between two adjacent waypoints, i.e., the acceleration of the UAV. The second part,

Δ a_{i}^{k}

, describes the difference in acceleration between two adjacent trajectory segments, i.e., the jerk of the UAV. The smoothness cost is represented by

s_{i}^{k} = β_{1} Δ v_{i}^{k} + β_{2} Δ a_{i}^{k}

(5)

Δ v_{i}^{k} = v_{i}^{k + 1} - v_{i}^{k}

(6)

Δ a_{i}^{k} = a_{i}^{k + 1} - a_{i}^{k}

(7)

where

s_{i}^{k}

is the smoothness cost of UAVi at the kth trajectory point, while

β_{1}

and

β_{2}

are the weighting factors of the two parts of the smoothness cost.

3.4.2. Formation Cost

The formation cost is designed to realize the formation maintenance through the optimization process. First, the calculation of the formation rotation matrix is introduced as follows.

For a two-dimensional vector, we define its rotation matrix R as

R = [\begin{matrix} c o s (η) & - s i n (η) \\ s i n (η) & c o s (η) \end{matrix}]

(8)

where

η

represents the rotation angle. The counterclockwise rotation of a two-dimensional vector by an angle

η

relative to its original orientation can be achieved by left-multiplying it by the rotation matrix R.

The orientation of the UAV formation is denoted as

ω

. Let

ω = 0

when the formation is towards the positive direction of the x-axis in the xOy plane, and

ω

gradually increases as the formation rotates counterclockwise. The starting and goal orientation angles of the formation are defined as

ω_{s}

and

ω_{g}

, respectively. To realize the rotation of the moving formation, the previously searched reference path (in Line 2 of Algorithm 1) is utilized to obtain the rotation angle of the formation at each time step, denoted as

p_{c} = {q_{c}^{k}}_{0}^{L}

. The kth waypoint of the reference path is denoted as

q_{c}^{k} = {[x_{c}^{k} y_{c}^{k} ω_{c}^{k}]}^{T}

, with

ω_{c}^{1} = ω_{s}, ω_{c}^{L} = ω_{g}

. Therefore, the rotation matrix of the reference formation at the kth trajectory point can be written as

R_{k} = [\begin{matrix} c o s (ω_{c}^{k} - ω_{s}) & - s i n (ω_{c}^{k} - ω_{s}) \\ s i n (ω_{c}^{k} - ω_{s}) & c o s (ω_{c}^{k} - ω_{s}) \end{matrix}]

(9)

where

ω_{c}^{k}

represents the orientation angle of the reference path at the kth point. A transformation and rotation process of a triangle reference formation with 3 UAVs is illustrated in Figure 4. The black circles and the red circles represent UAVs and the center of the formation, respectively. The red dashed line represents the reference path (the path taken by formation’s center). It shows that the orientation of the formation’s center is used to describe the rotation of the overall formation. As k increases from 1 to L, the formation’s orientation angle (shown as dark blue arrows) gradually reduces from

ω_{s}

to

ω_{g}

.

After the rotation matrix is obtained, the calculation of the formation cost is explained as follows.

We construct the formation cost of the optimization problem using two values,

{\tilde{F}}_{i c}^{k}

and

{\tilde{F}}_{i j}^{k}

, both representing the deviation between reference and actual relative position vectors:

{\tilde{F}}_{i c}^{k}

is the deviation of the relative position vector between each UAV and the formation’s center at the kth trajectory point. Based on the rotation matrix and the reference formation shape, the reference value can be calculated by

{\hat{F}}_{i c}^{k} = R_{k} \cdot (F_{i}^{r} - F_{c}^{r})

, where

F_{c}^{r}

represents the reference position of the formation’s center. The actual value is

F_{i c}^{k} = q_{i}^{k} - q_{c}^{k}

, where

q_{i}^{k}

and

q_{c}^{k}

represent the k-th waypoint of the i-th UAV’s path and the reference path, respectively. Then,

{\tilde{F}}_{i c}^{k} = R_{k} \cdot (F_{i}^{r} - F_{c}^{r}) - (q_{i}^{k} - q_{c}^{k})

(10)

{\tilde{F}}_{i j}^{k}

is the deviation of relative position vectors between two UAVs at the kth trajectory point. Only one neighbor

j = i - 1 (i \geq 2)

is taken into consideration for UAV i in order to reduce the computational burden. The reference value is

{\hat{F}}_{i j}^{k} = R_{k} \cdot (F_{i}^{r} - F_{j}^{r})

, while the actual value is

F_{i j}^{k} = q_{i}^{k} - q_{j}^{k}

. Then,

{\tilde{F}}_{i j}^{k} = R_{k} \cdot (F_{i}^{r} - F_{j}^{r}) - (q_{i}^{k} - q_{j}^{k})

(11)

In Figure 4, the gray arrows are examples of the two reference vectors mentioned above.

{\hat{F}}_{21}^{k}

represents the reference relative position between UAV1 and UAV2, while

{\hat{F}}_{2 c}^{k}

represents the reference relative position between UAV2 and the formation’s center. The sum of the two terms above yields the formation error vector at the kth trajectory point (i.e., at time step

t = k Δ t

) as follows:

m_{i}^{k} = γ_{1} {\tilde{F}}_{i c}^{k} + γ_{2} {\tilde{F}}_{i j}^{k}

(12)

where

m_{i}^{k}

is the formation cost of UAVi at the kth trajectory point, while

γ_{1}, γ_{2}

are positive weighting constants. To obtain optimal formation shape maintenance, the objective of the optimization is to minimize the norm of

m_{i}^{k}

, i.e.,

∥ m_{i}^{k} ∥ \to 0, \forall k \in {1, 2, \dots, L}

.

3.4.3. Trajectory Optimization Problem

The proposed trajectory optimization problem is as follows:

\begin{matrix} \min & \sum_{k = 1}^{L - 1} s_{i}^{k T} P s_{i}^{k} + \sum_{k = 1}^{L} m_{i}^{k T} Q m_{i}^{k} \end{matrix}

(13a)

\begin{matrix} s . t . & q_{i}^{0} = s_{i}, q_{i}^{L} = g_{i}, \forall i \end{matrix}

(13b)

\begin{matrix} q_{i}^{k + 1} = ξ (q_{i}^{k}, v_{i}^{k}), \forall i, k \end{matrix}

(13c)

\begin{matrix} v_{i}^{k + 1} = ζ (v_{i}^{k}, a_{i}^{k}), \forall i, k \end{matrix}

(13d)

\begin{matrix} q_{i}^{k} \in S_{i}^{k}, \forall i, k \end{matrix}

(13e)

\begin{matrix} a_{i}^{k} \in A, \forall i, k \end{matrix}

(13f)

\begin{matrix} ∥q_{i}^{k} - q_{j}^{k}∥ \geq 2 R_{s a f e}, \forall i > 1, j = {1, 2, . . ., i - 1}, k; \end{matrix}

(13g)

where

P \in R^{+}

and

Q \in R^{+}

are weight constants. The number of the trajectory points is denoted as L. The cost function (13a) involves a smoothness cost

\sum_{k = 1}^{L - 1} s_{i}^{k T} P s_{i}^{k}

and a formation cost

\sum_{k = 1}^{L} m_{i}^{k T} Q m_{i}^{k}

, which are represented in quadratic form. The optimization variable is the acceleration of the UAVs. Equations (13b)–(13g) are the hard constraints of the optimization problem. The starting position and goal position of each UAV are limited by (13b), where

i \in {1, 2, \dots n}

. The kinematics of a UAV is demonstrated by (13c) and (13d), where

i \in {1, 2, \dots n}, k \in {1, 2, \dots L - 1}

. The specific forms of (13c) and (13d) are denoted as

\begin{matrix} \{\begin{matrix} q_{i}^{k + 1} = ξ (q_{i}^{k}, v_{i}^{k}) = q_{i}^{k} + v_{i}^{k} Δ t \\ v_{i}^{k + 1} = ζ (v_{i}^{k}, a_{i}^{k}) = v_{i}^{k} + a_{i}^{k} Δ t \end{matrix} \end{matrix}

(14)

The safe corridor constraint is denoted as (13e), where

i \in {1, 2, \dots n}, k \in {1, 2, \dots L}

. Equation (13f) determines the upper bound and lower bound of the control input. Equation (13g) represents the collision avoidance constraint among UAVs, where

i \in {2, \dots n}, j \in {1, 2, \dots, i - 1}, k \in {1, 2, \dots L}

.

R_{s a f e}

is the collision radius of the UAV. From UAV1 to UAVn, an optimization problem (13a) is constructed and solved sequentially. The prior optimized trajectories are utilized by the latter ones to calculate formation cost and to achieve reciprocal avoidance in the swarm. A directed graph is used to describe the communication topology between UAVs. Take a formation with 4 UAVs as an example (see Figure 5), UAV2 receives the trajectory of UAV1, and UAV3 can receive the trajectories of both UAV1 and UAV2, etc.

Note that the proposed optimization problem (13a) is a non-convex optimization with nonlinear constraints. This may preclude the finding of an optimal solution. However, if a slight deviation from the reference formation shape is acceptable, classic nonlinear optimization techniques can still be utilized to solve the problem and obtain a feasible solution.

4. Formation Tracking Control

This section describes a formation tracking control method to track the generated trajectories (Section 3), aiming to confirm that the planned trajectories in Section 3 satisfy the UAV dynamics and can be executed by UAVs. The framework of the control scheme is shown in Figure 6. Sliding mode control is utilized to design the controller. This method is inspired by a previous study [25].

A UAV can be classified as an under-actuated system because it has four control inputs but six state variables. The horizontal control is closely related to the roll and pitch control. The desired roll angle and the desired pitch angle need to be derived from the horizontal control component, which can be expressed by the following nonlinear equations:

\begin{matrix} \{\begin{matrix} u_{x i} = cos ϕ_{i} sin θ_{i} cos ψ_{i} + sin ψ_{i} sin ψ_{i} \\ u_{y i} = cos ϕ_{i} sin θ_{i} sin ψ_{i} + sin ψ_{i} cos ψ_{i} . \end{matrix} \end{matrix}

(15)

Solving (15) derives the desired attitude angle, which can be written as

\begin{matrix} \{\begin{matrix} ϕ_{i}^{d} = arcsin (u_{x i} sin ψ_{i} - u_{y i} cos ψ_{i}) \\ θ_{i}^{d} = arcsin (\frac{u_{x i} cos ψ_{i} - u_{y i} sin ψ_{i}}{cos ψ_{i}^{d}}) \end{matrix} \end{matrix}

(16)

The position control in this paper is designed as follows:

\begin{matrix} \{\begin{matrix} u_{x i} & = \frac{m_{i} ({\ddot{x}}_{i}^{d} - k_{x} s_{x i} + c_{x} {\dot{e}}_{x i})}{F_{i}} \\ u_{y i} & = \frac{m_{i} ({\ddot{y}}_{i}^{d} - k_{y} s_{y i} + c_{y} {\dot{e}}_{y i})}{F_{i}} \\ F_{i} & = \frac{m_{i} ({\ddot{z}}_{i}^{d} - k_{z} s_{z i} + c_{z} {\dot{e}}_{z i} - g)}{cos ϕ_{i} cos θ_{i}} \end{matrix} \end{matrix}

(17)

where

e_{x i} = x_{i}^{d} - x_{i}

,

e_{y i} = y_{i}^{d} - y_{i}

, and

e_{z i} = z_{i}^{d} - z_{i}

are position tracking errors;

s_{x i} = c_{x} e_{x i} + {\dot{e}}_{x i}

,

s_{y i} = c_{y} e_{y i} + {\dot{e}}_{y i}

, and

s_{z i} = c_{z} e_{z i} + {\dot{e}}_{z i}

are designed sliding mode surfaces; and

K_{p} =

diag

(k_{x}, k_{y}, k_{z})

and

C_{p} =

diag

(c_{x}, c_{y}, c_{z})

are two positive definite gain matrices.

Theorem 1.

For the position dynamics of a UAV (1a)–(1c), if

K_{p}

and

C_{p}

are positive definite, the position tracking error coordinates

(e_{x i}, e_{y i}, e_{z i})

are stable under position control (17).

Proof.

See Appendix A. □

The procedure for attitude control in this paper is designed as follows:

\begin{matrix} \{\begin{matrix} τ_{ϕ i} & = ({\ddot{ϕ}}_{i}^{d} - k_{ϕ} s_{ϕ i} - c_{ϕ} {\dot{e}}_{ϕ i}) I_{x} - (I_{y} - I_{z}) {\dot{θ}}_{i} {\dot{ψ}}_{i} \\ τ_{θ i} & = ({\ddot{θ}}_{i}^{d} - k_{θ} s_{θ i} - c_{θ} {\dot{e}}_{θ i}) I_{y} - (I_{z} - I_{x}) {\dot{ψ}}_{i} {\dot{ϕ}}_{i} \\ τ_{ψ i} & = ({\ddot{ψ}}_{i}^{d} - k_{ψ} s_{ψ i} - c_{ψ} {\dot{e}}_{ψ i}) I_{z} - (I_{x} - I_{y}) {\dot{ϕ}}_{i} {\dot{θ}}_{i} \end{matrix} \end{matrix}

(18)

where

e_{ϕ i} = ψ_{i}^{d} - ψ_{i}, e_{θ i} = θ_{i}^{d} - θ_{i}

, and

e_{ψ} i = ψ_{i}^{d} - ψ_{i}

are attitude tracking errors;

s_{ϕ i} = c_{ϕ} e_{ϕ i} + {\dot{e}}_{ϕ i}, s_{θ i} = c_{θ} e_{θ i} + {\dot{e}}_{θ i}

, and

s_{ψ i} = c_{ψ} e_{ψ i} + {\dot{e}}_{ψ i}

are designed sliding mode surfaces; and

K_{a} =

diag

(k_{ϕ}, k_{θ}, k_{ψ})

and

C_{a} =

diag

(c_{ϕ}, c_{θ}, c_{ψ})

are two positive definite gain matrices.

Theorem 2.

For the attitude dynamics of a UAV (1d)–(1f), if

K_{a}

and

C_{a}

are positive definite, the attitude control procedure (18) can force the attitude tracking errors (

e_{ϕ i}

,

e_{θ i}

,

e_{ψ i}

) to converge to zero.

Proof.

The proof of Theorem 2 is similar to that of Theorem 1. The detailed proof can be seen in [25]. □

5. Simulation

To verify the effectiveness of the proposed method, simulations were conducted in different environments. The simulations were implemented in MATLAB R2024b and Simulink on a laptop with an Intel i7-14650HX @2.20 GHz CPU and 32 GB of RAM. SQP (Sequential Quadratic Programming) was utilized to solve the trajectory optimization problems.

5.1. Simulation Setup

The size of the UAVs’ working space was 50 m × 50 m, which was described by a grid map with a size of 50 × 50. The radius of the UAVs’ collision range was

R_{s a f e} = 0.4

m, and the acceleration of each UAV was constrained by

a_{m i n} = - 3

m/s²,

a_{m a x} = 3

m/s². The time interval between two adjacent trajectory points was set to

Δ t = 0.2

s. The weight coefficients in the cost function were chosen as

P = 1, Q = 1, β_{1} = β_{2} = 1, γ_{1} = 20, γ_{2} = 1

. The weight coefficients in the path search were chosen as

α_{1} = 1, α_{2} = 2.5

. The parameters of the tracking controller were selected as

C_{p} = C_{a} = d i a g (8, 8, 8)

,

K_{p} = d i a g (0.1, 0.8, 0.8)

, and

K_{a} = d i a g (10, 10, 10)

.

5.2. Simulation of Formation Planning

Figure 7 shows the trajectory optimization result of three UAVs maintaining a square formation in an environment with static obstacles. We captured the positions of the UAVs at certain trajectory points, including

k = 1, \frac{L}{6}, \frac{L}{3}, \frac{2 L}{3}, \frac{5 L}{6}

, and L. The starting and goal orientation angles of the formation were both

\frac{π}{2}

. The results showed that the formation could be maintained during the flight. The safety of the UAVs was guaranteed by sacrificing the quality of formation shape maintenance when they moved through the narrow areas between obstacles. After the UAVs passed through the obstacle-rich regions, they quickly regrouped in the reference formation shape. To validate the ability of UAVs to maintain formation with the proposed method, the formation errors are shown in Figure 8. The two subgraphs demonstrate the changes in

| | F_{i j}^{k} | |

and

\frac{F_{i j}^{k}}{| | F_{i j}^{k} | |}

, which represent the position error and the angle error between UAVs, respectively. It can be seen that the formation errors nearly converged to zero except when the swarm traversed the narrow gap between obstacles and during the subsequent regrouping phase. Figure 9 and Figure 10 are the simulation results in a different environment. The starting and goal orientation angles of the formation were changed into

\frac{π}{2}

and

π

, respectively. The statistical data of the formation errors were calculated and shown in Table 2. It can be observed that the mean values of both position error and angle error were very small, indicating that the formation was well maintained. Additionally, the standard deviations of the errors were relatively low, suggesting that the formation remained stable over time with minimal fluctuations in individual agents’ deviations. This demonstrates the robustness of the planned trajectories in maintaining formation integrity. The above simulation results prove that the proposed method has the ability to balance between safety and formation maintenance. The adaptability of the proposed formation planning method to different environments was also verified through the simulation results.

5.3. Simulation of Formation Tracking Control

In this subsection, the controller designed in Section 4 is utilized to track the formation trajectories shown in Figure 9. As is shown in Figure 11a, the control results drawn in blue lines track the desired trajectories well. The tracking errors, illustrated in Figure 11b, also prove that the tracking performance is quite satisfactory. It also indicates that the trajectories generated by the proposed method satisfy the dynamic constraints of quadrotor UAVs.

6. Experiment

The UAVs utilized in the experiment are Crazyflies (https://www.bitcraze.io/products/crazyflie-2-1/, accessed on 10 April 2025). The Crazyflie is a light, versatile, open-source quadrotor platform. The position measurements were supported by the OptiTrack (https://www.optitrack.com/, accessed on 10 April 2025) motion capture system. The proposed distributed formation planning was realized and solved using MATLAB R2024b. The control commands were broadcast using the Crazyradio PA (https://www.bitcraze.io/products/crazyradio-pa/, accessed on 10 April 2025) data transmission module through a data relay laptop running Ubuntu 20.04.

Four Crazyflies were considered in a planar obstacle environment. The reference formation and shape was a square with a side length of 1 m. The task of the UAVs was to pass by two obstacles, which were located at

{[3.7, - 1.8]}^{T}

and

{[0.75, 0.7]}^{T}

, respectively. To guarantee the safety of the flight, the cuboid obstacles were expanded in the xOy plane, i.e., the obstacles are modeled as cylinders with a radius of 1m. The starting and goal positions of the four UAVs were

\begin{matrix} \{\begin{matrix} s_{1} = {[1.0, - 1.8]}^{T} \\ s_{2} = {[2.0, - 1.8]}^{T} \\ s_{3} = {[2.0, - 2.8]}^{T} \\ s_{4} = {[1.0, - 2.8]}^{T} \end{matrix} \{\begin{matrix} g_{1} = {[2.25, 2.2]}^{T} \\ g_{2} = {[3.25, 2.2]}^{T} \\ g_{3} = {[3.25, 1.2]}^{T} \\ g_{4} = {[2.25, 1.2]}^{T} \end{matrix} \end{matrix}

(19)

The parameters in the experiment were selected as

\begin{matrix} \{\begin{matrix} a_{m i n} & = - 2 {m / s}^{2} \\ a_{m a x} & = 2 {m / s}^{2} \\ Δ t & = 0.25 s \\ R_{s a f e} & = 0.3 m \end{matrix} \{\begin{matrix} P & = 1 \\ Q & = 1 \\ β_{1} & = β_{2} = 1 \\ γ_{1} & = γ_{2} = 1 \end{matrix} \{\begin{matrix} α_{1} & = 1 \\ α_{2} & = 2.5 \end{matrix} \end{matrix}

(20)

Figure 12 shows snapshots at t = 0, 23, and 50 s during the real-world experiment. The Crazyflies are marked with flashing blue lights, and the colored cubes are the obstacles (modeled as cylinders). The red dashed lines represent the square formation of the UAVs during the flight, and the yellow arrows illustrate the reference path of the formation’s center. It can be seen that four UAVs maintaining a square formation were able to move along the reference path from the starting points (the upper right corner in the scene) to the goal points (the bottom left corner in the scene). Collisions with the two obstacles and collisions between UAVs were avoided. The reference formation shape was maintained well, and the formation scale was automatically transformed to pass through the narrow gap between obstacles. Figure 13 illustrates the UAV position data collected during the experiment. Note that the figure is displayed upside down from reality to better show the process of moving from the starting points to the goal points of the UAVs. The blue dots represent the four UAVs. The light blue lines and the red line represent the trajectories of the UAVs and the reference path, respectively. Obstacles are drawn in gray. It was found that the UAVs took off and formed an initial reference square formation at 0 s. As the UAVs drew closer to the obstacles during the flight, the formation shape was compressed because of the obstacle avoidance consciousness of each individual UAV (e.g., at 23 s). After that, the UAVs regrouped into the reference formation and finally land at 50 s.

7. Conclusions and Future Work

This paper proposed a distributed formation planning method for multiple UAVs in environments with static obstacles. The proposed method consists of swarm path searching and distributed trajectory optimization. We designed a path searching method named swarm-A* to find discrete, collision-free UAV paths that prevent formation disintegration when encountering an obstacle. The main result is a distributed trajectory optimization that is constructed and solved to transform the paths into smooth formation trajectories, with safe flight corridors being the safety constraints. A rotation matrix is applied to realize rotation of the whole formation, and the relative position vectors between the UAV and the reference points are utilized to build the cost function of the optimization. A tracking controller is designed to track the generated formation trajectories, confirming that the trajectories satisfy quadrotor dynamics. According to the simulation and real-world experiment results, the UAV swarm can travel in an obstacle-containing environment safely with flexible formations, realizing a balance between obstacle avoidance and formation shape maintenance. Smooth rotation and transformation of the UAV formation can be achieved. In summary, the main novel aspects of this paper are (1) the swarm-A* algorithm, which enhances the cohesion of the swarm and prevents disintegration of the formation during the path searching process, and (2) a distributed formation trajectory optimization framework that can balance collision avoidance and formation shape maintenance.

In future work, the proposed method can be enhanced by addressing the following. First, the method can only handle static obstacles. Real-time planning will be considered to handle moving obstacles, improving adaptability to different environments. Second, our current method is validated only in a planar setting. Extending it to full 3D space with altitude control will enable application to more complex aerial missions. Third, future work could explore adaptive strategies inspired by driving behavior research [36,37]. Such techniques could provide enhancement for UAV formations to handle multi-agent interactions, uncertainty, and individual differences better. Moreover, another promising research interest is to combine the proposed formation planning method with the theory of affine formation (in [19,20]) to achieve better performance in formation variation.

Author Contributions

Conceptualization, H.F., Q.Y., X.Z. and Z.Z.; methodology, Q.Y., X.Z. and Z.Z.; software, Z.Z.; validation, Z.Z.; formal analysis, Q.Y., X.Z. and Z.Z.; investigation, Z.Z.; resources, Z.Z.; data curation, Z.Z.; writing—original draft preparation, Z.Z.; writing—review and editing, Z.Z.; visualization, Z.Z.; supervision, H.F., Q.Y. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the NSFC under grants 62373048, 62088101, in part by the National Key Research and Development Program of China under No. 2022YFB4702000, No. 2022YFA1004703, in part by the NSFC under Grants 62133002, U1913602, in part by the Fundamental Research Funds for the Central Universities and in part by the Shanghai Municipal Science and Technology Major Project (2021SHZDZX0100).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

According to (1a)–(1c), there exists

\begin{matrix} {\ddot{p}}_{i} = f (u_{i}) + g_{i} \end{matrix}

where

f (u_{i}) = [\begin{matrix} u_{x i} F_{i} / m_{i} \\ u_{y i} F_{i} / m_{i} \\ cos ϕ_{i} cos θ_{i} F_{i} / m_{i} \end{matrix}]

,

g_{i} = [\begin{matrix} 0 \\ 0 \\ - g \end{matrix}]

.

The position tracking errors are written as

\begin{matrix} {\hat{p}}_{i} = p_{i}^{d} - p_{i} \\ {\dot{\hat{p}}}_{i} = {\dot{p}}_{i}^{d} - {\dot{p}}_{i} . \end{matrix}

The sliding mode surface is denoted as

\begin{matrix} s_{p i} = C_{p} {\hat{p}}_{i} + {\dot{\hat{p}}}_{i} \end{matrix}

where

s_{p i} = {[s_{x i} s_{y i} s_{z i}]}^{T}

. The chosen Lyapunov function is

V_{p i} = \frac{1}{2} s_{p i}^{T} s_{p i}

. Its derivative is written as

\begin{matrix} {\dot{V}}_{p i} & = s_{p i}^{T} {\dot{s}}_{p i} \\ = - s_{p i}^{T} K_{p} s_{p i} \leq - 2 λ_{m i n} V_{p i} . \end{matrix}

where

λ_{m i n}

is the minimum eigenvalue of

K_{p}

. It implies that if

K_{p}

is positive definite, then

s_{p i} \to 0

. Therefore, for

s_{p i} = C_{p} {\hat{p}}_{i} + {\dot{\hat{p}}}_{i} = 0

, it can be observed that

\hat{p}

and

\dot{\hat{p}}

will converge to 0 as well.

References

Gu, J.; Su, T.; Wang, Q.; Du, X.; Guizani, M. Multiple moving targets surveillance based on a cooperative network for multi-UAV. IEEE Commun. Mag. 2018, 56, 82–89. [Google Scholar] [CrossRef]
Luo, Y.; Zhuang, Z.; Pan, N.; Feng, C.; Shen, S.; Gao, F.; Cheng, H.; Zhou, B. Star-Searcher: A Complete and Efficient Aerial System for Autonomous Target Search in Complex Unknown Environments. IEEE Robot. Autom. Lett. 2024, 9, 4329–4336. [Google Scholar] [CrossRef]
Scherer, J.; Yahyanejad, S.; Hayat, S.; Yanmaz, E.; Andre, T.; Khan, A.; Vukadinovic, V.; Bettstetter, C.; Hellwagner, H.; Rinner, B. An autonomous multi-UAV system for search and rescue. In Proceedings of the First Workshop on Micro Aerial Vehicle Networks, Systems, and Applications for Civilian Use, Florence, Italy, 18 May 2015; pp. 33–38. [Google Scholar]
Zhou, B.; Xu, H.; Shen, S. Racer: Rapid collaborative exploration with a decentralized multi-uav system. IEEE Trans. Robot. 2023, 39, 1816–1835. [Google Scholar] [CrossRef]
Hafez, A.T.; Marasco, A.J.; Givigi, S.N.; Iskandarani, M.; Yousefi, S.; Rabbath, C.A. Solving multi-UAV dynamic encirclement via model predictive control. IEEE Trans. Control Syst. Technol. 2015, 23, 2251–2265. [Google Scholar] [CrossRef]
Zhang, X.; Zhang, F.; Huang, P.; Gao, J.; Yu, H.; Pei, C.; Zhang, Y. Self-triggered based coordinate control with low communication for tethered multi-UAV collaborative transportation. IEEE Robot. Autom. Lett. 2021, 6, 1559–1566. [Google Scholar] [CrossRef]
Chai, Y.; Liang, X.; Han, J. A Unified Collision Avoidance Trajectory Planning with Dual Variables for Collaborative Aerial Transportation Systems. Drones 2024, 8, 637. [Google Scholar] [CrossRef]
Alonso-Mora, J.; Naegeli, T.; Siegwart, R.; Beardsley, P. Collision avoidance for aerial vehicles in multi-agent scenarios. Auton. Robot. 2015, 39, 101–121. [Google Scholar] [CrossRef]
Zhou, X.; Zhu, J.; Zhou, H.; Xu, C.; Gao, F. Ego-swarm: A fully autonomous and decentralized quadrotor swarm system in cluttered environments. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 4101–4107. [Google Scholar]
Zhou, X.; Wang, Z.; Wen, X.; Zhu, J.; Xu, C.; Gao, F. Decentralized spatial-temporal trajectory planning for multicopter swarms. arXiv 2021, arXiv:2106.12481. [Google Scholar]
Tordesillas, J.; How, J.P. MADER: Trajectory planner in multiagent and dynamic environments. IEEE Trans. Robot. 2021, 38, 463–476. [Google Scholar] [CrossRef]
Toumieh, C.; Floreano, D. High-Speed Motion Planning for Aerial Swarms in Unknown and Cluttered Environments. arXiv 2024, arXiv:2402.19033. [Google Scholar] [CrossRef]
Zhao, W.; Li, L.; Wang, Y.; Zhan, H.; Fu, Y.; Song, Y. Research on A Global Path-Planning Algorithm for Unmanned Arial Vehicle Swarm in Three-Dimensional Space Based on Theta*–Artificial Potential Field Method. Drones 2024, 8, 125. [Google Scholar] [CrossRef]
Bu, Y.; Yan, Y.; Yang, Y. Advancement Challenges in UAV Swarm Formation Control: A Comprehensive Review. Drones 2024, 8, 320. [Google Scholar] [CrossRef]
Yun, B.; Chen, B.M.; Lum, K.Y.; Lee, T.H. Design and implementation of a leader-follower cooperative control system for unmanned helicopters. J. Control Theory Appl. 2010, 8, 61–68. [Google Scholar] [CrossRef]
Jasim, W.; Gu, D. Robust team formation control for quadrotors. IEEE Trans. Control Syst. Technol. 2017, 26, 1516–1523. [Google Scholar] [CrossRef]
He, L.; Bai, P.; Liang, X.; Zhang, J.; Wang, W. Feedback formation control of UAV swarm with multiple implicit leaders. Aerosp. Sci. Technol. 2018, 72, 327–334. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, P.; Li, L.; Ye, L. Distributed Cooperative Tracking Control for Multi-UAV Formation with Communication Constraints. In Proceedings of the China Conference on Command and Control, Beijing, China, 17–19 May 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 225–236. [Google Scholar]
Zhang, X.; Yang, Q.; Lyu, J.; Zhao, X.; Fang, H. Distributed Variation Parameter Design for Dynamic Formation Maneuvers With Bearing Constraints. IEEE Trans. Autom. Sci. Eng. 2024, 21, 3664–3677. [Google Scholar] [CrossRef]
Zhang, X.; Lv, J.; Lu, S.; Yang, Q. Distributed Decision Making on Scaling Size for Obstacle Avoidance in Affine Formation Control. In Proceedings of the 2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC), Beijing, China, 19–20 November 2022; pp. 1001–1006. [Google Scholar]
Gong, B.; Li, Y.; Zhang, L.; Ai, J. Adaptive Factor Fuzzy Controller for Keeping Multi-UAV Formation While Avoiding Dynamic Obstacles. Drones 2024, 8, 344. [Google Scholar] [CrossRef]
Nguyen, B.; Nghiem, T.; Nguyen, L.; Nguyen, T.; La, H.; Sookhak, M.; Nguyen, T. Distributed formation trajectory planning for multi-vehicle systems. In Proceedings of the 2023 American Control Conference (ACC), San Diego, CA, USA, 31 May–2 June 2023; pp. 1325–1330. [Google Scholar]
Quan, L.; Yin, L.; Xu, C.; Gao, F. Distributed swarm trajectory optimization for formation flight in dense environments. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022; pp. 4979–4985. [Google Scholar]
Peng, P.; Dong, W.; Chen, G.; Zhu, X. Obstacle avoidance of resilient UAV swarm formation with active sensing system in the dense environment. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; pp. 10529–10535. [Google Scholar]
Zhang, X.; Zhang, F.; Huang, P. Formation planning for tethered multirotor uav cooperative transportation with unknown payload and cable length. IEEE Trans. Autom. Sci. Eng. 2023, 21, 3449–3460. [Google Scholar] [CrossRef]
Mikkelsen, J.H.; Fumagalli, M. Distributed Planning for Rigid Robot Formations using Consensus on the Transformation of a Base Configuration. In Proceedings of the 2023 21st International Conference on Advanced Robotics (ICAR), Abu Dhabi, United Arab Emirates, 5–8 December 2023; pp. 627–632. [Google Scholar]
Liu, W.; Hu, J.; Zhang, H.; Wang, M.Y.; Xiong, Z. A Novel Graph-based Motion Planner of Multi-Mobile Robot Systems with Formation and Obstacle Constraints. IEEE Trans. Robot. 2023, 40, 714–728. [Google Scholar] [CrossRef]
Wen, G.; Chen, C.P.; Li, B. Optimized formation control using simplified reinforcement learning for a class of multiagent systems with unknown dynamics. IEEE Trans. Ind. Electron. 2019, 67, 7879–7888. [Google Scholar] [CrossRef]
Wang, L.; Wang, K.; Pan, C.; Xu, W.; Aslam, N.; Hanzo, L. Multi-agent deep reinforcement learning-based trajectory planning for multi-UAV assisted mobile edge computing. IEEE Trans. Cogn. Commun. Netw. 2020, 7, 73–84. [Google Scholar] [CrossRef]
Cao, Y.; Cheng, X.; Mu, J. Concentrated coverage path planning algorithm of UAV formation for aerial photography. IEEE Sens. J. 2022, 22, 11098–11111. [Google Scholar] [CrossRef]
Zhou, J.; Zhang, H.; Hua, M.; Wang, F.; Yi, J. P-DRL: A Framework for Multi-UAVs Dynamic Formation Control under Operational Uncertainty and Unknown Environment. Drones 2024, 8, 475. [Google Scholar] [CrossRef]
Bouabdallah, S.; Murrieri, P.; Siegwart, R. Design and control of an indoor micro quadrotor. In Proceedings of the IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA’04. 2004, New Orleans, LA, USA, 26 April–1 May 2004; Volume 5, pp. 4393–4398. [Google Scholar]
Labbadi, M.; Cherkaoui, M. Adaptive Fractional-Order Nonsingular Fast Terminal Sliding Mode Based Robust Tracking Control of Quadrotor UAV With Gaussian Random Disturbances and Uncertainties. IEEE Trans. Aerosp. Electron. Syst. 2021, 57, 2265–2277. [Google Scholar] [CrossRef]
Dolgov, D.; Thrun, S.; Montemerlo, M.; Diebel, J. Practical search techniques in path planning for autonomous driving. Ann Arbor 2008, 1001, 18–80. [Google Scholar]
Li, J.; Ran, M.; Xie, L. Efficient Trajectory Planning for Multiple Non-Holonomic Mobile Robots via Prioritized Trajectory Optimization. IEEE Robot. Autom. Lett. 2021, 6, 405–412. [Google Scholar] [CrossRef]
Mohammadnazar, A.; Arvin, R.; Khattak, A.J. Classifying travelers’ driving style using basic safety messages generated by connected vehicles: Application of unsupervised machine learning. Transp. Res. Part C Emerg. Technol. 2021, 122, 102917. [Google Scholar] [CrossRef]
Deng, Z.; Hu, W.; Sun, C.; Chu, D.; Huang, T.; Li, W.; Yu, C.; Pirani, M.; Cao, D.; Khajepour, A. Eliminating Uncertainty of Driver’s Social Preferences for Lane Change Decision-Making in Realistic Simulation Environment. IEEE Trans. Intell. Transp. Syst. 2025, 26, 1583–1597. [Google Scholar] [CrossRef]

Figure 1. Framework of the formation planning method. The dashed lines represent the discrete paths and the smooth curves represent the optimized trajectories.

Figure 2. Comparison of path searching results as the weight parameter

α_{2}

changes.

Figure 2. Comparison of path searching results as the weight parameter

α_{2}

changes.

Figure 3. An illustration of safe corridor generation.

Figure 4. An illustration of the rotation process of a reference formation. The black circles represent the UAVs, which are labeled by i.

Figure 5. The communication topology of four UAVs in a formation. The UAVs are represented by blue circles and are labeled by the numbers in the circles.

Figure 6. The tracking control framework of a single quadrotor.

Figure 7. Formation planning results for four UAVs in a square formation passing through a narrow aisle (

ω_{g} = ω_{s} = \frac{π}{2}

).

Figure 7. Formation planning results for four UAVs in a square formation passing through a narrow aisle (

ω_{g} = ω_{s} = \frac{π}{2}

).

Figure 8. Formation error of four UAVs in a square formation passing through a narrow aisle (

ω_{g} = ω_{s} = \frac{π}{2}

).

Figure 8. Formation error of four UAVs in a square formation passing through a narrow aisle (

ω_{g} = ω_{s} = \frac{π}{2}

).

Figure 9. Formation planning results of four UAVs in a square formation circumventing obstacles (

ω_{g} = \frac{π}{2}, ω_{s} = π

).

Figure 9. Formation planning results of four UAVs in a square formation circumventing obstacles (

ω_{g} = \frac{π}{2}, ω_{s} = π

).

Figure 10. Formation error of four UAVs in a square formation circumventing obstacles (

ω_{g} = \frac{π}{2}, ω_{s} = π

).

Figure 10. Formation error of four UAVs in a square formation circumventing obstacles (

ω_{g} = \frac{π}{2}, ω_{s} = π

).

Figure 11. (a) Formation tracking control results. The UAVs are labeled by the numbers in the blue dots. (b) Formation tracking control error.

Figure 12. Snapshots of four UAVs in a square formation in the experiment. The yellow arrows represent the reference path and the red dashed boxes represent the formation maintained by the UAVs.

Figure 13. Recorded trajectories of the UAVs in the experiment. The UAVs are labeled by the numbers in the blue dots.

Table 1. Comparison of different kinds of formation planning methods.

Method	Computational Complexity	Scalability	Environmental Adaptability
Control-based	relatively low	moderate	relatively low
Search-optimization	moderate	relatively high	relatively high
DRL-based	very high	high	high

Table 2. Error analysis of the simulation results (

ω_{g} = \frac{π}{2}, ω_{s} = π

).

Table 2. Error analysis of the simulation results (

ω_{g} = \frac{π}{2}, ω_{s} = π

).

	Mean of Position Error (m)	Standard Deviation of Position Error (m)	Mean of Angle Error (Degrees)	Standard Deviation of Angle Error (Degrees)
UAVs 1 and 2	0.0518	0.0798	1.7392	2.7281
UAVs 1 and 3	0.1007	0.1380	1.4751	2.3894
UAVs 1 and 4	0.1201	0.2062	1.6443	2.8327
UAVs 2 and 3	0.0685	0.0945	1.2044	2.1282
UAVs 2 and 4	0.1501	0.2802	1.5088	2.3690
UAVs 3 and 4	0.0788	0.1385	2.5375	4.6287

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, Z.; Zhang, X.; Fang, H.; Yang, Q. Distributed Formation Planning for Unmanned Aerial Vehicles. Drones 2025, 9, 306. https://doi.org/10.3390/drones9040306

AMA Style

Zhao Z, Zhang X, Fang H, Yang Q. Distributed Formation Planning for Unmanned Aerial Vehicles. Drones. 2025; 9(4):306. https://doi.org/10.3390/drones9040306

Chicago/Turabian Style

Zhao, Zeming, Xiaozhen Zhang, Hao Fang, and Qingkai Yang. 2025. "Distributed Formation Planning for Unmanned Aerial Vehicles" Drones 9, no. 4: 306. https://doi.org/10.3390/drones9040306

APA Style

Zhao, Z., Zhang, X., Fang, H., & Yang, Q. (2025). Distributed Formation Planning for Unmanned Aerial Vehicles. Drones, 9(4), 306. https://doi.org/10.3390/drones9040306

Article Menu

Distributed Formation Planning for Unmanned Aerial Vehicles

Abstract

1. Introduction

2. System Description and Problem Formulation

2.1. System Description

2.2. Problem Formulation

2.3. Graph Theory

3. Formation Planning

3.1. Path Searching

3.2. Waypoint Reallocation

3.3. Safe Corridor Generation

3.4. Trajectory Optimization

3.4.1. Smoothness Cost

3.4.2. Formation Cost

3.4.3. Trajectory Optimization Problem

4. Formation Tracking Control

5. Simulation

5.1. Simulation Setup

5.2. Simulation of Formation Planning

5.3. Simulation of Formation Tracking Control

6. Experiment

7. Conclusions and Future Work

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI