Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies

Zhang, Yingzheng; Jin, Zhenghong

doi:10.3390/act15030163

Open AccessArticle

Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies

by

Yingzheng Zhang

^1,2 and

Zhenghong Jin

^3,*

¹

School of Electronic Information Engineering, Henan Institute of Technology, Xinxiang 453003, China

²

Henan Industrial Equipment Intelligent Engineering Technology Research Center, Xinxiang 453003, China

³

School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore 639798, Singapore

^*

Author to whom correspondence should be addressed.

Actuators 2026, 15(3), 163; https://doi.org/10.3390/act15030163

Submission received: 9 February 2026 / Revised: 24 February 2026 / Accepted: 10 March 2026 / Published: 12 March 2026

(This article belongs to the Section Aerospace Actuators)

Download

Browse Figures

Versions Notes

Abstract

This paper addresses distributed formation control for multiple unmanned aerial vehicles (UAVs) operating in obstacle-dense environments under directed switching communication topologies. A leader–follower architecture is adopted, wherein the leader performs online trajectory replanning while followers rely on delayed and intermittently available neighbor information. To simultaneously tackle collision avoidance, formation feasibility under narrow passages, and communication intermittency, we propose an integrated deformable formation navigation framework. The framework couples Safe Flight Corridor (SFC)-constrained Bézier trajectory planning with a dynamic formation scaling mechanism, allowing the swarm to adaptively shrink or expand its geometric configuration when traversing constricted spaces, thereby ensuring all agents remain within certified collision-free corridors. A nonlinear distributed consensus-based estimator is designed to propagate leader reference states under directed switching graphs with bounded delays. Using a max-min contraction analytical approach, we establish guaranteed practical convergence for both leader tracking and inter-follower agreement without requiring persistent connectivity. Extensive simulations in complex cluttered environments demonstrate that the proposed approach enables flexible and real-time formation reshaping, enhancing navigational safety and robustness while maintaining cohesive swarm behavior under challenging communication and spatial constraints.

Keywords:

adaptive formation control; unmanned aerial vehicle; safe flight corridor; communication delays; switching topology

1. Introduction

Autonomous multi-UAV systems have attracted increasing attention due to their potential in surveillance, environmental monitoring, disaster response, and cooperative transportation [1,2,3,4,5,6,7,8]. Compared with single vehicle systems [9,10], UAV formations offer enhanced robustness, scalability, and task efficiency by exploiting spatial cooperation and information sharing among multiple agents [11,12]. However, in practical applications, UAV formations [13,14] are often required to operate in obstacle-rich environments, where collision avoidance, trajectory feasibility, and formation maintenance must be simultaneously ensured. Moreover, realistic communication networks are subject to limited bandwidth, time-varying connectivity, and non-negligible delays, which significantly complicate the design and analysis of distributed formation control strategies [15,16]. These considerations motivate the development of formation control frameworks that explicitly account for environmental constraints and communication imperfections, while maintaining rigorous performance and safety guarantees.

Formation control under switching communication topologies and communication delays has been extensively studied in recent years [12,17,18,19,20,21]. Existing research has extensively studied consensus and formation maintenance under directed or undirected switching graphs. Convergence and robustness results have been established under various connectivity conditions [22,23], such as joint connectivity or uniformly quasi-strong connectivity [24,25]. In parallel, obstacle avoidance and safe navigation for multi-UAV systems have been addressed using potential fields, artificial constraints, and optimization-based trajectory planning methods [26,27,28]. Nevertheless, most existing formation control approaches [29] treat obstacle avoidance and formation maintenance as loosely coupled problems, often assuming either static formations or centralized planning. When communication topologies switch and delays are present, these approaches may suffer from degraded performance, loss of feasibility, or even safety violations, especially in narrow passages where the geometric footprint of the formation becomes critical.

Several techniques have been proposed to handle switching topologies and delays in distributed control problem of multi-agent systems. Lyapunov-based methods and common quadratic Lyapunov functions have been widely used to establish stability under switching graphs, but they typically impose restrictive assumptions on the switching signals or require conservative dwell-time conditions [30,31,32]. Alternatively, delay-tolerant consensus and estimation schemes have been developed using predictor-based designs or augmented-state formulations [33], which often lead to increased computational complexity and limited scalability. More recently, nonlinear agreement and contraction-based approaches have been explored to relax the need for fixed topologies and common Lyapunov functions [34]. Despite these advances, integrating such communication-aware online control mechanisms, obstacle-aware trajectory planning, and formation feasibility guarantees remains challenging. In particular, few existing works provide a unified framework that simultaneously addresses switching directed graphs, bounded delays, and formation-level safety constraints induced by obstacle-rich environments.

In obstacle-dense environments, the formation control problem becomes significantly more challenging due to geometric feasibility constraints imposed by limited free space [35,36]. Unlike open environments where a fixed formation geometry can be maintained throughout the mission, narrow passages and irregular obstacle distributions may render a rigid formation configuration infeasible. In such cases, the effective footprint of the formation, determined by inter-agent spacing, directly affects whether the entire team can remain within collision-free regions. Moreover, obstacle avoidance and formation maintenance are intrinsically coupled [37,38]. A leader trajectory that is safe for a single UAV may not guarantee safety for the entire formation unless formation size is explicitly incorporated into the planning stage. When communication topologies are switching and subject to delays, this coupling becomes even more critical, as followers rely on distributed and potentially outdated information to reconstruct the leader reference. Consequently, ensuring corridor feasibility, formation coherence, and distributed stability simultaneously in cluttered environments remains a nontrivial and underexplored problem. These considerations motivate the development of a unified framework that integrates Safe Flight Corridor-based trajectory planning with adaptive formation scaling and delay-robust distributed estimation under directed switching communication graphs.

The considered problem introduces several intertwined technical difficulties. First, in obstacle-dense environments, a fixed-geometry formation may become geometrically infeasible when traversing narrow passages, as the formation footprint may exceed the available free space. Second, obstacle avoidance and formation maintenance are strongly coupled: a trajectory that is collision-free for a single UAV does not necessarily guarantee safety for the entire formation unless formation size is explicitly incorporated into the planning stage. Third, under directed switching communication topologies with bounded delays, followers may only have intermittent and outdated access to leader information, making distributed reference reconstruction nontrivial. Finally, the overall closed-loop system forms a cascade interconnection between distributed estimation dynamics and nonlinear tracking control, requiring careful stability analysis to guarantee practical convergence.

In this paper, we develop a distributed formation planning and control framework that explicitly accounts for switching directed communication topologies, bounded delays, and obstacle-induced environmental constraints. A leader–follower architecture is adopted in which the leader performs online trajectory planning based on Safe Flight Corridors (SFCs), while followers reconstruct the leader reference through a nonlinear agreement mechanism under intermittent and delayed information exchange. Unlike conventional formation control frameworks that assume fixed formation geometry or treat obstacle avoidance and formation maintenance as loosely coupled modules, the proposed method embeds the formation footprint directly into corridor tightening and incorporates formation size as a planning variable. As a result, the geometric feasibility of the entire formation is preserved when traversing narrow passages under switching communication and delayed leader information. Furthermore, the nonlinear agreement protocol is not introduced as an isolated delay consensus scheme, but as an estimator layer tightly coupled with adaptive formation resizing. The theoretical analysis establishes a cascade structure that quantitatively links estimator contraction rate to ultimate formation tracking accuracy, thereby providing a unified safety–communication–tracking guarantee. The effectiveness and robustness of the framework are demonstrated through simulations of multi-UAV formations navigating obstacle-rich environments under time-varying communication conditions. Our main contributions are as follows:

(1): We propose a formation control framework that systematically addresses directed switching communication topologies and bounded transmission delays. By reformulating leader state dissemination as a nonlinear agreement process, the scheme achieves practical leader tracking and inter-follower consensus under uniformly quasi-strongly connected (UQSC) switching conditions, without requiring fixed topologies or global synchronization.
(2): We develop a planning--control co-design methodology that couples SFC-constrained Bézier trajectory planning with online optimization of time-varying formation size. The formation size is adaptively adjusted according to corridor feasibility, allowing the entire formation to safely contract or expand when navigating narrow passages. This integration bridges the gap between single-agent motion planning and multi-agent formation constraints in cluttered environments.
(3): Using nonlinear agreement theory and a window-based max-min contraction analysis, we establish formal proofs of follower agreement and practical leader tracking under directed switching graphs with bounded delays. The analysis avoids restrictive quadratic Lyapunov assumptions and naturally accommodates the hybrid dynamics arising from replanning and communication switching, providing explicit bounds on estimation and tracking errors.

The rest of this article is organized as follows. Section 2 presents the model of the UAV, followed by the problem formulation that explicitly states the control objectives and the assumptions. Section 3 explains the proposed trajectory planning and distributed formation control framework and main results. Section 4 demonstrates the effectiveness of the proposed algorithm through the simulation study. Finally, Section 5 concludes the paper. Appendix A provides the key lemmas and main proof, deriving the sufficient condition for constraint satisfaction and closed-loop stability under bounded disturbances.

Notation: In this paper,

R

denotes the set of real numbers and

R^{n}

denotes the Euclidean Space. We use

I_{n}

and

0_{m \times n}

to represent the n-dimensional identity matrix, and

n \times m

-dimensional zero matrix, respectively. Let

∥ \cdot ∥

be the Euclidean norm. For a signal

x (\cdot)

, define

\bar{x} (t) : = {sup}_{τ \in [t - \bar{d}, t]} x (τ)

and

\underset{̲}{x} (t) : = {inf}_{τ \in [t - \bar{d}, t]} x (τ)

for a given delay bound

\bar{d} \geq 0

. For a locally Lipschitz function

V (\cdot)

, denote

D^{+} V (t)

its upper right Dini derivative. For a directed graph

G = (V, E)

, let

N_{i}

be the in-neighborhood of node i. We use

1_{n}

to denote the all-one vector of length n.

R_{i} \in SO (3)

denotes the rotation matrix from the body-fixed frame to the inertial frame. The operator × denotes the vector cross product in

R^{3}

.

2. Preliminaries and Problem Formulation

2.1. UAV Model

Consider a team of

N + 1

UAVs indexed by

{0, 1, \dots, N}

, where UAV 0 is the leader and UAVs

i \in V : = {1, \dots, N}

are followers. Each UAV is described by the standard quadrotor rigid-body dynamics

\begin{matrix} m_{i} {\ddot{p}}_{i} = & m_{i} g e_{3} - F_{i} R_{i} e_{3}, \end{matrix}

(1)

\begin{matrix} J_{i} {\dot{Ω}}_{i} = & - Ω_{i} \times J_{i} Ω_{i} + M_{i}, \end{matrix}

(2)

where

p_{i} = {[x_{i}, y_{i}, z_{i}]}^{T} \in R^{3}

is the position in the inertial frame,

e_{3} = {[0, 0, 1]}^{⊤}

denotes the unit vector along the z-axis of the inertial frame,

m_{i}

denotes the mass of the i-th UAV,

g > 0

is the gravitational acceleration,

R_{i} \in SO (3)

is the attitude,

Ω_{i} \in R^{3}

is the body angular velocity,

F_{i} \in R

is the thrust, and

M_{i} \in R^{3}

is the body moment. As in the differential flatness framework, a geometric tracking controller can asymptotically track a sufficiently smooth reference position and yaw trajectory for each UAV. We will use this fact as a modular tracking layer.

For each follower

i \in V

, define its desired relative direction to the leader using angles

(ϕ_{i}, θ_{i})

,

c_{i} : = c (ϕ_{i}, θ_{i}) = [\begin{matrix} \sin θ_{i} \cos ϕ_{i} \\ \sin θ_{i} \sin ϕ_{i} \\ \cos θ_{i} \end{matrix}] \in R^{3} .

(3)

Let

d_{r} (t) \geq d_{\min} > 0

denote the time-varying formation size. Then the ideal formation reference for follower i is

p_{r, i} (t) = p_{r} (t) + c_{i} d_{r} (t),

(4)

where

p_{r} (t)

is the leader-planned safe trajectory and

d_{\min}

is a positive constant.

2.2. Switching Communication Topology

Communication among followers and the leader is described by a directed switching graph

G (t) = (V, E (t)), t \geq 0,

(5)

where

E (t) \subseteq V \times V

is piecewise constant with switching times

{t_{k}}_{k \in N}

. An edge

(j, i) \in E (t)

means that follower i can receive information from follower j at time t. Let

a_{i j} (t) \geq 0

be the weight on edge

(j, i)

with

a_{i i} (t) = 0

. Define the neighborhood set

N_{i} (t) = {j \in V : (j, i) \in E (t)}

and the normalized weights

{\bar{a}}_{i j} (t) = \{\begin{matrix} \frac{a_{i j} (t)}{\sum_{k \in N_{i} (t)} a_{i k} (t) + μ_{i} (t)} & j \in N_{i} (t), \\ 0 & otherwise, \end{matrix} {\bar{μ}}_{i} (t) = \frac{μ_{i} (t)}{\sum_{k \in N_{i} (t)} a_{i k} (t) + μ_{i} (t)} .

(6)

Here

μ_{i} (t) \in {0, 1}

indicates whether follower i has direct access from the leader at time t. We allow bounded time-varying communication delays:

d_{i j} (t) \in [0, \bar{d}], d_{i 0} (t) \in [0, \bar{d}], \forall i, j \in V .

(7)

where

\bar{d}

is the upper bound of the time delays.

2.3. Problem Description

In this paper, the main controlled system under consideration consists of a multi-UAV system (1) operating in an obstacle-rich environment. The leader UAV is equipped with access to a local map and capable of online replanning. In contrast, the follower UAVs typically lack sufficient computational resources for trajectory planning and do not necessarily receive the planning information from the leader directly.

Control objectives: Given global goal

p_{g}

and a sequence of local goals

{p_{l g, k}}

for replanning, we aim to design: (i) an online planner that generates a safe leader trajectory

p_{r} (t)

and formation size

d_{r} (t)

, and (ii) a distributed formation controller under switching directed communication such that:

all UAVs remain inside a Safe Flight Corridor $χ_{free} (t)$ :

$\begin{matrix} p_{i} (t) \in χ_{free} (t, k), \forall t \in [t_{k}, t_{k + 1}), i = 0, 1, \dots, N . \end{matrix}$

(8)
the leader tracks the planned trajectory:

$\begin{matrix} \underset{t \to \infty}{lim sup} ∥ p_{0} (t) - p_{r} (t) ∥ = 0 . \end{matrix}$

(9)
each follower asymptotically tracks its formation reference:

$\begin{matrix} \underset{t \to \infty}{lim sup} ∥ p_{i} (t) - p_{r, i} (t) ∥ = 0 . \end{matrix}$

(10)
the planned leader trajectory reaches each local goal within finite time during each replanning iteration:

$\begin{matrix} p_{r} (T) = p_{l g, k}, \exists 0 < T < \infty . \end{matrix}$

(11)

We adopt a leader-following analogue of the UQSC condition:

Assumption 1.

There exist constants

λ > 0

and

λ_{D} > 0

such that for any

t \geq 0

, (i) each active edge in

E (t)

persists for at least

λ_{D}

once it appears; and (ii) the union graph

G ([t, t + λ)) : = (V, \cup_{τ \in [t, t + λ)} E (τ))

contains a directed spanning tree whose root belongs to the set

{i \in V : \exists τ \in [t, t + λ) s . t . μ_{i} (τ) = 1}

, i.e., within every window of length λ, leader information can reach all followers through directed paths.

Remark 1.

Assumption 1 is a leader-following version of the UQSC/QSC-type joint connectivity used in switching directed consensus with delays [39]. It is strictly weaker than requiring a fixed spanning tree at all times.

Assumption 2.

Each follower i can access its prescribed relative angles

(ϕ_{i}, θ_{i})

.

The theoretical analysis is established under bounded delays and the joint leader reachability condition (Assumption 1), which ensure window-based contraction of the nonlinear agreement dynamics. In practice, UAV communication links may experience packet loss or temporary blackouts, leading to occasional violations of these conditions. During such periods, missing packets can be interpreted as effectively increased delays or intermittent unavailability of leader information. The proposed agreement dynamics remain well-posed and the estimator states stay bounded due to the dissipative feedback structure; however, the contraction rate may temporarily reduce and the tracking error bound can grow. Once communication quality recovers such that Assumption 1 holds again, the window-based contraction mechanism reactivates and the estimator error contracts back to its nominal neighborhood.

3. Trajectory Planning and Adaptive Formation Control

3.1. Safe Flight Corridor

Let

W_{w} \in R^{3}

denote the w-th waypoint of the guidance path. The endpoints

W_{0} = p_{0} (t_{k})

and

W_{L} = p_{l g, k}

are fixed, while the intermediate waypoints

{W_{1}, \dots, W_{L - 1}}

are decision variables to be optimized for smoothness and obstacle clearance.

At each replanning iteration k, the leader generates an initial waypoint list

W_{ini}

, then optimizes waypoints to improve smoothness and safety:

W_{opt} = \arg \min_{W_{1}, \dots, W_{L - 1}} λ_{s} \sum_{w = 1}^{L - 1} {∥ 2 W_{w} - W_{w - 1} - W_{w + 1} ∥}^{2} + λ_{c} \sum_{w = 1}^{L - 1} Ψ (d (W_{w}, χ)),

(12)

where

χ

denotes the surround environment,

d (W_{w}, χ)

is the distance from waypoint

W_{w}

to the closest obstacle,

Ψ (\cdot)

is a barrier/penalty enforcing a safe distance, and

λ_{s}

and

λ_{c}

are two positive constants.

As shown in Figure 1, the Safe Flight Corridor is represented as a sequence of adjacent convex polyhedra

χ_{free} : = ⋃_{ℓ = 1}^{L_{c}} P_{ℓ}, P_{ℓ} : = {x \in R^{3} ∣ A_{ℓ} x \leq c_{ℓ}},

(13)

where

L_{c} \in N

is the number of corridor cells. For each cell ℓ, the matrix

A_{ℓ} \in R^{n_{ℓ} \times 3}

and vector

c_{ℓ} \in R^{n_{ℓ}}

collect

n_{ℓ}

half-space inequalities

a_{ℓ r}^{⊤} x \leq c_{ℓ r}

(

r = 1, \dots, n_{ℓ}

), with

a_{ℓ r}^{⊤}

being the r-th row of

A_{ℓ}

.

To guarantee collision-free motion for a formation with size

d \geq d_{\min}

, we tighten each corridor cell by shrinking every supporting half-space inward. Assuming the half-space normals are normalized, i.e.,

∥ a_{ℓ r} ∥ = 1

, the tightened corridor is

A_{ℓ} x \leq c_{ℓ} - Δ_{ℓ} (d), Δ_{ℓ} (d) : = ζ d 1_{n_{ℓ}},

(14)

where

ζ > 1

is a safety margin. If

a_{ℓ r}

is not normalized, the shrinkage can be written as

Δ_{ℓ r} (d) = ζ d ∥ a_{ℓ r} ∥

, so that each face is offset inward by the geometric distance

ζ d

. The step of this method is concluded as Algorithm 1.

Algorithm 1 Safe Flight Corridor Construction

Require: Initial waypoint list

W_{ini}

, obstacle set

χ

, formation diameter d
Ensure: Corridor cells

{P_{ℓ}}

1: Generate initial piecewise linear path
2: for each segment do
3: Compute separating hyperplanes
4: Construct polyhedral cell
5: end for
6: Tighten constraints

A_{ℓ} x \leq c_{ℓ} - ζ d

3.2. Trajectory and Formation Size Generation

Within replanning iteration k, the leader trajectory is represented by an M-segment n-th order Bézier curve

p_{r, k} (t) = \sum_{j = 0}^{n} b_{j}^{n} (τ) P_{m, j}, τ = \frac{t - T_{m}}{T_{m + 1} - T_{m}} \in [0, 1], t \in [T_{m}, T_{m + 1}],

(15)

where

{P_{m, j}}

are control points and

b_{j}^{n}

are Bernstein polynomials. Using the convex hull property, containment in the Safe Flight Corridor can be enforced by requiring all control points in the corresponding polyhedron.

We choose the formation size

d_{k}

as an optimization variable jointly with control points and polyhedron assignment binaries

e_{m ℓ} \in {0, 1}

:

\begin{matrix} \min_{{P_{m, j}}, {e_{m ℓ}}, d_{k}} & λ_{traj} \sum_{m = 1}^{M} \int_{T_{m}}^{T_{m + 1}} {∥ {\overset{⃛}{p}}_{r, k} (t) ∥}^{2} d t + λ_{form} {(d_{k} - \bar{d})}^{2} \\ s . t . & dynamic feasibility bounds on \dot{p}, \ddot{p}, \overset{⃛}{p}, p^{(4)}, \\ \sum_{ℓ = 1}^{L_{c}} e_{m ℓ} = 1, e_{m ℓ} = 1 \Rightarrow A_{ℓ} P_{m, j} \leq c_{ℓ} - Δ_{ℓ} (d_{k}), \forall j, \\ p_{r, k} (T_{1}) = p_{0} (t_{k}), p_{r, k} (T_{M + 1}) = p_{l g, k} . \end{matrix}

(16)

A smooth transition

d_{r} (t)

between

d_{k - 1}

and

d_{k}

is generated by a high-order polynomial, satisfying boundary derivative matching up to order 4.

Remark 2.

The trajectory smoothness term is chosen as the integral of the squared jerk, i.e.,

\int ∥ {\overset{⃛}{p}}_{r, k} {(t) ∥}^{2} d t

, for the following reasons. First, the jerk directly reflects the rate of change of acceleration and is closely related to the smoothness of thrust and attitude commands of quadrotor UAVs, making it more suitable than acceleration-based costs for execution on real platforms. Second, when the trajectory is parameterized by Bézier curves, the jerk remains a low order polynomial, which leads to a quadratic cost in the control points and enables efficient MIQP formulation. Finally, higher-order smoothness is enforced through explicit bounds on snap, guaranteeing

C^{4}

continuity and compatibility with geometric tracking controllers. This choice provides a favorable trade-off between trajectory smoothness, tracking feasibility, and computational efficiency.

3.3. Distributed Time-Varying Formation Control Under Switching Topologies

To enable followers to compute

p_{r} (t)

and its derivatives, the leader publishes a parameter vector

s (t) \in R^{n_{s}}

, which contains the current segment Bézier control points and timing information and the current formation-size polynomial parameters. Given

s (t)

and time t, any agent can compute

p_{r} (t), {\dot{p}}_{r} (t), \dots, p_{r}^{(4)} (t)

, and

d_{r} (t), {\dot{d}}_{r} (t), \dots, d_{r}^{(4)} (t)

analytically. Since replanning occurs at discrete instants,

s (t)

is piecewise constant with bounded jumps and bounded update frequency.

For each coordinate

ℓ = 1, \dots, n_{s}

, define the scalar estimate

{\hat{s}}_{i, ℓ}

and the leader signal

s_{ℓ} (t)

. We propose the nonlinear agreement protocol with delays:

{\dot{\hat{s}}}_{i, ℓ} (t) = φ_{ℓ} ({\hat{s}}_{i, ℓ} (t) - κ_{i, ℓ} (t)),

(17)

where

κ_{i, ℓ} (t) : = \sum_{j \in N_{i} (t)} {\bar{a}}_{i j} (t) {\hat{s}}_{j, ℓ} (t - d_{i j} (t)) + {\bar{μ}}_{i} (t) s_{ℓ} (t - d_{i 0} (t)),

(18)

and the nonlinearity

φ_{ℓ} : R \to R

satisfies the strict dissipativity condition.

Assumption 3.

For each coordinate ℓ,

φ_{ℓ} (0) = 0

, and for all

r \neq 0

,

r φ_{ℓ} (r) < 0

. Moreover,

φ_{ℓ}

is locally Lipschitz and there exists a class-

K

function

α_{ℓ}

such that

| φ_{ℓ} (r) | \geq α_{ℓ} (| r |)

for all r in the semi-global domain of interest.

Define the leader oscillation over the delay window:

{osc}_{L} (t) : = S_{L}^{\max} (t) - S_{L}^{\min} (t), S_{L}^{\max} (t) : = sup_{τ \in [t - \bar{d}, t]} s (τ), S_{L}^{\min} (t) : = inf_{τ \in [t - \bar{d}, t]} s (τ) .

Equations (17) and (18) are a direct leader-following extension of the agreement model in [39]: each follower is driven toward a time-varying convex combination of delayed neighbor estimates and delayed leader signal. The core advantage is that its convergence can be established via max–min contraction arguments under Assumption 1, without requiring a common quadratic Lyapunov function for switching directed graphs.

Define the estimation error for coordinate ℓ:

e_{i, ℓ} (t) : = {\hat{s}}_{i, ℓ} (t) - s_{ℓ} (t)

. Let

E_{ℓ}^{\max} (t) : = \max_{i \in V} e_{i, ℓ} (t), E_{ℓ}^{\min} (t) : = \min_{i \in V} e_{i, ℓ} (t), W_{ℓ} (t) : = E_{ℓ}^{\max} (t) - E_{ℓ}^{\min} (t) .

Note that due to the leader time-variation, the asymptotic tracking generally requires additional internal model structure. Here we establish agreement among followers and practical tracking to the leader with a bound proportional to

∥ {\dot{s}}_{ℓ} ∥

.

Theorem 1.

Suppose Assumptions 1 and 3 and bounded delays (7) hold, then for each coordinate ℓ:

1.: The follower disagreement width $W_{ℓ} (t)$ converges to a neighborhood of zero:

$\underset{t \to \infty}{lim sup} W_{ℓ} (t) \leq γ_{ℓ} (∥ {\dot{s}}_{ℓ} ∥_{\infty}),$

for some class- $K$ functions, $γ_{ℓ} (\cdot)$ is determined by $α_{ℓ} (\cdot)$ and the joint leader reachability window $(λ, λ_{D})$ . In particular, if $s_{ℓ} (t)$ is piecewise constant, then $W_{ℓ} (t) \to 0$ on each constant interval.
2.: Each follower error satisfies the practical bound

$\underset{t \to \infty}{lim sup} \max_{i \in V} | e_{i, ℓ} (t) | \leq Γ_{ℓ} (∥ {\dot{s}}_{ℓ} ∥_{\infty}),$

where $Γ_{ℓ}$ is explicitly constructible via the window-based contraction recursion as in [39].

Proof.

We prove the result for a single scalar coordinate of the leader parameter vector

s (t)

. The vector case follows by applying the same argument componentwise and taking the maximum bounds.

Define the follower envelope over the delay window

{\hat{S}}^{\max} (t) : = \max_{i \in V} sup_{τ \in [t - \bar{d}, t]} {\hat{s}}_{i} (τ), {\hat{S}}^{\min} (t) : = \min_{i \in V} inf_{τ \in [t - \bar{d}, t]} {\hat{s}}_{i} (τ),

and the width

W (t) : = {\hat{S}}^{\max} (t) - {\hat{S}}^{\min} (t)

. By Lemma A3, for all i and all t

κ_{i} (t) \in [\min {{\hat{S}}^{\min} (t), S_{L}^{\min} (t)}, \max {{\hat{S}}^{\max} (t), S_{L}^{\max} (t)}] .

Applying Lemma A2 to each estimator

{\dot{\hat{s}}}_{i} = φ ({\hat{s}}_{i} - κ_{i})

implies that

{\hat{s}}_{i} (t)

cannot exit the convex hull generated by

{\hat{S}}^{\min} (t)

,

{\hat{S}}^{\max} (t)

and the leader window. Hence

W (t)

is well-defined and uniformly bounded for all

t \geq 0

.

By Assumption 1, for any t there exists a time window

[t, t + λ)

whose union graph contains a directed spanning tree rooted at some follower r that receives leader information (i.e.,

{\bar{μ}}_{r} (t) \geq μ_{*} > 0

on a subinterval of length at least

λ_{D}

). Applying Lemma A4 on this subinterval shows that

{\hat{s}}_{r}

moves away from the envelope extremes by a margin proportional to

(1 - e^{- k λ_{D}}) μ_{*} W (t)

, up to the leader window oscillation

{osc}_{L} (t) : = S_{L}^{\max} (t) - S_{L}^{\min} (t)

.

Along each directed edge of the spanning tree, Assumption 1 guarantees persistence for at least

λ_{D}

and a uniform weight lower bound

a_{*} > 0

. By Lemma A5, the contraction margin at the root propagates to each downstream node with at least a factor

a_{*} (1 - e^{- k λ_{D}})

per edge. Since the depth of the spanning tree is at most

| V | - 1

, after the window

[t, t + λ)

, every follower contracts toward the interior of the envelope.

Then there exist explicit constants

ρ : = (1 - e^{- k λ_{D}}) a_{*}^{N - 1} μ_{*} \in (0, 1), η : = 2 \sum_{m = 0}^{N - 1} a_{*}^{m} \in (0, \infty),

(19)

such that for all

t \geq 0

W (t + λ) \leq (1 - ρ) W (t) + η {osc}_{L} (t + λ) .

(20)

Iterating this recursion proves that (i) if

s (t)

is constant on

[t, t + λ]

, then

W (t)

strictly contracts and

W (t + n λ) \to 0

as

n \to \infty

; and (ii) for time-varying

s (t)

,

W (t)

is ultimately bounded by

\frac{η}{ρ} {lim sup}_{t \to \infty} {osc}_{L} (t)

.

The proof is complete. □

Each follower i reconstructs the leader reference and its derivatives from

{\hat{s}}_{i} (t)

:

{\hat{p}}_{r}^{(j)} (t) = P^{(j)} ({\hat{s}}_{i} (t), t), j = 0, 1, 2, 3, 4, {\hat{d}}_{r}^{(j)} (t) = D^{(j)} ({\hat{s}}_{i} (t), t), j = 0, 1, 2, 3, 4,

where

P^{(j)} (\cdot)

and

D^{(j)} (\cdot)

are analytic maps induced by Bézier/polynomial parameterization. Then the follower’s desired trajectory and derivatives are

{\hat{p}}_{r, i}^{(j)} (t) = {\hat{p}}_{r}^{(j)} (t) + c_{i} {\hat{d}}_{r}^{(j)} (t), j = 0, 1, 2, 3, 4 .

(21)

Finally, each follower applies a geometric tracking controller using

{\hat{p}}_{r, i} (t)

,

{\dot{\hat{p}}}_{r, i} (t)

,

{\ddot{\hat{p}}}_{r, i} (t)

, etc.

Given the implemented reference

{\hat{p}}_{r, i} (t)

and yaw

{\hat{ψ}}_{r, i} (t)

with bounded derivatives up to order four, we adopt a standard geometric tracking controller on

SE (3)

. Define the position/velocity errors

e_{p, i} : = p_{i} - {\hat{p}}_{r, i}, e_{v, i} : = {\dot{p}}_{i} - {\dot{\hat{p}}}_{r, i} .

A commanded acceleration is chosen as

a_{c, i} : = {\ddot{\hat{p}}}_{r, i} - K_{p} e_{p, i} - K_{v} e_{v, i},

and the thrust is set to

F_{i} : = m_{i} a_{c, i}^{⊤} (R_{i} e_{3}) .

The desired attitude

R_{c, i} (t) \in SO (3)

is constructed from the commanded acceleration and yaw. Its corresponding reference angular velocity is defined by

{\hat{Ω}}_{r, i} (t) : = {(R_{c, i}^{⊤} (t) {\dot{R}}_{c, i} (t))}^{\lor},

where

{(\cdot)}^{\lor}

denotes the vee map from

so (3)

to

R^{3}

.

The desired attitude

R_{c, i}

is constructed such that

R_{c, i} e_{3}

aligns with

a_{c, i} - g e_{3}

and the yaw matches

{\hat{ψ}}_{r, i}

. Then the moment input

M_{i}

is designed by a standard attitude error feedback controller

\begin{matrix} M_{i} : = - K_{R} e_{R, i} - K_{Ω} e_{Ω, i} + Ω_{i} \times J_{i} Ω_{i} - J_{i} ({\hat{Ω}}_{r, i}^{\times} R_{i}^{⊤} R_{c, i} {\hat{Ω}}_{r, i} - R_{i}^{⊤} R_{c, i} {\dot{\hat{Ω}}}_{r, i}), \end{matrix}

(22)

which guarantees exponential tracking of

(p_{i}, R_{i})

to

({\hat{p}}_{r, i}, R_{c, i})

for sufficiently smooth references. This yields the ISS-type bound in Lemma A7. For a detailed ISS definition, see [40].

Assumption 4.

For each coordinate ℓ, the function

φ_{ℓ} : R \to R

is continuous, locally Lipschitz,

φ_{ℓ} (0) = 0

, and there exists a constant

k_{ℓ} > 0

such that

r φ_{ℓ} (r) \leq - k_{ℓ} r^{2}, \forall r \in R .

(23)

Remark 3.

Assumption 4 is a standard strong dissipativity/sector condition. It implies the sign condition

r φ_{ℓ} (r) < 0

for

r \neq 0

used in the switching agreement literature, while additionally providing a quantitative contraction rate

k_{ℓ}

that yields explicit recursion constants. A canonical choice is

φ_{ℓ} (r) = - k_{ℓ} r

or

φ_{ℓ} (r) = - k_{ℓ} tanh (r / ϵ)

.

Proposition 1

(Corridor tightening implies formation safety under time-varying radius). Fix a replanning interval

t \in [t_{k}, t_{k + 1})

and a corridor cell

P_{ℓ} : = {x \in R^{n} ∣ A_{ℓ} x \leq c_{ℓ}}

. Assume each row

a_{ℓ r}^{⊤}

of

A_{ℓ}

is normalized, i.e.,

∥ a_{ℓ r} ∥ = 1

. Let the leader reference satisfy the tightened constraints

A_{ℓ} p_{r} (t) \leq c_{ℓ} - ζ {\bar{d}}_{k} 1, \forall t \in [t_{k}, t_{k + 1}),

(24)

for some tightening radius

{\bar{d}}_{k} > 0

and safety factor

ζ > 1

. Suppose the executed formation radius after smoothing,

d_{r} (t)

, satisfies

d_{r} (t) \leq {\bar{d}}_{k}, \forall t \in [t_{k}, t_{k + 1}),

(25)

and each follower remains within distance

ζ d_{r} (t)

of the leader reference, i.e.,

∥ p_{i} (t) - p_{r} (t) ∥ \leq ζ d_{r} (t), \forall i \in {1, \dots, N} .

(26)

Then all agents remain inside the original corridor cell:

A_{ℓ} p_{i} (t) \leq c_{ℓ}, \forall t \in [t_{k}, t_{k + 1}), \forall i \in {0, 1, \dots, N} .

(27)

Proof.

For any face inequality

a_{ℓ r}^{⊤} x \leq c_{ℓ r}

, by (24)

a_{ℓ r}^{⊤} p_{r} (t) \leq c_{ℓ r} - ζ {\bar{d}}_{k} .

For any follower i, write

p_{i} (t) = p_{r} (t) + Δ_{i} (t)

. Using

∥ a_{ℓ r} ∥ = 1

and Cauchy–Schwarz

a_{ℓ r}^{⊤} p_{i} (t) = a_{ℓ r}^{⊤} p_{r} (t) + a_{ℓ r}^{⊤} Δ_{i} (t) \leq c_{ℓ r} - ζ {\bar{d}}_{k} + ∥ Δ_{i} (t) ∥ .

By (26) and (25),

∥ Δ_{i} (t) ∥ \leq ζ d_{r} (t) \leq ζ {\bar{d}}_{k}

; hence,

a_{ℓ r}^{⊤} p_{i} (t) \leq c_{ℓ r}

. Since this holds for all faces,

A_{ℓ} p_{i} (t) \leq c_{ℓ}

. □

Theorem 2.

Under Assumptions 1–3, and bounded delays (7), the closed-loop system consisting of the online planner (12)–(16), the nonlinear estimators (17) and (18), and the tracking controllers, satisfies:

1.: All estimation signals ${\hat{s}}_{i} (t)$ are uniformly bounded for all $t \geq 0$ .
2.: The reconstructed references ${\hat{p}}_{r, i} (t)$ , ${\hat{d}}_{r} (t)$ are uniformly bounded and satisfy the practical tracking bounds implied by Theorem 1.
3.: The formation tracking error $e_{i} (t) : = p_{i} (t) - p_{r, i} (t)$ is ultimately bounded:

$\underset{t \to \infty}{lim sup} ∥ e_{i} (t) ∥ \leq Ξ (∥ \dot{s} ∥_{\infty}),$

where $Ξ (\cdot)$ can be made arbitrarily small by increasing the estimator contraction gain.

Proof.

The closed-loop system can be interpreted as a cascade interconnection of three subsystems:

(i): The nonlinear distributed estimator layer (17) and (18).
(ii): The reference reconstruction layer induced by the Bézier and formation-size parameterization.
(iii): The geometric tracking control layer for each UAV.

We analyze these subsystems sequentially.

Step 1: Estimator practical convergence.

By Theorem 1, for each follower i

\underset{t \to \infty}{lim sup} ∥ {\hat{s}}_{i} (t) - s (t) ∥ \leq \frac{η}{ρ} \underset{t \to \infty}{lim sup} {osc}_{L} (t),

(28)

where

η

and

ρ

are positive constants defined in (19). Moreover, on any interval where

s (t)

is constant (i.e., between replanning instants), the estimation error converges exponentially to zero. Hence, all estimator states remain uniformly bounded for all

t \geq 0

.

Step 2: Reference reconstruction error bound.

By Lemma A6, the mappings

p_{r}^{(j)} (t) = P^{(j)} (s (t), t), d_{r}^{(j)} (t) = D^{(j)} (s (t), t), j = 0, \dots, 4,

are locally Lipschitz with respect to the parameter vector s on the compact set ensured by the planner.

Therefore, for each follower i and

j = 0, \dots, 4

, there exist constants

L_{j} > 0

such that

sup_{t \geq 0} ∥ {\hat{p}}_{r, i}^{(j)} (t) - p_{r, i}^{(j)} (t) ∥ \leq L_{j} sup_{t \geq 0} ∥ {\hat{s}}_{i} (t) - s (t) ∥ .

(29)

Thus the reference mismatch is uniformly bounded and inherits the practical convergence property of the estimator layer.

Step 3: Tracking layer ISS property.

From Lemma A7, the geometric tracking controller satisfies the input-to-state stability (ISS) type bound

∥ e_{i} (t) ∥ \leq c_{i} e^{- γ_{i} t} ∥ e_{i} (0) ∥ + c_{i} sup_{τ \in [0, t]} ∥ {\hat{p}}_{r, i} (τ) - p_{r, i} (τ) ∥,

(30)

where

e_{i} (t) : = p_{i} (t) - p_{r, i} (t)

.

Taking the limit superior of (30) yields

\underset{t \to \infty}{lim sup} ∥ e_{i} (t) ∥ \leq c_{i} \underset{t \to \infty}{lim sup} ∥ {\hat{p}}_{r, i} (t) - p_{r, i} (t) ∥ .

(31)

Step 4: Composite bound.

Combining (31) with the reconstruction bound (29) gives

\underset{t \to \infty}{lim sup} ∥ e_{i} (t) ∥ \leq C_{i} \underset{t \to \infty}{lim sup} ∥ {\hat{s}}_{i} (t) - s (t) ∥,

(32)

for some constant

C_{i} > 0

.

Substituting the estimator bound (28) yields

\underset{t \to \infty}{lim sup} ∥ e_{i} (t) ∥ \leq Ξ (∥ \dot{s} ∥_{\infty}),

(33)

where

Ξ (r) = C_{i} \frac{η}{ρ} r .

Since the contraction factor

ρ

increases with the estimator gain

k_{ℓ}

(Assumption 4), the ultimate formation tracking bound can be made arbitrarily small by increasing the estimator contraction gain.

This completes the strengthened proof. □

Remark 4.

Compared with existing formation control methods under obstacle environments, the proposed framework offers several advantages. First, rather than treating obstacle avoidance and formation maintenance as loosely coupled problems, the formation footprint is explicitly embedded into the Safe Flight Corridor construction through corridor tightening. This guarantees the geometric feasibility of the entire formation in constrained spaces. Second, unlike approaches that rely on fixed or persistently connected communication graphs, the proposed nonlinear agreement protocol only requires joint leader reachability under switching directed topologies with bounded delays. Third, the analysis establishes an explicit cascade relationship between the estimator contraction rate and ultimate formation tracking accuracy, providing quantitative performance guarantees instead of purely qualitative convergence claims.

3.4. Computational Complexity Discussion

The Safe Flight Corridor construction consists of two main steps: (i) obstacle separation and half-space generation, and (ii) mixed-integer trajectory optimization.

For corridor construction, suppose each path segment interacts with at most

N_{o}

nearby obstacles. Computing separating hyperplanes requires

O (N_{o})

operations per segment. If the path contains L segments, the overall complexity of corridor generation is

O (L N_{o})

.

For trajectory optimization, let M denote the number of Bézier segments and n the polynomial order. The number of continuous decision variables is

O (M n)

, while the number of binary variables equals the number of possible cell assignments per segment. The resulting optimization is a mixed-integer quadratic program (MIQP), whose worst-case complexity is exponential in the number of binary variables. However, since the corridor decomposition restricts feasible assignments locally, the practical computation remains tractable for moderate environment complexity.

Overall, the dominant computational cost arises from the MIQP solver, whereas corridor construction scales linearly with the number of obstacles and path segments.

4. Simulation Results

In this section, we employ a numerical example to verify the effectiveness of the proposed algorithm.

To further evaluate robustness, we consider stochastic packet delays and intermittent leader communication. Specifically, communication delays are randomly generated within

[0, \bar{d}]

following a uniform distribution, and leader access is temporarily disabled with a fixed probability to emulate packet loss. Simulation results indicate that although transient estimator errors increase during temporary connectivity violations, all states remain bounded due to the dissipative agreement structure. Once joint leader reachability is restored, the window-based contraction mechanism drives the estimator errors back to their nominal bounds. These results support the robustness of the proposed framework under more realistic communication conditions.

We consider a planar multi-UAV formation navigation task in an obstacle environment. The team consists of five UAVs, including one leader indexed by 0 and

N = 4

followers indexed by

V = {1, 2, 3, 4}

. The leader starts from

p_{0} = {[0, 0]}^{⊤}

and aims to reach the goal

p_{g} = {[13, 0]}^{⊤}

. The environment contains six static circular obstacles, specified by their centers and radius:

{(3.5, 0.0, 0.7), (6.0, 1.2, 0.8), (6.5, - 1.4, 0.9), (9.0, 0.0, 0.7), (11.0, 1.4, 0.8), (11.0, - 2, 0.65)},

where each triple denotes

(x, y, r)

.

Followers are required to maintain a time-varying circular formation around the leader reference trajectory

p_{r} (t)

. Let

θ_{i} \in {0, \frac{π}{2}, π, \frac{3 π}{2}}

be the desired angular offsets equally spaced on a circle. Define

c_{i} = {[\cos θ_{i}, \sin θ_{i}]}^{⊤}

. The ideal formation reference for follower i is

p_{r, i} (t) = p_{r} (t) + d_{r} (t) c_{i},

(34)

where

d_{r} (t) \geq d_{\min} = 0.45 > 0

is the time-varying formation size.

Communication among followers is described by a directed switching graph with three modes. We use the adjacency matrix

A^{(m)} = [a_{i j}^{(m)}] \in R^{4 \times 4}

, where

a_{i j}^{(m)} = 1

indicates a directed edge

(i \to j)

(i.e., follower j receives information from follower i) under mode m. The three modes are:

A^{(1)} = [\begin{matrix} 0 & 1 & 1 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \end{matrix}], A^{(2)} = [\begin{matrix} 0 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \end{matrix}], A^{(3)} = [\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 1 & 0 & 0 & 0 \end{matrix}] .

(35)

The topology switches periodically every

T_{s} = 1.2 s

. In addition, the leader information is directly available to exactly one follower per mode, encoded by

μ^{(m)} \in {0, 1}^{4}

:

μ^{(1)} = [\begin{matrix} 1 \\ 0 \\ 0 \\ 0 \end{matrix}], μ^{(2)} = [\begin{matrix} 0 \\ 1 \\ 0 \\ 0 \end{matrix}], μ^{(3)} = [\begin{matrix} 0 \\ 0 \\ 1 \\ 0 \end{matrix}] .

(36)

Communication delays are time-varying but bounded by

\bar{d} = 0.1 s

.

This subsection lists the simulation parameters and briefly verifies that the theoretical assumptions are satisfied. The simulation runs with sampling time

Δ t = 0.05 s

over

T_{end} = 70 s

. The leader replans every

T_{replan} = 2 s

. The input of UAVs are sped with saturation

∥ v_{0} ∥ \leq v_{\max, L} = 0.9

for the leader and

∥ v_{i} ∥ \leq v_{\max, F} = 1.1

for followers. The leader tracks the planned reference point

p_{r} (t)

by a proportional law

v_{0} (t) = sat (k_{L} (p_{r} (t) - p_{0} (t)), v_{\max, L}),

with

k_{L} = 1.5

. Followers track their formation references using

v_{i} (t) = sat (- k_{track} (p_{i} (t) - {\hat{p}}_{r, i} (t)) + v_{i}^{rep} (t), v_{\max, F}),

with

k_{track} = 2.0

, where

v_{i}^{rep} (t)

is a light obstacle repulsion term for safety.

The leader generates a waypoint list

W_{ini} = [W_{0}, \dots, W_{L}]

(with

W_{0} = p_{0} (t_{k})

and

W_{L} = p_{g}

) and constructs a Safe Flight Corridor as a sequence of convex polyhedra

χ_{free} = ⋃_{ℓ = 1}^{L_{c}} P_{ℓ}, P_{ℓ} = {x \in R^{2} : A_{ℓ} x \leq c_{ℓ}} .

In implementation, each

P_{ℓ}

is initialized as an axis-aligned corridor box inflated by

1.0 m

and then tightened by obstacle separating half-planes. The leader trajectory over each corridor segment is parameterized by a quintic Bézier curve (

n = 5

). The control points are obtained by solving a quadratic program that minimizes a discrete proxy of the integrated squared jerk (third differences of control points), with weight

λ_{traj} = w_{jerk} = 1.0

.

The nominal formation size is

d_{nom} = 1.0

and the minimum radius is

d_{\min} = 0.45

. The corridor tightening uses the safety factor

ζ = 1.1

, so that the corridor constraints are reduced by approximately

ζ d_{k}

at each replanning iteration (consistent with the formation footprint). The planned radius

d_{k}

is smoothed into

d_{r} (t)

using a first-order filter with time constant

τ_{d} = 0.6 s

.

Each follower maintains an estimate

{\hat{p}}_{r, i} (t) \in R^{2}

of the leader reference point

p_{r} (t)

using the nonlinear agreement protocol with delays:

{\dot{\hat{p}}}_{r, i} (t) = φ ({\hat{p}}_{r, i} (t) - κ_{i} (t)), φ (r) = - k_{φ} tanh (r / ε), k_{φ} = 3.0, ε = 0.18,

where

κ_{i} (t)

is a convex combination of delayed neighbor estimates and (when available) the leader reference:

κ_{i} (t) = \sum_{j \in N_{i} (t)} {\bar{a}}_{i j} (t) {\hat{p}}_{r, j} (t - d_{i j} (t)) + {\bar{μ}}_{i} (t) p_{r} (t - d_{i 0} (t)),

with normalized weights

{\bar{a}}_{i j} (t)

and

{\bar{μ}}_{i} (t)

defined as in (6). Delays satisfy

d_{i j} (t), d_{i 0} (t) \in [0, \bar{d}]

and are simulated by random discrete delays up to

D = ⌈ \bar{d} / Δ t ⌉

steps.

The directed switching graphs (35) together with leader access vectors (36) ensure that within each switching window, the leader information can reach all followers through directed paths, satisfying the joint leader reachability condition. The delay bound

\bar{d}

is enforced by construction. The nonlinearity

φ (r)

satisfies

φ (0) = 0

and

r φ (r) < 0

for all

r \neq 0

, fulfilling the strict dissipativity requirement. Therefore, Assumptions 1–4 of the theoretical analysis are satisfied in the simulation.

The leader starts at

p_{0} = {[0, 0]}^{⊤}

. Each follower starts near the leader with small random perturbations around the desired circular formation positions. The initial estimator states

{\hat{p}}_{r, i} (0)

are also perturbed, resulting in nonzero initial estimation errors. The simulation results are shown as in Figure 2, Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7. Figure 2 shows the trajectories of the leader and followers. The leader successfully navigates through the cluttered environment and reaches the goal, while followers maintain the desired circular formation. As the team approaches narrow passages, the formation size

d_{r} (t)

decreases, enabling the entire formation to remain within the safe corridor, and then increases again in open regions. Figure 3 compares the planned leader reference trajectory

p_{r} (t)

and the executed leader trajectory

p_{0} (t)

. The executed trajectory closely tracks the planned reference, demonstrating that the planned trajectory is sufficiently smooth for tracking. Figure 4 plots the time evolution of the formation size

d_{r} (t)

, showing adaptive shrink behavior consistent with obstacle proximity. Figure 5 reports the distributed estimator errors

∥ {\hat{p}}_{r, i} (t) - p_{r} (t) ∥

for all followers. Figure 6 illustrates the mode switching process of the topology, while Figure 7 shows a screenshot of the animation depicting the collective motion of the moving bodies in formation. Despite switching directed communication topologies and bounded delays, the estimator errors remain bounded and converge to small neighborhoods of zero, which corroborates the practical agreement guarantee.

The simulation results validate the proposed framework in a cluttered environment: (i) online planning generates smooth safe trajectories; (ii) adaptive formation sizing enables obstacle avoidance for the entire team; and (iii) the distributed nonlinear estimator achieves robust leader reference tracking under directed switching graphs with delays, leading to successful distributed formation control.

In practical UAV systems, the proposed framework can be implemented in a distributed manner. The leader computes the Safe Flight Corridor and formation size optimization at each replanning step, while followers run lightweight nonlinear agreement dynamics and geometric tracking controllers. Communication overhead is limited to parameter vector exchange rather than full trajectory broadcasting. The dissipative structure naturally tolerates moderate delay variations and packet loss, provided that joint connectivity is periodically restored. Computationally, corridor construction scales linearly with nearby obstacles, and the MIQP problem remains tractable for moderate environment complexity.

5. Conclusions

This paper presented a distributed formation planning and control framework for multi-UAV systems operating under directed switching communication topologies and environmental constraints. By integrating SFC-based trajectory planning with adaptive formation sizing, the proposed method enables the entire formation to safely navigate obstacle-rich environments. A nonlinear agreement protocol was introduced to handle delayed and intermittent leader information, and rigorous analysis established practical tracking and agreement guarantees under switching graphs. Simulation results validated the theoretical findings and demonstrated robust performance in challenging scenarios. Future work will focus on extending the framework to fully three-dimensional environments, incorporating more complex vehicle dynamics, and investigating experimental validation on real UAV platforms.

The present study considers a single-leader formation architecture. Extending the framework to multiple leaders introduces additional challenges, including potential inconsistency among leader references, multi-source information fusion in the distributed estimator, and more complex cascade stability analysis. In multi-leader scenarios, additional coordination mechanisms among leaders or consensus-based aggregation strategies would be required to ensure coherent formation behavior. Although the proposed nonlinear agreement protocol can serve as a foundation for such extensions under suitable connectivity assumptions, rigorous analysis for multi-leader formations is left for future investigation.

Despite the demonstrated effectiveness of the proposed framework, several limitations remain. First, the theoretical guarantees rely on bounded communication delays and a joint leader reachability condition; performance under severe packet loss or prolonged disconnections is not addressed. Second, the Safe Flight Corridor construction assumes accurate obstacle representation and convex decomposition, which may become conservative in highly complex 3D environments. Third, the formation control strategy focuses on practical convergence rather than strict asymptotic consensus under time-varying leader signals. Finally, the current validation is limited to numerical simulations, and experimental implementation on real UAV platforms remains as future work.

Author Contributions

Conceptualization, Y.Z. and Z.J.; methodology, Y.Z. and Z.J.; software, Y.Z.; validation, Z.J.; formal analysis, Y.Z. and Z.J.; investigation, Y.Z.; writing—original draft preparation, Y.Z. and Z.J.; writing—review and editing, Y.Z. and Z.J.; visualization, Y.Z.; supervision, Y.Z. and Z.J.; project administration, Y.Z. and Z.J. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Natural Science Foundation of China under Grant 62403423, and the Natural Science Foundation of Zhejiang Province under Grant LMS25F030003. Henan Province Science and Technology Research Project (242102240056, 252102241020).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Lemmas and Main Proofs

Before presenting the technical Lemmas, we briefly summarize their roles in the overall proof structure. The analysis follows a cascade logic. First, Lemmas A1–A5 establish contraction of the nonlinear agreement dynamics under switching directed graphs with delays. In particular, Lemma A5 characterizes how contraction generated at a root node propagates along persistent directed edges, which is essential for proving window-based max–min contraction in Theorem 1. Second, Lemma A6 provides a Lipschitz continuity result that maps parameter estimation errors to trajectory reconstruction errors induced by the Bézier and formation-size parameterization. Finally, Lemma A7 establishes an ISS-type tracking bound for the geometric controller, showing that the UAV formation tracking error is ultimately bounded by the reference mismatch. This layered structure enables the derivation of Theorem 2 through a cascade argument from distributed agreement to formation tracking.

Lemma A1.

Under Assumptions 3 and bounded delays (7), for each ℓ, the follower envelope satisfies for almost all t

\begin{matrix} D^{+} E_{ℓ}^{\max} (t) \leq & - α_{ℓ} (E_{ℓ}^{\max} (t) - {\bar{κ}}_{ℓ} (t)) + {∥ {\dot{s}}_{ℓ} ∥}_{\infty}, \end{matrix}

(A1)

\begin{matrix} D^{+} E_{ℓ}^{\min} (t) \geq & α_{ℓ} ({\underset{̲}{κ}}_{ℓ} (t) - E_{ℓ}^{\min} (t)) - {∥ {\dot{s}}_{ℓ} ∥}_{\infty}, \end{matrix}

(A2)

where

{\bar{κ}}_{ℓ} (t)

and

{\underset{̲}{κ}}_{ℓ} (t)

are appropriate max/min convex combination bounds induced by (18).

Proof.

Pick an index

i^{*} (t) \in \arg \max_{i} e_{i, ℓ} (t)

. Using the Dini derivative of a pointwise maximum and the dynamics (17)

\begin{matrix} D^{+} E_{ℓ}^{\max} (t) = D^{+} e_{i^{*}, ℓ} (t) = φ_{ℓ} ({\hat{s}}_{i^{*}, ℓ} - κ_{i^{*}, ℓ}) - {\dot{s}}_{ℓ} (t) . \end{matrix}

(A3)

By Assumption 3,

φ_{ℓ} (\cdot)

is sign-definite and drives

{\hat{s}}_{i^{*}, ℓ}

toward

κ_{i^{*}, ℓ}

. Since

κ_{i^{*}, ℓ}

is a convex combination of delayed neighbor estimates and leader signal, it lies between the delayed envelopes. This yields an inequality in terms of

E_{ℓ}^{\max}

and the delayed envelopes. The bound for

E_{ℓ}^{\min}

is analogous by choosing

i_{*} (t) \in \arg \min_{i} e_{i, ℓ} (t)

. □

Lemma A2.

Consider the scalar system

\dot{x} (t) = φ (x (t) - κ (t)),

(A4)

where

κ (\cdot)

is measurable and satisfies

κ (t) \in [\underset{̲}{κ} (t), \bar{κ} (t)]

for all t. Assume (23) holds with gain

k > 0

for φ. Then for almost all t

D^{+} {(x (t) - \bar{κ} (t))}^{+} \leq - k {(x (t) - \bar{κ} (t))}^{+} + {(D^{+} \bar{κ} (t))}^{+},

(A5)

and

D^{+} {(\underset{̲}{κ} (t) - x (t))}^{+} \leq - k {(\underset{̲}{κ} (t) - x (t))}^{+} + {(D^{+} \underset{̲}{κ} (t))}^{+},

(A6)

where

y^{+} = \max {y, 0}

. In particular, if

\bar{κ}

and

\underset{̲}{κ}

are constant on an interval

[t_{0}, t_{0} + T]

, then for all

t \in [t_{0}, t_{0} + T]

{(x (t) - \bar{κ})}^{+} \leq e^{- k (t - t_{0})} {(x (t_{0}) - \bar{κ})}^{+}, {(\underset{̲}{κ} - x (t))}^{+} \leq e^{- k (t - t_{0})} {(\underset{̲}{κ} - x (t_{0}))}^{+} .

(A7)

Proof.

Let

y (t) : = x (t) - \bar{κ} (t)

. When

y (t) > 0

, we have

x (t) > \bar{κ} (t) \geq κ (t)

; hence,

r (t) : = x (t) - κ (t) \geq y (t) > 0

. By (23),

φ (r) \leq - k r \leq - k y

for

r > 0

. Thus, for almost all t with

y (t) > 0

\dot{y} (t) = \dot{x} (t) - \dot{\bar{κ}} (t) = φ (x - κ) - \dot{\bar{κ}} (t) \leq - k y (t) - \dot{\bar{κ}} (t) .

Taking upper Dini derivatives and using

D^{+} (y^{+}) \leq (D^{+} y) 1_{{y > 0}}

yields (A5). The lower bound (A6) is analogous by considering

z (t) : = \underset{̲}{κ} (t) - x (t)

. Since

\bar{κ}

and

\underset{̲}{κ}

are constants, Grönwall inequality gives (A7). □

Lemma A3.

For all

t \geq 0

and each follower i

κ_{i} (t) \in [\min {{\hat{S}}^{\min} (t), S_{L}^{\min} (t)}, \max {{\hat{S}}^{\max} (t), S_{L}^{\max} (t)}] .

(A8)

Moreover, if

{\bar{μ}}_{i} (t) \geq μ_{*} > 0

on a time subinterval, then

κ_{i} (t)

is a strict convex combination in the sense that it lies at least a fraction

μ_{*}

toward the leader window interval.

Proof.

By definition (18),

κ_{i} (t)

is a convex combination of the delayed neighbor values

{\hat{s}}_{j} (t - d_{i j} (t)) \in [{\hat{S}}^{\min} (t), {\hat{S}}^{\max} (t)]

and the delayed leader value

s (t - d_{i 0} (t)) \in [S_{L}^{\min} (t), S_{L}^{\max} (t)]

. Thus, (A8) holds. If

{\bar{μ}}_{i} (t) \geq μ_{*}

, write

κ_{i} (t) = (1 - {\bar{μ}}_{i} (t)) ξ (t) + {\bar{μ}}_{i} (t) η (t)

with

ξ (t) \in [{\hat{S}}^{\min} (t), {\hat{S}}^{\max} (t)]

and

η (t) \in [S_{L}^{\min} (t), S_{L}^{\max} (t)]

; the strictness follows. □

Lemma A4.

Under Assumption 4 and bounded delays. Fix an interval

I = [t_{0}, t_{0} + T]

with

T \geq λ_{D}

such that for some follower r,

{\bar{μ}}_{r} (t) \geq μ_{*} > 0

for all

t \in I

. Then on I

{\hat{s}}_{r} (t_{0} + λ_{D}) \in [{\hat{S}}^{\min} (t_{0}) + α_{R}, {\hat{S}}^{\max} (t_{0}) - α_{R}] \oplus [S_{L}^{\min} (t_{0}) - δ_{L}, S_{L}^{\max} (t_{0}) + δ_{L}],

(A9)

where ⊕ denotes Minkowski sum of intervals,

δ_{L} : = {sup}_{t \in I} | s (t) - s (t_{0}) |

, and the contraction margin

α_{R} : = μ_{*} (1 - e^{- k λ_{D}}) \frac{W (t_{0})}{2} .

(A10)

In particular, if

s (\cdot)

is constant on I, then

δ_{L} = 0

and

{\hat{s}}_{r}

moves at least

α_{R}

away from the follower envelope extremes.

Proof.

Let

U : = {\hat{S}}^{\max} (t_{0})

and

L : = {\hat{S}}^{\min} (t_{0})

. Consider the upper deviation

y (t) : = {({\hat{s}}_{r} (t) - U)}^{+}

on I. By Lemma A3,

κ_{r} (t) \leq (1 - μ_{*}) U + μ_{*} S_{L}^{\max} (t)

. Hence,

{\hat{s}}_{r} - κ_{r} \geq {\hat{s}}_{r} - [(1 - μ_{*}) U + μ_{*} S_{L}^{\max}]

. Applying Lemma A2 with

\bar{κ} (t) = (1 - μ_{*}) U + μ_{*} S_{L}^{\max} (t)

yields an exponential decay of

y (t)

up to the variation of

S_{L}^{\max} (t)

. A symmetric argument holds for the lower deviation

{(L - {\hat{s}}_{r})}^{+}

using

\underset{̲}{κ} (t) = (1 - μ_{*}) L + μ_{*} S_{L}^{\min} (t)

. Combining both bounds and noting that the leader window variation within I is captured by

δ_{L}

gives (A9). The explicit margin (A10) follows by integrating the exponential contraction over

λ_{D}

and using that strict leader weight;

μ_{*}

pulls

κ_{r}

toward the leader interval by a fraction

μ_{*}

. □

Lemma A5.

Under Assumption 4, consider an interval

I = [t_{0}, t_{0} + T]

with

T \geq λ_{D}

during which a directed edge

(j, i)

is continuously active and has weight lower bounded as

{\bar{a}}_{i j} (t) \geq a_{*} > 0

for all

t \in I

. If at time

t_{0}

the source node satisfies

{\hat{s}}_{j} (t) \in [L + β, U - β] \forall t \in [t_{0} - \bar{d}, t_{0}]

for some

β \in (0, W (t_{0}) / 2]

, then the target node satisfies at

t_{0} + λ_{D}

{\hat{s}}_{i} (t_{0} + λ_{D}) \in [L + β^{'}, U - β^{'}] \oplus [- δ_{L}, + δ_{L}], β^{'} : = a_{*} (1 - e^{- k λ_{D}}) β,

(A11)

where

δ_{L} : = {sup}_{t \in I} | s (t) - s (t_{0}) |

accounts for leader variation entering through other neighbors/leader.

Proof.

On the set I,

κ_{i} (t)

includes the term

{\bar{a}}_{i j} (t) {\hat{s}}_{j} (t - d_{i j} (t))

with weight at least

a_{*}

. Since delayed

{\hat{s}}_{j}

stays in

[L + β, U - β]

over the relevant delay window, and all other terms lie in

[L, U]

(by definition of envelopes), we obtain bounds:

κ_{i} (t) \leq (1 - a_{*}) U + a_{*} (U - β) = U - a_{*} β, κ_{i} (t) \geq (1 - a_{*}) L + a_{*} (L + β) = L + a_{*} β,

up to leader variation

δ_{L}

. Then Lemma A2 yields that

{\hat{s}}_{i}

is attracted to the tightened interval with exponential rate k, producing the margin

β^{'} = a_{*} (1 - e^{- k λ_{D}}) β

after time

λ_{D}

. □

Lemma A6

(Lipschitz reference reconstruction). Let

s (t)

encode the active Bézier segment control points and timing information, and the formation-size polynomial parameters, so that

p_{r}^{(j)} (t) = P^{(j)} (s (t), t)

and

d_{r}^{(j)} (t) = D^{(j)} (s (t), t)

for

j = 0, 1, 2, 3, 4

. Assume the planner ensures the boundedness of

s (t)

and excludes singular timing (i.e., segment durations are bounded away from 0). Then for each compact time interval between replannings and for each

j \leq 4

, there exist constants

L_{P, j}, L_{D, j} > 0

such that

∥ P^{(j)} (\hat{s}, t) - P^{(j)} (s, t) ∥ \leq L_{P, j} ∥ \hat{s} - s ∥, | D^{(j)} (\hat{s}, t) - D^{(j)} (s, t) | \leq L_{D, j} ∥ \hat{s} - s ∥ .

Proof.

For Bézier curves and polynomials,

P^{(j)} (\cdot, t)

and

D^{(j)} (\cdot, t)

are smooth functions of the parameters on any set where segment durations are bounded away from zero. Thus, their Jacobians are bounded on the compact parameter set induced by the planner bounds, implying local Lipschitz continuity with constants

L_{P, j}, L_{D, j}

. □

Lemma A7.

Consider follower UAV i with dynamics (1) controlled by a geometric tracking controller that achieves exponential tracking for a

C^{4}

reference trajectory. Let the controller be driven by an implemented reference

{\hat{p}}_{r, i} (t)

and its derivatives up to order 4, while the ideal formation reference is

p_{r, i} (t)

. Define the position tracking error

e_{i} (t) : = p_{i} (t) - p_{r, i} (t)

. Then there exist constants

c_{i} > 0

and

γ_{i} > 0

such that

∥ e_{i} (t) ∥ \leq c_{i} e^{- γ_{i} t} ∥ e_{i} (0) ∥ + c_{i} sup_{τ \in [0, t]} ∥ {\hat{p}}_{r, i} (τ) - p_{r, i} (τ) ∥ .

(A12)

Proof.

For flatness-based geometric controllers, the closed-loop tracking error dynamics can be written in a cascade form with an exponentially stable linear part plus bounded higher-order terms that are dominated by the reference mismatch. Under bounded reference derivatives and standard gain conditions, one obtains an exponential Lyapunov function

V_{i}

, satisfying

{\dot{V}}_{i} \leq - 2 γ_{i} V_{i} + α {∥ {\hat{p}}_{r, i} - p_{r, i} ∥}^{2}

. Gronwall’s inequality yields (A12). □

References

Chowdhury, M.M.U.; Maeng, S.J.; Bulut, E.; Güvenç, Ĭ. 3-D Trajectory Optimization in UAV-Assisted Cellular Networks Considering Antenna Radiation Pattern and Backhaul Constraint. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 3735–3750. [Google Scholar] [CrossRef]
Bao, T.; Wang, H.; Wang, W.J.; Yang, H.C.; Hasna, M.O. Secrecy Outage Performance Analysis of UAV-Assisted Relay Communication Systems with Multiple Aerial and Ground Eavesdroppers. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 2592–2600. [Google Scholar]
Hung, H.A.; Hsu, H.H.; Cheng, T.H. Optimal Sensing for Tracking Task by Heterogeneous Multi-UAV Systems. IEEE Trans. Control Syst. Technol. 2024, 32, 282–289. [Google Scholar]
Gong, X.; Gui, J.; Chen, Y.; Yang, X.; Yu, W.; Huang, T. Resilient Human-in-the-Loop Formation-Tracking of Multi-UAV Systems Against Byzantine Attacks. IEEE Trans. Autom. Sci. Eng. 2025, 22, 3797–3809. [Google Scholar] [CrossRef]
Hai, X.; Tan, L.; Feng, Q.; Duan, H.; Wen, C. Replanning-Oriented Framework for Efficient Real-Time Decision-Making in Multi-UAV Systems. IEEE Trans. Ind. Inform. 2025, 21, 5127–5137. [Google Scholar] [CrossRef]
Ma, J.; Chen, X.; Wen, G.; Wang, J.; Zhao, F.; Qiu, J. Dynamic Memory Event-Triggered Lag Consensus of Multi-UAV Systems with Hybrid Attacks Over Stochastic Switching Topology. IEEE Trans. Autom. Sci. Eng. 2025, 22, 16999–17009. [Google Scholar] [CrossRef]
Savkin, A.V.; Huang, H. Multi-UAV Navigation for Optimized Video Surveillance of Ground Vehicles on Uneven Terrains. IEEE Trans. Intell. Transp. Syst. 2023, 24, 10238–10242. [Google Scholar] [CrossRef]
Cao, X.; Liu, L. A Multi-Timescale Method for State of Charge Estimation for Lithium-Ion Batteries in Electric UAVs Based on Battery Model and Data-Driven Fusion. Drones 2025, 9, 247. [Google Scholar] [CrossRef]
El-Malek, A.H.A.; Aboulhassan, M.A.; Salhab, A.M.; Zummo, S.A. Performance Analysis and Optimization of UAV-Assisted Networks: Single UAV with Multiple Antennas Versus Multiple UAVs with Single Antenna. IEEE Syst. J. 2023, 17, 3468–3479. [Google Scholar] [CrossRef]
Li, Y.; Zhang, X.; Li, X.; Chen, Z.; Hu, Y.; Yang, J.; Schmeink, A. Cooperative Elliptic Positioning Through Single UAV During GNSS Outages. IEEE Trans. Wirel. Commun. 2024, 23, 12749–12764. [Google Scholar] [CrossRef]
Dong, X.; Yu, B.; Shi, Z.; Zhong, Y. Time-Varying Formation Control for Unmanned Aerial Vehicles: Theories and Applications. IEEE Trans. Control Syst. Technol. 2015, 23, 340–348. [Google Scholar] [CrossRef]
Ranjan, P.K.; Sinha, A.; Cao, Y.; Casbeer, D.; Weintraub, I. Relational Maneuvering of Leader-Follower Unmanned Aerial Vehicles for Flexible Formation. IEEE Trans. Cybern. 2024, 54, 5598–5609. [Google Scholar] [CrossRef]
Yang, Y.; Sun, L.; Fu, Y.; Feng, W.; Xu, K. Three-Dimensional UAV Trajectory Planning Based on Improved Sparrow Search Algorithm. Symmetry 2025, 17, 2071. [Google Scholar] [CrossRef]
Seyyedabbasi, A. Multi-Strategy Variable Secretary Bird Optimization Algorithm (MSVSBOA) for Global Optimization and UAV 3D Path Planning. Symmetry 2026, 18, 273. [Google Scholar] [CrossRef]
Jin, Z.; Bai, L.; Wang, Z.; Zhang, P. Self-Triggered Distributed Formation Control of Fixed-Wing Unmanned Aerial Vehicles Subject to Velocity and Overload Constraints. IEEE Trans. Autom. Sci. Eng. 2024, 21, 4082–4093. [Google Scholar] [CrossRef]
Jin, Z.; Li, H.; Qin, Z.; Wang, Z. Gradient-Free Cooperative Source-Seeking of Quadrotor Under Disturbances and Communication Constraints. IEEE Trans. Ind. Electron. 2025, 72, 1969–1979. [Google Scholar] [CrossRef]
Dong, X.; Zhou, Y.; Ren, Z.; Zhong, Y. Time-Varying Formation Tracking for Second-Order Multi-Agent Systems Subjected to Switching Topologies with Application to Quadrotor Formation Flying. IEEE Trans. Ind. Electron. 2017, 64, 5014–5024. [Google Scholar]
Zou, Y.; Zhou, Z.; Dong, X.; Meng, Z. Distributed Formation Control for Multiple Vertical Takeoff and Landing UAVs with Switching Topologies. IEEE/ASME Trans. Mechatron. 2018, 23, 1750–1761. [Google Scholar] [CrossRef]
Shi, S.; Wu, S.; Wei, B. Neural-Network-Based Event-Triggered Formation Tracking for Nonlinear Multi-UAV Systems with Switching Topologies Under DoS Attacks. IEEE Trans. Autom. Sci. Eng. 2025, 22, 11656–11667. [Google Scholar] [CrossRef]
Dong, Z.; Shi, S.; Zhen, Z. Asynchronously Integral Event-Triggered Formation Tracking in UAV Swarm Systems Featuring Switching Directed Topologies. IEEE Trans. Control Netw. Syst. 2025, 12, 1251–1263. [Google Scholar] [CrossRef]
Zhang, H.; Chen, C.; Xiang, Z. Adaptive Fuzzy Prescribed-Time Formation Control for Nonlinear Multi-Agent Systems. IEEE Trans. Autom. Sci. Eng. 2025, 22, 10986–10996. [Google Scholar] [CrossRef]
Abbasi, M.; Marquez, H.J. Dynamic Event-Triggered Formation Control of Multi-Agent Systems with Non-Uniform Time-Varying Communication Delays. IEEE Trans. Autom. Sci. Eng. 2025, 22, 8988–9000. [Google Scholar] [CrossRef]
Chang, W.J.; Lin, Y.H.; Lee, Y.C.; Ku, C.C. Investigating Formation and Containment Problem for Nonlinear Multiagent Systems by Interval Type-2 Fuzzy Sliding Mode Tracking Approach. IEEE Trans. Fuzzy Syst. 2024, 32, 4163–4177. [Google Scholar] [CrossRef]
Proskurnikov, A.V.; Calafiore, G.C. Delay Robustness of Consensus Algorithms: Continuous-Time Theory. IEEE Trans. Autom. Control 2023, 68, 5301–5316. [Google Scholar] [CrossRef]
Wu, H.; Meng, D. Synchronizability-Based Distributed Learning Control for Multi-Agent Systems. IEEE Trans. Circuits Syst. II Express Briefs 2024, 71, 2109–2113. [Google Scholar] [CrossRef]
Li, F.; Wang, C.; Mikulski, D.; Wagner, J.R.; Wang, Y. Unmanned Ground Vehicle Platooning Under Cyber Attacks: A Human-Robot Interaction Framework. IEEE Trans. Intell. Transp. Syst. 2022, 23, 18113–18128. [Google Scholar] [CrossRef]
Schäfer, L.; Manzinger, S.; Althoff, M. Computation of Solution Spaces for Optimization-Based Trajectory Planning. IEEE Trans. Intell. Veh. 2023, 8, 216–231. [Google Scholar] [CrossRef]
Guo, Z.; Wang, Y.; Yu, H.; Xi, J. Fast Optimization-Based Trajectory Planning with Cumulative Key Constraints for Automated Parking in Unstructured Environments. IEEE Trans. Veh. Technol. 2025, 74, 11820–11831. [Google Scholar] [CrossRef]
Ren, L.; Li, M.; Fan, S.; Zhang, Y.; Yu, F.; Yang, J. Cooperative Control Method of Multi-Agent Formation and Obstacle Avoidance. In Proceedings of the 2025 IEEE International Conference on Mechatronics and Automation (ICMA), Beijing, China; IEEE: Piscataway, NJ, USA, 2025; pp. 1066–1072. [Google Scholar]
Olfati-Saber, R.; Murray, R.M. Consensus Problems in Networks of Agents with Switching Topology and Time-Delays. IEEE Trans. Autom. Control 2004, 49, 1520–1533. [Google Scholar] [CrossRef]
Lin, Z.; Francis, B.; Maggiore, M. State Agreement for Continuous-Time Coupled Nonlinear Systems. SIAM J. Control Optim. 2007, 46, 288–307. [Google Scholar] [CrossRef]
Shi, G.; Hong, Y. Global Target Aggregation and State Agreement of Nonlinear Multi-Agent Systems with Switching Topologies. Automatica 2009, 45, 1165–1175. [Google Scholar] [CrossRef]
Cong, X.; Zi, L.; Du, D.Z. DTNB: A Blockchain Transaction Framework with Discrete Token Negotiation for the Delay Tolerant Network. IEEE Trans. Netw. Sci. Eng. 2021, 8, 1584–1599. [Google Scholar] [CrossRef]
Liu, T.; Qin, Z.; Hong, Y.; Jiang, Z.P. Distributed Optimization of Nonlinear Multiagent Systems: A Small-Gain Approach. IEEE Trans. Autom. Control 2022, 67, 676–691. [Google Scholar] [CrossRef]
Chang, Z.; Zong, G.; Wang, W.; Yue, M.; Zhao, X. Formation Control and Obstacle Avoidance Design for Networked USV Swarm with Exogenous Disturbance Under Intermittent Communication. IEEE Trans. Netw. Sci. Eng. 2025, 12, 3234–3243. [Google Scholar] [CrossRef]
Xiao, Q.; Yang, S.; Zeng, Z.; Huang, T.; Pal, N. Null-Space-Based Prescribed-Time Formation Control with Collision and Obstacle Avoidance. IEEE Trans. Control Netw. Syst. 2025, 12, 3003–3014. [Google Scholar] [CrossRef]
Wang, L.; Zhu, D.; Pang, W.; Luo, C. A Novel Obstacle Avoidance Consensus Control for Multi-AUV Formation System. IEEE/CAA J. Autom. Sin. 2023, 10, 1304–1318. [Google Scholar] [CrossRef]
Yuan, Z.; Yao, C.; Liu, X.; Gao, Z.; Zhang, W. Multiagent Formation Control and Dynamic Obstacle Avoidance Based on Deep Reinforcement Learning. IEEE Trans. Ind. Inform. 2025, 21, 4672–4682. [Google Scholar] [CrossRef]
Wang, Z.X.; Jin, Z.H.; Li, H. Semi-global asymptotic state agreement of nonlinear multi-agent systems with communication delays under directed switching topologies. Nonlinear Anal. Hybrid Syst. 2024, 52, 101458. [Google Scholar] [CrossRef]
Jin, Z. Global Asymptotic Stability Analysis for Autonomous Optimization. IEEE Trans. Autom. Control 2025, 70, 6953–6960. [Google Scholar] [CrossRef]

Figure 1. Schematic of the Safe Flight Corridor (SFC). The original corridor

χ_{free}

(solid blue) consists of convex polyhedra

P_{ℓ}

connecting optimized waypoints. Each supporting half-space

a_{ℓ r}^{⊤} x \leq c_{ℓ r}

is shifted inward by

ζ_{d}

to obtain the tightened corridor (dashed blue), ensuring collision-free motion for a formation of diameter d.

Figure 1. Schematic of the Safe Flight Corridor (SFC). The original corridor

χ_{free}

(solid blue) consists of convex polyhedra

P_{ℓ}

connecting optimized waypoints. Each supporting half-space

a_{ℓ r}^{⊤} x \leq c_{ℓ r}

is shifted inward by

ζ_{d}

to obtain the tightened corridor (dashed blue), ensuring collision-free motion for a formation of diameter d.

Figure 2. The trajectories of the leader and followers. The black line represents the trajectory of the leader, while the other four colors represent the trajectories of the followers.

Figure 3. The planned leader reference trajectory

p_{r} (t)

and the executed leader trajectory

p_{0} (t)

.

Figure 3. The planned leader reference trajectory

p_{r} (t)

and the executed leader trajectory

p_{0} (t)

.

Figure 4. Time evolution of the formation size

d_{r} (t)

.

Figure 4. Time evolution of the formation size

d_{r} (t)

.

Figure 5. The distributed estimator errors

∥ {\hat{p}}_{r, i} (t) - p_{r} (t) ∥

for all followers.

Figure 5. The distributed estimator errors

∥ {\hat{p}}_{r, i} (t) - p_{r} (t) ∥

for all followers.

Figure 6. The mode of switching topology.

Figure 7. Four snapshots (2 × 2) of the formation navigation under switching topologies. The solid black circles represent the leader, the other four colors represent the followers, and the red circles are obstacles.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Y.; Jin, Z. Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies. Actuators 2026, 15, 163. https://doi.org/10.3390/act15030163

AMA Style

Zhang Y, Jin Z. Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies. Actuators. 2026; 15(3):163. https://doi.org/10.3390/act15030163

Chicago/Turabian Style

Zhang, Yingzheng, and Zhenghong Jin. 2026. "Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies" Actuators 15, no. 3: 163. https://doi.org/10.3390/act15030163

APA Style

Zhang, Y., & Jin, Z. (2026). Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies. Actuators, 15(3), 163. https://doi.org/10.3390/act15030163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies

Abstract

1. Introduction

2. Preliminaries and Problem Formulation

2.1. UAV Model

2.2. Switching Communication Topology

2.3. Problem Description

3. Trajectory Planning and Adaptive Formation Control

3.1. Safe Flight Corridor

3.2. Trajectory and Formation Size Generation

3.3. Distributed Time-Varying Formation Control Under Switching Topologies

3.4. Computational Complexity Discussion

4. Simulation Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Lemmas and Main Proofs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI