Next Article in Journal
Discriminating Vibrotactile Signals: The Relative Roles of Amplitude and Frequency
Previous Article in Journal
Shape Memory Alloy Actuators in Robotics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies

1
School of Electronic Information Engineering, Henan Institute of Technology, Xinxiang 453003, China
2
Henan Industrial Equipment Intelligent Engineering Technology Research Center, Xinxiang 453003, China
3
School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore 639798, Singapore
*
Author to whom correspondence should be addressed.
Actuators 2026, 15(3), 163; https://doi.org/10.3390/act15030163
Submission received: 9 February 2026 / Revised: 24 February 2026 / Accepted: 10 March 2026 / Published: 12 March 2026
(This article belongs to the Section Aerospace Actuators)

Abstract

This paper addresses distributed formation control for multiple unmanned aerial vehicles (UAVs) operating in obstacle-dense environments under directed switching communication topologies. A leader–follower architecture is adopted, wherein the leader performs online trajectory replanning while followers rely on delayed and intermittently available neighbor information. To simultaneously tackle collision avoidance, formation feasibility under narrow passages, and communication intermittency, we propose an integrated deformable formation navigation framework. The framework couples Safe Flight Corridor (SFC)-constrained Bézier trajectory planning with a dynamic formation scaling mechanism, allowing the swarm to adaptively shrink or expand its geometric configuration when traversing constricted spaces, thereby ensuring all agents remain within certified collision-free corridors. A nonlinear distributed consensus-based estimator is designed to propagate leader reference states under directed switching graphs with bounded delays. Using a max-min contraction analytical approach, we establish guaranteed practical convergence for both leader tracking and inter-follower agreement without requiring persistent connectivity. Extensive simulations in complex cluttered environments demonstrate that the proposed approach enables flexible and real-time formation reshaping, enhancing navigational safety and robustness while maintaining cohesive swarm behavior under challenging communication and spatial constraints.

1. Introduction

Autonomous multi-UAV systems have attracted increasing attention due to their potential in surveillance, environmental monitoring, disaster response, and cooperative transportation [1,2,3,4,5,6,7,8]. Compared with single vehicle systems [9,10], UAV formations offer enhanced robustness, scalability, and task efficiency by exploiting spatial cooperation and information sharing among multiple agents [11,12]. However, in practical applications, UAV formations [13,14] are often required to operate in obstacle-rich environments, where collision avoidance, trajectory feasibility, and formation maintenance must be simultaneously ensured. Moreover, realistic communication networks are subject to limited bandwidth, time-varying connectivity, and non-negligible delays, which significantly complicate the design and analysis of distributed formation control strategies [15,16]. These considerations motivate the development of formation control frameworks that explicitly account for environmental constraints and communication imperfections, while maintaining rigorous performance and safety guarantees.
Formation control under switching communication topologies and communication delays has been extensively studied in recent years [12,17,18,19,20,21]. Existing research has extensively studied consensus and formation maintenance under directed or undirected switching graphs. Convergence and robustness results have been established under various connectivity conditions [22,23], such as joint connectivity or uniformly quasi-strong connectivity [24,25]. In parallel, obstacle avoidance and safe navigation for multi-UAV systems have been addressed using potential fields, artificial constraints, and optimization-based trajectory planning methods [26,27,28]. Nevertheless, most existing formation control approaches [29] treat obstacle avoidance and formation maintenance as loosely coupled problems, often assuming either static formations or centralized planning. When communication topologies switch and delays are present, these approaches may suffer from degraded performance, loss of feasibility, or even safety violations, especially in narrow passages where the geometric footprint of the formation becomes critical.
Several techniques have been proposed to handle switching topologies and delays in distributed control problem of multi-agent systems. Lyapunov-based methods and common quadratic Lyapunov functions have been widely used to establish stability under switching graphs, but they typically impose restrictive assumptions on the switching signals or require conservative dwell-time conditions [30,31,32]. Alternatively, delay-tolerant consensus and estimation schemes have been developed using predictor-based designs or augmented-state formulations [33], which often lead to increased computational complexity and limited scalability. More recently, nonlinear agreement and contraction-based approaches have been explored to relax the need for fixed topologies and common Lyapunov functions [34]. Despite these advances, integrating such communication-aware online control mechanisms, obstacle-aware trajectory planning, and formation feasibility guarantees remains challenging. In particular, few existing works provide a unified framework that simultaneously addresses switching directed graphs, bounded delays, and formation-level safety constraints induced by obstacle-rich environments.
In obstacle-dense environments, the formation control problem becomes significantly more challenging due to geometric feasibility constraints imposed by limited free space [35,36]. Unlike open environments where a fixed formation geometry can be maintained throughout the mission, narrow passages and irregular obstacle distributions may render a rigid formation configuration infeasible. In such cases, the effective footprint of the formation, determined by inter-agent spacing, directly affects whether the entire team can remain within collision-free regions. Moreover, obstacle avoidance and formation maintenance are intrinsically coupled [37,38]. A leader trajectory that is safe for a single UAV may not guarantee safety for the entire formation unless formation size is explicitly incorporated into the planning stage. When communication topologies are switching and subject to delays, this coupling becomes even more critical, as followers rely on distributed and potentially outdated information to reconstruct the leader reference. Consequently, ensuring corridor feasibility, formation coherence, and distributed stability simultaneously in cluttered environments remains a nontrivial and underexplored problem. These considerations motivate the development of a unified framework that integrates Safe Flight Corridor-based trajectory planning with adaptive formation scaling and delay-robust distributed estimation under directed switching communication graphs.
The considered problem introduces several intertwined technical difficulties. First, in obstacle-dense environments, a fixed-geometry formation may become geometrically infeasible when traversing narrow passages, as the formation footprint may exceed the available free space. Second, obstacle avoidance and formation maintenance are strongly coupled: a trajectory that is collision-free for a single UAV does not necessarily guarantee safety for the entire formation unless formation size is explicitly incorporated into the planning stage. Third, under directed switching communication topologies with bounded delays, followers may only have intermittent and outdated access to leader information, making distributed reference reconstruction nontrivial. Finally, the overall closed-loop system forms a cascade interconnection between distributed estimation dynamics and nonlinear tracking control, requiring careful stability analysis to guarantee practical convergence.
In this paper, we develop a distributed formation planning and control framework that explicitly accounts for switching directed communication topologies, bounded delays, and obstacle-induced environmental constraints. A leader–follower architecture is adopted in which the leader performs online trajectory planning based on Safe Flight Corridors (SFCs), while followers reconstruct the leader reference through a nonlinear agreement mechanism under intermittent and delayed information exchange. Unlike conventional formation control frameworks that assume fixed formation geometry or treat obstacle avoidance and formation maintenance as loosely coupled modules, the proposed method embeds the formation footprint directly into corridor tightening and incorporates formation size as a planning variable. As a result, the geometric feasibility of the entire formation is preserved when traversing narrow passages under switching communication and delayed leader information. Furthermore, the nonlinear agreement protocol is not introduced as an isolated delay consensus scheme, but as an estimator layer tightly coupled with adaptive formation resizing. The theoretical analysis establishes a cascade structure that quantitatively links estimator contraction rate to ultimate formation tracking accuracy, thereby providing a unified safety–communication–tracking guarantee. The effectiveness and robustness of the framework are demonstrated through simulations of multi-UAV formations navigating obstacle-rich environments under time-varying communication conditions. Our main contributions are as follows:
(1)
We propose a formation control framework that systematically addresses directed switching communication topologies and bounded transmission delays. By reformulating leader state dissemination as a nonlinear agreement process, the scheme achieves practical leader tracking and inter-follower consensus under uniformly quasi-strongly connected (UQSC) switching conditions, without requiring fixed topologies or global synchronization.
(2)
We develop a planning--control co-design methodology that couples SFC-constrained Bézier trajectory planning with online optimization of time-varying formation size. The formation size is adaptively adjusted according to corridor feasibility, allowing the entire formation to safely contract or expand when navigating narrow passages. This integration bridges the gap between single-agent motion planning and multi-agent formation constraints in cluttered environments.
(3)
Using nonlinear agreement theory and a window-based max-min contraction analysis, we establish formal proofs of follower agreement and practical leader tracking under directed switching graphs with bounded delays. The analysis avoids restrictive quadratic Lyapunov assumptions and naturally accommodates the hybrid dynamics arising from replanning and communication switching, providing explicit bounds on estimation and tracking errors.
The rest of this article is organized as follows. Section 2 presents the model of the UAV, followed by the problem formulation that explicitly states the control objectives and the assumptions. Section 3 explains the proposed trajectory planning and distributed formation control framework and main results. Section 4 demonstrates the effectiveness of the proposed algorithm through the simulation study. Finally, Section 5 concludes the paper. Appendix A provides the key lemmas and main proof, deriving the sufficient condition for constraint satisfaction and closed-loop stability under bounded disturbances.
Notation: In this paper, R denotes the set of real numbers and R n denotes the Euclidean Space. We use I n and 0 m × n to represent the n-dimensional identity matrix, and n × m -dimensional zero matrix, respectively. Let · be the Euclidean norm. For a signal x ( · ) , define x ¯ ( t ) : = sup τ [ t d ¯ , t ] x ( τ ) and x ̲ ( t ) : = inf τ [ t d ¯ , t ] x ( τ ) for a given delay bound d ¯ 0 . For a locally Lipschitz function V ( · ) , denote D + V ( t ) its upper right Dini derivative. For a directed graph G = ( V , E ) , let N i be the in-neighborhood of node i. We use 1 n to denote the all-one vector of length n. R i SO ( 3 ) denotes the rotation matrix from the body-fixed frame to the inertial frame. The operator × denotes the vector cross product in  R 3 .

2. Preliminaries and Problem Formulation

2.1. UAV Model

Consider a team of N + 1 UAVs indexed by { 0 , 1 , , N } , where UAV 0 is the leader and UAVs  i V : = { 1 , , N } are followers. Each UAV is described by the standard quadrotor rigid-body dynamics
m i p ¨ i = m i g e 3 F i R i e 3 ,
J i Ω ˙ i = Ω i × J i Ω i + M i ,
where p i = [ x i , y i , z i ] T R 3 is the position in the inertial frame, e 3 = [ 0 , 0 , 1 ] denotes the unit vector along the z-axis of the inertial frame, m i denotes the mass of the i-th UAV, g > 0 is the gravitational acceleration, R i SO ( 3 ) is the attitude, Ω i R 3 is the body angular velocity, F i R is the thrust, and M i R 3 is the body moment. As in the differential flatness framework, a geometric tracking controller can asymptotically track a sufficiently smooth reference position and yaw trajectory for each UAV. We will use this fact as a modular tracking layer.
For each follower i V , define its desired relative direction to the leader using angles  ( ϕ i , θ i ) ,
c i : = c ( ϕ i , θ i ) = sin θ i cos ϕ i sin θ i sin ϕ i cos θ i R 3 .
Let d r ( t ) d min > 0 denote the time-varying formation size. Then the ideal formation reference for follower i is
p r , i ( t ) = p r ( t ) + c i d r ( t ) ,
where p r ( t ) is the leader-planned safe trajectory and d min is a positive constant.

2.2. Switching Communication Topology

Communication among followers and the leader is described by a directed switching graph
G ( t ) = ( V , E ( t ) ) , t 0 ,
where E ( t ) V × V is piecewise constant with switching times { t k } k N . An edge ( j , i ) E ( t ) means that follower i can receive information from follower j at time t. Let a i j ( t ) 0 be the weight on edge ( j , i ) with a i i ( t ) = 0 . Define the neighborhood set N i ( t ) = { j V : ( j , i ) E ( t ) } and the normalized weights
a ¯ i j ( t ) = a i j ( t ) k N i ( t ) a i k ( t ) + μ i ( t ) j N i ( t ) , 0 otherwise , μ ¯ i ( t ) = μ i ( t ) k N i ( t ) a i k ( t ) + μ i ( t ) .
Here μ i ( t ) { 0 , 1 } indicates whether follower i has direct access from the leader at time t. We allow bounded time-varying communication delays:
d i j ( t ) [ 0 , d ¯ ] , d i 0 ( t ) [ 0 , d ¯ ] , i , j V .
where d ¯ is the upper bound of the time delays.

2.3. Problem Description

In this paper, the main controlled system under consideration consists of a multi-UAV system (1) operating in an obstacle-rich environment. The leader UAV is equipped with access to a local map and capable of online replanning. In contrast, the follower UAVs typically lack sufficient computational resources for trajectory planning and do not necessarily receive the planning information from the leader directly.
Control objectives: Given global goal p g and a sequence of local goals { p l g , k } for replanning, we aim to design: (i) an online planner that generates a safe leader trajectory p r ( t ) and formation size d r ( t ) , and (ii) a distributed formation controller under switching directed communication such that:
  • all UAVs remain inside a Safe Flight Corridor χ free ( t ) :
    p i ( t ) χ free ( t , k ) , t [ t k , t k + 1 ) , i = 0 , 1 , , N .
  • the leader tracks the planned trajectory:
    lim sup t p 0 ( t ) p r ( t ) = 0 .
  • each follower asymptotically tracks its formation reference:
    lim sup t p i ( t ) p r , i ( t ) = 0 .
  • the planned leader trajectory reaches each local goal within finite time during each replanning iteration:
    p r ( T ) = p l g , k , 0 < T < .
We adopt a leader-following analogue of the UQSC condition:
Assumption 1.
There exist constants λ > 0 and λ D > 0 such that for any t 0 , (i) each active edge in E ( t ) persists for at least λ D once it appears; and (ii) the union graph
G ( [ t , t + λ ) ) : = V , τ [ t , t + λ ) E ( τ )
contains a directed spanning tree whose root belongs to the set { i V : τ [ t , t + λ ) s . t . μ i ( τ ) = 1 } , i.e., within every window of length λ, leader information can reach all followers through directed paths.
Remark 1.
Assumption 1 is a leader-following version of the UQSC/QSC-type joint connectivity used in switching directed consensus with delays [39]. It is strictly weaker than requiring a fixed spanning tree at all times.
Assumption 2.
Each follower i can access its prescribed relative angles ( ϕ i , θ i ) .
The theoretical analysis is established under bounded delays and the joint leader reachability condition (Assumption 1), which ensure window-based contraction of the nonlinear agreement dynamics. In practice, UAV communication links may experience packet loss or temporary blackouts, leading to occasional violations of these conditions. During such periods, missing packets can be interpreted as effectively increased delays or intermittent unavailability of leader information. The proposed agreement dynamics remain well-posed and the estimator states stay bounded due to the dissipative feedback structure; however, the contraction rate may temporarily reduce and the tracking error bound can grow. Once communication quality recovers such that Assumption 1 holds again, the window-based contraction mechanism reactivates and the estimator error contracts back to its nominal neighborhood.

3. Trajectory Planning and Adaptive Formation Control

3.1. Safe Flight Corridor

Let W w R 3 denote the w-th waypoint of the guidance path. The endpoints W 0 = p 0 ( t k ) and W L = p l g , k are fixed, while the intermediate waypoints { W 1 , , W L 1 } are decision variables to be optimized for smoothness and obstacle clearance.
At each replanning iteration k, the leader generates an initial waypoint list W ini , then optimizes waypoints to improve smoothness and safety:
W opt = arg min W 1 , , W L 1 λ s w = 1 L 1 2 W w W w 1 W w + 1 2 + λ c w = 1 L 1 Ψ ( d ( W w , χ ) ) ,
where χ denotes the surround environment, d ( W w , χ ) is the distance from waypoint W w to the closest obstacle, Ψ ( · ) is a barrier/penalty enforcing a safe distance, and λ s and λ c are two positive constants.
As shown in Figure 1, the Safe Flight Corridor is represented as a sequence of adjacent convex polyhedra
χ free : = = 1 L c P , P : = { x R 3 A x c } ,
where L c N is the number of corridor cells. For each cell , the matrix A R n × 3 and vector c R n collect n half-space inequalities a r x c r ( r = 1 , , n ), with a r being the r-th row of A .
To guarantee collision-free motion for a formation with size d d min , we tighten each corridor cell by shrinking every supporting half-space inward. Assuming the half-space normals are normalized, i.e., a r = 1 , the tightened corridor is
A x c Δ ( d ) , Δ ( d ) : = ζ d 1 n ,
where ζ > 1 is a safety margin. If a r is not normalized, the shrinkage can be written as Δ r ( d ) = ζ d a r , so that each face is offset inward by the geometric distance ζ d . The step of this method is concluded as Algorithm 1.
Algorithm 1 Safe Flight Corridor Construction
Require: Initial waypoint list W ini , obstacle set χ , formation diameter d
Ensure: Corridor cells { P }
  1:  Generate initial piecewise linear path
  2:  for each segment do
  3:        Compute separating hyperplanes
  4:        Construct polyhedral cell
  5:  end for
  6:  Tighten constraints A x c ζ d

3.2. Trajectory and Formation Size Generation

Within replanning iteration k, the leader trajectory is represented by an M-segment n-th order Bézier curve
p r , k ( t ) = j = 0 n b j n ( τ ) P m , j , τ = t T m T m + 1 T m [ 0 , 1 ] , t [ T m , T m + 1 ] ,
where { P m , j } are control points and b j n are Bernstein polynomials. Using the convex hull property, containment in the Safe Flight Corridor can be enforced by requiring all control points in the corresponding polyhedron.
We choose the formation size d k as an optimization variable jointly with control points and polyhedron assignment binaries e m { 0 , 1 } :
min { P m , j } , { e m } , d k λ traj m = 1 M T m T m + 1 p r , k ( t ) 2 d t + λ form ( d k d ¯ ) 2 s . t . dynamic feasibility bounds on p ˙ , p ¨ , p , p ( 4 ) , = 1 L c e m = 1 , e m = 1 A P m , j c Δ ( d k ) , j , p r , k ( T 1 ) = p 0 ( t k ) , p r , k ( T M + 1 ) = p l g , k .
A smooth transition d r ( t ) between d k 1 and d k is generated by a high-order polynomial, satisfying boundary derivative matching up to order 4.
Remark 2.
The trajectory smoothness term is chosen as the integral of the squared jerk, i.e., p r , k ( t ) 2 d t , for the following reasons. First, the jerk directly reflects the rate of change of acceleration and is closely related to the smoothness of thrust and attitude commands of quadrotor UAVs, making it more suitable than acceleration-based costs for execution on real platforms. Second, when the trajectory is parameterized by Bézier curves, the jerk remains a low order polynomial, which leads to a quadratic cost in the control points and enables efficient MIQP formulation. Finally, higher-order smoothness is enforced through explicit bounds on snap, guaranteeing C 4 continuity and compatibility with geometric tracking controllers. This choice provides a favorable trade-off between trajectory smoothness, tracking feasibility, and computational efficiency.

3.3. Distributed Time-Varying Formation Control Under Switching Topologies

To enable followers to compute p r ( t ) and its derivatives, the leader publishes a parameter vector s ( t ) R n s , which contains the current segment Bézier control points and timing information and the current formation-size polynomial parameters. Given s ( t ) and time t, any agent can compute p r ( t ) , p ˙ r ( t ) , , p r ( 4 ) ( t ) , and d r ( t ) , d ˙ r ( t ) , , d r ( 4 ) ( t ) analytically. Since replanning occurs at discrete instants, s ( t ) is piecewise constant with bounded jumps and bounded update frequency.
For each coordinate = 1 , , n s , define the scalar estimate s ^ i , and the leader signal s ( t ) . We propose the nonlinear agreement protocol with delays:
s ^ ˙ i , ( t ) = φ s ^ i , ( t ) κ i , ( t ) ,
where
κ i , ( t ) : = j N i ( t ) a ¯ i j ( t ) s ^ j , t d i j ( t ) + μ ¯ i ( t ) s t d i 0 ( t ) ,
and the nonlinearity φ : R R satisfies the strict dissipativity condition.
Assumption 3.
For each coordinate ℓ, φ ( 0 ) = 0 , and for all r 0 , r φ ( r ) < 0 . Moreover, φ is locally Lipschitz and there exists a class- K function α such that | φ ( r ) | α ( | r | ) for all r in the semi-global domain of interest.
Define the leader oscillation over the delay window:
osc L ( t ) : = S L max ( t ) S L min ( t ) , S L max ( t ) : = sup τ [ t d ¯ , t ] s ( τ ) , S L min ( t ) : = inf τ [ t d ¯ , t ] s ( τ ) .
Equations (17) and (18) are a direct leader-following extension of the agreement model in [39]: each follower is driven toward a time-varying convex combination of delayed neighbor estimates and delayed leader signal. The core advantage is that its convergence can be established via max–min contraction arguments under Assumption 1, without requiring a common quadratic Lyapunov function for switching directed graphs.
Define the estimation error for coordinate : e i , ( t ) : = s ^ i , ( t ) s ( t ) . Let
E max ( t ) : = max i V e i , ( t ) , E min ( t ) : = min i V e i , ( t ) , W ( t ) : = E max ( t ) E min ( t ) .
Note that due to the leader time-variation, the asymptotic tracking generally requires additional internal model structure. Here we establish agreement among followers and practical tracking to the leader with a bound proportional to s ˙ .
Theorem 1.
Suppose Assumptions 1 and 3 and bounded delays (7) hold, then for each coordinate ℓ:
1.
The follower disagreement width W ( t ) converges to a neighborhood of zero:
lim sup t W ( t ) γ s ˙ ,
for some class- K functions, γ ( · ) is determined by α ( · ) and the joint leader reachability window ( λ , λ D ) . In particular, if s ( t ) is piecewise constant, then W ( t ) 0 on each constant interval.
2.
Each follower error satisfies the practical bound
lim sup t max i V | e i , ( t ) | Γ s ˙ ,
where Γ is explicitly constructible via the window-based contraction recursion as in [39].
Proof. 
We prove the result for a single scalar coordinate of the leader parameter vector s ( t ) . The vector case follows by applying the same argument componentwise and taking the maximum bounds.
Define the follower envelope over the delay window
S ^ max ( t ) : = max i V sup τ [ t d ¯ , t ] s ^ i ( τ ) , S ^ min ( t ) : = min i V inf τ [ t d ¯ , t ] s ^ i ( τ ) ,
and the width W ( t ) : = S ^ max ( t ) S ^ min ( t ) . By Lemma A3, for all i and all t
κ i ( t ) min { S ^ min ( t ) , S L min ( t ) } , max { S ^ max ( t ) , S L max ( t ) } .
Applying Lemma A2 to each estimator s ^ ˙ i = φ ( s ^ i κ i ) implies that s ^ i ( t ) cannot exit the convex hull generated by S ^ min ( t ) , S ^ max ( t ) and the leader window. Hence W ( t ) is well-defined and uniformly bounded for all t 0 .
By Assumption 1, for any t there exists a time window [ t , t + λ ) whose union graph contains a directed spanning tree rooted at some follower r that receives leader information (i.e., μ ¯ r ( t ) μ * > 0 on a subinterval of length at least λ D ). Applying Lemma A4 on this subinterval shows that s ^ r moves away from the envelope extremes by a margin proportional to ( 1 e k λ D ) μ * W ( t ) , up to the leader window oscillation osc L ( t ) : = S L max ( t ) S L min ( t ) .
Along each directed edge of the spanning tree, Assumption 1 guarantees persistence for at least λ D and a uniform weight lower bound a * > 0 . By Lemma A5, the contraction margin at the root propagates to each downstream node with at least a factor a * ( 1 e k λ D ) per edge. Since the depth of the spanning tree is at most | V | 1 , after the window [ t , t + λ ) , every follower contracts toward the interior of the envelope.
Then there exist explicit constants
ρ : = 1 e k λ D a * N 1 μ * ( 0 , 1 ) , η : = 2 m = 0 N 1 a * m ( 0 , ) ,
such that for all t 0
W ( t + λ ) ( 1 ρ ) W ( t ) + η osc L ( t + λ ) .
Iterating this recursion proves that (i) if s ( t ) is constant on [ t , t + λ ] , then W ( t ) strictly contracts and W ( t + n λ ) 0 as n ; and (ii) for time-varying s ( t ) , W ( t ) is ultimately bounded by η ρ lim sup t osc L ( t ) .
The proof is complete. □
Each follower i reconstructs the leader reference and its derivatives from s ^ i ( t ) :
p ^ r ( j ) ( t ) = P ( j ) s ^ i ( t ) , t , j = 0 , 1 , 2 , 3 , 4 , d ^ r ( j ) ( t ) = D ( j ) s ^ i ( t ) , t , j = 0 , 1 , 2 , 3 , 4 ,
where P ( j ) ( · ) and D ( j ) ( · ) are analytic maps induced by Bézier/polynomial parameterization. Then the follower’s desired trajectory and derivatives are
p ^ r , i ( j ) ( t ) = p ^ r ( j ) ( t ) + c i d ^ r ( j ) ( t ) , j = 0 , 1 , 2 , 3 , 4 .
Finally, each follower applies a geometric tracking controller using p ^ r , i ( t ) , p ^ ˙ r , i ( t ) , p ^ ¨ r , i ( t ) , etc.
Given the implemented reference p ^ r , i ( t ) and yaw ψ ^ r , i ( t ) with bounded derivatives up to order four, we adopt a standard geometric tracking controller on SE ( 3 ) . Define the position/velocity errors
e p , i : = p i p ^ r , i , e v , i : = p ˙ i p ^ ˙ r , i .
A commanded acceleration is chosen as
a c , i : = p ^ ¨ r , i K p e p , i K v e v , i ,
and the thrust is set to
F i : = m i a c , i ( R i e 3 ) .
The desired attitude R c , i ( t ) SO ( 3 ) is constructed from the commanded acceleration and yaw. Its corresponding reference angular velocity is defined by
Ω ^ r , i ( t ) : = R c , i ( t ) R ˙ c , i ( t ) ,
where ( · ) denotes the vee map from so ( 3 ) to R 3 .
The desired attitude R c , i is constructed such that R c , i e 3 aligns with a c , i g e 3 and the yaw matches ψ ^ r , i . Then the moment input M i is designed by a standard attitude error feedback controller
M i : = K R e R , i K Ω e Ω , i + Ω i × J i Ω i J i Ω ^ r , i × R i R c , i Ω ^ r , i R i R c , i Ω ^ ˙ r , i ,
which guarantees exponential tracking of ( p i , R i ) to ( p ^ r , i , R c , i ) for sufficiently smooth references. This yields the ISS-type bound in Lemma A7. For a detailed ISS definition, see [40].
Assumption 4.
For each coordinate ℓ, the function φ : R R is continuous, locally Lipschitz, φ ( 0 ) = 0 , and there exists a constant k > 0 such that
r φ ( r ) k r 2 , r R .
Remark 3.
Assumption 4 is a standard strong dissipativity/sector condition. It implies the sign condition r φ ( r ) < 0 for r 0 used in the switching agreement literature, while additionally providing a quantitative contraction rate k that yields explicit recursion constants. A canonical choice is φ ( r ) = k r or φ ( r ) = k tanh ( r / ϵ ) .
Proposition 1
(Corridor tightening implies formation safety under time-varying radius). Fix a replanning interval t [ t k , t k + 1 ) and a corridor cell P : = { x R n A x c } . Assume each row a r of A is normalized, i.e., a r = 1 . Let the leader reference satisfy the tightened constraints
A p r ( t ) c ζ d ¯ k 1 , t [ t k , t k + 1 ) ,
for some tightening radius d ¯ k > 0 and safety factor ζ > 1 . Suppose the executed formation radius after smoothing, d r ( t ) , satisfies
d r ( t ) d ¯ k , t [ t k , t k + 1 ) ,
and each follower remains within distance ζ d r ( t ) of the leader reference, i.e.,
p i ( t ) p r ( t ) ζ d r ( t ) , i { 1 , , N } .
Then all agents remain inside the original corridor cell:
A p i ( t ) c , t [ t k , t k + 1 ) , i { 0 , 1 , , N } .
Proof. 
For any face inequality a r x c r , by (24)
a r p r ( t ) c r ζ d ¯ k .
For any follower i, write p i ( t ) = p r ( t ) + Δ i ( t ) . Using a r = 1 and Cauchy–Schwarz
a r p i ( t ) = a r p r ( t ) + a r Δ i ( t ) c r ζ d ¯ k + Δ i ( t ) .
By (26) and (25), Δ i ( t ) ζ d r ( t ) ζ d ¯ k ; hence, a r p i ( t ) c r . Since this holds for all faces, A p i ( t ) c . □
Theorem 2.
Under Assumptions 1–3, and bounded delays (7), the closed-loop system consisting of the online planner (12)–(16), the nonlinear estimators (17) and (18), and the tracking controllers, satisfies:
1.
All estimation signals s ^ i ( t ) are uniformly bounded for all t 0 .
2.
The reconstructed references p ^ r , i ( t ) , d ^ r ( t ) are uniformly bounded and satisfy the practical tracking bounds implied by Theorem 1.
3.
The formation tracking error e i ( t ) : = p i ( t ) p r , i ( t ) is ultimately bounded:
lim sup t e i ( t ) Ξ s ˙ ,
where Ξ ( · ) can be made arbitrarily small by increasing the estimator contraction gain.
Proof. 
The closed-loop system can be interpreted as a cascade interconnection of three subsystems:
(i)
The nonlinear distributed estimator layer (17) and (18).
(ii)
The reference reconstruction layer induced by the Bézier and formation-size parameterization.
(iii)
The geometric tracking control layer for each UAV.
We analyze these subsystems sequentially.
  • Step 1: Estimator practical convergence.
By Theorem 1, for each follower i
lim sup t s ^ i ( t ) s ( t ) η ρ lim sup t osc L ( t ) ,
where η and ρ are positive constants defined in (19). Moreover, on any interval where s ( t ) is constant (i.e., between replanning instants), the estimation error converges exponentially to zero. Hence, all estimator states remain uniformly bounded for all t 0 .
  • Step 2: Reference reconstruction error bound.
By Lemma A6, the mappings
p r ( j ) ( t ) = P ( j ) ( s ( t ) , t ) , d r ( j ) ( t ) = D ( j ) ( s ( t ) , t ) , j = 0 , , 4 ,
are locally Lipschitz with respect to the parameter vector s on the compact set ensured by the planner.
Therefore, for each follower i and j = 0 , , 4 , there exist constants L j > 0 such that
sup t 0 p ^ r , i ( j ) ( t ) p r , i ( j ) ( t ) L j sup t 0 s ^ i ( t ) s ( t ) .
Thus the reference mismatch is uniformly bounded and inherits the practical convergence property of the estimator layer.
  • Step 3: Tracking layer ISS property.
From Lemma A7, the geometric tracking controller satisfies the input-to-state stability (ISS) type bound
e i ( t ) c i e γ i t e i ( 0 ) + c i sup τ [ 0 , t ] p ^ r , i ( τ ) p r , i ( τ ) ,
where e i ( t ) : = p i ( t ) p r , i ( t ) .
Taking the limit superior of (30) yields
lim sup t e i ( t ) c i lim sup t p ^ r , i ( t ) p r , i ( t ) .
  • Step 4: Composite bound.
Combining (31) with the reconstruction bound (29) gives
lim sup t e i ( t ) C i lim sup t s ^ i ( t ) s ( t ) ,
for some constant C i > 0 .
Substituting the estimator bound (28) yields
lim sup t e i ( t ) Ξ s ˙ ,
where
Ξ ( r ) = C i η ρ r .
Since the contraction factor ρ increases with the estimator gain k (Assumption 4), the ultimate formation tracking bound can be made arbitrarily small by increasing the estimator contraction gain.
This completes the strengthened proof. □
Remark 4.
Compared with existing formation control methods under obstacle environments, the proposed framework offers several advantages. First, rather than treating obstacle avoidance and formation maintenance as loosely coupled problems, the formation footprint is explicitly embedded into the Safe Flight Corridor construction through corridor tightening. This guarantees the geometric feasibility of the entire formation in constrained spaces. Second, unlike approaches that rely on fixed or persistently connected communication graphs, the proposed nonlinear agreement protocol only requires joint leader reachability under switching directed topologies with bounded delays. Third, the analysis establishes an explicit cascade relationship between the estimator contraction rate and ultimate formation tracking accuracy, providing quantitative performance guarantees instead of purely qualitative convergence claims.

3.4. Computational Complexity Discussion

The Safe Flight Corridor construction consists of two main steps: (i) obstacle separation and half-space generation, and (ii) mixed-integer trajectory optimization.
For corridor construction, suppose each path segment interacts with at most N o nearby obstacles. Computing separating hyperplanes requires O ( N o ) operations per segment. If the path contains L segments, the overall complexity of corridor generation is O ( L N o ) .
For trajectory optimization, let M denote the number of Bézier segments and n the polynomial order. The number of continuous decision variables is O ( M n ) , while the number of binary variables equals the number of possible cell assignments per segment. The resulting optimization is a mixed-integer quadratic program (MIQP), whose worst-case complexity is exponential in the number of binary variables. However, since the corridor decomposition restricts feasible assignments locally, the practical computation remains tractable for moderate environment complexity.
Overall, the dominant computational cost arises from the MIQP solver, whereas corridor construction scales linearly with the number of obstacles and path segments.

4. Simulation Results

In this section, we employ a numerical example to verify the effectiveness of the proposed algorithm.
To further evaluate robustness, we consider stochastic packet delays and intermittent leader communication. Specifically, communication delays are randomly generated within [ 0 , d ¯ ] following a uniform distribution, and leader access is temporarily disabled with a fixed probability to emulate packet loss. Simulation results indicate that although transient estimator errors increase during temporary connectivity violations, all states remain bounded due to the dissipative agreement structure. Once joint leader reachability is restored, the window-based contraction mechanism drives the estimator errors back to their nominal bounds. These results support the robustness of the proposed framework under more realistic communication conditions.
We consider a planar multi-UAV formation navigation task in an obstacle environment. The team consists of five UAVs, including one leader indexed by 0 and N = 4 followers indexed by V = { 1 , 2 , 3 , 4 } . The leader starts from p 0 = [ 0 , 0 ] and aims to reach the goal p g = [ 13 , 0 ] . The environment contains six static circular obstacles, specified by their centers and radius:
{ ( 3.5 , 0.0 , 0.7 ) , ( 6.0 , 1.2 , 0.8 ) , ( 6.5 , 1.4 , 0.9 ) , ( 9.0 , 0.0 , 0.7 ) , ( 11.0 , 1.4 , 0.8 ) , ( 11.0 , 2 , 0.65 ) } ,
where each triple denotes ( x , y , r ) .
Followers are required to maintain a time-varying circular formation around the leader reference trajectory p r ( t ) . Let θ i { 0 , π 2 , π , 3 π 2 } be the desired angular offsets equally spaced on a circle. Define c i = [ cos θ i , sin θ i ] . The ideal formation reference for follower i is
p r , i ( t ) = p r ( t ) + d r ( t ) c i ,
where d r ( t ) d min = 0.45 > 0 is the time-varying formation size.
Communication among followers is described by a directed switching graph with three modes. We use the adjacency matrix A ( m ) = [ a i j ( m ) ] R 4 × 4 , where a i j ( m ) = 1 indicates a directed edge ( i j ) (i.e., follower j receives information from follower i) under mode m. The three modes are:
A ( 1 ) = 0 1 1 0 0 0 1 0 0 0 0 1 0 0 0 0 , A ( 2 ) = 0 0 0 0 1 0 1 0 0 0 0 1 0 0 0 0 , A ( 3 ) = 0 1 0 0 0 0 1 0 0 0 0 1 1 0 0 0 .
The topology switches periodically every T s = 1.2 s . In addition, the leader information is directly available to exactly one follower per mode, encoded by μ ( m ) { 0 , 1 } 4 :
μ ( 1 ) = 1 0 0 0 , μ ( 2 ) = 0 1 0 0 , μ ( 3 ) = 0 0 1 0 .
Communication delays are time-varying but bounded by d ¯ = 0.1 s .
This subsection lists the simulation parameters and briefly verifies that the theoretical assumptions are satisfied. The simulation runs with sampling time Δ t = 0.05 s over T end = 70 s . The leader replans every T replan = 2 s . The input of UAVs are sped with saturation v 0 v max , L = 0.9 for the leader and v i v max , F = 1.1 for followers. The leader tracks the planned reference point p r ( t ) by a proportional law
v 0 ( t ) = sat k L ( p r ( t ) p 0 ( t ) ) , v max , L ,
with k L = 1.5 . Followers track their formation references using
v i ( t ) = sat k track ( p i ( t ) p ^ r , i ( t ) ) + v i rep ( t ) , v max , F ,
with k track = 2.0 , where v i rep ( t ) is a light obstacle repulsion term for safety.
The leader generates a waypoint list W ini = [ W 0 , , W L ] (with W 0 = p 0 ( t k ) and W L = p g ) and constructs a Safe Flight Corridor as a sequence of convex polyhedra
χ free = = 1 L c P , P = { x R 2 : A x c } .
In implementation, each P is initialized as an axis-aligned corridor box inflated by 1.0 m and then tightened by obstacle separating half-planes. The leader trajectory over each corridor segment is parameterized by a quintic Bézier curve ( n = 5 ). The control points are obtained by solving a quadratic program that minimizes a discrete proxy of the integrated squared jerk (third differences of control points), with weight λ traj = w jerk = 1.0 .
The nominal formation size is d nom = 1.0 and the minimum radius is d min = 0.45 . The corridor tightening uses the safety factor ζ = 1.1 , so that the corridor constraints are reduced by approximately ζ d k at each replanning iteration (consistent with the formation footprint). The planned radius d k is smoothed into d r ( t ) using a first-order filter with time constant τ d = 0.6 s .
Each follower maintains an estimate p ^ r , i ( t ) R 2 of the leader reference point p r ( t ) using the nonlinear agreement protocol with delays:
p ^ ˙ r , i ( t ) = φ p ^ r , i ( t ) κ i ( t ) , φ ( r ) = k φ tanh ( r / ε ) , k φ = 3.0 , ε = 0.18 ,
where κ i ( t ) is a convex combination of delayed neighbor estimates and (when available) the leader reference:
κ i ( t ) = j N i ( t ) a ¯ i j ( t ) p ^ r , j ( t d i j ( t ) ) + μ ¯ i ( t ) p r ( t d i 0 ( t ) ) ,
with normalized weights a ¯ i j ( t ) and μ ¯ i ( t ) defined as in (6). Delays satisfy d i j ( t ) , d i 0 ( t ) [ 0 , d ¯ ] and are simulated by random discrete delays up to D = d ¯ / Δ t steps.
The directed switching graphs (35) together with leader access vectors (36) ensure that within each switching window, the leader information can reach all followers through directed paths, satisfying the joint leader reachability condition. The delay bound d ¯ is enforced by construction. The nonlinearity φ ( r ) satisfies φ ( 0 ) = 0 and r φ ( r ) < 0 for all r 0 , fulfilling the strict dissipativity requirement. Therefore, Assumptions 1–4 of the theoretical analysis are satisfied in the simulation.
The leader starts at p 0 = [ 0 , 0 ] . Each follower starts near the leader with small random perturbations around the desired circular formation positions. The initial estimator states p ^ r , i ( 0 ) are also perturbed, resulting in nonzero initial estimation errors. The simulation results are shown as in Figure 2, Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7. Figure 2 shows the trajectories of the leader and followers. The leader successfully navigates through the cluttered environment and reaches the goal, while followers maintain the desired circular formation. As the team approaches narrow passages, the formation size d r ( t ) decreases, enabling the entire formation to remain within the safe corridor, and then increases again in open regions. Figure 3 compares the planned leader reference trajectory p r ( t ) and the executed leader trajectory p 0 ( t ) . The executed trajectory closely tracks the planned reference, demonstrating that the planned trajectory is sufficiently smooth for tracking. Figure 4 plots the time evolution of the formation size d r ( t ) , showing adaptive shrink behavior consistent with obstacle proximity. Figure 5 reports the distributed estimator errors p ^ r , i ( t ) p r ( t ) for all followers. Figure 6 illustrates the mode switching process of the topology, while Figure 7 shows a screenshot of the animation depicting the collective motion of the moving bodies in formation. Despite switching directed communication topologies and bounded delays, the estimator errors remain bounded and converge to small neighborhoods of zero, which corroborates the practical agreement guarantee.
The simulation results validate the proposed framework in a cluttered environment: (i) online planning generates smooth safe trajectories; (ii) adaptive formation sizing enables obstacle avoidance for the entire team; and (iii) the distributed nonlinear estimator achieves robust leader reference tracking under directed switching graphs with delays, leading to successful distributed formation control.
In practical UAV systems, the proposed framework can be implemented in a distributed manner. The leader computes the Safe Flight Corridor and formation size optimization at each replanning step, while followers run lightweight nonlinear agreement dynamics and geometric tracking controllers. Communication overhead is limited to parameter vector exchange rather than full trajectory broadcasting. The dissipative structure naturally tolerates moderate delay variations and packet loss, provided that joint connectivity is periodically restored. Computationally, corridor construction scales linearly with nearby obstacles, and the MIQP problem remains tractable for moderate environment complexity.

5. Conclusions

This paper presented a distributed formation planning and control framework for multi-UAV systems operating under directed switching communication topologies and environmental constraints. By integrating SFC-based trajectory planning with adaptive formation sizing, the proposed method enables the entire formation to safely navigate obstacle-rich environments. A nonlinear agreement protocol was introduced to handle delayed and intermittent leader information, and rigorous analysis established practical tracking and agreement guarantees under switching graphs. Simulation results validated the theoretical findings and demonstrated robust performance in challenging scenarios. Future work will focus on extending the framework to fully three-dimensional environments, incorporating more complex vehicle dynamics, and investigating experimental validation on real UAV platforms.
The present study considers a single-leader formation architecture. Extending the framework to multiple leaders introduces additional challenges, including potential inconsistency among leader references, multi-source information fusion in the distributed estimator, and more complex cascade stability analysis. In multi-leader scenarios, additional coordination mechanisms among leaders or consensus-based aggregation strategies would be required to ensure coherent formation behavior. Although the proposed nonlinear agreement protocol can serve as a foundation for such extensions under suitable connectivity assumptions, rigorous analysis for multi-leader formations is left for future investigation.
Despite the demonstrated effectiveness of the proposed framework, several limitations remain. First, the theoretical guarantees rely on bounded communication delays and a joint leader reachability condition; performance under severe packet loss or prolonged disconnections is not addressed. Second, the Safe Flight Corridor construction assumes accurate obstacle representation and convex decomposition, which may become conservative in highly complex 3D environments. Third, the formation control strategy focuses on practical convergence rather than strict asymptotic consensus under time-varying leader signals. Finally, the current validation is limited to numerical simulations, and experimental implementation on real UAV platforms remains as future work.

Author Contributions

Conceptualization, Y.Z. and Z.J.; methodology, Y.Z. and Z.J.; software, Y.Z.; validation, Z.J.; formal analysis, Y.Z. and Z.J.; investigation, Y.Z.; writing—original draft preparation, Y.Z. and Z.J.; writing—review and editing, Y.Z. and Z.J.; visualization, Y.Z.; supervision, Y.Z. and Z.J.; project administration, Y.Z. and Z.J. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Natural Science Foundation of China under Grant 62403423, and the Natural Science Foundation of Zhejiang Province under Grant LMS25F030003. Henan Province Science and Technology Research Project (242102240056, 252102241020).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Lemmas and Main Proofs

Before presenting the technical Lemmas, we briefly summarize their roles in the overall proof structure. The analysis follows a cascade logic. First, Lemmas A1–A5 establish contraction of the nonlinear agreement dynamics under switching directed graphs with delays. In particular, Lemma A5 characterizes how contraction generated at a root node propagates along persistent directed edges, which is essential for proving window-based max–min contraction in Theorem 1. Second, Lemma A6 provides a Lipschitz continuity result that maps parameter estimation errors to trajectory reconstruction errors induced by the Bézier and formation-size parameterization. Finally, Lemma A7 establishes an ISS-type tracking bound for the geometric controller, showing that the UAV formation tracking error is ultimately bounded by the reference mismatch. This layered structure enables the derivation of Theorem 2 through a cascade argument from distributed agreement to formation tracking.
Lemma A1.
Under Assumptions 3 and bounded delays (7), for each ℓ, the follower envelope satisfies for almost all t
D + E max ( t ) α E max ( t ) κ ¯ ( t ) + s ˙ ,
D + E min ( t ) α κ ̲ ( t ) E min ( t ) s ˙ ,
where κ ¯ ( t ) and κ ̲ ( t ) are appropriate max/min convex combination bounds induced by (18).
Proof. 
Pick an index i * ( t ) arg max i e i , ( t ) . Using the Dini derivative of a pointwise maximum and the dynamics (17)
D + E max ( t ) = D + e i * , ( t ) = φ ( s ^ i * , κ i * , ) s ˙ ( t ) .
By Assumption 3, φ ( · ) is sign-definite and drives s ^ i * , toward κ i * , . Since κ i * , is a convex combination of delayed neighbor estimates and leader signal, it lies between the delayed envelopes. This yields an inequality in terms of E max and the delayed envelopes. The bound for E min is analogous by choosing i * ( t ) arg min i e i , ( t ) . □
Lemma A2.
Consider the scalar system
x ˙ ( t ) = φ x ( t ) κ ( t ) ,
where κ ( · ) is measurable and satisfies κ ( t ) [ κ ̲ ( t ) , κ ¯ ( t ) ] for all t. Assume (23) holds with gain k > 0 for φ. Then for almost all t
D + x ( t ) κ ¯ ( t ) + k x ( t ) κ ¯ ( t ) + + D + κ ¯ ( t ) + ,
and
D + κ ̲ ( t ) x ( t ) + k κ ̲ ( t ) x ( t ) + + D + κ ̲ ( t ) + ,
where y + = max { y , 0 } . In particular, if κ ¯ and κ ̲ are constant on an interval [ t 0 , t 0 + T ] , then for all t [ t 0 , t 0 + T ]
x ( t ) κ ¯ + e k ( t t 0 ) x ( t 0 ) κ ¯ + , κ ̲ x ( t ) + e k ( t t 0 ) κ ̲ x ( t 0 ) + .
Proof. 
Let y ( t ) : = x ( t ) κ ¯ ( t ) . When y ( t ) > 0 , we have x ( t ) > κ ¯ ( t ) κ ( t ) ; hence, r ( t ) : = x ( t ) κ ( t ) y ( t ) > 0 . By (23), φ ( r ) k r k y for r > 0 . Thus, for almost all t with y ( t ) > 0
y ˙ ( t ) = x ˙ ( t ) κ ¯ ˙ ( t ) = φ ( x κ ) κ ¯ ˙ ( t ) k y ( t ) κ ¯ ˙ ( t ) .
Taking upper Dini derivatives and using D + ( y + ) ( D + y ) 1 { y > 0 } yields (A5). The lower bound (A6) is analogous by considering z ( t ) : = κ ̲ ( t ) x ( t ) . Since κ ¯ and κ ̲ are constants, Grönwall inequality gives (A7). □
Lemma A3.
For all t 0 and each follower i
κ i ( t ) min { S ^ min ( t ) , S L min ( t ) } , max { S ^ max ( t ) , S L max ( t ) } .
Moreover, if μ ¯ i ( t ) μ * > 0 on a time subinterval, then κ i ( t ) is a strict convex combination in the sense that it lies at least a fraction μ * toward the leader window interval.
Proof. 
By definition (18), κ i ( t ) is a convex combination of the delayed neighbor values s ^ j ( t d i j ( t ) ) [ S ^ min ( t ) , S ^ max ( t ) ] and the delayed leader value s ( t d i 0 ( t ) ) [ S L min ( t ) , S L max ( t ) ] . Thus, (A8) holds. If μ ¯ i ( t ) μ * , write κ i ( t ) = ( 1 μ ¯ i ( t ) ) ξ ( t ) + μ ¯ i ( t ) η ( t ) with ξ ( t ) [ S ^ min ( t ) , S ^ max ( t ) ] and η ( t ) [ S L min ( t ) , S L max ( t ) ] ; the strictness follows. □
Lemma A4.
Under Assumption 4 and bounded delays. Fix an interval I = [ t 0 , t 0 + T ] with T λ D such that for some follower r, μ ¯ r ( t ) μ * > 0 for all t I . Then on I
s ^ r ( t 0 + λ D ) S ^ min ( t 0 ) + α R , S ^ max ( t 0 ) α R S L min ( t 0 ) δ L , S L max ( t 0 ) + δ L ,
where ⊕ denotes Minkowski sum of intervals, δ L : = sup t I | s ( t ) s ( t 0 ) | , and the contraction margin
α R : = μ * 1 e k λ D W ( t 0 ) 2 .
In particular, if s ( · ) is constant on I, then δ L = 0 and s ^ r moves at least α R away from the follower envelope extremes.
Proof. 
Let U : = S ^ max ( t 0 ) and L : = S ^ min ( t 0 ) . Consider the upper deviation y ( t ) : = ( s ^ r ( t ) U ) + on I. By Lemma A3, κ r ( t ) ( 1 μ * ) U + μ * S L max ( t ) . Hence, s ^ r κ r s ^ r [ ( 1 μ * ) U + μ * S L max ] . Applying Lemma A2 with κ ¯ ( t ) = ( 1 μ * ) U + μ * S L max ( t ) yields an exponential decay of y ( t ) up to the variation of S L max ( t ) . A symmetric argument holds for the lower deviation ( L s ^ r ) + using κ ̲ ( t ) = ( 1 μ * ) L + μ * S L min ( t ) . Combining both bounds and noting that the leader window variation within I is captured by δ L gives (A9). The explicit margin (A10) follows by integrating the exponential contraction over λ D and using that strict leader weight; μ * pulls κ r toward the leader interval by a fraction μ * . □
Lemma A5.
Under Assumption 4, consider an interval I = [ t 0 , t 0 + T ] with T λ D during which a directed edge ( j , i ) is continuously active and has weight lower bounded as a ¯ i j ( t ) a * > 0 for all t I . If at time t 0 the source node satisfies
s ^ j ( t ) [ L + β , U β ] t [ t 0 d ¯ , t 0 ]
for some β ( 0 , W ( t 0 ) / 2 ] , then the target node satisfies at t 0 + λ D
s ^ i ( t 0 + λ D ) [ L + β , U β ] [ δ L , + δ L ] , β : = a * 1 e k λ D β ,
where δ L : = sup t I | s ( t ) s ( t 0 ) | accounts for leader variation entering through other neighbors/leader.
Proof. 
On the set I, κ i ( t ) includes the term a ¯ i j ( t ) s ^ j ( t d i j ( t ) ) with weight at least a * . Since delayed s ^ j stays in [ L + β , U β ] over the relevant delay window, and all other terms lie in [ L , U ] (by definition of envelopes), we obtain bounds:
κ i ( t ) ( 1 a * ) U + a * ( U β ) = U a * β , κ i ( t ) ( 1 a * ) L + a * ( L + β ) = L + a * β ,
up to leader variation δ L . Then Lemma A2 yields that s ^ i is attracted to the tightened interval with exponential rate k, producing the margin β = a * ( 1 e k λ D ) β after time λ D . □
Lemma A6
(Lipschitz reference reconstruction). Let s ( t ) encode the active Bézier segment control points and timing information, and the formation-size polynomial parameters, so that p r ( j ) ( t ) = P ( j ) ( s ( t ) , t ) and d r ( j ) ( t ) = D ( j ) ( s ( t ) , t ) for j = 0 , 1 , 2 , 3 , 4 . Assume the planner ensures the boundedness of s ( t ) and excludes singular timing (i.e., segment durations are bounded away from 0). Then for each compact time interval between replannings and for each j 4 , there exist constants L P , j , L D , j > 0 such that
P ( j ) ( s ^ , t ) P ( j ) ( s , t ) L P , j s ^ s , | D ( j ) ( s ^ , t ) D ( j ) ( s , t ) | L D , j s ^ s .
Proof. 
For Bézier curves and polynomials, P ( j ) ( · , t ) and D ( j ) ( · , t ) are smooth functions of the parameters on any set where segment durations are bounded away from zero. Thus, their Jacobians are bounded on the compact parameter set induced by the planner bounds, implying local Lipschitz continuity with constants L P , j , L D , j . □
Lemma A7.
Consider follower UAV i with dynamics (1) controlled by a geometric tracking controller that achieves exponential tracking for a C 4 reference trajectory. Let the controller be driven by an implemented reference p ^ r , i ( t ) and its derivatives up to order 4, while the ideal formation reference is p r , i ( t ) . Define the position tracking error e i ( t ) : = p i ( t ) p r , i ( t ) . Then there exist constants c i > 0 and γ i > 0 such that
e i ( t ) c i e γ i t e i ( 0 ) + c i sup τ [ 0 , t ] p ^ r , i ( τ ) p r , i ( τ ) .
Proof. 
For flatness-based geometric controllers, the closed-loop tracking error dynamics can be written in a cascade form with an exponentially stable linear part plus bounded higher-order terms that are dominated by the reference mismatch. Under bounded reference derivatives and standard gain conditions, one obtains an exponential Lyapunov function V i , satisfying V ˙ i 2 γ i V i + α p ^ r , i p r , i 2 . Gronwall’s inequality yields (A12). □

References

  1. Chowdhury, M.M.U.; Maeng, S.J.; Bulut, E.; Güvenç, Ĭ. 3-D Trajectory Optimization in UAV-Assisted Cellular Networks Considering Antenna Radiation Pattern and Backhaul Constraint. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 3735–3750. [Google Scholar] [CrossRef]
  2. Bao, T.; Wang, H.; Wang, W.J.; Yang, H.C.; Hasna, M.O. Secrecy Outage Performance Analysis of UAV-Assisted Relay Communication Systems with Multiple Aerial and Ground Eavesdroppers. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 2592–2600. [Google Scholar]
  3. Hung, H.A.; Hsu, H.H.; Cheng, T.H. Optimal Sensing for Tracking Task by Heterogeneous Multi-UAV Systems. IEEE Trans. Control Syst. Technol. 2024, 32, 282–289. [Google Scholar]
  4. Gong, X.; Gui, J.; Chen, Y.; Yang, X.; Yu, W.; Huang, T. Resilient Human-in-the-Loop Formation-Tracking of Multi-UAV Systems Against Byzantine Attacks. IEEE Trans. Autom. Sci. Eng. 2025, 22, 3797–3809. [Google Scholar] [CrossRef]
  5. Hai, X.; Tan, L.; Feng, Q.; Duan, H.; Wen, C. Replanning-Oriented Framework for Efficient Real-Time Decision-Making in Multi-UAV Systems. IEEE Trans. Ind. Inform. 2025, 21, 5127–5137. [Google Scholar] [CrossRef]
  6. Ma, J.; Chen, X.; Wen, G.; Wang, J.; Zhao, F.; Qiu, J. Dynamic Memory Event-Triggered Lag Consensus of Multi-UAV Systems with Hybrid Attacks Over Stochastic Switching Topology. IEEE Trans. Autom. Sci. Eng. 2025, 22, 16999–17009. [Google Scholar] [CrossRef]
  7. Savkin, A.V.; Huang, H. Multi-UAV Navigation for Optimized Video Surveillance of Ground Vehicles on Uneven Terrains. IEEE Trans. Intell. Transp. Syst. 2023, 24, 10238–10242. [Google Scholar] [CrossRef]
  8. Cao, X.; Liu, L. A Multi-Timescale Method for State of Charge Estimation for Lithium-Ion Batteries in Electric UAVs Based on Battery Model and Data-Driven Fusion. Drones 2025, 9, 247. [Google Scholar] [CrossRef]
  9. El-Malek, A.H.A.; Aboulhassan, M.A.; Salhab, A.M.; Zummo, S.A. Performance Analysis and Optimization of UAV-Assisted Networks: Single UAV with Multiple Antennas Versus Multiple UAVs with Single Antenna. IEEE Syst. J. 2023, 17, 3468–3479. [Google Scholar] [CrossRef]
  10. Li, Y.; Zhang, X.; Li, X.; Chen, Z.; Hu, Y.; Yang, J.; Schmeink, A. Cooperative Elliptic Positioning Through Single UAV During GNSS Outages. IEEE Trans. Wirel. Commun. 2024, 23, 12749–12764. [Google Scholar] [CrossRef]
  11. Dong, X.; Yu, B.; Shi, Z.; Zhong, Y. Time-Varying Formation Control for Unmanned Aerial Vehicles: Theories and Applications. IEEE Trans. Control Syst. Technol. 2015, 23, 340–348. [Google Scholar] [CrossRef]
  12. Ranjan, P.K.; Sinha, A.; Cao, Y.; Casbeer, D.; Weintraub, I. Relational Maneuvering of Leader-Follower Unmanned Aerial Vehicles for Flexible Formation. IEEE Trans. Cybern. 2024, 54, 5598–5609. [Google Scholar] [CrossRef]
  13. Yang, Y.; Sun, L.; Fu, Y.; Feng, W.; Xu, K. Three-Dimensional UAV Trajectory Planning Based on Improved Sparrow Search Algorithm. Symmetry 2025, 17, 2071. [Google Scholar] [CrossRef]
  14. Seyyedabbasi, A. Multi-Strategy Variable Secretary Bird Optimization Algorithm (MSVSBOA) for Global Optimization and UAV 3D Path Planning. Symmetry 2026, 18, 273. [Google Scholar] [CrossRef]
  15. Jin, Z.; Bai, L.; Wang, Z.; Zhang, P. Self-Triggered Distributed Formation Control of Fixed-Wing Unmanned Aerial Vehicles Subject to Velocity and Overload Constraints. IEEE Trans. Autom. Sci. Eng. 2024, 21, 4082–4093. [Google Scholar] [CrossRef]
  16. Jin, Z.; Li, H.; Qin, Z.; Wang, Z. Gradient-Free Cooperative Source-Seeking of Quadrotor Under Disturbances and Communication Constraints. IEEE Trans. Ind. Electron. 2025, 72, 1969–1979. [Google Scholar] [CrossRef]
  17. Dong, X.; Zhou, Y.; Ren, Z.; Zhong, Y. Time-Varying Formation Tracking for Second-Order Multi-Agent Systems Subjected to Switching Topologies with Application to Quadrotor Formation Flying. IEEE Trans. Ind. Electron. 2017, 64, 5014–5024. [Google Scholar]
  18. Zou, Y.; Zhou, Z.; Dong, X.; Meng, Z. Distributed Formation Control for Multiple Vertical Takeoff and Landing UAVs with Switching Topologies. IEEE/ASME Trans. Mechatron. 2018, 23, 1750–1761. [Google Scholar] [CrossRef]
  19. Shi, S.; Wu, S.; Wei, B. Neural-Network-Based Event-Triggered Formation Tracking for Nonlinear Multi-UAV Systems with Switching Topologies Under DoS Attacks. IEEE Trans. Autom. Sci. Eng. 2025, 22, 11656–11667. [Google Scholar] [CrossRef]
  20. Dong, Z.; Shi, S.; Zhen, Z. Asynchronously Integral Event-Triggered Formation Tracking in UAV Swarm Systems Featuring Switching Directed Topologies. IEEE Trans. Control Netw. Syst. 2025, 12, 1251–1263. [Google Scholar] [CrossRef]
  21. Zhang, H.; Chen, C.; Xiang, Z. Adaptive Fuzzy Prescribed-Time Formation Control for Nonlinear Multi-Agent Systems. IEEE Trans. Autom. Sci. Eng. 2025, 22, 10986–10996. [Google Scholar] [CrossRef]
  22. Abbasi, M.; Marquez, H.J. Dynamic Event-Triggered Formation Control of Multi-Agent Systems with Non-Uniform Time-Varying Communication Delays. IEEE Trans. Autom. Sci. Eng. 2025, 22, 8988–9000. [Google Scholar] [CrossRef]
  23. Chang, W.J.; Lin, Y.H.; Lee, Y.C.; Ku, C.C. Investigating Formation and Containment Problem for Nonlinear Multiagent Systems by Interval Type-2 Fuzzy Sliding Mode Tracking Approach. IEEE Trans. Fuzzy Syst. 2024, 32, 4163–4177. [Google Scholar] [CrossRef]
  24. Proskurnikov, A.V.; Calafiore, G.C. Delay Robustness of Consensus Algorithms: Continuous-Time Theory. IEEE Trans. Autom. Control 2023, 68, 5301–5316. [Google Scholar] [CrossRef]
  25. Wu, H.; Meng, D. Synchronizability-Based Distributed Learning Control for Multi-Agent Systems. IEEE Trans. Circuits Syst. II Express Briefs 2024, 71, 2109–2113. [Google Scholar] [CrossRef]
  26. Li, F.; Wang, C.; Mikulski, D.; Wagner, J.R.; Wang, Y. Unmanned Ground Vehicle Platooning Under Cyber Attacks: A Human-Robot Interaction Framework. IEEE Trans. Intell. Transp. Syst. 2022, 23, 18113–18128. [Google Scholar] [CrossRef]
  27. Schäfer, L.; Manzinger, S.; Althoff, M. Computation of Solution Spaces for Optimization-Based Trajectory Planning. IEEE Trans. Intell. Veh. 2023, 8, 216–231. [Google Scholar] [CrossRef]
  28. Guo, Z.; Wang, Y.; Yu, H.; Xi, J. Fast Optimization-Based Trajectory Planning with Cumulative Key Constraints for Automated Parking in Unstructured Environments. IEEE Trans. Veh. Technol. 2025, 74, 11820–11831. [Google Scholar] [CrossRef]
  29. Ren, L.; Li, M.; Fan, S.; Zhang, Y.; Yu, F.; Yang, J. Cooperative Control Method of Multi-Agent Formation and Obstacle Avoidance. In Proceedings of the 2025 IEEE International Conference on Mechatronics and Automation (ICMA), Beijing, China; IEEE: Piscataway, NJ, USA, 2025; pp. 1066–1072. [Google Scholar]
  30. Olfati-Saber, R.; Murray, R.M. Consensus Problems in Networks of Agents with Switching Topology and Time-Delays. IEEE Trans. Autom. Control 2004, 49, 1520–1533. [Google Scholar] [CrossRef]
  31. Lin, Z.; Francis, B.; Maggiore, M. State Agreement for Continuous-Time Coupled Nonlinear Systems. SIAM J. Control Optim. 2007, 46, 288–307. [Google Scholar] [CrossRef]
  32. Shi, G.; Hong, Y. Global Target Aggregation and State Agreement of Nonlinear Multi-Agent Systems with Switching Topologies. Automatica 2009, 45, 1165–1175. [Google Scholar] [CrossRef]
  33. Cong, X.; Zi, L.; Du, D.Z. DTNB: A Blockchain Transaction Framework with Discrete Token Negotiation for the Delay Tolerant Network. IEEE Trans. Netw. Sci. Eng. 2021, 8, 1584–1599. [Google Scholar] [CrossRef]
  34. Liu, T.; Qin, Z.; Hong, Y.; Jiang, Z.P. Distributed Optimization of Nonlinear Multiagent Systems: A Small-Gain Approach. IEEE Trans. Autom. Control 2022, 67, 676–691. [Google Scholar] [CrossRef]
  35. Chang, Z.; Zong, G.; Wang, W.; Yue, M.; Zhao, X. Formation Control and Obstacle Avoidance Design for Networked USV Swarm with Exogenous Disturbance Under Intermittent Communication. IEEE Trans. Netw. Sci. Eng. 2025, 12, 3234–3243. [Google Scholar] [CrossRef]
  36. Xiao, Q.; Yang, S.; Zeng, Z.; Huang, T.; Pal, N. Null-Space-Based Prescribed-Time Formation Control with Collision and Obstacle Avoidance. IEEE Trans. Control Netw. Syst. 2025, 12, 3003–3014. [Google Scholar] [CrossRef]
  37. Wang, L.; Zhu, D.; Pang, W.; Luo, C. A Novel Obstacle Avoidance Consensus Control for Multi-AUV Formation System. IEEE/CAA J. Autom. Sin. 2023, 10, 1304–1318. [Google Scholar] [CrossRef]
  38. Yuan, Z.; Yao, C.; Liu, X.; Gao, Z.; Zhang, W. Multiagent Formation Control and Dynamic Obstacle Avoidance Based on Deep Reinforcement Learning. IEEE Trans. Ind. Inform. 2025, 21, 4672–4682. [Google Scholar] [CrossRef]
  39. Wang, Z.X.; Jin, Z.H.; Li, H. Semi-global asymptotic state agreement of nonlinear multi-agent systems with communication delays under directed switching topologies. Nonlinear Anal. Hybrid Syst. 2024, 52, 101458. [Google Scholar] [CrossRef]
  40. Jin, Z. Global Asymptotic Stability Analysis for Autonomous Optimization. IEEE Trans. Autom. Control 2025, 70, 6953–6960. [Google Scholar] [CrossRef]
Figure 1. Schematic of the Safe Flight Corridor (SFC). The original corridor χ free (solid blue) consists of convex polyhedra P connecting optimized waypoints. Each supporting half-space a r x c r is shifted inward by ζ d to obtain the tightened corridor (dashed blue), ensuring collision-free motion for a formation of diameter d.
Figure 1. Schematic of the Safe Flight Corridor (SFC). The original corridor χ free (solid blue) consists of convex polyhedra P connecting optimized waypoints. Each supporting half-space a r x c r is shifted inward by ζ d to obtain the tightened corridor (dashed blue), ensuring collision-free motion for a formation of diameter d.
Actuators 15 00163 g001
Figure 2. The trajectories of the leader and followers. The black line represents the trajectory of the leader, while the other four colors represent the trajectories of the followers.
Figure 2. The trajectories of the leader and followers. The black line represents the trajectory of the leader, while the other four colors represent the trajectories of the followers.
Actuators 15 00163 g002
Figure 3. The planned leader reference trajectory p r ( t ) and the executed leader trajectory p 0 ( t ) .
Figure 3. The planned leader reference trajectory p r ( t ) and the executed leader trajectory p 0 ( t ) .
Actuators 15 00163 g003
Figure 4. Time evolution of the formation size d r ( t ) .
Figure 4. Time evolution of the formation size d r ( t ) .
Actuators 15 00163 g004
Figure 5. The distributed estimator errors p ^ r , i ( t ) p r ( t ) for all followers.
Figure 5. The distributed estimator errors p ^ r , i ( t ) p r ( t ) for all followers.
Actuators 15 00163 g005
Figure 6. The mode of switching topology.
Figure 6. The mode of switching topology.
Actuators 15 00163 g006
Figure 7. Four snapshots (2 × 2) of the formation navigation under switching topologies. The solid black circles represent the leader, the other four colors represent the followers, and the red circles are obstacles.
Figure 7. Four snapshots (2 × 2) of the formation navigation under switching topologies. The solid black circles represent the leader, the other four colors represent the followers, and the red circles are obstacles.
Actuators 15 00163 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Y.; Jin, Z. Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies. Actuators 2026, 15, 163. https://doi.org/10.3390/act15030163

AMA Style

Zhang Y, Jin Z. Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies. Actuators. 2026; 15(3):163. https://doi.org/10.3390/act15030163

Chicago/Turabian Style

Zhang, Yingzheng, and Zhenghong Jin. 2026. "Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies" Actuators 15, no. 3: 163. https://doi.org/10.3390/act15030163

APA Style

Zhang, Y., & Jin, Z. (2026). Adaptive Formation Control for Multi-UAV Swarms in Cluttered Environments with Communication Delays Under Directed Switching Topologies. Actuators, 15(3), 163. https://doi.org/10.3390/act15030163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop