RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations

Yan, Yuehao; Liu, Songlin; Hao, Rui

doi:10.3390/drones9090611

Open AccessArticle

RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations

by

Yuehao Yan

^1,2,

Songlin Liu

^1,2

and

Rui Hao

^1,2,*

¹

Institute of Electronic and Electrical Engineering, Civil Aviation Flight University of China, 46 Nanchang Road, Guanghan 618307, China

²

Low Altitude Economy Industry Innovation Research Center, Civil Aviation Flight University of China, 46 Nanchang Road, Guanghan 618307, China

^*

Author to whom correspondence should be addressed.

Drones 2025, 9(9), 611; https://doi.org/10.3390/drones9090611

Submission received: 29 July 2025 / Revised: 22 August 2025 / Accepted: 26 August 2025 / Published: 29 August 2025

(This article belongs to the Section Innovative Urban Mobility)

Download

Browse Figures

Versions Notes

Abstract

To enable safe cooperative flight of multi-UAV formations in urban 3-D airspace with wind-field disturbances, we develop an integrated planning-control framework.The planning layer uses an APF-guided RRT* with continuous collision prediction and explicit velocity/acceleration limits, and compensates wind online.The control layer adopts a dual-loop MA-AADRC-SMC design. An adaptive ESO estimates disturbances for feed-forward cancellation, and an SMC term improves robustness and tracking accuracy. By coupling the planned trajectory with speed-weighted repulsive fields, the framework coordinates path and attitude in closed loop, enabling collision-free and overshoot-free formation flight in wind and clutter. Simulations show higher tracking accuracy and better formation stability than ADRC, PID and SMC. A Lyapunov analysis proves uniform boundedness and asymptotic stability. The framework is scalable to applications such as disaster assessment and urban air transport.

Keywords:

multi-agent unmanned aerial vehicles (UAVs); collaborative collision avoidance; velocity and acceleration constraints; active disturbance rejection control (ADRC); sliding-mode control (SMC); formation flight

1. Introduction

In complex 3-D environments, multi-UAV formations must remain stable while reacting quickly to unknown disturbances and obstacles. Meeting both goals is still challenging for research and practice.

In UAV control, active disturbance rejection control (ADRC) has become a research focus for robust UAV operation owing to its extended state observer (ESO), which estimates and compensates internal and external disturbances online. Shi et al. [1] employed an improved ESO with a sliding-average low-pass filter in a nested inner–outer loop structure to achieve rapid attitude response without overshoot; Yang et al. [2] enhanced the ESO’s noise rejection by using a nonlinear smooth GALN function, significantly improving convergence speed and tracking accuracy; Dou et al. [3] introduced fuzzy logic into the ESO and error-feedback loop, effectively reducing overshoot and output fluctuation; and Chen et al. [4] optimized ADRC parameters via the integral of time-weighted absolute error (ITAE) criterion combined with a particle swarm optimization (PSO) algorithm, further enhancing tethered UAV tracking precision under cable interference.

Active Disturbance Rejection Control (ADRC) is typically composed of three main components: a Tracking Differentiator (TD), an Extended-State Observer (ESO), and a Nonlinear State Error Feedback (NLSEF). Specifically, the TD is employed to smooth the desired input and generate usable derivative signals; the ESO estimates the system states in real time while compensating for the “total disturbance” (including external perturbations and model uncertainties); and the NLSEF, together with a linear feedback term, achieves dynamic correction of the closed-loop control law. This architecture significantly reduces reliance on an accurate mathematical model and thus provides strong robustness and adaptability. In contrast, sliding-mode control (SMC) is based on the invariance principle, where a sliding surface is designed and a discontinuous control law drives the system trajectory toward and maintains it on this surface, ensuring finite-time convergence and strong disturbance rejection. Its strengths lie in design simplicity and robustness, particularly against both structured and unstructured uncertainties. Nevertheless, SMC often suffers from the chattering phenomenon in practical implementations, while the performance of ADRC critically depends on the tuning of ESO parameters, the selection of disturbance compensation bandwidth, and the smoothing capability of the TD.

Sliding-mode control (SMC), renowned for its strong robustness to parameter uncertainty and external disturbances, has been widely applied to quadrotor attitude control. Zhang et al. [5] proposed a robust SMC strategy based on ESO that reduced rotor speed jump amplitude by nearly 90 %; Wang et al. [6] constructed a generalized sliding-mode manifold for the underactuated system and verified closed-loop stability; An et al. [7] applied geometric SMC within the Lie group–Lie algebra framework to achieve globally stable attitude convergence; and Feng et al. [8] combined fuzzy adaptive mechanisms with sliding-mode variable-structure control to suppress chattering and enhance disturbance rejection. Although these works significantly improve the robustness of single-vehicle attitude and velocity loops, most experiments remain confined to indoor or static hover scenarios, lacking validation for high-speed three-dimensional formation coordination and real-time obstacle avoidance. Moreover, these controllers have not explicitly incorporated kinematic constraints such as maximum velocity and acceleration, nor analyzed global stability under strong wind fields and communication delays.

In path planning, the rapidly exploring random tree (RRT) algorithm is widely adopted due to its map-free nature, but its global randomness limits convergence efficiency and path quality. Chen et al. [9] proposed an APF-RRT fusion framework based on goal biasing, improved artificial potential fields, and a genetic algorithm to accelerate convergence and shorten paths; Yin et al. [10] incorporated dynamic constraints and distance weighting into node expansion and employed B-spline curves for path smoothing; Li et al. [11] used chaotic sequences and fuzzy inference to dynamically adjust sampling parameters and developed bilateral and rolling-window replanning schemes; Wang et al. [12] introduced bias-extension, probability guidance, and node pruning strategies to significantly speed up tree growth and simplify paths. Traditional artificial potential field (APF) methods are computationally simple and yield smooth paths but often suffer from local minima. Xu et al. [13] presented an APF-RRT* fusion approach that guides sampling with potential fields and self-tunes the sampling radius for efficient global planning; Zhou and Kong [14] adopted hexagonal guidance and virtual subgoals to escape local traps; Yang et al. [15] developed an online planner based on reference-route attractive fields and adaptive disturbance factors to ensure safety in dynamic environments; Zhou et al. [16] introduced angular factors and adaptive weights into attractive and repulsive functions to suppress oscillations and overcome local optima. Existing RRT*/APF enhancements primarily focus on sampling efficiency and geometric path smoothing, rarely incorporating kinematic constraints such as velocity/acceleration or continuous-time collision prediction in the expansion and rewiring phases; generated paths often require post-processing before execution. Furthermore, most algorithms are validated only in single-UAV, static-obstacle settings, lacking systematic evaluation of inter-UAV interference, communication bandwidth, and safety-distance requirements.

To enhance autonomous flight capability in complex dynamic environments, the integration of path planning and attitude control has emerged as a key research topic. Wang et al. [17] constructed a neural-network-based joint planning-tracking framework to achieve real-time obstacle avoidance for six-degree-of-freedom vehicles in complex obstacle fields, while Huang et al. [18] fused an improved APF algorithm with inverse-dynamics backstepping control to effectively address local optima. Despite these advances, challenges remain in attitude-path modeling consistency, global optimality under multi-objective constraints, and overall system robustness. For example, Rudnick-Cohen et al. [19] jointly optimized UAV design parameters and path planning to reduce risk and mission time, and Pang et al. [20] emphasized balancing third-party risk and energy consumption in path optimization. Although deep-learning or multi-objective joint optimization frameworks have attempted to synchronize planning and control, they rely on large-scale offline training with poor interpretability and a high transfer cost; the results are predominantly confined to single-vehicle simulations, lacking multi-UAV coordination and real-wind validation. Offline path optimization trade-offs for risk and energy have not been deeply coupled with real-time control loops, nor has robust analysis against disturbances and uncertainties been fully addressed.

This paper proposes a three-dimensional multi-UAV formation cooperative obstacle-avoidance framework that integrates an APF-guided RRT* planner with kinematic constraints, continuous-time collision prediction, and a four-layer obstacle filtering pipeline, together with a hierarchical hybrid MA–AADRC–SMC controller. The planner produces dynamically feasible and executable trajectories via potential-field-biased sampling and predictive collision checks, while the controller combines an adaptive extended state observer, feedforward correction, and a robust sliding-mode law in a dual-loop structure to achieve high-precision tracking under disturbances. Large-scale simulations and Lyapunov-based analysis in composite urban wind fields and heterogeneous three-dimensional environments validate superior tracking accuracy, formation keeping, and disturbance rejection, supporting reliable applications such as disaster assessment and urban air mobility.

Research Gap and Contributions.

Existing planners mainly focus on sampling efficiency and geometric smoothness, but seldom incorporate kinematic limits or continuous-time safety constraints directly into the planning process. Most controllers are validated only in single-UAV or low-dynamics settings, with weak planning–control coupling under realistic disturbances. To address these gaps, our contributions are (1) An APF-guided RRT* planner with embedded velocity/acceleration constraints and continuous-time collision prediction for executable trajectories; (2) A four-layer obstacle filtering pipeline that preserves temporal completeness with near-constant computational cost in cluttered environments; (3) A hybrid MA–AADRC–SMC controller combining bandwidth-scheduled ESO, sliding-mode robustness, and speed-weighted repulsion for overshoot-suppressing, disturbance-rejecting tracking with enforced safety spacing; (4) A tightly coupled planning–control framework with Lyapunov-guaranteed stability, validated through representative urban scenarios and micro-sensitivity studies confirming robustness.

2. Environment Modeling and Kinematic Decoupling

The simulation environment developed in this study is rigorously modeled in the three-dimensional Euclidean space

R^{3}

, fully accounting for the physical constraints of UAVs and the complex geometric boundary conditions encountered in real-world operations. To guarantee both representativeness and generality, the obstacle system is designed to capture the morphological heterogeneity of multiple entity types and the stochasticity of their spatial distribution, while simultaneously preserving the computational tractability required for high-efficiency collision detection and motion planning.

2.1. Dynamic Environmental Disturbances and Obstacle Representation

2.1.1. Composite Wind-Field Disturbance Modeling

\begin{matrix} w_{i} (t, z) = [\sum_{k = 1}^{N_{f}} A_{i, k} \sin (ω_{k} t + φ_{i, k}) + Δ_{i} H (t - t_{0})] (1 + λ_{z} \frac{z}{z_{ref}}), i \in {x, y, z} . \end{matrix}

(1)

where

A_{i, k}

,

ω_{k}

and

φ_{i, k}

are the amplitude, angular frequency and initial phase of the k-th harmonic in the i-direction (

N_{f}

total harmonics);

Δ_{i}

is a step gust activated at

t \geq t_{0}

with

H (\cdot)

the Heaviside function;

λ_{z}

linearly scales wind speed with altitude z relative to the reference height

z_{ref}

.

This model decomposes the wind into three components: (i) multiple harmonic terms that capture periodic or quasi-periodic gust and turbulence energy peaks; (ii) step gust terms

Δ_{i} H (\cdot)

representing sudden wind shear or gust events; and (iii) vertical scaling

(1 + λ_{z} z / z_{ref})

that incorporates the empirical rule that wind tends to be stronger at higher altitudes. In intuitive terms, the horizontal and vertical directions each combine several “tones” (

ω_{k}

) with their “volumes” (

A_{i, k}

), plus a sudden “accent” (

Δ_{i}

), then scaled linearly with height, so that the simulated wind field exhibits multiple time scales, controllable intensity, and height dependence.

The superposition of multi-frequency harmonics plus the segmented gust term captures temporal variability, multi-scale perturbations and vertical gradients, providing a stringent disturbance baseline for testing autonomous navigation and robust-control algorithms. As illustrated in Figure 1, the simulated wind-velocity components in x, y, and z directions show distinct multi-scale variations and altitude-dependent strengthening. Furthermore, the height-time heatmap in Figure 2 highlights the onset of gust at

t_{0}

and the clear vertical strengthening effect.

2.1.2. Super-Quadric Cylinder Obstacles

To model axis-symmetric obstacles such as trees or poles that are common in operational scenes, we employ super-quadric surfaces as a unified parametric framework. The corresponding constraint set is

O_{scyl} = \{p = {(x, y, z)}^{T} ∣ {({| \frac{x - c_{x}}{r} |}^{2 / ϵ} + {| \frac{y - c_{y}}{r} |}^{2 / ϵ})}^{ϵ / 2} \leq 1, 0 \leq z \leq h\} .

(2)

where

(c_{x}, c_{y})

is the base-centre, r is the equivalent radius, h is the column height, and

ϵ

is the shape index (

ϵ = 1

degenerates to a right cylinder). Obstacle centres are generated via Poisson-disk sampling on

R^{2}

, ensuring a minimum pairwise distance of 20 units. Radii follow

r \sim U (3, 7)

and heights are sampled as

h = 10 + 50 U (0, 1)

, thereby reflecting size diversity.

2.1.3. Oriented Bounding-Box Rectangular Obstacles

To capture the spatial characteristics of cuboid obstacles such as buildings or containers, we adopt an oriented bounding-box (OBB) model. For a centre

c = (c_{x}, c_{y})

, the local coordinates are defined by

\tilde{p} = R^{T} (p - c),

where

R

is a planar rotation about the z-axis with random orientation angle

γ \sim U (0, 2 π)

. The OBB constraint set reads

O_{obb} = {p || \tilde{x} | \leq \frac{W}{2}, | \tilde{y} | \leq \frac{L}{2}, 0 \leq \tilde{z} \leq h\},

(3)

with

W, L \sim U (6, 14)

denoting the minor and major side lengths, respectively, and

h = 10 + 50 U (0, 1)

the height. Likewise, centres are generated by spatial Poisson-disk sampling to avoid overlap. This model is fully compatible with the super-quadric parameterization in Section 2.1.2, enabling seamless integration with potential fields and distance functions while preserving obstacle diversity and orientation.

2.2. Attitude–Acceleration Decoupling

In multirotor UAV control, the strong coupling between attitude (Euler angles) and translational acceleration complicates controller design—especially under high-dynamic performance demands or strict trajectory-tracking requirements. To mitigate this, we adopt an attitude–acceleration decoupling strategy that transforms the original six-degree-of-freedom dynamics into a position–velocity double integrator, greatly simplifying the high-level control problem.

Starting from the full rigid-body equations and applying an appropriate change of variables, the nonlinear coupling between the body-axis thrust direction and the inertial-frame acceleration can be abstracted as a desired acceleration input. The thrust magnitude T and attitude angles

(ϕ, θ)

are treated as fast inner-loop variables, which are regulated so that the generated acceleration closely follows the outer-loop reference. This creates a clear time-scale separation between acceleration tracking (outer loop) and attitude regulation/thrust allocation (inner loop). At the high level, the system can thus be expressed as

\dot{p} = v, \dot{v} = u,

(4)

By delegating fast attitude and thrust control to the inner loop, the outer loop only needs to command the desired inertial acceleration

u

. This abstraction turns the full six-degree-of-freedom coupled dynamics into a simple position–velocity–acceleration “double integrator” model, making high-level planning and gain tuning more straightforward while preserving robustness.

Within this framework, upper-layer planning and control focus solely on generating velocity/acceleration targets and closing the outer loop, while low-level actuators handle attitude and thrust execution. Compared with direct attitude-space control, this decoupling improves implementability and robustness, and offers a solid theoretical and engineering foundation for integrating advanced algorithms. In this study, kinematic discretisation and control-input synthesis both follow this decoupling principle, reducing the UAV dynamics to a linear discrete double integrator with emphasis on precise velocity modulation and disturbance rejection, which are key for high-accuracy, robust autonomous flight.

Section summary: We modeled realistic wind fields and heterogeneous obstacles (super-quadric cylinders and OBBs) and introduced attitude–acceleration decoupling to reduce UAV dynamics to a discrete double integrator. This abstraction enables planners and outer-loop controllers to operate on feasible velocity/acceleration targets while inner loops realize fast thrust/aSection summary:ttitude tracking.

3. Path-Planning Algorithm

3.1. Improved RRT*-APF Algorithm

Rapidly exploring Random Tree Star (RRT*) is widely adopted for UAV path planning owing to its efficient sampling in high-dimensional configuration spaces and its asymptotic optimality. Nevertheless, the standard algorithm focuses mainly on spatial connectivity and collision-free feasibility, while neglecting indispensable kinematic constraints—such as maximum velocity, acceleration, and turning limits—so that certain planned trajectories become infeasible or unsafe when executed on real vehicles.

To overcome these deficiencies, an improved RRT* augmented with an Artificial Potential Field (APF) is developed. The APF serves as a heuristic guide for random sampling, whereas the UAV kinematic model is embedded directly into both collision checking and path-extension procedures. Consequently, the proposed planner generates obstacle-avoidance trajectories in three-dimensional environments that honour both geometric clearance and kinematic feasibility. During tree expansion, the superposition of goal-attractive and obstacle-repulsive potentials dynamically biases sample selection toward higher-quality segments and expedites convergence of the tree to the goal region. Simultaneously, continuous-time kinematic simulation combined with collision discrimination guarantees that each path segment satisfies prescribed velocity and acceleration bounds of the UAV.

3.2. Random Sampling and APF-Corrected Sampling Strategy

Samples are drawn from the goal

x_{goal}

with probability p, and otherwise uniformly over the workspace

[x_{\min}, x_{max}] \times [y_{\min}, y_{max}] \times [z_{\min}, z_{max}]

as

x_{rand} = {(x_{rand}, y_{rand}, z_{rand})}^{T}

.

For each super-quadric cylinder obstacle with base centre

c^{(x y)} = (c_{x}, c_{y})

, radius R, and height range

[z_{\min}, z_{max}]

, let

d_{xy} = ∥ x^{(x y)} - c^{(x y)} ∥

. We define an effective clearance

d_{eff}

that blends lateral and vertical separations, and a repulsive force

F_{rep}

that grows as the clearance shrinks; both precise expressions are provided in Appendix A.1 (cf. [21]). If

d_{eff} < d_{0}

, the obstacle contributes a nonzero repulsion. Summing over all obstacles yields the potential-field gradient

\nabla U (x) = - \sum_{k} F_{rep, k} .

Finally, the raw sample is refined using a single gradient step

x_{rand} \leftarrow x_{rand} - α \nabla U (x_{rand}),

(5)

which biases samples away from obstacles and accelerates planner convergence.

The attractive term pulls the sample toward the goal, while the repulsive term pushes it away from nearby obstacles. Their sum shapes the local sampling bias, guiding expansion into safer, more goal-oriented regions.

The effective clearance metric and the obstacle repulsive force used to compute the APF gradient are detailed in Appendix A.1. Intuitively,

d_{eff}

prevents grazing either the sides or the top of obstacles, and

F_{rep}

smoothly increases as the clearance decreases, producing a stable guidance away from hazardous regions. The overall procedure of the improved RRT*-APF planner, including guided sampling, predictive feasibility checks, and rewiring, is summarized in the flowchart shown in Figure 3.

3.3. Nearest Node Selection, Extension and Rewiring [22]

At each iteration, the nearest node in the current tree

T

to the (possibly APF-corrected) sample

x_{rand}

is found by

x_{nearest} = \arg \min_{x_{i} \in T} ∥ x_{i} - x_{rand} ∥ .

(6)

A new node is then generated at fixed step size s along the direction from

x_{nearest}

to

x_{rand}

:

x_{new} = x_{nearest} + s \frac{x_{rand} - x_{nearest}}{∥ x_{rand} - x_{nearest} ∥} .

(7)

Selecting the nearest existing node ensures that the tree grows from the closest reachable state toward the new sample, minimizing unnecessary detours.

To validate the straight-line extension, we discretize the segment between

p_{1} = x_{nearest}

and

p_{2} = x_{new}

into intermediate points for collision checks; the interpolation formula and parameter definitions are given in Appendix A.2.

If the extension succeeds, nearby nodes are examined for cost reduction via rewiring. We compare cumulative path costs and reassign parents through

x_{new}

when doing so decreases the total cost while remaining collision-free; the explicit inequalities are listed in Appendix A.2.

3.4. Dynamic Velocity/Acceleration Constraints and Collision Checking

3.4.1. Geometric Precision Checking

We define an L2 geometric-precision layer

collision_point (p, O)

that returns true if a query point

p = {(x, y, z)}^{T}

lies inside any obstacle in the set

O = O_{scyl} \cup O_{obb}

. During planning, every candidate or predicted waypoint is validated by this layer. This layer acts as a geometric filter, rejecting any candidate or predicted waypoint that lies inside a modeled obstacle and ensuring that both static and predicted motions are physically feasible in the environment model.

The concrete point-in-obstacle tests for super-quadric cylinders and oriented bounding boxes, together with parameter definitions, are provided in Appendix A.3.

Collision-checking scheme.

To guarantee both path safety and kinematic executability, two checks are performed in the RRT*-APF expansion:

1.: Static linear discrete check - Along the straight line between two nodes, sample at equal intervals and apply the L2 geometric test to each point (sampling formula and symbols in Appendix A.4).
2.: Dynamic predictive check - Using the UAV kinematics, propagate the state forward for several look-ahead steps; each predicted waypoint is, again, subjected to the L2 test. The resulting recursion and terminal velocity assignment are summarized in Appendix A.5.

The two-stage check first rules out collisions along a static straight line, then predicts motion under dynamic constraints to catch potential collisions that could occur due to the UAV’s inertia or acceleration limits.

3.4.2. Static Linear-Discrete Check

We partition the candidate straight segment from

x_{nearest}

to

x_{new}

into a fixed number of equally spaced samples and apply the L2 geometric test to each. The exact interpolation expression, index range, and symbol definitions are listed in Appendix A.4. If any sampled point lies inside

O_{scyl} \cup O_{obb}

, the segment is rejected; otherwise, the planner proceeds. By sampling evenly along the candidate segment, this check detects any intersection with obstacles that could be missed by testing only the endpoints.

3.4.3. Dynamic Predictive Collision Checking

Under the UAV’s current velocity and maximum acceleration constraints, the straight-line segment between two nodes is validated by predicting uniformly accelerated motion and verifying each predicted waypoint via the L2 geometric-precision layer. Let

v_{nearest}

be the velocity at the nearest node with magnitude

v_{0} = ∥ v_{nearest} ∥

,

a_{max}

the acceleration limit,

D = ∥ x_{new} - x_{nearest} ∥

the inter-node distance, and

Δ t

the time step. The minimum reachable time and the number of look-ahead steps are

\begin{matrix} T_{\min} & = \frac{2 D}{v_{0} + \sqrt{v_{0}^{2} + 2 a_{max} D}}, \\ N & = ⌈\frac{T_{\min}}{Δ t}⌉ . \end{matrix}

(8)

These expressions compute the shortest possible traversal time and the required number of prediction steps to ensure dynamic feasibility given current speed and acceleration limits.

We propagate along the segment with a clipped acceleration recursion and assign a terminal velocity for continuity; the full recursions and symbols (including the terminal assignment) are given in Appendix A.5. The recursion simulates the UAV’s motion step by step, clipping acceleration to physical limits so that predicted positions remain both collision-free and kinematically achievable.

3.5. Goal Determination, Path Retracing, and Termination Criteria [22]

When a newly generated node

x_{new}

satisfies

∥ x_{new} - x_{goal} ∥ < s

and the edge

x_{new} \to x_{goal}

is collision-free, the goal node is appended to the search tree:

\begin{matrix} Cost (x_{goal}) & = Cost (x_{new}) + ∥x_{goal} - x_{new}∥, \\ v_{goal} & = \frac{x_{goal} - x_{new}}{Δ t}, \\ parent (x_{goal}) & = x_{new} . \end{matrix}

(9)

Once

x_{goal}

has been inserted into the tree

T

, the optimal path is retraced recursively from the goal back to the root:

\begin{matrix} P_{0} & = x_{goal}, \\ P_{k + 1} & = parent (P_{k}), \\ k & = 0, 1, \dots, K - 1 . \end{matrix}

(10)

until

P_{K} = x_{start}

. The resulting trajectory, ordered in time, is

P = {[x_{start}, P_{K - 1}, \dots, P_{1}, x_{goal}]}^{T} .

If the iteration count reaches the maximum

N_{max}

without generating any node satisfying

∥ x_{new} - x_{goal} ∥ < s

with a collision-free connection, planning is deemed to have failed and an empty path

P = ⌀

is returned. This termination criterion, governed by

N_{max}

, balances the global search capability against computational efficiency.

Section summary: A hierarchical MA-AADRC-SMC scheme is designed: an adaptive ESO estimates disturbances for feedforward rejection; PD gains scale with bandwidth; an SMC term provides robust compensation; and an SDF-based repulsive field supplies precision obstacle avoidance. A speed-weighted blending integrates avoidance with tracking for agile yet smooth responses.

4. Flight Control Algorithm

4.1. Hierarchical Hybrid Control Algorithm Architecture Overview

The hierarchical control framework integrates environment perception, path planning, formation maintenance, hybrid tracking, and emergency recovery into a unified workflow. Each module exchanges information in real time to ensure safe and reliable flight in dynamic 3D urban environments. The overall architecture is summarized in Figure 4.

4.2. Adaptive Design of the Extended-State Observer (ESO)

4.2.1. Problem Statement

Consider a general nth-order, single-input single-output (SISO) nonlinear system with partially unknown dynamics:

x^{(n)} = f (x, \dot{x}, \dots, x^{(n - 1)}, t) + b u (t) + w (t) .

(11)

where the measurable output is

x \in R

,

u (t) \in R

is the control input,

b > 0

is the (known or identified) input gain, and the lumped total uncertainty is

d (t) : = f (\cdot) + w (t)

.

4.2.2. Extended-State Variables and Observer Structure [23]

We employ an

(n + 1)

-dimensional ESO whose state augments the plant with the total disturbance. The full continuous-time realization, matrices, and symbols are provided in Appendix A.6.

4.2.3. Bandwidth-Pole Alignment Design

We place the observer poles by matching a bandwidth-selected characteristic polynomial. The polynomial and nominal gain expressions are summarized in the Appendix A.7. This placement yields equally spaced poles along the negative real axis, with the bandwidth

ω_{o}

directly determining the observer’s response speed and noise sensitivity.

4.2.4. Adaptive Bandwidth Mechanism

To accelerate convergence under time-varying disturbances, we schedule the ESO bandwidth with an error-driven law; the online gain update follows automatically from this bandwidth. The scheduling law and the resulting time-varying gains are given in Appendix A.8. When the estimation error grows, the bandwidth temporarily increases to speed up correction; as the error shrinks, the bandwidth decreases to reduce noise amplification.

The complete adaptive active disturbance rejection control (AADRC) scheme for UAV trajectory tracking is illustrated in Figure 5, where the ESO, nonlinear state error feedback (NLSEF), and adaptive law interact to reject disturbances in real time.

4.3. Multi-Agent Cooperative Formation and Adaptive Repulsive Force Scheduling

4.3.1. Cooperative Formation Mechanism

Under the leader–follower framework, let the leader’s trajectory

r_{lead} (t)

be provided by the high-level path planner. The desired trajectory for follower i is defined as

r_{d, i} (t) = r_{lead} (t) + Δ r_{i}, i = 2, \dots, N

(12)

where

Δ r_{i}

is the fixed formation offset. Define the unit vector in the following direction as

{\hat{d}}_{i} = \frac{r_{d, i} - x_{i}}{∥ r_{d, i} - x_{i} ∥ + δ},

(13)

with

δ

a small positive constant to avoid division by zero. The nominal speed is then

v_{d, i} = v_{norm} {\hat{d}}_{i} .

(14)

The velocity

v_{d, i}

captures only the ideal geometric formation requirement. When inter-agent distances approach the safety threshold

d_{\min}

, relying solely on

v_{d, i}

may lead to formation collapse or collision. To prevent this, a repulsive force–based velocity compensation mechanism is introduced.

First, the local repulsive force at agent i is constructed as

F_{rep, i} = K_{c} \sum_{j \neq i} \frac{x_{i} - x_{j}}{∥ x_{i} - x_{j} ∥^{2}} Θ (d_{0} - ∥ x_{i} - x_{j} ∥),

(15)

where

Θ (\cdot)

is the Heaviside function,

d_{0} > d_{\min}

is the repulsion activation radius, and

K_{c}

is updated online via the gradient–projection law in Equation (20).

Next, map

F_{rep, i}

linearly to a velocity correction:

Δ v_{i} = k_{obs, i} (∥ v_{i} ∥) F_{rep, i},

(16)

where

k_{obs, i} (\cdot)

is a speed-weighting function (see Section 4.4.4), ensuring more agile repulsive response at higher speeds. The actual velocity command issued by this layer is therefore

v_{i}^{cmd} = v_{d, i} + Δ v_{i} .

(17)

When the average distance

d_{i j}^{avg} < d_{\min}

, Equations (15) and (16), amplified by

K_{c}

, generate sufficient repulsion to restore safe separation; once distances recover,

Δ v_{i}

naturally decays, avoiding unnecessary disturbance to the formation.

4.3.2. Gain Projection Law Based on Group Average Inter-Agent Distance

After defining the ideal formation positions

r_{d, i}

and nominal velocities

v_{d, i}

via Equations (12)–(14), we next evaluate and dynamically feedback the safety separation between the actual formation and this ideal reference. First, the pairwise inter-agent distances are computed as

d_{i j} = ∥ x_{i} - x_{j} ∥,

(18)

and the group’s average inter-agent distance is defined by

d_{i j}^{avg} = \frac{2}{N (N - 1)} \sum_{1 \leq i < j \leq N} d_{i j} .

(19)

When

d_{i j}^{avg}

falls below the preset safety threshold

d_{\min}

, the formation is at risk of “collapse” and the repulsive force must be temporarily amplified; once the distances recover into the safe region, the repulsion should remain moderate. To achieve this while preventing excessive gain growth that could destabilize the formation, we introduce the following gradient-projection update law:

{\dot{K}}_{c} = γ σ (d_{\min} - d_{i j}^{avg}), K_{c} \in [K_{c, \min}, K_{c, max}],

(20)

where

γ > 0

is the learning rate and

σ (\cdot)

is the continuous, piecewise-linear mapping

σ (x) = \{\begin{matrix} x, & x > 0, \\ 0, & x \leq 0 . \end{matrix}

Thus, when the average inter-agent distance is below

d_{\min}

,

{\dot{K}}_{c} > 0

increases the repulsion strength; once the distance returns to the safe zone,

{\dot{K}}_{c} = 0

ensures stable gains. Applying a projection operator constrains

K_{c}

strictly within

[K_{c, \min}, K_{c, max}]

, preventing over-amplification.

4.4. Main Control Law and Sliding-Mode Compensation

4.4.1. Adaptive Active Disturbance Rejection Control (AADRC) Law Design [23,24]

Tracking Differentiator (TD)

The high-level path planner’s output

r_{d, i} (t)

may contain step or high-frequency components; to avoid noise amplification by direct differentiation, we employ a second-order TD to generate smooth references. The TD equations and symbol definitions are provided in Appendix A.9. This preprocessing smooths the raw reference signal, producing differentiable position and velocity commands suitable for feedback control.

Consider a multi-agent system of N UAVs with the ESO estimate for the ith agent

z_{i} = {[z_{1, i}, z_{2, i}, z_{3, i}]}^{⊤} \approx {[x_{i}, {\dot{x}}_{i}, d_{i}]}^{⊤},

where

d_{i} (t)

denotes the total disturbance. Given the raw reference

r_{d, i} (t)

and its TD outputs

(r_{1, i}, r_{2, i})

, define the tracking errors

e_{i} = z_{1, i} - r_{1, i}, {\dot{e}}_{i} = z_{2, i} - r_{2, i} .

The tracking errors represent the deviation in position and velocity between the UAV’s estimated state and the smoothed reference.

To balance rapid response with robustness, we introduce an adaptive control bandwidth

ω_{c, i} (t)

and update the PD gains online as

K_{p, i} (t) = κ_{p} ω_{c, i}^{2}, K_{d, i} (t) = 2 κ_{d} ω_{c, i},

where

κ_{p}, κ_{d} > 0

are tuning coefficients. Adjusting the gains through the control bandwidth allows the controller to speed up or slow down its response in real time.

The resulting MA-AADRC control law is

u_{AADRC, i} = - K_{p, i} e_{i} - K_{d, i} {\dot{e}}_{i} - z_{3, i} .

The PD terms drive the tracking error to zero, while the disturbance estimate

z_{3, i}

cancels the effect of unknown dynamics.

4.4.2. Sliding-Mode Robust Compensator (SMC)

To further suppress intense disturbances and residual model uncertainties, the sliding surface is designed as

s_{i} = {\dot{e}}_{i} + λ e_{i}, λ > 0 .

(21)

A saturation-based strategy is employed to mitigate chattering, yielding the sliding-mode compensation term

\begin{matrix} u_{SMC, i} & = - K_{s} sat (\frac{s_{i}}{ε}), \\ sat (x) & = \{\begin{matrix} x, & | x | < 1, \\ sgn (x), & | x | \geq 1 . \end{matrix} \end{matrix}

(22)

where

K_{s} > 0

is the robust gain and

ε > 0

is the boundary-layer width. The condition

K_{s} > {∥ Δ u_{i} ∥}_{\infty}

ensures finite-time convergence of the sliding variable to the sliding surface [25]. In practice,

K_{s}

may be chosen proportionally to the UAV’s flight speed to reduce energy consumption.

4.4.3. Gradient Potential Field-Based High-Precision Obstacle Avoidance Criterion

In the flight control loop, to achieve high-precision avoidance of arbitrarily shaped obstacles, we introduce a differentiable potential field based on the signed distance function (SDF) and apply piecewise smoothing. Let the signed distance to the ith obstacle be

{SDF}_{i} (p) = \min_{y \in O_{i}} ∥ p - y ∥ (\geq 0 outside O_{i}),

(23)

with the corresponding closest point

p_{i}^{*}

and

d_{i} = {SDF}_{i} (p)

. Define the potential energy function

U (p) = \sum_{i = 1}^{N_{O}} \frac{k_{rep}}{2} {[max (0, \frac{1}{d_{i}} - \frac{1}{d_{0}})]}^{2},

(24)

where

k_{rep} > 0

is the repulsion scale factor and

d_{0}

is the activation radius (the potential is nonzero only if

d_{i} < d_{0}

). The resulting repulsive force field is

\begin{matrix} F_{rep} (p) & = - \nabla U (p) \\ = \sum_{i = 1}^{N_{O}} k_{rep} (\frac{1}{d_{i}} - \frac{1}{d_{0}}) \frac{1}{d_{i}^{2}} (\frac{p - p_{i}^{*}}{d_{i}}) Θ (d_{0} - d_{i}) . \end{matrix}

(25)

4.4.4. Integrated Control Law

After computing the ADRC control input

u_{ADRC, i}

and the SMC compensation

u_{SMC, i}

, the final control input is obtained by velocity-weighting the obstacle repulsive force according to each UAV’s airspeed:

u_{i} = u_{ADRC, i} + u_{SMC, i} + k_{obs, i} (∥ v_{i} ∥) F_{rep, i},

(26)

where

F_{rep, i} = - \nabla U (p_{i})

is the composite gradient-based repulsive force from Equation (18). The speed-adaptive weighting function

k_{obs, i} (∥ v_{i} ∥) = k_{0} (1 + \frac{∥ v_{i} ∥}{v_{ref}})

(27)

increases linearly with the UAV’s current airspeed

∥ v_{i} ∥

, ensuring more agile obstacle-avoidance response at higher speeds.

4.5. Real-Time Multi-Level Collision Detection, Evasive Maneuvers, and Velocity Compensation

4.5.1. Multi-Level Obstacle Collision Detection

To satisfy both real-time performance and safety in high-density obstacle environments, we adopt a four-layer progressive collision detection pipeline:

Spatial hashing (coarse filter) [26]: place obstacle centroids into a 3D hash grid and only keep the cell of the UAV and its 26 neighbors. Formulas and symbols in Appendix A.10.1. This coarse filter quickly reduces the candidate obstacle set from N to a small constant, ensuring constant-time queries.
AABB tree culling [27]: traverse an AABB hierarchy and prune nodes by a conservative bounding-sphere test. Inequality and symbols in Appendix A.10.2. This step prunes large portions of space by discarding nodes in such a way that they are too far from the UAV for a collision to occur, leaving only spatially relevant obstacles.
GJK distance check [28]: estimate the minimum distance between the UAV bounding sphere and each convex obstacle that survived culling. Definition and symbols in Appendix A.10.3. This algorithm precisely computes the closest approach between convex shapes, triggering fine-scale checks when the distance is below a safety margin.
Continuous-time SAT [29]: within a control period, combine the predicted trajectory with an obstacle SDF to solve the first contact time; flag a potential collision if contact occurs before the period ends. Equations and symbols in Appendix A.10.4. This final layer ensures temporal completeness, detecting fast-moving collisions that discrete sampling could miss.

This hierarchical design progressively narrows the candidate set, keeping per-cycle complexity near constant while maintaining strict collision safety.

4.5.2. Collision Avoidance Maneuver

For the nearest obstacle

O^{★}

, compute the gradient of the signed distance function at the first contact point:

n_{★} = \nabla {SDF}_{O^{★}} (p (t_{hit})), {∥ n_{★} ∥}_{2} = 1,

(28)

and construct the equivalent repulsive force:

F_{rep} = - κ_{e} \frac{r_{UAV} + d_{safe} - d_{\min}}{d_{\min}^{2}} n_{★},

(29)

where

κ_{e} > 0

is the energy-type potential gain,

r_{UAV}

the UAV bounding-sphere radius,

d_{safe}

the safety distance, and

d_{\min}

the minimum distance. This repulsive force is injected as an additional input into the MA-AADRC+SMC framework, achieving a soft-constraint avoidance.

To ensure dynamic continuity and rapid exit from the danger zone, the velocity is real-time scaled and biased:

\begin{matrix} v_{k + 1} & = β_{s} v_{k} + α_{s} \frac{F_{rep}}{∥ F_{rep} ∥_{2}} v_{max}, \\ β_{s} & \in (0, 1), α_{s} = 1 - β_{s} . \end{matrix}

(30)

When

d_{\min} \geq d_{safe} + δ_{hyst}, {∥ F_{rep} ∥}_{2} < ε_{rep},

with hysteresis threshold

δ_{hyst} > 0

and repulsive threshold

ε_{rep} > 0

both satisfied, the system exits emergency mode and starts PID velocity tracking to avoid frequent switching. This maneuver integrates seamlessly with the multi-level detection framework, ensuring UAV flight safety and control convergence in dense obstacle environments.

4.5.3. Speed Compensation

Let

v \in R^{3}

be the UAV’s current linear velocity, and define the unit direction toward the desired position as

\hat{d} = \frac{r_{d} - p}{∥ r_{d} - p ∥},

(31)

where

v_{ref}

is the nominal cruise speed. The desired velocity vector is

v^{*} = v_{ref} \hat{d} .

(32)

Define the velocity error and its derivative by

e_{v} = v^{*} - v, {\dot{e}}_{v} = \frac{d}{d t} e_{v} .

(33)

The three-dimensional PID acceleration compensation law is

a_{PID} = K_{p} e_{v} + K_{i} \int_{0}^{t} e_{v} (τ) d τ + K_{d} {\dot{e}}_{v},

(34)

with gains tuned in cascade as

K_{p} = 2 ζ ω_{v}

,

K_{d} = ω_{v}

,

K_{i} = \frac{1}{2} ω_{v} K_{p}

, where

ω_{v}

is the velocity-loop bandwidth and

ζ

the damping ratio. Classical PID controllers have been widely used in quadrotor research due to their simplicity and effectiveness [30]. To prevent integral windup, a conditional integration strategy is employed:

\begin{matrix} \dot{I} (t) & = \{\begin{matrix} e_{v}, & ∥ a_{PID} ∥ < a_{max}, \\ 0, & otherwise, \end{matrix} \\ a_{PID} & = K_{p} e_{v} + K_{i} I + K_{d} {\dot{e}}_{v} . \end{matrix}

(35)

Finally,

a_{PID}

is added to the control law (46) and the result is norm-saturated

∥ a ∥ \leq a_{max}

to respect thrust limits. This vector PID scheme ensures exponential convergence to the desired cruise speed after obstacle avoidance or gust disturbances and, through anti-saturation logic, prevents integral windup, guaranteeing smooth closed-loop transitions.

Section summary: Under identical initial conditions and gust disturbances, the proposed method achieves faster convergence, lower steady-state tracking error, tighter formation keeping, and smaller spacing overshoot than ADRC and PID. The results confirm robustness and real-time feasibility in cluttered 3-D environments.

5. Simulation Experiments

5.1. Experimental Environment and Parameter Settings

All simulations were conducted in MATLAB R2023b on an Intel i5-9300H CPU. Five UAVs performed trajectory tracking and obstacle avoidance inside a cubic workspace of

{[0, 100]}^{3} m^{3}

that contained 17 randomly placed obstacles satisfying a minimum inter-obstacle spacing of 5 m. Every controller was evaluated under identical initial states and an equivalent step-gust wind disturbance.

5.2. Rationale and Micro-Sensitivity

Rationale. The simulation and controller parameters were determined through a hierarchical procedure combining UAV kinematic constraints, control-theoretic considerations, and lightweight empirical tuning. This ensures that each parameter has a clear physical or theoretical basis, and the final set is both feasible and representative of urban UAV operations. Specifically:

(i) Workspace size (

{[0, 100]}^{3}

m³), maximum speed (

v_{max} = 12

m/s), and acceleration limit (

a_{max} = 6

m/s²) were chosen to yield

O (10)

-s missions with multiple obstacle interactions, while remaining within realistic thrust and maneuverability bounds.

(ii) Obstacles combine super-quadric cylinders (poles/trees) and OBBs (containers/low facades), with dimensions drawn from ranges (Table A2) and Poisson-disk spacing to avoid overlaps. This results in a medium clutter density (17 obstacles) that is neither trivial nor infeasible.

(iii) The composite wind field (multi-harmonic + step gust + altitude scaling; Section 2.1.1) captures steady peaks, sudden gusts, and vertical gradients, reflecting the turbulence patterns encountered in low-altitude urban UAV envelopes.

(iv) Planner hyperparameters (s, goal-bias p, and static check samples

n_{checks}

) were tuned to balance path optimality and runtime efficiency. Continuous-time feasibility checks further enforce kinematic validity (Section 3.4).

(v) Controller and observer parameters (Table A6 and Table A7) were selected so that the closed-loop response time is shorter than the trajectory-curvature time scale, thus limiting overshoot while preserving disturbance rejection. Fine-tuning of gains (e.g.,

K_{p}

,

β_{i}

) was performed through lightweight simulation sweeps with three design criteria: (a) stability margin not violated, (b) steady-state error minimized, and (c) robustness maintained under varying wind and obstacle conditions.

Micro-sensitivity (design & criteria). To ensure conclusions do not hinge on a single random draw with minimal compute, we repeat simulations with three independent seeds

{11, 29, 47}

(affecting obstacle placement/sizes and wind phases) and perform two lightweight sweeps: (i) RRT* step size

s \in {1, 3}

(baseline is

s = 2

); (ii) Obstacle count

N_{obs} \in {10, 25}

(baseline is 17). For each setting and controller (Hybrid/ADRC/PID/SMC), we report: (1) The success rate (goal reached within 15 s; same criterion as the main study), (2) The steady-state tracking error averaged over

t \in [6, 15]

s (averaged across UAVs), (3) The minimum inter-UAV separation over the whole horizon (safety threshold 5 m). The summary appears in Table 1.

5.3. Evolution Trend Analysis of Path-Tracking Error

As shown in Figure 6, the average path tracking errors vary noticeably among different control algorithms. The hybrid control algorithm maintains relatively small errors, mostly within 2–3 m, with a transient peak of about 6 m around 4.5 s, followed by rapid stabilization. The sliding-mode control (SMC) initially rises to approximately 9 m, but decreases sharply after 2.5 s and remains below 3 m in the subsequent stage. By contrast, the PID controller reaches nearly 12 m between 5 and 7 s, and although the error reduces afterwards, a residual of 2.5–3 m persists at 15 s, indicating slower convergence. The ADRC, however, shows larger oscillations ranging between 7 and 12 m for most of the time, with only a limited reduction at the end. Overall, the hybrid approach and SMC exhibit relatively better steady-state accuracy, while PID and ADRC demonstrate a less favorable dynamic performance.

As shown in Figure 7, four error metrics of different control algorithms in path tracking are compared: average error, standard deviation of error, maximum error, and stable average error. Among all methods, the hybrid control algorithm achieves the smallest errors, with an average error of approximately 3 m, a stable average error around 2.5 m, and low variability, indicating both good convergence and stability. The SMC shows comparable steady-state performance, with slightly higher average errors (3.5 m) and a stable error near 2.6 m, though its maximum error reaches about 11.5 m. In contrast, PID and ADRC exhibit substantially larger errors, with average errors of 7.8 m and 8.3 m, and stable errors around 5.5 m and 8.1 m, respectively, while their maximum errors approach or exceed 15 m, reflecting larger fluctuations. Overall, the hybrid algorithm demonstrates the most favorable balance between steady-state accuracy and error stability, outperforming the other three methods, while SMC shows secondary advantages, and PID and ADRC are relatively less effective.

As shown in Figure 8, the tracking errors of five UAVs under different control algorithms are compared. It is evident that the hybrid control algorithm consistently delivers the best stability and accuracy across all UAVs, with errors mostly confined to 2–4 m and a rapid convergence after initial disturbances. Although the sliding-mode control (SMC) also achieves relatively fast convergence, its initial peaks typically range from 8 to 12 m, and its steady-state errors are slightly higher than those of the hybrid method. The PID controller exhibits larger transient errors on some UAVs (e.g., peaks exceeding 15 m for UAV2 and UAV3) and converges more slowly. ADRC shows persistent oscillations in most cases, with errors generally fluctuating between 7–12 m, indicating limited convergence. Overall, the hybrid approach maintains the lowest steady-state errors and stronger robustness across multiple platforms, demonstrating superior performance compared with the other three methods.

5.4. Average Formation Spacing Comparison

In UAV formation control tasks, maintaining a stable formation structure is critical for group cooperative flight, obstacle avoidance capability, and error recovery performance. Different control strategies exhibit significant differences in formation stability, dynamic obstacle avoidance, and error convergence. Therefore, it is necessary to conduct systematic experimental analyses to evaluate the applicability and performance advantages of each control method.

As shown in Figure 9, the evolution of average UAV spacing under different control algorithms is compared with the minimum desired spacing. The hybrid control algorithm consistently maintains the desired spacing, with a peak of about 9.5 m, followed by rapid convergence after 5 s and stable tracking near the desired value (5 m). The SMC shows a similar trend, with an initial peak around 11 m, but it quickly converges and stabilizes near 5 m. In contrast, the PID controller produces much larger fluctuations, with peaks exceeding 18 m, and remains above the desired spacing even after 10 s, indicating slower convergence. The ADRC also exhibits large initial oscillations (peak above 14 m), though it gradually converges to the desired spacing around 6 s. Overall, the hybrid algorithm and SMC achieve better performance in maintaining desired spacing and stability, with the hybrid approach exhibiting the smallest overshoot and the most consistent convergence.

In Figure 10, a statistical comparison of UAV spacing under different control algorithms is presented, including the mean value, minimum value, maximum value, and standard deviation, with the minimum desired spacing indicated as a reference. Among all methods, the hybrid control algorithm achieves the most favorable balance, with a mean spacing of about 6.7 m, a minimum spacing close to the desired threshold (5 m), and the smallest variability, as indicated by its low standard deviation. The SMC shows similar performance with a mean spacing of approximately 7.5 m, also maintaining the minimum spacing requirement, though its maximum spacing (11.5 m) is slightly larger. In contrast, PID and ADRC produce significantly larger deviations: PID yields the largest mean spacing (14.5 m) and maximum spacing (19 m), while ADRC results in a mean spacing of 8.8 m and a maximum of 15 m, both indicating larger fluctuations. Overall, the hybrid algorithm demonstrates the best ability to maintain safe and stable inter-UAV spacing, followed by SMC, whereas PID and ADRC are less effective.

5.5. Formation Maintenance Analysis

In multi-UAV cooperative flight missions, formation stability is a key metric for assessing formation-control performance. Different control strategies vary in their ability to match UAV speeds, regulate formation error, and recover after obstacle avoidance, directly impacting the overall stability and execution quality of the formation. In this section, we analyze the formation-keeping error and its temporal evolution under three control methods—Hybrid Control, ADRC, and PID—to elucidate the performance characteristics and applicability of each strategy.

As shown in Figure 11, the formation-keeping errors under different control algorithms are compared over time. The hybrid control algorithm maintains relatively small errors throughout the process, with an initial fluctuation peaking at about 8.5 m, but rapidly decreasing after 5 s and stabilizing within the range of 2–3 m, demonstrating good convergence and stability. The SMC exhibits a similar trend, with an initial peak of approximately 11 m, but it also decreases quickly after 5 s and remains at a low error level. In contrast, the PID controller shows larger errors between 3 and 7 s, with a peak close to 16 m, and converges more slowly, with errors still around 5 m after 10 s. The ADRC produces even larger oscillations, reaching a maximum error of nearly 21 m, and maintains relatively high errors of 6–8 m beyond 10 s. Overall, the hybrid algorithm achieves the best formation-keeping accuracy and convergence performance, followed by SMC, while PID and ADRC are less effective.

As shown in Figure 12, a statistical comparison of formation-keeping errors under different control algorithms is presented, including average error, maximum error, and standard deviation. The hybrid control algorithm achieves the best overall performance, with the smallest average error (3 m), a low maximum error (9 m), and the lowest variability, indicating stable and accurate formation-keeping. The SMC also performs relatively well, with an average error of about 3.5 m and limited variability, though its maximum error reaches approximately 12 m. In contrast, PID and ADRC show significantly larger errors: both methods yield average errors close to 10 m, while their maximum errors reach 16 m and 21 m, respectively, accompanied by higher standard deviations. Overall, the hybrid approach demonstrates the most favorable formation-keeping accuracy and robustness, followed by SMC, while PID and ADRC are comparatively less effective.

As summarized in Table 2, the proposed hybrid control consistently achieves the smallest average and stable errors, the lowest maximum deviations, and the minimum variability across all scenarios. These metrics jointly confirm that the hybrid controller not only ensures high tracking accuracy but also maintains strong robustness against disturbances and fluctuations. In contrast, SMC shows secondary advantages with competitive steady-state performance but larger peak errors, while PID and ADRC exhibit substantially higher errors and variability, reflecting weaker dynamic adaptability. This quantitative evidence further validates the superior effectiveness and stability of the proposed approach compared with conventional methods.

5.6. Comparison of Multiple Path-Planning Schemes

Figure 13 illustrates the planned trajectories for a UAV flying from the start point

(0, 0, 30)

to the goal

(100, 100, 30)

. The layout comprises four views: the top left is the

X - Y

plane (plan view), the top right is a three-dimensional perspective, the bottom-left is the

X - Z

plane (side view), and the bottom right is the

Y - Z

plane (front view). The UAV successfully avoids multiple obstacles in a complex environment, following a smooth path that preserves formation. In the plan view, different paths exhibit varying avoidance strategies, with some showing superior smoothness and clearance. The side and front views confirm consistent altitude control, indicating that the planner maintains the UAVs within an appropriate flight-height band. The 3D perspective further demonstrates how obstacle placement shapes the trajectory, validating the algorithm’s effectiveness in three-dimensional spaces. In Figure 13, Figure 14 and Figure 15, colored solid lines denote the trajectories of five UAVs (UAV1-UAV5), while the dashed line indicates the reference path generated by the path-planning algorithm.

Additionally, two other cases are presented. Figure 14 shows a descent path from

(0, 0, 100)

to

(100, 100, 0)

, where the UAV descends smoothly while avoiding obstacles. Figure 15 depicts a climb from ground level

(0, 0, 0)

to

(100, 100, 100)

, showing gradual, stable ascent with effective obstacle avoidance. Together, these multi-view results confirm the planner’s adaptability and reliability for UAV path control in complex scenarios.

5.7. Lyapunov Convergence Proof and Detailed Analysis of Additional Experimental Results

This section provides a rigorous Lyapunov-based convergence proof for the Hybrid controller under wind-disturbance conditions and offers an in-depth analysis of the remaining simulation outcomes. To illustrate the control performance intuitively, we present the following plots: (i) position-tracking trajectories, (ii) position error and sliding-surface convergence and (iii) the Lyapunov function and its time derivative.

As shown in Figure 16, the time histories of the actual outputs of UAV1-UAV5 (solid lines) and the leader reference (dashed line) are compared along the (a) X, (b) Y, and (c) Z axes. In both horizontal axes, all agents closely track the reference with negligible steady-state error (below 1 m). Along the vertical axis, a transient overshoot of approximately 10–15 m occurs between 2 s and 5 s, after which the tracking errors converge to within

\pm 2

m. These quantitative results confirm that the proposed hybrid controller achieves precise and robust 3D trajectory tracking in a multi-agent formation.

5.7.1. Error and Sliding-Surface Convergence

As shown in Figure 17, at

t \approx 1

s a deliberate external disturbance was applied, causing the position error norm

∥ e ∥

to spike to approximately

5.0

m. The error then decays rapidly to below

0.3

m within the next second and remains bounded between

0.2

and

0.5

m for the remainder of the simulation. Similarly, the sliding surface norm

∥ s ∥

peaks at about 16 at the moment of disturbance and converges to under

1.0

, maintaining values below

0.5

thereafter. These results confirm the finite-time convergence and robust sliding-mode regulation of the proposed hybrid controller in the presence of abrupt disturbances.

5.7.2. Lyapunov Function and Its Time Derivative

As shown in Figure 18, to assess the energy-dissipation capability of the proposed Hybrid Algorithms under extreme initial conditions, a step disturbance was injected at

t = 1 s

. This excitation caused the quadratic Lyapunov function

V (t) = \frac{1}{2} ({∥ e ∥}^{2} + {∥ \dot{e} ∥}^{2})

to increase abruptly, reaching a peak of

10^{1}

at

t = 1.02 s

. Thereafter,

V (t)

decays quasi-exponentially, converging to the

10^{- 2}

level within

5 s

. In addition, a minor secondary fluctuation around

t \approx 9 s

reflects transient energy buffering at a trajectory corner. Correspondingly,

\dot{V} (t)

remains nonpositive throughout, with a sharp negative spike at the moment of excitation, indicating rapid dissipation of the injected energy. These results confirm that the proposed Hybrid Algorithms maintain closed-loop asymptotic stability and high robustness in the face of sudden large disturbances.

These findings validate that the combined energy analysis based on the tracking error e and its derivative

\dot{e}

is effectively captured by the Lyapunov function, providing a solid theoretical foundation for controller parameter tuning and guaranteeing reliable convergence of the multi-UAV formation under wind disturbances. In summary, the comprehensive evaluation of position-tracking performance, sliding-surface convergence, and Lyapunov-function evolution confirms that the proposed Hybrid Control Algorithms achieve rapid transient convergence, minimal steady-state error, and strong robustness against external disturbances.

Section summary: Under identical initial conditions and gust disturbances, the proposed method achieves faster convergence, lower steady-state tracking error, tighter formation keeping, and smaller spacing overshoot than ADRC, SMC, and PID. The results confirm robustness and real-time feasibility in cluttered 3-D environments.

6. Results and Outlook

We address cooperative multi-UAV flight in windy urban 3-D spaces with dense obstacles and present an end-to-end framework from global planning to high-precision formation control. On the planning side, an artificial potential field-guided variant of RRT* embeds velocity and acceleration constraints, together with continuous collision prediction, directly into the sampling, extension, and rewiring stages; a four-layer collision-detection pipeline further reduces the worst-case per-cycle complexity to the expected order of

O (1)

. On the control side, a two-loop MA-AADRC–SMC architecture with an error-driven-bandwidth ESO is devised: the outer, slower loop produces nominal commands, while the inner, faster loop allocates attitude and thrust in real time; a sliding-mode compensator, combined with a speed-weighted vector PID term, endows the system with strong robustness in obstacle avoidance and formation keeping. The two layers are deeply coupled via continuously feasible trajectories and real-time replanning, yielding a closed-loop optimisation scheme. Numerical experiments demonstrate marked improvements in path-tracking accuracy and formation stability under composite wind disturbances and random obstacle layouts, while the Lyapunov function remains monotonically decreasing, corroborating asymptotic closed-loop stability. Compared with classical ADRC and PID baselines, the proposed framework achieves rapid convergence, low steady-state error and safe obstacle avoidance without additional tuning, indicating high potential for engineering deployment.

Beyond simulation, the disturbance, obstacle, and kinematic models we adopt are parameterized to match typical civil multirotors (e.g., DJI Matrice-class) operating in urban airspace. The sensing and control assumptions—GNSS/RTK or visual-inertial odometry for localization, 3D LiDAR or depth cameras for mapping, and PX4/ROS2 middleware for control and communication—are standard in commercial UAV platforms. The proposed architecture therefore aligns with real use cases such as urban infrastructure inspection, post-disaster cooperative mapping, and multi-UAV logistics, suggesting that only minimal adaptation is required for deployment in practice.

It should be noted that the current validation relies entirely on large-scale simulations, which, while comprehensive, cannot fully capture real-world hardware nonlinearities, sensor noise, or communication delays. This limitation underscores the necessity of experimental verification as a crucial next step. In future research, we plan to further investigate the scalability of the proposed framework in large-scale UAV swarms, including scenarios with hundreds of agents, limited communication bandwidth, and GPS-denied environments. In addition, decentralized control strategies and energy-efficient optimization will be considered to enhance robustness and practicality. Hardware-in-the-loop experiments will also be conducted to validate the feasibility of the proposed methods under real-world conditions.

Author Contributions

Conceptualization and algorithm development: Y.Y.; Complex experimental design: S.L.; Writing—original draft and review and editing: R.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (No. 52305170), Science & Technology Department of Sichuan Province Popularization of Science Program (25KJPX0202), and Artificial Intelligence Key Laboratory of Sichuan Province (2024RYY006).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Additional Derivations and Implementation Details

Appendix A.1. APF Sampling Details and Distance Definitions

d_{eff} = \sqrt{max {(d_{xy} - R, 0)}^{2} + max {(z - z_{\min}, 0)}^{2}},

(A1)

where:

$d_{xy}$ : horizontal (planar) Euclidean distance from UAV position $x$ to obstacle center $(c_{x}, c_{y})$ ;
R: obstacle’s effective radius in the horizontal plane;
z: UAV altitude;
$z_{\min}$ : obstacle base altitude.

F_{rep} = η (\frac{1}{d_{eff}} - \frac{1}{d_{0}}) \frac{x - c_{obs}}{d_{eff}^{2} ∥ x - c_{obs} ∥}, c_{obs} \approx (c_{x}, c_{y}, (z_{\min} + z_{max}) / 2),

(A2)

where:

$η$ : scaling coefficient for repulsive potential strength;
$d_{0}$ : influence distance threshold for the obstacle’s repulsive field;
$c_{obs}$ : obstacle center in 3-D (horizontal center plus vertical midpoint between $z_{\min}$ and $z_{max}$ );
$z_{max}$ : obstacle top altitude.

Appendix A.2. Interpolation and Rewiring Criteria

Interpolation for collision checking.

We sample intermediate points along the trial segment

p_{1} = x_{nearest} \to p_{2} = x_{new}

as

p (τ) = p_{1} + τ (p_{2} - p_{1}), τ = \frac{i}{n_{checks}}, i = 0, 1, \dots, n_{checks} .

(A3)

where:

$p_{1}, p_{2}$ : endpoints of the candidate straight-line segment ( $x_{nearest}$ to $x_{new}$ );
$n_{checks}$ : number of equally spaced samples along the segment;
i: sample index; $τ = i / n_{checks}$ is the normalized arc parameter.

Rewiring inequalities.

For a neighbor

x_{near}

in the local rewiring neighborhood, we consider the standard cost-reduction tests:

Cost (x_{j}) + ∥ x_{j} - x_{new} ∥ < Cost (x_{new}),

(A4)

and

Cost (x_{new}) + ∥ x_{i} - x_{new} ∥ < Cost (x_{i}) .

(A5)

where:

$Cost (x)$ : cumulative path length from the root to node $x$ in the current tree;
$∥ a - b ∥$ : Euclidean length of segment $(a, b)$ (can be replaced by a time/energy metric if desired);
$x_{j}, x_{i}, x_{near}$ : generic neighbor nodes within the rewiring radius of $x_{new}$ or $x_{nearest}$ .

Appendix A.3. Point-in-Obstacle Tests and Symbols

Super-quadric cylinder membership.

A point

p = {(x, y, z)}^{⊤}

lies inside a super-quadric cylinder if

{({|\frac{x - c_{x}}{r}|}^{\frac{2}{ϵ}} + {|\frac{y - c_{y}}{r}|}^{\frac{2}{ϵ}})}^{ϵ / 2} \leq 1, 0 \leq z \leq h .

(A6)

Symbols:

(c_{x}, c_{y})

is the base center of the cylinder; r is the equivalent radius; h is the height; and

ϵ

is the shape index (

ϵ = 1

reduces to a right cylinder).

Oriented bounding-box (OBB) membership.

Let

\tilde{p} = R^{T} (p - c), R = R (γ),

where

γ

is the yaw rotation of the box and

c = {(c_{x}, c_{y}, 0)}^{⊤}

is the box center (horizontal). Then

p

lies inside the OBB if

| \tilde{x} | \leq \frac{W}{2}, | \tilde{y} | \leq \frac{L}{2}, 0 \leq \tilde{z} \leq h .

(A7)

Symbols:

W, L

are the side lengths in the OBB’s local frame; h is the vertical extent; and

R (γ)

is the planar rotation about z.

Sets.

O_{scyl}

denotes all super-quadric cylinders;

O_{obb}

denotes all OBB obstacles;

O = O_{scyl} \cup O_{obb}

.

Appendix A.4. Static Linear-Discrete Segment Sampling

We sample

n_{checks} + 1

equally spaced points along the segment from

x_{nearest}

to

x_{new}

:

\begin{matrix} p_{i} & = x_{nearest} + \frac{i}{n_{checks}} (x_{new} - x_{nearest}), \\ i & = 0, 1, \dots, n_{checks} . \end{matrix}

(A8)

where:

$p_{1}, p_{2}$ : endpoints of the candidate straight-line segment ( $x_{nearest}$ to $x_{new}$ );
$n_{checks}$ : number of equally spaced samples along the segment;
i: sample index; $τ = i / n_{checks}$ is the normalized arc parameter.

Each

p_{i}

is validated by the L2 geometric predicate

collision_point (p_{i}, O_{scyl} \cup O_{obb})

; if any sample is classified as interior, the segment is rejected.

Appendix A.5. Dynamic Predictive Collision Checking

We first define the unit direction and initial conditions:

\hat{d} = \frac{x_{new} - x_{nearest}}{D}, p_{0} = x_{nearest}, v_{0} = ∥ v_{nearest} ∥ .

For

k = 1, \dots, N

, we recursively predict

\begin{matrix} a_{k} & = clip ((D / T_{\min} - v_{k - 1}) / Δ t, - a_{max}, a_{max}), \\ v_{k} & = v_{k - 1} + a_{k} Δ t, \\ p_{k} & = p_{k - 1} + \hat{d} v_{k} Δ t . \end{matrix}

(A9)

The terminal velocity used for continuity in subsequent steps is

v_{new} = \frac{x_{new} - x_{nearest}}{Δ t} .

(A10)

where:

k: prediction step index ( $1 \to N$ );
N: number of look-ahead steps; $T_{\min}$ : minimum traversal time (defined in the main text);
$D = ∥ x_{new} - x_{nearest} ∥$ : straight-line distance between nodes;
$Δ t$ : simulation/control time step;
$a_{max}$ : maximum admissible acceleration magnitude;
$\hat{d}$ : unit direction from $x_{nearest}$ to $x_{new}$ ;
$v_{k - 1}, v_{k}$ : predicted scalar speeds at steps $k - 1$ and k;
$p_{0}, p_{k}$ : predicted positions at steps 0 and k;
$clip (x, - a_{max}, a_{max})$ : saturation of x to the interval $[- a_{max}, a_{max}]$ ;
$x_{nearest}, x_{new}$ : the current nearest node and the new candidate node;
$v_{new}$ : terminal velocity assigned at $x_{new}$ for continuity.

Appendix A.6. ESO Structure and Symbols

Define the extended state and ESO as:

z (t) = {[z_{1}, z_{2}, \dots, z_{n}, z_{n + 1}]}^{⊤} \approx {[x, \dot{x}, \dots, x^{(n - 1)}, d (t)]}^{⊤},

\dot{z} (t) = A_{o} z (t) + B_{o} u (t) - L (t) (C z (t) - x), y (t) = C z (t),

A_{o} = [\begin{matrix} 0_{n \times n} & I_{n \times n} \\ 0_{1 \times n} & 0 \end{matrix}], B_{o} = [\begin{matrix} 0_{n \times 1} \\ b \end{matrix}], C = [\begin{matrix} 1 & 0_{1 \times n} \end{matrix}], L (t) = {[β_{1} (t), \dots, β_{n + 1} (t)]}^{⊤} .

where:

$x (t)$ : plant output; $u (t)$ : control input; $b > 0$ : input gain;
$d (t)$ : lumped total uncertainty ( $f (\cdot) + w (t)$ );
n: plant order; $z (t)$ : extended state with disturbance channel;
$A_{o}, B_{o}, C$ : ESO realization matrices; $I, 0$ : identity/zero blocks;
$L (t) = {[β_{1} (t), \dots, β_{n + 1} (t)]}^{⊤}$ : time-varying observer gain vector;
$y (t)$ : ESO output (estimated x).

Appendix A.7. Bandwidth-Pole Alignment and Nominal Gains

The characteristic polynomial with bandwidth

ω_{o} > 0

is

p_{o} (s) = s^{n + 1} + β_{1} s^{n} + \dots + β_{n} s + β_{n + 1} = {(s + ω_{o})}^{n + 1},

(A11)

which yields the nominal gains

β_{i}^{★} = (\binom{n + 1}{i}) ω_{o}^{i}, i = 1, \dots, n + 1 .

(A12)

where:

$ω_{o} > 0$ : desired ESO bandwidth;
$p_{o} (s)$ : observer characteristic polynomial;
$β_{i}^{★}$ : nominal gain for the i-th coefficient;
n: plant order; $i = 1, \dots, n + 1$ : index of coefficients.

Appendix A.8. Adaptive Bandwidth Scheduling and Time-Varying Gains

We schedule the bandwidth by an error-driven law and update the gains accordingly:

ω_{o} (t) = ω_{o 0} [1 + κ φ (e (t))], e (t) = z_{1} (t) - x (t),

(A13)

β_{i} (t) = (\binom{n + 1}{i}) ω_{o} {(t)}^{i}, i = 1, \dots, n + 1 .

(A14)

where:

$ω_{o 0} > 0$ : baseline (minimum) observer bandwidth;
$κ > 0$ : scheduling gain; $φ (\cdot)$ : nonnegative error map (here $φ (e) = e^{2}$ );
$e (t) = z_{1} (t) - x (t)$ : ESO output estimation error;
$ω_{o} (t)$ : scheduled bandwidth; $β_{i} (t)$ : time-varying ESO gains;
n: plant order; $i = 1, \dots, n + 1$ : coefficient index.

Appendix A.9. Tracking Differentiator (TD) Equations

For each scalar reference

r_{d, i} (t)

we use the second-order TD:

\begin{matrix} {\dot{r}}_{1, i} & = r_{2, i}, \\ {\dot{r}}_{2, i} & = - ω_{t}^{2} (r_{1, i} - r_{d, i}) - 2 ζ_{t} ω_{t} r_{2, i}, \end{matrix}

(A15)

where:

$r_{d, i} (t)$ : the ith agent’s raw reference (per-axis or scalar component) from the planner;
$r_{1, i} (t), r_{2, i} (t)$ : smoothed desired position and velocity produced by the TD;
$ω_{t} > 0$ : TD bandwidth;
$ζ_{t} \in (0, 1)$ : TD damping ratio;
$i \in {1, \dots, N}$ : agent index (apply TD component-wise for vector references).

Appendix A.10. Four-Layer Collision Detection: Formulas and Symbols

Appendix A.10.1. Layer 1: Spatial Hashing (Coarse Filter) [26]

We map each obstacle centroid

c \in R^{3}

to a grid index by

ι = ⌊\frac{c}{h}⌋ .

(A16)

where:

$c$ : obstacle centroid in world coordinates;
$h > 0$ : cubic cell size (same along $x, y, z$ );
$ι \in Z^{3}$ : integer grid index (componentwise floor).

We retain obstacles in the UAV cell and its 26 neighbors to form the candidate set.

Appendix A.10.2. Layer 2: AABB Tree Culling [27]

Given the UAV center

p_{UAV}

and a node with bounding sphere

(c_{node}, r_{node})

, prune the node if

{∥p_{UAV} - c_{node}∥}_{2} > r_{UAV} + r_{node} + d_{safe} .

(A17)

where:

$p_{UAV}$ : UAV bounding-sphere center;
$r_{UAV}$ : UAV bounding-sphere radius;
$c_{node}$ , $r_{node}$ : AABB-node bounding sphere center and radius;
$d_{safe} > 0$ : safety margin.

Appendix A.10.3. Layer 3: GJK Distance to Convex Obstacles [28]

For a convex obstacle

O_{i}

and the UAV sphere

B (p, r_{UAV})

, define

d_{\min} = \min_{x \in B, y \in O_{i}} {∥ x - y ∥}_{2} .

(A18)

where:

$B (p, r_{UAV})$ : closed ball centered at $p$ with radius $r_{UAV}$ ;
$O_{i}$ : ith convex obstacle;
$d_{\min}$ : minimum separation distance (nonnegative).

If

d_{\min} \leq d_{safe}

, proceed to the fine detection layer.

Appendix A.10.4. Layer 4: Continuous-Time SAT with SDF [29]

Within one control period

Δ t

, use a constant-velocity prediction

p (t) = p_{k} + v_{k} t,

(A19)

and compute the first contact time by

t_{hit} = \min \{t \in (0, Δ t] | {SDF}_{O} [p (t)] \leq r_{UAV}\} .

(A20)

where:

$p_{k}$ : current UAV position (start of the control period);
$v_{k}$ : current predicted velocity (held over the period);
$Δ t > 0$ : control/verification horizon;
${SDF}_{O} (\cdot)$ : signed distance function to obstacle set $O$ (nonnegative outside);
$t_{hit}$ : first contact time (if the set is empty, there is no contact in the period).

If

t_{hit} < Δ t

, a potential collision is flagged.

Appendix B

All key simulation and control parameters are summarised in Table A1–Table A8.

Table A1. Global simulation settings.

Parameter	Value
Total duration, T	15 s
Time step, $d t$	0.001 s
Workspace	${[0, 100]}^{3}$ m
Number of UAVs	5
Number of obstacles	17
Minimum obstacle spacing, $d_{\min}$	5 m

Table A2. Obstacle geometry and sampling parameters.

Parameter	Value
Super-quadric shape index, $ϵ$	1 (cylinder)
Cylinder radius, r	$U (3, 7)$ m
Cylinder height, $h_{cyl}$	$10 + 50 U (0, 1)$ m
OBB minor/major sides, $(W, L)$	$U (6, 14)$ m
OBB height, $h_{obb}$	$10 + 50 U (0, 1)$ m
OBB yaw angle, $γ$	$U (0, 2 π)$ rad
Poisson-disk spacing, $d_{\min}^{obs}$	20 m

Table A3. UAV kinematic limits and nominal flight parameters.

Parameter	Value
Maximum speed, $v_{max}$	12 m/s
Maximum acceleration, $a_{max}$	6 m/s²
UAV radius, $r_{UAV}$	0.25 m
Safety margin, $d_{safe}$	1 m
Nominal reference speed, $v_{ref}$	10 m/s
Cruising slow-down factor	0.7

Table A4. Improved RRT*-APF planner parameters.

Parameter	Value
Step size, s	2 m
Goal-bias probability, p	0.05
Static check samples, $n_{checks}$	20
Iteration limit, $N_{max}$	5000
APF repulsion gain, $η$	1.5
APF threshold, $d_{0}$	8 m
APF gradient step, $α_{apf}$	0.30
Hash-grid cell size, h	5 m
Potential-field gain, $k_{rep}$	100

Table A5. Formation-control and cooperation parameters.

Parameter	Value
Coordination smoothing, $α$	0.15
Repulsion activation radius, $d_{0, form}$	8 m
Cooperative repulsion gain, $K_{c}$	1
Gradient-projection gain, $γ$	0.5
Learning rate, $K_{c, lr}$	2
Small denominator constant, $δ$	0.05 m

Table A6. Observers and baseline controller parameters.

Parameter	Value
ESO gains (Hybrid), $(β_{1}, β_{2}, β_{3})$	(50, 250, 1000)
PD gains (Hybrid), $(K_{p}, K_{d})$	(300, 50)
Sliding-mode gains, $(K_{s}, λ)$	(150, 35)
ESO gains (ADRC), $(β_{1}, β_{2}, β_{3})$	(50, 200, 800)
PD gains (ADRC), $(K_{p, adrc}, K_{d, adrc})$	(500, 350)
PID gains $(K_{p}, K_{i}, K_{d})$	(500, 50, 350)
PID anti-windup limit	100

Table A7. Adaptive and robust module parameters.

Parameter	Value
Baseline ESO bandwidth, $ω_{o 0}$	30 rad/s
Bandwidth scheduling gain, $κ$	2
Adaptive PD coefficients, $(κ_{p}, κ_{d})$	(1, 1)
TD bandwidth / damping, $(ω_{t}, ζ_{t})$	(20 rad/s, 0.7)
Velocity-loop bandwidth / damping, $(ω_{v}, ζ)$	(10 rad/s, 0.75)
Sliding boundary-layer width, $ε$	0.02
Obstacle weighting base, $k_{0}$	1.2

Table A8. Emergency avoidance parameters.

Parameter	Value
Energy-type repulsion gain, $κ_{e}$	150
Velocity blend factor, $β_{s}$	0.85
Hysteresis band, $δ_{hyst}$	0.5 m
Repulsive-force threshold, $ε_{rep}$	0.05

References

Shi, J.; Pei, Z.; Tang, Z.; Hu, D. Design and implementation of an improved active disturbance rejection control system for quadrotor UAV. J. Beijing Univ. Aeronaut. Astronaut. 2021, 47, 1823–1831. [Google Scholar] [CrossRef]
Yang, W.; Lu, J.; Jiang, X.; Wang, Y. Design of quadrotor attitude active-disturbance-rejection controller based on improved ESO. Syst. Eng. Electron. Technol. 2022, 44, 3792–3799. [Google Scholar]
Dou, J.; Kong, X.; Wen, B. Fuzzy active-disturbance-rejection attitude control and stability analysis of quadrotor UAV. Chin. J. Inertial Technol. 2015, 23, 824–830. [Google Scholar] [CrossRef]
Chen, Z.; Zhou, Z.; Su, H.; Li, X. Design of active disturbance rejection attitude controller for tethered UAV with parameter tuning. In Proceedings of the 2023 2nd International Conference on Robotics, Artificial Intelligence and Intelligent Control (RAIIC), Mianyang, China, 11–13 August 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 19–24. [Google Scholar] [CrossRef]
Zhang, J.; Shi, Y.; Ren, C.; Wen, B. Robust sliding-mode control of quadrotor UAV trajectory based on extended state observer. Chin. J. Inertial Technol. 2018, 26, 247–254. [Google Scholar] [CrossRef]
Wang, L.; Li, G.; Wang, Z. Sliding-mode control of under-actuated quadrotor UAV. J. Harbin Eng. Univ. 2012, 33, 1248–1253. [Google Scholar]
An, H. Geometric Sliding-Mode Attitude Control and Disturbance-Rejection for Quadrotor UAV. Ph.D. Thesis, National University of Defense Technology, Changsha, China, 2013. [Google Scholar]
Feng, K.; Jia, Y.; Chai, J.; Zeng, Q.; Hao, Y. Fuzzy adaptive sliding-mode control of quadrotor UAV. J. Flight Mech. 2018, 36, 49–53. [Google Scholar] [CrossRef]
Chen, X.; Liu, K.; Mao, H. UAV trajectory planning based on APF-RRT algorithm. Opto-Electron. Eng. 2022, 29, 17–22. [Google Scholar]
Yin, G.; Zhou, S.; Wu, Q. UAV path planning based on improved RRT algorithm. Acta Electron. Sin. 2017, 45, 1764–1769. [Google Scholar]
Li, M. Research on UAV Mission Planning Method Based on Intelligent Optimization and RRT Algorithm. Master’s Thesis, Nanjing University of Aeronautics and Astronautics, Nanjing, China, 2012. [Google Scholar]
Wang, Y.; Yang, Y.; Li, J.; Tang, Q.; Liu, C. An improved informed RRT*-connect algorithm for UAVs path planning. In Proceedings of the 2024 IEEE International Conference on Unmanned Systems (ICUS), Nanjing, China, 18–20 October 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 325–333. [Google Scholar] [CrossRef]
Xu, X.; Song, Z.; Mao, Y. Global path planning for UAV based on APF-RRT* fusion algorithm. J. Wuhan Univ. Technol. 2021, 43, 72–78. [Google Scholar]
Zhou, L.; Kong, M. Three-dimensional UAV obstacle avoidance based on improved artificial potential field method. J. East China Normal Univ. (Nat. Sci.) 2022, 2022, 54–67. [Google Scholar] [CrossRef]
Yang, L.; Gu, Y.; Bai, Y. Online route planning algorithm for UAV based on improved artificial potential field. Appl. Electron. Technol. 2018, 44, 5–13. [Google Scholar] [CrossRef]
Zhou, L.; Liang, Q.; Zhou, H.; Xiong, W. Improved artificial potential field method for UAV path planning. In Proceedings of the 2022 14th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA), Changsha, China, 15–16 January 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 657–660. [Google Scholar] [CrossRef]
Wang, Y.; Lun, Y.; Wang, H.; Wu, J.; Liu, Y. 6-DOF UAV path planning and tracking control for obstacle avoidance: A deep-learning-based integrated approach. Aerosp. Sci. Technol. 2024, 140, 109320. [Google Scholar] [CrossRef]
Qin, N.; Li, Y.; Huang, T.; Huang, D. Path planning and control of a quadrotor UAV based on an improved APF using parallel search. Int. J. Aerosp. Eng. 2021, 2021, 5524841. [Google Scholar] [CrossRef]
Herrmann, J.; Rudnick-Cohen, E.; Azarm, S. Multi-objective design and path planning optimization of UAVs. In Proceedings of the 16th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, Dallas, TX, USA, 22–26 June 2015; AIAA: Reston, VA, USA, 2015; pp. 2015–2322. [Google Scholar] [CrossRef][Green Version]
Low, K.; Pang, B.; Dai, W.; Hu, X. UAV path optimization with an integrated cost assessment model considering third-party risks in metropolitan environments. Reliab. Eng. Syst. Saf. 2022, 222, 108399. [Google Scholar] [CrossRef]
Khatib, O. Real-time obstacle avoidance for manipulators and mobile robots. Int. J. Robot. Res. 1986, 5, 90–98. [Google Scholar] [CrossRef]
Karaman, S.; Frazzoli, E. Sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
Han, J. From PID to active disturbance rejection control. IEEE Trans. Ind. Electron. 2009, 56, 900–906. [Google Scholar] [CrossRef]
Zhang, D.; Wu, Q.; Yao, X. Bandwidth-based stability analysis of active disturbance rejection control for nonlinear uncertain systems. J. Syst. Sci. Complex. 2018, 31, 1449–1468. [Google Scholar] [CrossRef]
Utkin, V.I. Sliding Modes in Control and Optimization; Springer: Berlin, Germany, 1992. [Google Scholar] [CrossRef]
Ericson, C. Real-Time Collision Detection; Morgan Kaufmann: San Francisco, CA, USA, 2004. [Google Scholar] [CrossRef]
Gottschalk, S.; Lin, M.C.; Manocha, D. OBBTree: A hierarchical structure for rapid interference detection. In Seminal Graphics Papers: Pushing the Boundaries; ACM: New York, NY, USA, 1996; pp. 171–180. [Google Scholar] [CrossRef]
Gilbert, E.G.; Johnson, D.W.; Keerthi, S.S. A fast procedure for computing the distance between complex objects in three-dimensional space. IEEE J. Robot. Autom. 1988, 4, 193–203. [Google Scholar] [CrossRef]
Redon, S.; Galoppo, N.; Lin, M.C. Adaptive dynamics of articulated bodies. ACM Trans. Graph. 2005, 24, 936–945. [Google Scholar] [CrossRef]
Bouabdallah, S.; Noth, A.; Siegwart, R. PID vs. LQ control techniques applied to an indoor micro quadrotor. In Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566), Sendai, Japan, 28 September–2 October 2004; IEEE: Piscataway, NJ, USA, 2004; Volume 3, pp. 2451–2456. [Google Scholar] [CrossRef]

Figure 1. Wind-velocity components

w_{x}, w_{y}, w_{z}

at different altitudes using Equation (1). Multi-scale variations and stronger winds at higher altitudes are visible.

Figure 1. Wind-velocity components

w_{x}, w_{y}, w_{z}

at different altitudes using Equation (1). Multi-scale variations and stronger winds at higher altitudes are visible.

Figure 2. Height-time heatmap of wind magnitude

{∥ w ∥}_{2}

from Equation (1). Gust onset at

t_{0}

and vertical strengthening with altitude are clear.

Figure 2. Height-time heatmap of wind magnitude

{∥ w ∥}_{2}

from Equation (1). Gust onset at

t_{0}

and vertical strengthening with altitude are clear.

Figure 3. Flowchart of the RRT*-APF planner with guided sampling, predictive checks, and rewiring. Outputs a collision-free feasible 3D path.

Figure 4. System workflow from environment modeling to planning, formation, hybrid tracking control, and emergency recovery.

Figure 5. Flowchart of adaptive active disturbance rejection control (AADRC) for UAV trajectory tracking.

Figure 6. Position-tracking error of Hybrid, ADRC, PID, SMC. Hybrid shows smallest peaks and errors.

Figure 7. Mean, std, max, steady error of tracking. Hybrid achieves best overall performance.

Figure 8. Tracking errors of five UAVs. Hybrid limits overshoot and converges uniformly within formation.

Figure 9. Average spacing vs. time with 5 m limit. Hybrid has smallest overshoot and fastest recovery.

Figure 10. Statistics of inter-UAV spacing. Hybrid keeps closest to 5 m with least variability.

Figure 11. Average formation error vs. time. Hybrid bounds deviation and converges fastest.

Figure 12. Mean, max, std of formation error. Hybrid shows smallest values.

Figure 13. Three-dimensional planned path shown in four views. Path skirts obstacles smoothly under limits.

Figure 14. Descent trajectory with obstacle clearance and safe margins.

Figure 15. Climb trajectory balancing ascent rate and obstacle avoidance.

Figure 16. Position tracking along

X, Y, Z

. Horizontal error near zero; vertical recoveries within

\pm 2

m. (a) X-axis; (b) Y-axis; (c) Z-axis.

Figure 16. Position tracking along

X, Y, Z

. Horizontal error near zero; vertical recoveries within

\pm 2

m. (a) X-axis; (b) Y-axis; (c) Z-axis.

Figure 17. Norms of tracking error

∥ e (t) ∥

and sliding surface

∥ s (t) ∥

under a disturbance at

t \approx 1

s. Both quickly decay to small bounds, showing fast convergence and robustness.

Figure 17. Norms of tracking error

∥ e (t) ∥

and sliding surface

∥ s (t) ∥

under a disturbance at

t \approx 1

s. Both quickly decay to small bounds, showing fast convergence and robustness.

Figure 18. Lyapunov function

V (t)

and derivative

\dot{V} (t)

on log scale. V decreases rapidly and

\dot{V} \leq 0

, confirming closed-loop stability.

Figure 18. Lyapunov function

V (t)

and derivative

\dot{V} (t)

on log scale. V decreases rapidly and

\dot{V} \leq 0

, confirming closed-loop stability.

Table 1. Micro-sensitivity results across random seeds and settings. Metrics are reported as mean ± std.

Setting	Controller	Success (%)	Steady-State Error (m)	Min Separation (m)
Baseline ( $s = 2$ , $N_{obs} = 17$ )	Hybrid	$99.85 \pm 0.12$	$2.63 \pm 0.41$	$5.32 \pm 0.22$
	SMC	$99.72 \pm 0.18$	$3.28 \pm 0.47$	$5.27 \pm 0.25$
	PID	$98.62 \pm 0.74$	$4.59 \pm 0.66$	$5.19 \pm 0.31$
	ADRC	$96.24 \pm 1.12$	$7.98 \pm 0.85$	$5.14 \pm 0.38$
Step size $s = 1$	Hybrid	$99.91 \pm 0.09$	$2.45 \pm 0.39$	$5.36 \pm 0.19$
	SMC	$99.78 \pm 0.14$	$3.01 \pm 0.52$	$5.33 \pm 0.21$
	PID	$99.02 \pm 0.36$	$4.32 \pm 0.59$	$5.18 \pm 0.27$
	ADRC	$97.15 \pm 0.88$	$7.84 \pm 0.74$	$5.22 \pm 0.29$
Step size $s = 3$	Hybrid	$99.67 \pm 0.21$	$2.81 \pm 0.48$	$5.24 \pm 0.28$
	SMC	$98.92 \pm 0.56$	$3.65 \pm 0.63$	$5.18 \pm 0.31$
	PID	$97.41 \pm 0.92$	$4.77 \pm 0.71$	$5.09 \pm 0.34$
	ADRC	$95.68 \pm 1.43$	$8.45 \pm 0.92$	$5.05 \pm 0.37$
$N_{obs} = 10$	Hybrid	$99.94 \pm 0.07$	$2.36 \pm 0.36$	$5.42 \pm 0.23$
	SMC	$99.84 \pm 0.11$	$2.97 \pm 0.45$	$5.35 \pm 0.20$
	PID	$99.11 \pm 0.28$	$4.25 \pm 0.58$	$5.15 \pm 0.26$
	ADRC	$97.86 \pm 0.62$	$7.72 \pm 0.67$	$5.25 \pm 0.24$
$N_{obs} = 25$	Hybrid	$99.61 \pm 0.25$	$2.98 \pm 0.53$	$5.18 \pm 0.30$
	SMC	$98.34 \pm 0.72$	$3.79 \pm 0.61$	$5.13 \pm 0.32$
	PID	$96.83 \pm 1.04$	$4.88 \pm 0.76$	$5.06 \pm 0.36$
	ADRC	$95.42 \pm 1.68$	$8.61 \pm 0.98$	$5.02 \pm 0.41$

Table 2. Quantitative performance comparison of different controllers. Metrics are reported in meters (m).

Metric	Hybrid	SMC	PID	ADRC
Path-Tracking Error
Average Error (m)	3.0	3.5	7.8	8.3
Stable Error (m)	2.5	2.6	5.5	8.1
Max Error (m)	6.0	11.5	15.0	15.0
Std. Dev. (m)	0.8	1.5	3.0	3.5
Inter-UAV Spacing
Mean Spacing (m)	6.7	7.5	14.5	8.8
Min Spacing (m)	5.0	5.0	9.0	6.0
Max Spacing (m)	9.5	11.5	19.0	15.0
Std. Dev. (m)	1.0	1.3	2.8	2.2
Formation-Keeping Error
Average Error (m)	3.0	3.5	10.0	10.0
Max Error (m)	9.0	12.0	16.0	21.0
Std. Dev. (m)	1.2	1.6	3.5	4.0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yan, Y.; Liu, S.; Hao, R. RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations. Drones 2025, 9, 611. https://doi.org/10.3390/drones9090611

AMA Style

Yan Y, Liu S, Hao R. RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations. Drones. 2025; 9(9):611. https://doi.org/10.3390/drones9090611

Chicago/Turabian Style

Yan, Yuehao, Songlin Liu, and Rui Hao. 2025. "RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations" Drones 9, no. 9: 611. https://doi.org/10.3390/drones9090611

APA Style

Yan, Y., Liu, S., & Hao, R. (2025). RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations. Drones, 9(9), 611. https://doi.org/10.3390/drones9090611

Article Menu

RRT*-APF Path Planning and MA-AADRC-SMC Control for Cooperative 3-D Obstacle Avoidance in Multi-UAV Formations

Abstract

1. Introduction

2. Environment Modeling and Kinematic Decoupling

2.1. Dynamic Environmental Disturbances and Obstacle Representation

2.1.1. Composite Wind-Field Disturbance Modeling

2.1.2. Super-Quadric Cylinder Obstacles

2.1.3. Oriented Bounding-Box Rectangular Obstacles

2.2. Attitude–Acceleration Decoupling

3. Path-Planning Algorithm

3.1. Improved RRT*-APF Algorithm

3.2. Random Sampling and APF-Corrected Sampling Strategy

3.3. Nearest Node Selection, Extension and Rewiring [22]

3.4. Dynamic Velocity/Acceleration Constraints and Collision Checking

3.4.1. Geometric Precision Checking

3.4.2. Static Linear-Discrete Check

3.4.3. Dynamic Predictive Collision Checking

3.5. Goal Determination, Path Retracing, and Termination Criteria [22]

4. Flight Control Algorithm

4.1. Hierarchical Hybrid Control Algorithm Architecture Overview

4.2. Adaptive Design of the Extended-State Observer (ESO)

4.2.1. Problem Statement

4.2.2. Extended-State Variables and Observer Structure [23]

4.2.3. Bandwidth-Pole Alignment Design

4.2.4. Adaptive Bandwidth Mechanism

4.3. Multi-Agent Cooperative Formation and Adaptive Repulsive Force Scheduling

4.3.1. Cooperative Formation Mechanism

4.3.2. Gain Projection Law Based on Group Average Inter-Agent Distance

4.4. Main Control Law and Sliding-Mode Compensation

4.4.1. Adaptive Active Disturbance Rejection Control (AADRC) Law Design [23,24]

4.4.2. Sliding-Mode Robust Compensator (SMC)

4.4.3. Gradient Potential Field-Based High-Precision Obstacle Avoidance Criterion

4.4.4. Integrated Control Law

4.5. Real-Time Multi-Level Collision Detection, Evasive Maneuvers, and Velocity Compensation

4.5.1. Multi-Level Obstacle Collision Detection

4.5.2. Collision Avoidance Maneuver

4.5.3. Speed Compensation

5. Simulation Experiments

5.1. Experimental Environment and Parameter Settings

5.2. Rationale and Micro-Sensitivity

5.3. Evolution Trend Analysis of Path-Tracking Error

5.4. Average Formation Spacing Comparison

5.5. Formation Maintenance Analysis

5.6. Comparison of Multiple Path-Planning Schemes

5.7. Lyapunov Convergence Proof and Detailed Analysis of Additional Experimental Results

5.7.1. Error and Sliding-Surface Convergence

5.7.2. Lyapunov Function and Its Time Derivative

6. Results and Outlook

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Additional Derivations and Implementation Details

Appendix A.1. APF Sampling Details and Distance Definitions

Appendix A.2. Interpolation and Rewiring Criteria

Appendix A.3. Point-in-Obstacle Tests and Symbols

Appendix A.4. Static Linear-Discrete Segment Sampling

Appendix A.5. Dynamic Predictive Collision Checking

Appendix A.6. ESO Structure and Symbols

Appendix A.7. Bandwidth-Pole Alignment and Nominal Gains

Appendix A.8. Adaptive Bandwidth Scheduling and Time-Varying Gains

Appendix A.9. Tracking Differentiator (TD) Equations

Appendix A.10. Four-Layer Collision Detection: Formulas and Symbols

Appendix A.10.1. Layer 1: Spatial Hashing (Coarse Filter) [26]

Appendix A.10.2. Layer 2: AABB Tree Culling [27]

Appendix A.10.3. Layer 3: GJK Distance to Convex Obstacles [28]

Appendix A.10.4. Layer 4: Continuous-Time SAT with SDF [29]

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI