1. Introduction
In complex 3-D environments, multi-UAV formations must remain stable while reacting quickly to unknown disturbances and obstacles. Meeting both goals is still challenging for research and practice.
In UAV control, active disturbance rejection control (ADRC) has become a research focus for robust UAV operation owing to its extended state observer (ESO), which estimates and compensates internal and external disturbances online. Shi et al. [
1] employed an improved ESO with a sliding-average low-pass filter in a nested inner–outer loop structure to achieve rapid attitude response without overshoot; Yang et al. [
2] enhanced the ESO’s noise rejection by using a nonlinear smooth GALN function, significantly improving convergence speed and tracking accuracy; Dou et al. [
3] introduced fuzzy logic into the ESO and error-feedback loop, effectively reducing overshoot and output fluctuation; and Chen et al. [
4] optimized ADRC parameters via the integral of time-weighted absolute error (ITAE) criterion combined with a particle swarm optimization (PSO) algorithm, further enhancing tethered UAV tracking precision under cable interference.
Active Disturbance Rejection Control (ADRC) is typically composed of three main components: a Tracking Differentiator (TD), an Extended-State Observer (ESO), and a Nonlinear State Error Feedback (NLSEF). Specifically, the TD is employed to smooth the desired input and generate usable derivative signals; the ESO estimates the system states in real time while compensating for the “total disturbance” (including external perturbations and model uncertainties); and the NLSEF, together with a linear feedback term, achieves dynamic correction of the closed-loop control law. This architecture significantly reduces reliance on an accurate mathematical model and thus provides strong robustness and adaptability. In contrast, sliding-mode control (SMC) is based on the invariance principle, where a sliding surface is designed and a discontinuous control law drives the system trajectory toward and maintains it on this surface, ensuring finite-time convergence and strong disturbance rejection. Its strengths lie in design simplicity and robustness, particularly against both structured and unstructured uncertainties. Nevertheless, SMC often suffers from the chattering phenomenon in practical implementations, while the performance of ADRC critically depends on the tuning of ESO parameters, the selection of disturbance compensation bandwidth, and the smoothing capability of the TD.
Sliding-mode control (SMC), renowned for its strong robustness to parameter uncertainty and external disturbances, has been widely applied to quadrotor attitude control. Zhang et al. [
5] proposed a robust SMC strategy based on ESO that reduced rotor speed jump amplitude by nearly 90 %; Wang et al. [
6] constructed a generalized sliding-mode manifold for the underactuated system and verified closed-loop stability; An et al. [
7] applied geometric SMC within the Lie group–Lie algebra framework to achieve globally stable attitude convergence; and Feng et al. [
8] combined fuzzy adaptive mechanisms with sliding-mode variable-structure control to suppress chattering and enhance disturbance rejection. Although these works significantly improve the robustness of single-vehicle attitude and velocity loops, most experiments remain confined to indoor or static hover scenarios, lacking validation for high-speed three-dimensional formation coordination and real-time obstacle avoidance. Moreover, these controllers have not explicitly incorporated kinematic constraints such as maximum velocity and acceleration, nor analyzed global stability under strong wind fields and communication delays.
In path planning, the rapidly exploring random tree (RRT) algorithm is widely adopted due to its map-free nature, but its global randomness limits convergence efficiency and path quality. Chen et al. [
9] proposed an APF-RRT fusion framework based on goal biasing, improved artificial potential fields, and a genetic algorithm to accelerate convergence and shorten paths; Yin et al. [
10] incorporated dynamic constraints and distance weighting into node expansion and employed B-spline curves for path smoothing; Li et al. [
11] used chaotic sequences and fuzzy inference to dynamically adjust sampling parameters and developed bilateral and rolling-window replanning schemes; Wang et al. [
12] introduced bias-extension, probability guidance, and node pruning strategies to significantly speed up tree growth and simplify paths. Traditional artificial potential field (APF) methods are computationally simple and yield smooth paths but often suffer from local minima. Xu et al. [
13] presented an APF-RRT* fusion approach that guides sampling with potential fields and self-tunes the sampling radius for efficient global planning; Zhou and Kong [
14] adopted hexagonal guidance and virtual subgoals to escape local traps; Yang et al. [
15] developed an online planner based on reference-route attractive fields and adaptive disturbance factors to ensure safety in dynamic environments; Zhou et al. [
16] introduced angular factors and adaptive weights into attractive and repulsive functions to suppress oscillations and overcome local optima. Existing RRT*/APF enhancements primarily focus on sampling efficiency and geometric path smoothing, rarely incorporating kinematic constraints such as velocity/acceleration or continuous-time collision prediction in the expansion and rewiring phases; generated paths often require post-processing before execution. Furthermore, most algorithms are validated only in single-UAV, static-obstacle settings, lacking systematic evaluation of inter-UAV interference, communication bandwidth, and safety-distance requirements.
To enhance autonomous flight capability in complex dynamic environments, the integration of path planning and attitude control has emerged as a key research topic. Wang et al. [
17] constructed a neural-network-based joint planning-tracking framework to achieve real-time obstacle avoidance for six-degree-of-freedom vehicles in complex obstacle fields, while Huang et al. [
18] fused an improved APF algorithm with inverse-dynamics backstepping control to effectively address local optima. Despite these advances, challenges remain in attitude-path modeling consistency, global optimality under multi-objective constraints, and overall system robustness. For example, Rudnick-Cohen et al. [
19] jointly optimized UAV design parameters and path planning to reduce risk and mission time, and Pang et al. [
20] emphasized balancing third-party risk and energy consumption in path optimization. Although deep-learning or multi-objective joint optimization frameworks have attempted to synchronize planning and control, they rely on large-scale offline training with poor interpretability and a high transfer cost; the results are predominantly confined to single-vehicle simulations, lacking multi-UAV coordination and real-wind validation. Offline path optimization trade-offs for risk and energy have not been deeply coupled with real-time control loops, nor has robust analysis against disturbances and uncertainties been fully addressed.
This paper proposes a three-dimensional multi-UAV formation cooperative obstacle-avoidance framework that integrates an APF-guided RRT* planner with kinematic constraints, continuous-time collision prediction, and a four-layer obstacle filtering pipeline, together with a hierarchical hybrid MA–AADRC–SMC controller. The planner produces dynamically feasible and executable trajectories via potential-field-biased sampling and predictive collision checks, while the controller combines an adaptive extended state observer, feedforward correction, and a robust sliding-mode law in a dual-loop structure to achieve high-precision tracking under disturbances. Large-scale simulations and Lyapunov-based analysis in composite urban wind fields and heterogeneous three-dimensional environments validate superior tracking accuracy, formation keeping, and disturbance rejection, supporting reliable applications such as disaster assessment and urban air mobility.
Existing planners mainly focus on sampling efficiency and geometric smoothness, but seldom incorporate kinematic limits or continuous-time safety constraints directly into the planning process. Most controllers are validated only in single-UAV or low-dynamics settings, with weak planning–control coupling under realistic disturbances. To address these gaps, our contributions are (1) An APF-guided RRT* planner with embedded velocity/acceleration constraints and continuous-time collision prediction for executable trajectories; (2) A four-layer obstacle filtering pipeline that preserves temporal completeness with near-constant computational cost in cluttered environments; (3) A hybrid MA–AADRC–SMC controller combining bandwidth-scheduled ESO, sliding-mode robustness, and speed-weighted repulsion for overshoot-suppressing, disturbance-rejecting tracking with enforced safety spacing; (4) A tightly coupled planning–control framework with Lyapunov-guaranteed stability, validated through representative urban scenarios and micro-sensitivity studies confirming robustness.
3. Path-Planning Algorithm
3.1. Improved RRT*-APF Algorithm
Rapidly exploring Random Tree Star (RRT*) is widely adopted for UAV path planning owing to its efficient sampling in high-dimensional configuration spaces and its asymptotic optimality. Nevertheless, the standard algorithm focuses mainly on spatial connectivity and collision-free feasibility, while neglecting indispensable kinematic constraints—such as maximum velocity, acceleration, and turning limits—so that certain planned trajectories become infeasible or unsafe when executed on real vehicles.
To overcome these deficiencies, an improved RRT* augmented with an Artificial Potential Field (APF) is developed. The APF serves as a heuristic guide for random sampling, whereas the UAV kinematic model is embedded directly into both collision checking and path-extension procedures. Consequently, the proposed planner generates obstacle-avoidance trajectories in three-dimensional environments that honour both geometric clearance and kinematic feasibility. During tree expansion, the superposition of goal-attractive and obstacle-repulsive potentials dynamically biases sample selection toward higher-quality segments and expedites convergence of the tree to the goal region. Simultaneously, continuous-time kinematic simulation combined with collision discrimination guarantees that each path segment satisfies prescribed velocity and acceleration bounds of the UAV.
3.2. Random Sampling and APF-Corrected Sampling Strategy
Samples are drawn from the goal with probability p, and otherwise uniformly over the workspace as .
For each super-quadric cylinder obstacle with base centre
, radius
R, and height range
, let
. We define an
effective clearance that blends lateral and vertical separations, and a repulsive force
that grows as the clearance shrinks; both precise expressions are provided in
Appendix A.1 (cf. [
21]). If
, the obstacle contributes a nonzero repulsion. Summing over all obstacles yields the potential-field gradient
Finally, the raw sample is refined using a single gradient step
which biases samples away from obstacles and accelerates planner convergence.
The attractive term pulls the sample toward the goal, while the repulsive term pushes it away from nearby obstacles. Their sum shapes the local sampling bias, guiding expansion into safer, more goal-oriented regions.
The effective clearance metric and the obstacle repulsive force used to compute the APF gradient are detailed in
Appendix A.1. Intuitively,
prevents grazing either the sides or the top of obstacles, and
smoothly increases as the clearance decreases, producing a stable guidance away from hazardous regions. The overall procedure of the improved RRT*-APF planner, including guided sampling, predictive feasibility checks, and rewiring, is summarized in the flowchart shown in
Figure 3.
3.3. Nearest Node Selection, Extension and Rewiring [22]
At each iteration, the nearest node in the current tree
to the (possibly APF-corrected) sample
is found by
A new node is then generated at fixed step size
s along the direction from
to
:
Selecting the nearest existing node ensures that the tree grows from the closest reachable state toward the new sample, minimizing unnecessary detours.
To validate the straight-line extension, we discretize the segment between
and
into intermediate points for collision checks; the interpolation formula and parameter definitions are given in
Appendix A.2.
If the extension succeeds, nearby nodes are examined for cost reduction via rewiring. We compare cumulative path costs and reassign parents through
when doing so decreases the total cost while remaining collision-free; the explicit inequalities are listed in
Appendix A.2.
3.4. Dynamic Velocity/Acceleration Constraints and Collision Checking
3.4.1. Geometric Precision Checking
We define an L2 geometric-precision layer that returns true if a query point lies inside any obstacle in the set . During planning, every candidate or predicted waypoint is validated by this layer. This layer acts as a geometric filter, rejecting any candidate or predicted waypoint that lies inside a modeled obstacle and ensuring that both static and predicted motions are physically feasible in the environment model.
The concrete point-in-obstacle tests for super-quadric cylinders and oriented bounding boxes, together with parameter definitions, are provided in
Appendix A.3.
To guarantee both path safety and kinematic executability, two checks are performed in the RRT*-APF expansion:
- 1.
Static linear discrete check - Along the straight line between two nodes, sample at equal intervals and apply the L2 geometric test to each point (sampling formula and symbols in
Appendix A.4).
- 2.
Dynamic predictive check - Using the UAV kinematics, propagate the state forward for several look-ahead steps; each predicted waypoint is, again, subjected to the L2 test. The resulting recursion and terminal velocity assignment are summarized in
Appendix A.5.
The two-stage check first rules out collisions along a static straight line, then predicts motion under dynamic constraints to catch potential collisions that could occur due to the UAV’s inertia or acceleration limits.
3.4.2. Static Linear-Discrete Check
We partition the candidate straight segment from
to
into a fixed number of equally spaced samples and apply the L2 geometric test to each. The exact interpolation expression, index range, and symbol definitions are listed in
Appendix A.4. If any sampled point lies inside
, the segment is rejected; otherwise, the planner proceeds.
By sampling evenly along the candidate segment, this check detects any intersection with obstacles that could be missed by testing only the endpoints. 3.4.3. Dynamic Predictive Collision Checking
Under the UAV’s current velocity and maximum acceleration constraints, the straight-line segment between two nodes is validated by predicting uniformly accelerated motion and verifying each predicted waypoint via the L2 geometric-precision layer. Let
be the velocity at the nearest node with magnitude
,
the acceleration limit,
the inter-node distance, and
the time step. The minimum reachable time and the number of look-ahead steps are
These expressions compute the shortest possible traversal time and the required number of prediction steps to ensure dynamic feasibility given current speed and acceleration limits.
We propagate along the segment with a clipped acceleration recursion and assign a terminal velocity for continuity; the full recursions and symbols (including the terminal assignment) are given in
Appendix A.5. The recursion simulates the UAV’s motion step by step, clipping acceleration to physical limits so that predicted positions remain both collision-free and kinematically achievable.
3.5. Goal Determination, Path Retracing, and Termination Criteria [22]
When a newly generated node
satisfies
and the edge
is collision-free, the goal node is appended to the search tree:
Once
has been inserted into the tree
, the optimal path is retraced recursively from the goal back to the root:
until
. The resulting trajectory, ordered in time, is
If the iteration count reaches the maximum without generating any node satisfying with a collision-free connection, planning is deemed to have failed and an empty path is returned. This termination criterion, governed by , balances the global search capability against computational efficiency.
5. Simulation Experiments
5.1. Experimental Environment and Parameter Settings
All simulations were conducted in MATLAB R2023b on an Intel i5-9300H CPU. Five UAVs performed trajectory tracking and obstacle avoidance inside a cubic workspace of that contained 17 randomly placed obstacles satisfying a minimum inter-obstacle spacing of 5 m. Every controller was evaluated under identical initial states and an equivalent step-gust wind disturbance.
5.2. Rationale and Micro-Sensitivity
Rationale. The simulation and controller parameters were determined through a hierarchical procedure combining UAV kinematic constraints, control-theoretic considerations, and lightweight empirical tuning. This ensures that each parameter has a clear physical or theoretical basis, and the final set is both feasible and representative of urban UAV operations. Specifically:
(i) Workspace size ( m3), maximum speed ( m/s), and acceleration limit ( m/s2) were chosen to yield -s missions with multiple obstacle interactions, while remaining within realistic thrust and maneuverability bounds.
(ii) Obstacles combine super-quadric cylinders (poles/trees) and OBBs (containers/low facades), with dimensions drawn from ranges (
Table A2) and Poisson-disk spacing to avoid overlaps. This results in a medium clutter density (17 obstacles) that is neither trivial nor infeasible.
(iii) The composite wind field (multi-harmonic + step gust + altitude scaling;
Section 2.1.1) captures steady peaks, sudden gusts, and vertical gradients, reflecting the turbulence patterns encountered in low-altitude urban UAV envelopes.
(iv) Planner hyperparameters (
s, goal-bias
p, and static check samples
) were tuned to balance path optimality and runtime efficiency. Continuous-time feasibility checks further enforce kinematic validity (
Section 3.4).
(v) Controller and observer parameters (
Table A6 and
Table A7) were selected so that the closed-loop response time is shorter than the trajectory-curvature time scale, thus limiting overshoot while preserving disturbance rejection. Fine-tuning of gains (e.g.,
,
) was performed through lightweight simulation sweeps with three design criteria: (a) stability margin not violated, (b) steady-state error minimized, and (c) robustness maintained under varying wind and obstacle conditions.
Micro-sensitivity (design & criteria). To ensure conclusions do not hinge on a single random draw
with minimal compute, we repeat simulations with three independent seeds
(affecting obstacle placement/sizes and wind phases) and perform two lightweight sweeps: (i) RRT* step size
(baseline is
); (ii) Obstacle count
(baseline is 17). For each setting and controller (Hybrid/ADRC/PID/SMC), we report: (1) The success rate (goal reached within 15 s; same criterion as the main study), (2) The steady-state tracking error averaged over
s (averaged across UAVs), (3) The minimum inter-UAV separation over the whole horizon (safety threshold 5 m). The summary appears in
Table 1.
5.3. Evolution Trend Analysis of Path-Tracking Error
As shown in
Figure 6, the average path tracking errors vary noticeably among different control algorithms. The hybrid control algorithm maintains relatively small errors, mostly within 2–3 m, with a transient peak of about 6 m around 4.5 s, followed by rapid stabilization. The sliding-mode control (SMC) initially rises to approximately 9 m, but decreases sharply after 2.5 s and remains below 3 m in the subsequent stage. By contrast, the PID controller reaches nearly 12 m between 5 and 7 s, and although the error reduces afterwards, a residual of 2.5–3 m persists at 15 s, indicating slower convergence. The ADRC, however, shows larger oscillations ranging between 7 and 12 m for most of the time, with only a limited reduction at the end. Overall, the hybrid approach and SMC exhibit relatively better steady-state accuracy, while PID and ADRC demonstrate a less favorable dynamic performance.
As shown in
Figure 7, four error metrics of different control algorithms in path tracking are compared: average error, standard deviation of error, maximum error, and stable average error. Among all methods, the hybrid control algorithm achieves the smallest errors, with an average error of approximately 3 m, a stable average error around 2.5 m, and low variability, indicating both good convergence and stability. The SMC shows comparable steady-state performance, with slightly higher average errors (3.5 m) and a stable error near 2.6 m, though its maximum error reaches about 11.5 m. In contrast, PID and ADRC exhibit substantially larger errors, with average errors of 7.8 m and 8.3 m, and stable errors around 5.5 m and 8.1 m, respectively, while their maximum errors approach or exceed 15 m, reflecting larger fluctuations. Overall, the hybrid algorithm demonstrates the most favorable balance between steady-state accuracy and error stability, outperforming the other three methods, while SMC shows secondary advantages, and PID and ADRC are relatively less effective.
As shown in
Figure 8, the tracking errors of five UAVs under different control algorithms are compared. It is evident that the hybrid control algorithm consistently delivers the best stability and accuracy across all UAVs, with errors mostly confined to 2–4 m and a rapid convergence after initial disturbances. Although the sliding-mode control (SMC) also achieves relatively fast convergence, its initial peaks typically range from 8 to 12 m, and its steady-state errors are slightly higher than those of the hybrid method. The PID controller exhibits larger transient errors on some UAVs (e.g., peaks exceeding 15 m for UAV2 and UAV3) and converges more slowly. ADRC shows persistent oscillations in most cases, with errors generally fluctuating between 7–12 m, indicating limited convergence. Overall, the hybrid approach maintains the lowest steady-state errors and stronger robustness across multiple platforms, demonstrating superior performance compared with the other three methods.
5.4. Average Formation Spacing Comparison
In UAV formation control tasks, maintaining a stable formation structure is critical for group cooperative flight, obstacle avoidance capability, and error recovery performance. Different control strategies exhibit significant differences in formation stability, dynamic obstacle avoidance, and error convergence. Therefore, it is necessary to conduct systematic experimental analyses to evaluate the applicability and performance advantages of each control method.
As shown in
Figure 9, the evolution of average UAV spacing under different control algorithms is compared with the minimum desired spacing. The hybrid control algorithm consistently maintains the desired spacing, with a peak of about 9.5 m, followed by rapid convergence after 5 s and stable tracking near the desired value (5 m). The SMC shows a similar trend, with an initial peak around 11 m, but it quickly converges and stabilizes near 5 m. In contrast, the PID controller produces much larger fluctuations, with peaks exceeding 18 m, and remains above the desired spacing even after 10 s, indicating slower convergence. The ADRC also exhibits large initial oscillations (peak above 14 m), though it gradually converges to the desired spacing around 6 s. Overall, the hybrid algorithm and SMC achieve better performance in maintaining desired spacing and stability, with the hybrid approach exhibiting the smallest overshoot and the most consistent convergence.
In
Figure 10, a statistical comparison of UAV spacing under different control algorithms is presented, including the mean value, minimum value, maximum value, and standard deviation, with the minimum desired spacing indicated as a reference. Among all methods, the hybrid control algorithm achieves the most favorable balance, with a mean spacing of about 6.7 m, a minimum spacing close to the desired threshold (5 m), and the smallest variability, as indicated by its low standard deviation. The SMC shows similar performance with a mean spacing of approximately 7.5 m, also maintaining the minimum spacing requirement, though its maximum spacing (11.5 m) is slightly larger. In contrast, PID and ADRC produce significantly larger deviations: PID yields the largest mean spacing (14.5 m) and maximum spacing (19 m), while ADRC results in a mean spacing of 8.8 m and a maximum of 15 m, both indicating larger fluctuations. Overall, the hybrid algorithm demonstrates the best ability to maintain safe and stable inter-UAV spacing, followed by SMC, whereas PID and ADRC are less effective.
5.5. Formation Maintenance Analysis
In multi-UAV cooperative flight missions, formation stability is a key metric for assessing formation-control performance. Different control strategies vary in their ability to match UAV speeds, regulate formation error, and recover after obstacle avoidance, directly impacting the overall stability and execution quality of the formation. In this section, we analyze the formation-keeping error and its temporal evolution under three control methods—Hybrid Control, ADRC, and PID—to elucidate the performance characteristics and applicability of each strategy.
As shown in
Figure 11, the formation-keeping errors under different control algorithms are compared over time. The hybrid control algorithm maintains relatively small errors throughout the process, with an initial fluctuation peaking at about 8.5 m, but rapidly decreasing after 5 s and stabilizing within the range of 2–3 m, demonstrating good convergence and stability. The SMC exhibits a similar trend, with an initial peak of approximately 11 m, but it also decreases quickly after 5 s and remains at a low error level. In contrast, the PID controller shows larger errors between 3 and 7 s, with a peak close to 16 m, and converges more slowly, with errors still around 5 m after 10 s. The ADRC produces even larger oscillations, reaching a maximum error of nearly 21 m, and maintains relatively high errors of 6–8 m beyond 10 s. Overall, the hybrid algorithm achieves the best formation-keeping accuracy and convergence performance, followed by SMC, while PID and ADRC are less effective.
As shown in
Figure 12, a statistical comparison of formation-keeping errors under different control algorithms is presented, including average error, maximum error, and standard deviation. The hybrid control algorithm achieves the best overall performance, with the smallest average error (3 m), a low maximum error (9 m), and the lowest variability, indicating stable and accurate formation-keeping. The SMC also performs relatively well, with an average error of about 3.5 m and limited variability, though its maximum error reaches approximately 12 m. In contrast, PID and ADRC show significantly larger errors: both methods yield average errors close to 10 m, while their maximum errors reach 16 m and 21 m, respectively, accompanied by higher standard deviations. Overall, the hybrid approach demonstrates the most favorable formation-keeping accuracy and robustness, followed by SMC, while PID and ADRC are comparatively less effective.
As summarized in
Table 2, the proposed hybrid control consistently achieves the smallest average and stable errors, the lowest maximum deviations, and the minimum variability across all scenarios. These metrics jointly confirm that the hybrid controller not only ensures high tracking accuracy but also maintains strong robustness against disturbances and fluctuations. In contrast, SMC shows secondary advantages with competitive steady-state performance but larger peak errors, while PID and ADRC exhibit substantially higher errors and variability, reflecting weaker dynamic adaptability. This quantitative evidence further validates the superior effectiveness and stability of the proposed approach compared with conventional methods.
5.6. Comparison of Multiple Path-Planning Schemes
Figure 13 illustrates the planned trajectories for a UAV flying from the start point
to the goal
. The layout comprises four views: the top left is the
plane (plan view), the top right is a three-dimensional perspective, the bottom-left is the
plane (side view), and the bottom right is the
plane (front view). The UAV successfully avoids multiple obstacles in a complex environment, following a smooth path that preserves formation. In the plan view, different paths exhibit varying avoidance strategies, with some showing superior smoothness and clearance. The side and front views confirm consistent altitude control, indicating that the planner maintains the UAVs within an appropriate flight-height band. The 3D perspective further demonstrates how obstacle placement shapes the trajectory, validating the algorithm’s effectiveness in three-dimensional spaces. In
Figure 13,
Figure 14 and
Figure 15, colored solid lines denote the trajectories of five UAVs (UAV1-UAV5), while the dashed line indicates the reference path generated by the path-planning algorithm.
Additionally, two other cases are presented.
Figure 14 shows a descent path from
to
, where the UAV descends smoothly while avoiding obstacles.
Figure 15 depicts a climb from ground level
to
, showing gradual, stable ascent with effective obstacle avoidance. Together, these multi-view results confirm the planner’s adaptability and reliability for UAV path control in complex scenarios.
5.7. Lyapunov Convergence Proof and Detailed Analysis of Additional Experimental Results
This section provides a rigorous Lyapunov-based convergence proof for the Hybrid controller under wind-disturbance conditions and offers an in-depth analysis of the remaining simulation outcomes. To illustrate the control performance intuitively, we present the following plots: (i) position-tracking trajectories, (ii) position error and sliding-surface convergence and (iii) the Lyapunov function and its time derivative.
As shown in
Figure 16, the time histories of the actual outputs of UAV1-UAV5 (solid lines) and the leader reference (dashed line) are compared along the (a) X, (b) Y, and (c) Z axes. In both horizontal axes, all agents closely track the reference with negligible steady-state error (below 1 m). Along the vertical axis, a transient overshoot of approximately 10–15 m occurs between 2 s and 5 s, after which the tracking errors converge to within
m. These quantitative results confirm that the proposed hybrid controller achieves precise and robust 3D trajectory tracking in a multi-agent formation.
5.7.1. Error and Sliding-Surface Convergence
As shown in
Figure 17, at
s a deliberate external disturbance was applied, causing the position error norm
to spike to approximately
m. The error then decays rapidly to below
m within the next second and remains bounded between
and
m for the remainder of the simulation. Similarly, the sliding surface norm
peaks at about 16 at the moment of disturbance and converges to under
, maintaining values below
thereafter. These results confirm the finite-time convergence and robust sliding-mode regulation of the proposed hybrid controller in the presence of abrupt disturbances.
5.7.2. Lyapunov Function and Its Time Derivative
As shown in
Figure 18, to assess the energy-dissipation capability of the proposed
Hybrid Algorithms under extreme initial conditions, a step disturbance was injected at
. This excitation caused the quadratic Lyapunov function
to increase abruptly, reaching a peak of
at
. Thereafter,
decays quasi-exponentially, converging to the
level within
. In addition, a minor secondary fluctuation around
reflects transient energy buffering at a trajectory corner. Correspondingly,
remains nonpositive throughout, with a sharp negative spike at the moment of excitation, indicating rapid dissipation of the injected energy. These results confirm that the proposed
Hybrid Algorithms maintain closed-loop asymptotic stability and high robustness in the face of sudden large disturbances.
These findings validate that the combined energy analysis based on the tracking error e and its derivative is effectively captured by the Lyapunov function, providing a solid theoretical foundation for controller parameter tuning and guaranteeing reliable convergence of the multi-UAV formation under wind disturbances. In summary, the comprehensive evaluation of position-tracking performance, sliding-surface convergence, and Lyapunov-function evolution confirms that the proposed Hybrid Control Algorithms achieve rapid transient convergence, minimal steady-state error, and strong robustness against external disturbances.
Section summary: Under identical initial conditions and gust disturbances, the proposed method achieves faster convergence, lower steady-state tracking error, tighter formation keeping, and smaller spacing overshoot than ADRC, SMC, and PID. The results confirm robustness and real-time feasibility in cluttered 3-D environments.
6. Results and Outlook
We address cooperative multi-UAV flight in windy urban 3-D spaces with dense obstacles and present an end-to-end framework from global planning to high-precision formation control. On the planning side, an artificial potential field-guided variant of RRT* embeds velocity and acceleration constraints, together with continuous collision prediction, directly into the sampling, extension, and rewiring stages; a four-layer collision-detection pipeline further reduces the worst-case per-cycle complexity to the expected order of . On the control side, a two-loop MA-AADRC–SMC architecture with an error-driven-bandwidth ESO is devised: the outer, slower loop produces nominal commands, while the inner, faster loop allocates attitude and thrust in real time; a sliding-mode compensator, combined with a speed-weighted vector PID term, endows the system with strong robustness in obstacle avoidance and formation keeping. The two layers are deeply coupled via continuously feasible trajectories and real-time replanning, yielding a closed-loop optimisation scheme. Numerical experiments demonstrate marked improvements in path-tracking accuracy and formation stability under composite wind disturbances and random obstacle layouts, while the Lyapunov function remains monotonically decreasing, corroborating asymptotic closed-loop stability. Compared with classical ADRC and PID baselines, the proposed framework achieves rapid convergence, low steady-state error and safe obstacle avoidance without additional tuning, indicating high potential for engineering deployment.
Beyond simulation, the disturbance, obstacle, and kinematic models we adopt are parameterized to match typical civil multirotors (e.g., DJI Matrice-class) operating in urban airspace. The sensing and control assumptions—GNSS/RTK or visual-inertial odometry for localization, 3D LiDAR or depth cameras for mapping, and PX4/ROS2 middleware for control and communication—are standard in commercial UAV platforms. The proposed architecture therefore aligns with real use cases such as urban infrastructure inspection, post-disaster cooperative mapping, and multi-UAV logistics, suggesting that only minimal adaptation is required for deployment in practice.
It should be noted that the current validation relies entirely on large-scale simulations, which, while comprehensive, cannot fully capture real-world hardware nonlinearities, sensor noise, or communication delays. This limitation underscores the necessity of experimental verification as a crucial next step. In future research, we plan to further investigate the scalability of the proposed framework in large-scale UAV swarms, including scenarios with hundreds of agents, limited communication bandwidth, and GPS-denied environments. In addition, decentralized control strategies and energy-efficient optimization will be considered to enhance robustness and practicality. Hardware-in-the-loop experiments will also be conducted to validate the feasibility of the proposed methods under real-world conditions.