Dynamic Self-Organization and Safe Navigation for Hierarchical Embodied Swarms
Highlights
- A hierarchical embodied swarm framework with corridor-driven split–merge reconfiguration and feasibility projection enables coordinated multi-UAV navigation in complex environments.
- Multi-seed simulations show that LLM-assisted decisions remain feasible under the same projection layer and improve recovery in the most constrained alternating-gate scenario through stronger semantic split–merge reasoning.
- High-level semantic reasoning can be integrated into UAV swarms without sacrificing geometric and kinematic executability when explicit feasibility and safety constraints are enforced.
- The proposed framework provides a practical basis for multi-UAV search, inspection, and other missions in narrow passages and dense obstacle fields that require online reconfiguration and safe cooperative navigation.
Abstract
1. Introduction
- Hierarchical swarm decision architecture and rule-based mode design: A hierarchical embodied swarm architecture with leader, subleader, and follower roles is established, together with a rule-based mode design that organizes corridor negotiation, split–merge reconfiguration, and task-mode switching in a unified decision loop.
- Safe navigation mechanism for cluttered environments: A safety-oriented navigation mechanism is developed, including LiDAR-based obstacle repulsion, occlusion-aware attraction regulation, and geometric safety projection, so that the swarm can maintain coordinated motion in narrow and obstacle-dense environments.
- LLM-assisted decision making under rule constraints: An LLM-assisted top-layer decision interface is introduced to connect semantic reasoning with swarm structural control. By combining language-guided decision proposals with rule constraints and feasibility projection, only geometrically and kinematically executable actions are delivered to the bottom-layer controller. The multi-seed complex-scenario evaluation further shows that this semantic layer is most useful when repeated split–merge reasoning is required, as in the narrow alternating gate benchmark.
2. Notation and System Modeling
2.1. Notation
- The total number of UAVs is N, and the UAV indices are .
- Continuous time is denoted by ; the high-level decision layer is updated at discrete epochs indexed by with sampling step .
- The number of sub-swarms is , and the partition set is .
- The sth sub-swarm is denoted by with .
- Vectors are boldface: position, velocity, acceleration, and control input are denoted as and , respectively.
2.2. Hierarchical Organization and State Reporting
2.3. UAV Dynamics
2.4. LiDAR Perception Model
3. High-Level Dynamic Self-Organization and Decision Making
3.1. Action Sets and Task Modes
3.2. Rule-Based Decisions: Split and Merge
3.3. LLM Decision and Feasibility Projection
4. Low-Level Controller Design
4.1. Reference Trajectory and Attraction Term
4.2. Separation, Cohesion, and Following Coupling
4.3. LiDAR-Based Obstacle Repulsion
4.4. Safe Navigation Pipeline
- Collect Pack data from each sub-swarm and update high-level state summaries.
- Generate structural candidate actions via rules or the LLM, then project to feasible action .
- Update task mode (navigate/encircle), sub-swarm waypoints, and reference formations; execute dynamic follower reassignment (see below).
- Synthesize low-level control terms and apply acceleration saturation.
5. Simulation and Analysis
5.1. Protocol and Metric Definitions
5.2. Simulation 1: Rule-Based and LLM Decision Comparison
5.3. Simulation 2: Validation of the Risk-Control Mechanism
5.4. Simulation 3: Navigation in More Complex Scenarios
- default_corridor: The standard scenario from Simulations 1–2. Two diagonal interior blockers (, ) and one central separator (, ) create a two-branch passage; the goal is at . This serves as the reference baseline.
- narrow_alternating_gates: Five single-sided gate walls alternate between upper and lower halves of the channel at and , each leaving only a sub- gap on the opposite side. The swarm must make repeated, correctly directed split decisions; the goal is at .
- dense_cross_blocks: Eight rectangular blocks are arranged at irregular positions, forming cross-shaped obstacles at and along the channel. Multiple simultaneous branching choices and frequent split–merge cycles are required; the goal is at .
- off_axis_goal: Two pairs of symmetric lateral blockers and one central separator redirect the swarm, and the goal is placed at (off the channel axis) to test goal-directed recovery when the required heading deviates from the corridor main axis.
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A. Three-Dimensional Perspective Views of Simulation 1

Appendix B. Lyapunov Error-Boundedness Analysis of the Smooth Controller Core
Appendix C. Simulation Parameter Summary
| Parameter | Value | Description |
|---|---|---|
| Dynamics and sensing | ||
| 20, | Swarm size; integration step | |
| UAV mass | ||
| , | LiDAR range; repulsion trigger radius | |
| LiDAR angular resolution ( = 360 rays) | ||
| Acceleration saturation | ||
| Force-synthesis gains | ||
| , | Attraction position/velocity gains | |
| , | LiDAR repulsion base gain; emergency gain | |
| , | Short-range boost factor and exponent | |
| , , | Separation; intra-swarm cohesion; intra-swarm repulsion gains | |
| , , | Corridor centering; follower–subleader coupling gains | |
| Centroid cohesion gain | ||
| , | Safety distance; occlusion attenuation factor | |
| High-level structural decisions | ||
| Split–merge cooldown | ||
| , | Encircle enter/exit thresholds | |
| , | Near-goal and inter-swarm merge thresholds | |
| Follower reassignment | ||
| , | Subleader- and waypoint-distance cost weights | |
| , | Minimum improvement threshold; per-follower cooldown | |
| 3 | Maximum number of transfers per planner update | |
| LLM interface | ||
| 50 steps () | LLM invocation interval | |
| Decoding temperature | 0 (greedy) | Deterministic LLM decoding setting |
| Max tokens | 64 | Maximum output length |
| Timeout | Per-call inference timeout | |
References
- Skorobogatov, G.; Barrado, C.; Salamí, E. Multiple UAV systems: A survey. Unmanned Syst. 2020, 8, 149–169. [Google Scholar] [CrossRef]
- Javed, S.; Hassan, A.; Ahmad, R.; Ahmed, W.; Ahmed, R.; Saadat, A.; Guizani, M. State-of-the-art and future research challenges in UAV swarms. IEEE Internet Things J. 2024, 11, 19023–19045. [Google Scholar] [CrossRef]
- Wang, Y. Reinforcement Learning-Based Methods for Cooperative Control of UAVs: Challenges and Perspectives. Guid. Navig. Control 2025, 5, 435–438. [Google Scholar] [CrossRef]
- Reynolds, C.W. Flocks, herds and schools: A distributed behavioral model. ACM Siggraph Comput. Graph. 1987, 21, 25–34. [Google Scholar] [CrossRef]
- Vicsek, T.; Czirók, A.; Ben-Jacob, E.; Cohen, I.; Shochet, O. Novel type of phase transition in a system of self-driven particles. Phys. Rev. Lett. 1995, 75, 1226. [Google Scholar] [CrossRef]
- Shafiq, M.; Ali, Z.A.; Israr, A.; Alkhammash, E.H.; Hadjouni, M. A multi-colony social learning approach for the self-organization of a swarm of UAVs. Drones 2022, 6, 104. [Google Scholar] [CrossRef]
- Chen, B.; Hu, J.; Ghosh, B.K. Finite-time tracking control of heterogeneous multi-AUV systems with partial measurements and intermittent communication. Sci. China Inf. Sci. 2024, 67, 152202. [Google Scholar] [CrossRef]
- Xu, B.; Bai, G.; Liu, T.; Fang, Y.; Zhang, Y.a.; Tao, J. An improved swarm model with informed agents to prevent swarm-splitting. Chaos Solitons Fractals 2023, 169, 113296. [Google Scholar] [CrossRef]
- Zhang, C.; Ji, L.; Yang, S.; Guo, X.; Li, H. Distributed optimal consensus control for multiagent systems based on event-triggered and prioritized experience replay strategies. Sci. China Inf. Sci. 2025, 68, 112206. [Google Scholar] [CrossRef]
- Zhu, H.; Juhl, J.; Ferranti, L.; Alonso-Mora, J. Distributed multi-robot formation splitting and merging in dynamic environments. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA); IEEE: Piscataway, NJ, USA, 2019; pp. 9080–9086. [Google Scholar] [CrossRef]
- Zhu, K.; Han, B.; Zhang, T. Multi-UAV distributed collaborative coverage for target search using heuristic strategy. Guid. Navig. Control 2021, 1, 2150002. [Google Scholar] [CrossRef]
- Zhang, X.; Ding, W.; Wang, Y.; Luo, Y.; Zhang, Z.; Xiao, J. Bio-inspired self-organized fission-fusion control algorithm for UAV swarm. Aerospace 2022, 9, 714. [Google Scholar] [CrossRef]
- Wang, N.; Jia, W.; Wu, H.; Wang, Y. Event-triggered self-organizing swarm control of distributed unmanned surface vehicles. IEEE Trans. Intell. Transp. Syst. 2025, 26, 3431–3445. [Google Scholar] [CrossRef]
- Sendra-Arranz, R.; Gutierrez, A.; Christensen, A.L. Evolution of transferable and self-organized communication modules for solving multiple swarm robotics tasks. IEEE Trans. Cybern. 2026, 56, 595–608. [Google Scholar] [CrossRef]
- Li, J.; Fang, Y.; Cheng, H.; Wang, Z.; Wu, Z.; Zeng, M. Large-scale fixed-wing UAV swarm system control with collision avoidance and formation maneuver. IEEE Syst. J. 2023, 17, 744–755. [Google Scholar] [CrossRef]
- Zhu, W.; Ouz, S.; Heinrich, M.K.; Allwright, M.; Wahby, M.; Christensen, A.L.; Garone, E.; Dorigo, M. Self-organizing nervous systems for robot swarms. Sci. Robot. 2024, 9, eadl5161. [Google Scholar] [CrossRef] [PubMed]
- de Groot, O.; Ferranti, L.; Gavrila, D.M.; Alonso-Mora, J. Topology-driven parallel trajectory optimization in dynamic environments. IEEE Trans. Robot. 2025, 41, 110–126. [Google Scholar] [CrossRef]
- Zhang, G.; Li, Z.; Li, J.; Huang, J. Robust Event-Triggered Path Following Control for a Rotor-Assisted Vehicle With Output Constraints: Theory and Experiment. IEEE Trans. Ind. Electron. 2026, 73, 10594–10604. [Google Scholar] [CrossRef]
- Wang, M.; Niu, Y.; Wang, B.; Zhang, W.; Wang, C. A survey on learning motion planning and control for mobile robots: Toward embodied intelligence. IEEE Trans. Neural Netw. Learn. Syst. 2026. Early access. [Google Scholar] [CrossRef]
- Feng, Z.; Xue, R.; Yuan, L.; Yu, Y.; Ding, N.; Liu, M.; Gao, B.; Sun, J.; Zheng, X.; Wang, G. Multi-agent embodied AI: Advances and future directions. arXiv 2025. [Google Scholar] [CrossRef]
- Liu, S.; Zhang, H.; Qi, Y.; Wang, P.; Zhang, Y.; Wu, Q. AerialVLN: Vision-and-Language Navigation for UAVs. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: Piscataway, NJ, USA, 2023; pp. 15384–15394. [Google Scholar] [CrossRef]
- Guo, M.; Wu, M.; He, J.; Li, S.; Li, H.; Tao, C. BEDI: A comprehensive benchmark for evaluating embodied agents on UAVs. ISPRS J. Photogramm. Remote Sens. 2026, 232, 910–936. [Google Scholar] [CrossRef]
- Wang, X.; Yang, D.; Liao, Y.; Zheng, W.; Dai, B.; Wu, W.; Li, H.; Liu, S. UAV-Flow Colosseo: A real-world benchmark for flying-on-a-word UAV imitation learning. arXiv 2025, arXiv:2505.15725. [Google Scholar] [CrossRef]
- Agal, S.; Odedra, N.D. Decentralized reinforcement learning for scalable embodied intelligence in robotic swarms. Embodied Intell. Robot. 2025, X, 1–16. [Google Scholar] [CrossRef]
- Ahn, M.; Brohan, A.; Brown, N.; Chebotar, Y.; Cortes, O.; David, B.; Finn, C.; Fu, C.; Gopalakrishnan, K.; Hausman, K.; et al. Do as I can, not as I say: Grounding language in robotic affordances. In Proceedings of the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand, 16 August 2022. [Google Scholar] [CrossRef]
- Huang, W.; Xia, F.; Xiao, T.; Chan, H.; Liang, J.; Florence, P.; Zeng, A.; Tompson, J.; Mordatch, I.; Chebotar, Y.; et al. Inner monologue: Embodied reasoning through planning with language models. In Proceedings of the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand, 16 August 2022. [Google Scholar] [CrossRef]
- Kannan, S.S.; Venkatesh, V.L.N.; Min, B. SMART-LLM: Smart multi-agent robot task planning using large language models. In Proceedings of the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: Abu Dhabi, UAE, 2024; pp. 12140–12147. [Google Scholar] [CrossRef]
- Yu, H.; Wang, C.; Niu, Y.; Wu, L. MUTP-LLM: Empowering Multi-UAV Task Planning with Large Language Models. Guid. Navig. Control 2025, 5, 477–489. [Google Scholar] [CrossRef]
- Zhu, M.; Pang, T.; Gao, M.; Xu, H.; Yu, G.; Xia, F. A novel UAV swarm decision-making system under unified task and spatial view. In Proceedings of the International Conference on Artificial Intelligence, Internet of Things and Cloud Computing Technology (AIoTC), Guilin, China, 8–10 August 2025. [Google Scholar] [CrossRef]
- Hai, X.; Qiu, H.; Wen, C.; Feng, Q. A novel distributed situation awareness consensus approach for UAV swarm systems. IEEE Trans. Intell. Transp. Syst. 2023, 24, 14706–14717. [Google Scholar] [CrossRef]
- Wang, Z.; Li, J.; Li, J.; Liu, C. A decentralized decision-making algorithm of UAV swarm with information fusion strategy. Expert Syst. Appl. 2024, 237, 121444. [Google Scholar] [CrossRef]
- Wei, Z.; Wei, R. An information aggregation decision making method for UAV swarm intelligence system based on joint communication and proximal strategy. Expert Syst. Appl. 2026, 298, 129617. [Google Scholar] [CrossRef]
- Shahid, S.; Zhen, Z.; Javaid, U.; Wen, L. Offense-defense distributed decision making for swarm vs. swarm confrontation while attacking the aircraft carriers. Drones 2022, 6, 271. [Google Scholar] [CrossRef]
- Xia, W.; Zhou, Z.; Jiang, W.; Zhang, Y. Dynamic UAV swarm confrontation: An imitation based on mobile adaptive networks. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 7183–7202. [Google Scholar] [CrossRef]
- Jiang, Q.; Yan, Y.; Dai, Y.; Yang, Z.; Cao, H.; Wang, B.; Ma, X. Autonomous task planning of intelligent unmanned aerial vehicle swarm based on deep deterministic policy gradient. Drones 2025, 9, 272. [Google Scholar] [CrossRef]
- Zheng, Z.; Wei, C.; Duan, H. UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring. Sci. China Inf. Sci. 2024, 67, 180204. [Google Scholar] [CrossRef]
- Chen, W.; Hai, X.; Hu, Y.; Feng, Q.; Wang, Z. Hierarchical decision-making framework for multi-UAV task assignment via enhanced pigeon-inspired optimization. Guid. Navig. Control 2023, 3, 2350028. [Google Scholar] [CrossRef]
- Roy, D.; Maitra, M.; Bhattacharya, S. Exploration of multiple unknown areas by swarm of robots utilizing virtual-region-based splitting and merging technique. IEEE Trans. Autom. Sci. Eng. 2021, 19, 3459–3470. [Google Scholar] [CrossRef]
- Gazi, V.; Passino, K.M. Stability analysis of swarms. IEEE Trans. Autom. Control 2003, 48, 692–697. [Google Scholar] [CrossRef]
- Olfati-Saber, R. Flocking for multi-agent dynamic systems: Algorithms and theory. IEEE Trans. Autom. Control 2006, 51, 401–420. [Google Scholar] [CrossRef]
- Tanner, H.G.; Jadbabaie, A.; Pappas, G.J. Flocking in fixed and switching networks. IEEE Trans. Autom. Control 2007, 52, 863–868. [Google Scholar] [CrossRef]









| Symbol | Meaning |
|---|---|
| N | Total number of UAVs. |
| t; | Continuous time; discrete decision epoch and sampling step. |
| Position, velocity, and acceleration of UAV i. | |
| Control input and mass of UAV i. | |
| Number of sub-swarms, partition set, and sth sub-swarm. | |
| Subleader index and local waypoint of sub-swarm s. | |
| Global target point. | |
| Centroid of sub-swarm s and global centroid. |
| Variant | (m) | (m) | (m) | (m) | |||
|---|---|---|---|---|---|---|---|
| rule | |||||||
| qwen2.5:3b | |||||||
| qwen3.5:4b |
| Variant | (m) | (m) | Path Stretch | (m) | |||
|---|---|---|---|---|---|---|---|
| safety_stack_on | |||||||
| safety_stack_off |
| Variant | (m) | (m) | Path Stretch | (m) | |||
|---|---|---|---|---|---|---|---|
| Full safety stack | |||||||
| w/o occ. attenuation | |||||||
| w/o acc. bound |
| Scenario | (m) | (m) | ||||||
|---|---|---|---|---|---|---|---|---|
| default_corridor | ||||||||
| narrow_alternating_gates | ||||||||
| dense_cross_blocks | ||||||||
| off_axis_goal |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wu, L.; Wei, C. Dynamic Self-Organization and Safe Navigation for Hierarchical Embodied Swarms. Drones 2026, 10, 453. https://doi.org/10.3390/drones10060453
Wu L, Wei C. Dynamic Self-Organization and Safe Navigation for Hierarchical Embodied Swarms. Drones. 2026; 10(6):453. https://doi.org/10.3390/drones10060453
Chicago/Turabian StyleWu, Lanbo, and Chen Wei. 2026. "Dynamic Self-Organization and Safe Navigation for Hierarchical Embodied Swarms" Drones 10, no. 6: 453. https://doi.org/10.3390/drones10060453
APA StyleWu, L., & Wei, C. (2026). Dynamic Self-Organization and Safe Navigation for Hierarchical Embodied Swarms. Drones, 10(6), 453. https://doi.org/10.3390/drones10060453

