Reinforcement-Learning-Based Geographic Routing Considering Future Evolution of Link States for UAV Networks
Highlights
- This paper proposes a reinforcement-learning-based geographic routing protocol that incorporates a new multi-parameter fusion link-state evaluation method and a new routing hole bypass method.
- Simulation results show that compared to existing geographic routing protocols, the proposed one achieves higher packet reception rate, lower energy consumption and end-to-end latency.
- Considering the future evolution of link states helps the geographic routing protocol effectively cope with link fluctuations caused by high-speed movement of UAVs, thereby more suitable for highly dynamic UAV networks.
Abstract
1. Introduction
2. Related Works
3. System Model
3.1. Mobility Model
3.2. Channel Model
3.3. Network Performance Metrics
3.4. Problem Formulation
4. Description of the Proposed Protocol
4.1. Multi-Parameter Fusion-Based Link-State Evaluation
4.2. Reinforcement-Learning-Based Geographic Routing
4.3. The Routing-Hole Bypass Method
5. Simulation Setup
5.1. Protocols for Comparison
5.2. Simulation Parameters
5.3. Implementing Hardware-in-the-Loop Protocol Simulation
6. Simulation Results of Protocols
6.1. Simulation Results
6.2. HIL Simulation Results
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Fan, C.; Zhou, X.; Zhang, T.; Yi, W.; Liu, Y. Cache-enabled UAV emergency communication networks: Performance analysis with stochastic geometry. IEEE Trans. Veh. Technol. 2023, 72, 9308–9321. [Google Scholar] [CrossRef]
- Du, Z.; Luo, C.; Min, G.; Wu, J.; Luo, C.; Pu, J.; Li, S. A survey on autonomous and intelligent swarms of uncrewed aerial vehicles (UAVs). IEEE Trans. Intell. Transp. Syst. 2025, 26, 14477–14500. [Google Scholar] [CrossRef]
- Lakew, D.S.; Sa’ad, U.; Dao, N.-N.; Na, W.; Cho, S. Routing in flying ad hoc networks: A comprehensive survey. IEEE Commun. Surv. Tutor. 2020, 22, 1071–1120. [Google Scholar] [CrossRef]
- Yuan, X.; Su, J.; Xia, Y. A survey on routing in flying ad hoc networks: Scenario characteristics, multi-dimensional classification and future prospects. Chin. J. Comput. 2025, 48, 3000–3030. [Google Scholar]
- Zhou, Z.; Tang, J.; Feng, W.; Zhao, N.; Yang, Z.; Wong, K.-K. Optimized routing protocol through exploitation of trajectory knowledge for UAV swarms. IEEE Trans. Veh. Technol. 2024, 73, 15499–15512. [Google Scholar] [CrossRef]
- Ferronato, J.J.; Trentin, M.A.S. Analysis of routing protocols OLSR, AODV and ZRP in real urban vehicular scenario with density variation. IEEE Lat. Am. Trans. 2017, 6 15, 1727–1734. [Google Scholar] [CrossRef]
- Khanh, Q.V.; Chehri, A.; Nam, V.H.; Hue, C.T.M.; Quy, N.M. Performance evaluation of routing protocol for 6G UAV communication networks. In Proceedings of the 2024 IEEE 99th Vehicular Technology Conference (VTC2024-Spring), Singapore, 24–27 June 2024; IEEE: New York, NY, USA, 2024. [Google Scholar]
- Tan, X.; Zuo, Z.; Su, S.; Guo, X.; Sun, X.; Jiang, D. Performance analysis of routing protocols for UAV communication networks. IEEE Access 2020, 8, 92212–92224. [Google Scholar] [CrossRef]
- Bujari, A.; Palazzi, C.E.; Ronzani, D. A comparison of stateless position-based packet routing algorithms for FANETs. IEEE Trans. Mob. Comput. 2018, 17, 2468–2482. [Google Scholar] [CrossRef]
- Rosati, S.; Krużelecki, K.; Heitz, G.; Floreano, D.; Rimoldi, B. Dynamic routing for flying ad hoc networks. IEEE Trans. Veh. Technol. 2016, 65, 1690–1700. [Google Scholar] [CrossRef]
- Gangopadhyay, S.; Jain, V.K. A position-based modified OLSR routing protocol for flying ad hoc networks. IEEE Trans. Veh. Technol. 2023, 72, 12087–12098. [Google Scholar] [CrossRef]
- Wang, C.-M.; Yang, S.; Dong, W.-Y.; Zhao, W.; Lin, W. A distributed hybrid proactive–reactive ant colony routing protocol for highly dynamic FANETs with link quality prediction. IEEE Trans. Veh. Technol. 2025, 74, 1817–1822. [Google Scholar] [CrossRef]
- Cui, Y.; Tian, H.; Chen, C.; Ni, W.; Wu, H.; Nie, G. New geographical routing protocol for three-dimensional flying ad hoc network based on new effective transmission range. IEEE Trans. Veh. Technol. 2023, 72, 16135–16147. [Google Scholar] [CrossRef]
- Singh, V.; Sharma, K.P.; Verma, H.K.; Kumar, G.; Balusamy, B.; Rani, S.; Jiang, W.; Çırpan, H.A. A-Geo: Adaptive geographic routing for consumer FANETs in next-generation communication. IEEE Trans. Consum. Electron. 2025, 71, 11034–11043. [Google Scholar] [CrossRef]
- Cui, Y.; Liu, L.; Zuo, X.; Yang, X.; Zhang, W.; Hou, Z.; Feng, Z. Seeing is better than hearing: Sensing-assisted efficient routing scheme for UAV network. In Proceedings of the IEEE/CIC International Conference on Communications in China, Shanghai, China, 26–28 August 2025; IEEE: New York, NY, USA, 2025. [Google Scholar]
- Jiang, M.; Zhang, Q.; Feng, Z.; Han, Z.; Li, W. Mobility prediction based virtual routing for ad hoc UAV network. In Proceedings of the 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, HI, USA, 9–13 December 2019; IEEE: New York, NY, USA, 2019. [Google Scholar]
- Asadpour, M.; Hummel, K.A.; Giustiniano, D.; Draskovic, S. Route or carry: Motion-driven packet forwarding in micro aerial vehicle networks. IEEE Trans. Mob. Comput. 2017, 16, 843–856. [Google Scholar] [CrossRef]
- Zhou, T.; Yan, F.; Shen, F.; Xia, W.; Shen, L. A geographic location prediction-based routing algorithm for flying ad hoc networks. In Proceedings of the 2023 IEEE/CIC International Conference on Communications in China (ICCC), Dalian, China, 10–12 August 2023; IEEE: New York, NY, USA, 2023. [Google Scholar]
- Li, X.; Sun, H. Prediction-based reactive-greedy routing protocol for flying ad hoc networks. Wirel. Netw. 2025, 31, 2893–2907. [Google Scholar] [CrossRef]
- Jung, W.-S.; Yim, J.; Ko, Y.-B. QGeo: Q-learning-based geographic ad hoc routing protocol for unmanned robotic networks. IEEE Commun. Lett. 2017, 21, 2258–2261. [Google Scholar] [CrossRef]
- Wei, C.; Wang, Y.; Wang, X.; Tang, Y. QFAGR: A Q-learning-based fast adaptive geographic routing protocol for flying ad hoc networks. In Proceedings of the IEEE Global Communications Conference, Kuala Lumpur, Malaysia, 4–8 December 2023; IEEE: New York, NY, USA, 2023. [Google Scholar]
- Huang, S.; Tang, J.; Zhou, Z.; Yang, G.; Davydov, M.V.; Wong, K.K. A Q-learning and fuzzy logic based routing protocol for UAV networks. In Proceedings of the 2024 16th International Conference on Wireless Communications and Signal Processing (WCSP), Hefei, China, 23–25 October 2024; IEEE: New York, NY, USA, 2024. [Google Scholar]
- Wu, M.; Jiang, B.; Chen, S.; Xu, H.; Pang, T.; Gao, M.; Xia, F. Traj-Q-GPSR: A trajectory-informed and Q-learning enhanced GPSR protocol for mission-oriented FANETs. Drones 2025, 9, 489. [Google Scholar] [CrossRef]
- Clausen, T.; Jacquet, P. Optimized Link State Routing Protocol (OLSR); RFC 3626; The Internet Engineering Task Force (IETF): Fremont, CA, USA, 2003. [Google Scholar]
- Perkins, C.; Belding-Royer, E.; Das, S. Ad Hoc On-Demand Distance Vector (AODV) Routing; RFC 3561; The Internet Engineering Task Force (IETF): Fremont, CA, USA, 2003. [Google Scholar]
- Karp, B.; Kung, H.-T. GPSR: Greedy perimeter stateless routing for wireless networks. In Proceedings of the 6th ACM Annual International Conference on Mobile Computing and Networking (MobiCom), Boston, MA, USA, 6–11 October 2000; Association for Computing Machinery: New York, NY, USA, 2000. [Google Scholar]
- Ede, B.; Kaplan, B.; Kahraman, İ.; Keşir, S.; Yarkan, S.; Ekti, A.R.; Baykaş, T.; Görçin, A.; Çırpan, H.A. Measurement-based large-scale statistical modeling of air-to-air wireless UAV channels via novel time–frequency analysis. IEEE Wirel. Commun. Lett. 2022, 11, 136–140. [Google Scholar] [CrossRef]
- Hua, B.; Han, L.; Deng, Q.; Zhu, Q.; Li, H.; Qu, Y.; Briso-Rodríguez, C.; Mao, K. AAV air-to-air channel: Statistical properties and experimental verification. IEEE Internet Things J. 2025, 12, 25790–25803. [Google Scholar] [CrossRef]
- Xu, M.; Liu, W.; Xu, C.; Zhang, Y.; Zhang, K.; Feng, Y.; Xia, Y.; Huang, D. Implementing hardware-in-the-loop protocol simulation for UAV Networks. In Proceedings of the 28th Asia Pacific Conference on Communications (APCC), Sydney, Australia, 9–22 November 2023; IEEE: New York, NY, USA, 2023. [Google Scholar]










| Parameter | Value |
|---|---|
| Network lifetime (s) | 120 |
| Deployment areas (m3) | 4003, 6003, 8003, 10003, 12003 |
| Number of UAVs | 20, 30, 40, 50, 60 |
| Communication range (m) | 250, 300, 350, 400, 450 |
| Mobility model | RWP |
| Propagation loss model | Log-normal integrates both shadowing and Rayleigh fading |
| Channel model parameters | η = 2, σ = 2 dB, σR = 1 dB |
| Receiver sensitivity | −98 dBm |
| Beacon interval | 3 s |
| MAC | IEEE 802.11b |
| Maximum flight speed (m/s) | 5, 10, 20, 40, 60 |
| Minimum beacon interval (s) | 1 |
| Packet size | 200 bits |
| Bandwidth | 384 kHz |
| Data rate | 250 bps |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Xu, M.; Xia, Y.; Liu, W.; Huang, D. Reinforcement-Learning-Based Geographic Routing Considering Future Evolution of Link States for UAV Networks. Drones 2026, 10, 150. https://doi.org/10.3390/drones10020150
Xu M, Xia Y, Liu W, Huang D. Reinforcement-Learning-Based Geographic Routing Considering Future Evolution of Link States for UAV Networks. Drones. 2026; 10(2):150. https://doi.org/10.3390/drones10020150
Chicago/Turabian StyleXu, Ming, Yu Xia, Wei Liu, and Daqing Huang. 2026. "Reinforcement-Learning-Based Geographic Routing Considering Future Evolution of Link States for UAV Networks" Drones 10, no. 2: 150. https://doi.org/10.3390/drones10020150
APA StyleXu, M., Xia, Y., Liu, W., & Huang, D. (2026). Reinforcement-Learning-Based Geographic Routing Considering Future Evolution of Link States for UAV Networks. Drones, 10(2), 150. https://doi.org/10.3390/drones10020150

