Reinforcement Learning-Based Handover Algorithm for 5G/6G AI-RAN
Abstract
1. Introduction
- For RL agent training, we propose to use data that has undergone primary multilinear processing: tensor decomposition and clustering of factor matrices;
- We propose to train the RL agent based on identified trends after multilinear data processing;
- We develop a method in which the RL agent manages the HO process based on user classification by mobility type;
- Simulation results show that in a ping-pong HO scenario, the proposed algorithm avoids a large number of HOs while maintaining a high data rate.
1.1. Formulation of HO Problem
1.2. Review of Existing Articles
1.2.1. Application of AI/ML Algorithms in Mobile Networks
1.2.2. Application of RL Algorithm in Mobility Management
2. Materials and Methods
2.1. Description of the Overall Concept for HO Management
2.2. Proposed HO Algorithm Based on RL
- Initializing HO to any neighbor BS;
- Deciding not to initiate any HO.
3. Results
3.1. Simulation Setup
3.2. Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| 3GPP | Third Generation Partnership Project |
| 4G | Fourth Generation |
| 5G | Fifth Generation |
| 6G | Sixth Generation |
| AI | Artificial Intelligence |
| ANN | Artificial Neural Network |
| B5G | Beyond 5G |
| BS | Base Station |
| CPD | Canonical Polyadic Decomposition |
| CHO | Conditional Handover |
| CMAB | Contextual Multi-Arm Bandit |
| DBSCAN | Density-Based Spatial Clustering of Applications with Noise |
| DRL | Deep Reinforcement Learning |
| DNN | Deep Neural Network |
| DDRL | Double Deep Reinforcement Learning |
| DDQN | Double Deep Q-Network |
| FLC | Fuzzy Logic Control |
| GPU | Graphical Processing Unit |
| HM | Hysteresis Margin |
| HO | Handover |
| LSTM | Long Short-Term Memory |
| LTE | Long-Term Evolution |
| Near-RT | Near-Real-Time |
| Non-RT | Non-Real-Time |
| NR | New Radio |
| ML | Machine Learning |
| O-RAN | Open Radio Access Network |
| PPO | Proximal Policy Optimization |
| QoE | Quality of Experience |
| QoS | Quality of Service |
| ReLU | Rectified Linear Unit |
| RIC | RAN Intelligent Controller |
| RL | Reinforcement Learning |
| RSRP | Reference Signal Received Power |
| RSRQ | Reference Signal Received Quality |
| RT | Real Time |
| SINR | Signal-to-Noise and Interference Ratio |
| SL | Supervised Learning |
| SNR | Signal-to-Noise Ratio |
| TTT | Time-to-Trigger |
| UAV | Unmanned Aerial Vehicle |
| UDSC | Ultra-Dense Small Cells |
| UE | User Equipment |
References
- Agiwal, M.; Roy, A.; Saxena, N. Next Generation 5G Wireless Networks: A Comprehensive Survey. IEEE Commun. Surv. Tutor. 2016, 18, 1617–1655. [Google Scholar] [CrossRef]
- Shafi, M.; Molisch, A.F.; Smith, P.J.; Haustein, T.; Zhu, P.; De Silva, P.; Tufvesson, F.; Benfebbour, A.; Wunder, G. 5G: A Tutorial Overview of Standards, Trials, Challenges, Deployment, and Practice. IEEE J. Sel. Areas Commun. 2017, 35, 1201–1221. [Google Scholar] [CrossRef]
- Polese, M.; Giordani, M.; Zorzi, M. 3GPP NR: The Standard for 5G Cellular Networks. In 5G Italy White eBook: From Research to Market; Marsan, M.A., Melazzi, N.B., Buzzi, S., Eds.; Consorzio Nazionale Interuniversitario per le Telecomunicazioni: Parma, Italy, 2018; pp. 69–78. ISBN 978-8-8321-7001-6. [Google Scholar]
- Kozlov, S.; Spirina, E.; Ashaev, I.; Bukharina, A.; Gaysin, A. Novel Modification of the Collective Dynamic Routing Method for Sensors’ Communication in Wi-Fi Public Networks. Sensors 2022, 22, 8602. [Google Scholar] [CrossRef] [PubMed]
- O-RAN Alliance, O-RAN: Towards an Open and Smart RAN. Available online: https://mediastorage.o-ran.org/white-papers/O-RAN.White-Paper-2018-10.pdf (accessed on 7 November 2025).
- Singh, S.K.; Singh, R.; Kumbhani, B. The Evolution of Radio Access Network towards Open-RAN: Challenges and Opportunities. In Proceedings of the 2020 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Seoul, Republic of Korea, 6–9 April 2020; pp. 1–6. [Google Scholar]
- Balasubramanian, B.; Daniels, E.S.; Hiltunen, M.; Jana, R.; Joshi, K.; Tran, T.X.; Wang, C. RIC: A RAN Intelligent Controller Platform for AI-Enabled Cellular Networks. IEEE Internet Comput. 2021, 25, 7–17. [Google Scholar] [CrossRef]
- Taleb, T.; Benzaïd, C.; Addad, R.A.; Samdanis, K. AI/ML for Beyond 5G Systems: Concepts, Technology Enablers & Solutions. Comput. Netw. 2023, 237, 110044. [Google Scholar] [CrossRef]
- Tanveer, J.; Haider, A.; Ali, R.; Kim, A. An Overview of Reinforcement Learning Algorithms for Handover Management in 5G Ultra-Dense Small Cell Networks. Appl. Sci. 2022, 12, 426. [Google Scholar] [CrossRef]
- Agarwal, B.; Irmer, R.; Lister, D.; Muntean, G.M. Open RAN for 6G Networks: Architecture, Use Cases and Open Issues. IEEE Commun. Surv. Tutor. 2025, 27, 2881–2924. [Google Scholar] [CrossRef]
- Basaran, O.T.; Zafar, H.; Kasparick, M.; Dressler, F.; Stańczak, S. Next-Gen AI-on-RAN: AI-native, Interoperable, and GPU-Accelerated Testbed Towards 6G Open-RAN. In Proceedings of the 2025 IEEE International Conference on Communications (ICC), Montreal, QC, Canada, 8–12 June 2025; pp. 1–6. [Google Scholar]
- 5G NR, NR and NG-RAN Overall Description Stage-2, Document TS 38.300 v15.8.0. 2019. Available online: https://www.etsi.org/deliver/etsi_ts/138300_138399/138300/15.08.00_60/ts_138300v150800p.pdf (accessed on 7 November 2025).
- 5G NR, Radio Resource Control (RRC) Protocol Specification, Document TS 38.331 v15.8.0. 2019. Available online: https://www.etsi.org/deliver/etsi_ts/138300_138399/138331/15.08.00_60/ts_138331v150800p.pdf (accessed on 7 November 2025).
- Martikainen, H.; Viering, I.; Lobinger, A.; Jokela, T. On the Basics of Conditional Handover for 5G Mobility. In Proceedings of the 2018 IEEE 29th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Bologna, Italy, 9–12 September 2018; pp. 1–7. [Google Scholar]
- Kim, C.; Dudin, A.; Dudin, S.; Dudina, O. Mathematical model of operation of a cell of a mobile communication network with adaptive modulation schemes and handover of mobile users. IEEE Access 2021, 9, 106933–106946. [Google Scholar] [CrossRef]
- Stanczak, J.; Karabulut, U.; Awada, A. Conditional Handover in 5G—Principles, Future Use Cases and FR2 Performance. In Proceedings of the 2022 International Wireless Communications and Mobile Computing (IWCMC), Dubrovnik, Croatia, 30 May–3 June 2022; pp. 1–6. [Google Scholar]
- Lin, H.-P.; Juang, R.-T.; Lin, D.-B. Validation of an improved location-based handover algorithm using GSM measurement data. IEEE Trans. Mob. Comput. 2005, 4, 530–536. [Google Scholar] [CrossRef]
- Evolved Universal Terrestrial Radio Access (E-UTRA); Mobility Enhancements in Heterogeneous Networks, Document TR 36.839 v11.1.0. 2013. Available online: https://www.3gpp.org/ftp/Specs/archive/36_series/36.839/36839-b10.zip (accessed on 7 November 2025).
- Tayyab, M.; Gelabert, X.; Jäntti, R. A Survey on Handover Management: From LTE to NR. IEEE Access 2019, 7, 118907–118930. [Google Scholar] [CrossRef]
- Haghrah, A.; Abdollahi, M.P.; Azarhava, H.; Niya, J.M. A Survey on the Handover Management in 5G-NR Cellular Networks: Aspects, Approaches and Challenges. EURASIP J. Wirel. Commun. Netw. 2023, 2023, 1–57. [Google Scholar] [CrossRef]
- Alraih, S.; Nordin, R.; Abu-Samah, A.; Shayea, I.; Abdullah, N.F. A Survey on Handover Optimization in Beyond 5G Mobile Networks: Challenges and Solutions. IEEE Access 2023, 11, 59317–59345. [Google Scholar] [CrossRef]
- Mollel, M.S.; Abubakar, A.I.; Ozturk, M.; Kaijage, S.F.; Kisangiri, M.; Hussain, S.; Imran, M.A.; Abbasi, Q.H. A Survey of Machine Learning Applications to Handover Management in 5G and Beyond. IEEE Access 2021, 9, 45770–45802. [Google Scholar] [CrossRef]
- Mahamod, U.; Mohamad, H.; Shayea, I.; Othman, M.; Asuhaimi, F.A. Handover Parameter for Self-Optimisation in 6G Mobile Networks: A Survey. Alex. Eng. J. 2023, 78, 104–119. [Google Scholar] [CrossRef]
- Saoud, B.; Shayea, I.; Alnakhli, M.A.; Mohamad, H. Mobility and Handover Management in 5G/6G Networks: Challenges, Innovations, and Sustainable Solutions. Technologies 2025, 13, 352. [Google Scholar] [CrossRef]
- Cabral de Brito Guerra, T.; Dantas, Y.R.; Sousa, V.A., Jr. A Machine Learning Approach for Handover in LTE Networks with Signal Obstructions. J. Commun. Inf. Syst. 2020, 35, 271–289. [Google Scholar] [CrossRef]
- Mei, L.; Gou, J.; Cai, Y.; Cao, H.; Liu, Y. Realtime Mobile Bandwidth and Handoff Predictions in 4G/5G Networks. Comput. Netw. 2022, 204, 108736. [Google Scholar] [CrossRef]
- Alhammadi, A.; Hassan, W.H.; El-Saleh, A.A.; Shayea, I.; Mohamad, H.; Saad, W.K. Intelligent Coordinated Self-Optimizing Handover Scheme for 4G/5G Heterogeneous Networks. ICT Express 2023, 9, 276–281. [Google Scholar] [CrossRef]
- Asad, S.M.; Klaine, P.V.; Rais, R.N.B.; Mollel, M.S.; Hussain, S.; Abbasi, Q.H.; Imran, M.A. Context-Aware Handover Skipping for Train Passengers in Next Generation Wireless Networks. J. Commun. Netw. 2023, 25, 285–298. [Google Scholar] [CrossRef]
- Alraih, S.; Nordin, R.; Abu-Samah, A.; Shayea, I.; Abdullah, N.F. ML-Based Self-Optimization Handover Technique for Beyond 5G Mobile Network. IEEE Access 2025, 13, 8568–8584. [Google Scholar] [CrossRef]
- Karmakar, R.; Kaddoum, G.; Chattopadhyay, S. Mobility Management in 5G and Beyond: A Novel Smart Handover With Adaptive Time-to-Trigger and Hysteresis Margin. IEEE Trans. Mob. Comput. 2023, 22, 5995–6010. [Google Scholar] [CrossRef]
- Prananto, B.H.; Iskandar; Hendrawan; Kurniawan, A. LSTM Neural Network Algorithm for Handover Improvement in a Non-Ideal Network Using O-RAN Near-RT RIC. IEICE Trans. Commun. 2024, E107-B, 458–469. [Google Scholar] [CrossRef]
- Dzaferagic, M.; Xavier, B.M.; Collins, D.; D’Onofrio, V.; Martinello, M.; Ruffini, M. ML-Based Handover Prediction Over a Real O-RAN Deployment Using RAN Intelligent Controller. IEEE Trans. Netw. Serv. Manag. 2025, 22, 635–647. [Google Scholar] [CrossRef]
- Gain, M.; Raha, A.D.; Dam, S.K.; Amirjon, A.; Kim, K.; Hong, C.S. AI-Driven Proactive Handover Optimization for NextG O-RAN Systems. In Proceedings of the 2025 25th Asia-Pacific Network Operations and Management Symposium (APNOMS), Kaohsiung, Taiwan, 22–24 September 2025; pp. 1–6. [Google Scholar]
- Mollel, M.S.; Abubakar, A.I.; Ozturk, M.; Kaijage, S.; Kisangiri, M.; Zoha, A.; Imran, M.A.; Abbasi, Q.H. Intelligent Handover Decision Scheme using Double Deep Reinforcement Learning. Phys. Commun. 2020, 42, 101133. [Google Scholar] [CrossRef]
- Mollel, M.S.; Kaijage, S.; Kisangiri, M. Deep Reinforcement Learning based Handover Management for Millimeter Wave Communication. Int. J. Adv. Comput. Sci. Appl. 2021, 12, 784–791. [Google Scholar] [CrossRef]
- Prado, A.; Stöckeler, F.; Mehmeti, F.; Krämer, P.; Kellerer, W. Enabling Proportionally-Fair Mobility Management With Reinforcement Learning in 5G Networks. IEEE J. Sel. Areas Commun. 2023, 41, 1845–1858. [Google Scholar] [CrossRef]
- Yajnanarayana, V.; Rydén, H.; Hévizi, L. 5G Handover using Reinforcement Learning. In Proceedings of the 2020 IEEE 3rd 5G World Forum (5GWF), Bangalore, India, 10–12 September 2020; pp. 349–354. [Google Scholar]
- Dai, J.; Mahboob, S.; Wang, H.; Liu, L. Intelligent Handover Management Enabled by O-RAN and Deep Reinforcement Learning. In Proceedings of the 2024 IEEE 100th Vehicular Technology Conference (VTC2024-Fall), Washington, DC, USA, 7–10 October 2024; pp. 1–6. [Google Scholar]
- Korobkov, A.A.; Gaysin, A.K.; Safiullin, I.A.; Ashaev, I.P.; Nadeev, A.F. Interaction Model of O-RAN Radio Access Network Elements for Mobility Management. Electromagn. Waves Electron. Syst. 2025, 30, 79–92. [Google Scholar]
- Kolda, T.G.; Bader, B.W. Tensor Decompositions and Applications. SIAM Rev. 2009, 51, 455–500. [Google Scholar] [CrossRef]
- Schubert, E.; Sander, J.; Ester, M.; Kriegel, H.-P.; Xu, X. DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN. ACM Trans. Datab. Syst. 2017, 42, 1–21. [Google Scholar] [CrossRef]
- Murphy, K.P. Naive Bayes Classifiers. Univ. Br. Columbia 2006, 18, 1–8. [Google Scholar]
- Hills, J.; Lines, J.; Baranauskas, E.; Mapp, J.; Bagnall, A. Classification of Time Series by Shapelet Transformation. Data Min. Knowl. Discov. 2014, 28, 851–881. [Google Scholar] [CrossRef]
- Ashaev, I.P.; Safiullin, I.A.; Gaysin, A.K.; Nadeev, A.F.; Korobkov, A.A. An Approach for Using a Tensor-Based Method for Mobility-User Pattern Determining. Inventions 2024, 9, 1. [Google Scholar] [CrossRef]
- Hendrawan, H.; Zain, A.R.; Lestari, S. Performance Evaluation of A2-A4-RSRQ and A3-RSRP Handover Algorithms in LTE Network. J. Elektron. dan Telekomun. 2019, 19, 64–74. [Google Scholar] [CrossRef]









| Parameter Name | Value |
|---|---|
| Number of BSs | 2 |
| Number of UEs with high mobility | 5 |
| Number of static UEs (scenario 3) | 10 |
| Transmit power of BSs, dBm | 30 |
| Channel model | 3GPP Urban Macro |
| HO threshold for baseline algorithm, dBm | 3 |
| Antenna mode | Isotropic |
| Center frequency, GHz | 3.5 |
| Channel bandwidth, MHz | 20 |
| Mobility model | Constant velocity |
| HARQ transmission | Enabled |
| RRC model | Ideal |
| Traffic model | UDP, 2.56 Mbps |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Safiullin, I.A.; Ashaev, I.P.; Korobkov, A.A.; Gaysin, A.K.; Nadeev, A.F. Reinforcement Learning-Based Handover Algorithm for 5G/6G AI-RAN. Inventions 2026, 11, 8. https://doi.org/10.3390/inventions11010008
Safiullin IA, Ashaev IP, Korobkov AA, Gaysin AK, Nadeev AF. Reinforcement Learning-Based Handover Algorithm for 5G/6G AI-RAN. Inventions. 2026; 11(1):8. https://doi.org/10.3390/inventions11010008
Chicago/Turabian StyleSafiullin, Ildar A., Ivan P. Ashaev, Alexey A. Korobkov, Artur K. Gaysin, and Adel F. Nadeev. 2026. "Reinforcement Learning-Based Handover Algorithm for 5G/6G AI-RAN" Inventions 11, no. 1: 8. https://doi.org/10.3390/inventions11010008
APA StyleSafiullin, I. A., Ashaev, I. P., Korobkov, A. A., Gaysin, A. K., & Nadeev, A. F. (2026). Reinforcement Learning-Based Handover Algorithm for 5G/6G AI-RAN. Inventions, 11(1), 8. https://doi.org/10.3390/inventions11010008

