A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks

Li, Xinyang; Wu, Yanbo; Zhu, Min; Ren, Jie

doi:10.3390/jmse13122374

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks

¹

Ocean Acoustic Technology Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

³

Beijing Engineering Technology Research Center of Ocean Acoustic Equipment, Beijing 100190, China

⁴

State Key Laboratory of Acoustics, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(12), 2374; https://doi.org/10.3390/jmse13122374 (registering DOI)

Submission received: 10 November 2025 / Revised: 3 December 2025 / Accepted: 11 December 2025 / Published: 14 December 2025

(This article belongs to the Special Issue Underwater Acoustic Communication and Marine Robot Networks)

Download Versions Notes

Abstract

In Underwater Wireless Sensor Networks (UWSNs) with mobile nodes, the mobility of the nodes leads to dynamic changes in the network topology. Thus, pre-established routing paths may become invalid and next-hop nodes may be unavailable due to link disruptions. This implies that routing decisions for mobile UWSNs that do not account for changes in the connectivity state of communication links cannot guarantee reliable packet delivery. In this study, a Q-learning-based link-aware routing (QLAR) protocol designed for mobile UWSNs is proposed. The proposed QLAR protocol introduces the Link Expiration Time (LET) into the reward function of the Q-learning algorithm as a critical decision metric, thereby guiding the agent to prioritize more stable communication links with longer expected lifetime. In addition, multiple decision metrics are dynamically predicted and updated by actively perceiving and acquiring information from neighbor nodes through periodic control packet interactions. To achieve a balance among these metrics, the Entropy Weight Method (EWM) is employed to adaptively adjust their weights in response to real-time network conditions. Comprehensive simulation results demonstrate that QLAR outperforms existing routing protocols in terms of various performance metrics under different scenarios.

Keywords: Underwater Wireless Sensor Networks (UWSNs); routing protocol; Q-learning; link-aware

Share and Cite

MDPI and ACS Style

Li, X.; Wu, Y.; Zhu, M.; Ren, J. A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks. J. Mar. Sci. Eng. 2025, 13, 2374. https://doi.org/10.3390/jmse13122374

AMA Style

Li X, Wu Y, Zhu M, Ren J. A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks. Journal of Marine Science and Engineering. 2025; 13(12):2374. https://doi.org/10.3390/jmse13122374

Chicago/Turabian Style

Li, Xinyang, Yanbo Wu, Min Zhu, and Jie Ren. 2025. "A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks" Journal of Marine Science and Engineering 13, no. 12: 2374. https://doi.org/10.3390/jmse13122374

APA Style

Li, X., Wu, Y., Zhu, M., & Ren, J. (2025). A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks. Journal of Marine Science and Engineering, 13(12), 2374. https://doi.org/10.3390/jmse13122374

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Q-Learning-Based Link-Aware Routing Protocol for Underwater Wireless Sensor Networks

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI