AUV Intelligent Decision-Making System Empowered by Deep Learning: Evolution, Challenges and Future Prospects
Abstract
1. Introduction
1.1. Research Background
1.2. The Development History of AUV Intelligent Decision-Making Systems
1.3. Research Gaps and Motivation
1.4. The Purpose of This Article
1.5. Paper Structure Arrangement
2. Definition and Module Division of Intelligent Decision-Making Systems
2.1. Intelligent Decision Systems: Definitions, Paradigms, and Autonomous Cores
2.2. The Four-Module Deconstruction and DL Function of the Intelligent Decision-Making System
2.2.1. Definition of Information Processing Module
2.2.2. The Information Understanding Module
2.2.3. Definition of Information Judgment Module
2.2.4. Definition of Output Module
2.2.5. Module Splitting and Mix
2.3. The Flexibility of Deconstructing Intelligent Decision-Making Systems
2.4. Module Collaboration and System Integration
3. Modules for Autonomous Decision-Making Empowered by Deep Learning
3.1. Information Processing Module
3.1.1. The Evolution of Information Processing Module
3.1.2. Applications of DL in Information Processing
3.1.3. Development Summary and Future Projections of Information Processing Module
3.2. Information Understanding Module
3.2.1. The Evolution of Information Understanding Module
3.2.2. Applications of Deep Learning in Information Understanding
3.2.3. Development Summary and Future Predictions of Information Understanding Module
3.3. Information Judgment Module
3.3.1. The Evolution of Information Judgment Module
3.3.2. The Evolution of Task-Driven Decision-Making Schemes
3.3.3. Applications of Deep Learning in Information Analysis
3.3.4. Development Summary and Future Projections of Information Judgment Module
3.4. Output Module
3.4.1. Evolution of Output Module
3.4.2. Applications of Deep Learning in Information Output
3.4.3. Integration and Separation of Output Modules and Information Judgment Modules
3.4.4. Development Summaries and Future Projections of Out Put Module
4. Application Analysis and Technology Selection of AUV Intelligent Decision-Making System Empowered by Deep Learning
4.1. Division Criteria: Orthogonal Deconstruction of Task Complexity and Environmental Uncertainty
4.2. Scene Analysis and Deep-Learning Technology Selection
4.2.1. Simple Tasks, Structured Environments
4.2.2. Simple Tasks, Unstructured Environment
4.2.3. Complex Tasks, Structured Environments
4.2.4. Complex Tasks, Unstructured Environment
4.3. Key Points and Insights of This Chapter
5. Challenges, Frontiers and Future Prospects
5.1. Challenges and Frontiers
5.1.1. Dual Scarcity of Underwater Perception Data
5.1.2. Black Box Problem
5.1.3. Limitations on Computational Capacity
5.1.4. Fragmentation of Applications
5.2. Frontier Technology Trends and Future Prospects
5.2.1. Underwater Foundation Models and Self-Supervised Learning
5.2.2. Physical-Data Dual-Driven and Trusted AI
5.2.3. Offline Learning and Sim-to-Real Efficient Migration
5.3. Future Outlook: Moving Towards Integrated and Clustered Underwater Intelligence
5.3.1. Robot Architecture Focused on the Underwater Domain
5.3.2. Perception–Cognition–Decision-Making Integrated Underwater Agents
5.3.3. From Individual Intelligence to Distributed Cluster Intelligence
5.3.4. Operational Paradigms of the Next Decade
6. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Wu, H.; Chen, Y.; Yang, Q.; Yan, B.; Yang, X. A Review of Underwater Robot Localization in Confined Spaces. J. Mar. Sci. Eng. 2024, 12, 428. [Google Scholar] [CrossRef]
- Stefanidou, A.; Politi, E.; Chronis, C.; Dimitrakopoulos, G.; Varlamis, I. A Deep Reinforcement Learning Approach for Navigation and Control of Autonomous Underwater Vehicles in Complex Environments. In Proceedings of the 18th International Conference on Control, Automation, Robotics and Vision (ICARCV), Dubai, United Arab Emirates, 12–15 December 2024; pp. 750–755. [Google Scholar]
- Choi, H.S.; Lee, P.-M. Development of a System Architecture for an Advanced Autonomous Underwater Vehicle, ORCA. In Proceedings of the 4th Conference of International Conference on Control, Automation and Systems, Bangkok, Thailand, 25–27 August 2004. [Google Scholar]
- Chemori, A. Control of Complex Robotic Systems: Challenges, Design and Experiments. In Proceedings of the 22nd International Conference on Methods and Models in Automation and Robotics (MMAR), Międzyzdroje, Poland, 28–31 August 2017; pp. 622–631. [Google Scholar]
- Li, A.; Guo, S.; Liu, M.; Yin, H. Hydrodynamic Characteristic-Based Adaptive Model Predictive Control for the Spherical Underwater Robot under Ocean Current Disturbance. Machines 2022, 10, 798. [Google Scholar] [CrossRef]
- Kozhubaev, Y.; Belyaev, V.; Murashov, Y.; Prokofev, O. Controlling of Unmanned Underwater Vehicles Using the Dynamic Planning of Symmetric Trajectory Based on Machine Learning for Marine Resources Exploration. Symmetry 2023, 15, 1783. [Google Scholar] [CrossRef]
- Sun, X.; Liu, L.; Dong, J. Underwater Image Enhancement with Encoding-Decoding Deep CNN Networks. In Proceedings of the IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), San Francisco, CA, USA, 4–8 August 2017; pp. 1–6. [Google Scholar]
- Yuan, J.; Wang, H.; Zhang, H.; Lin, C.; Yu, D.; Li, C. AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning. J. Mar. Sci. Eng. 2021, 9, 1166. [Google Scholar] [CrossRef]
- Cao, X.; Sun, C.; Yan, M. Target Search Control of AUV in Underwater Environment with Deep Reinforcement Learning. IEEE Access 2019, 7, 96549–96559. [Google Scholar] [CrossRef]
- Singh, P.; Sehgal, A. Computer Vision, Sonar and Sensor Fusion for Autonomous Underwater Vehicle (AUV) Navigation. In Proceedings of the Conference and Planning Committee Meeting, Seville, Spain, 28–31 March 2006. [Google Scholar]
- Yang, Y.; Xiao, Y.; Li, T. A Survey of Autonomous Underwater Vehicle Formation: Performance, Formation Control, and Communication Capability. IEEE Commun. Surv. Tutor. 2021, 23, 815–841. [Google Scholar] [CrossRef]
- Spaho, E.; Matsuo, K.; Barolli, L.; Xhafa, F. Robot Control Architectures: A Survey. In Information Technology Convergence: Security, Robotics, Automations and Communication; Springer: Dordrecht, The Netherlands, 2013. [Google Scholar] [CrossRef]
- Baranidharan, V.; Hari Murugesh, K.; Gokulvasanth, K.; Rahul Vikash, K.; Jane Mystika, D.; Renugadevi, S. Modeling and Control Techniques for Autonomous Underwater Vehicles—A Comprehensive Review. In Proceedings of the International Conference on Recent Advances in Science and Engineering Technology (ICRASET), Mandya, India, 21–22 November 2024; pp. 1–7. [Google Scholar]
- Sunbeam, M. Deep Learning for Visual Navigation of Underwater Robots. arXiv 2023, arXiv:2310.19495. [Google Scholar] [CrossRef]
- Wang, X.; Zhang, H.; Liu, H.; Lewis, F.L. Control Oriented Reinforcement Learning: A Survey of Recent Progress and Applications. Int. J. Robust Nonlinear Control 2025. [Google Scholar] [CrossRef]
- Yu, D.; Lin, C.; Liu, N. Logic-Optimization Behavior Tree Algorithm for Enhanced Autonomous Underwater Vehicle Cooperative Decision-Making. Complex Intell. Syst. 2025, 11, 406. [Google Scholar] [CrossRef]
- Zhao, S.; Yuh, J. Experimental Study on Advanced Underwater Robot Control. IEEE Trans. Robot. 2005, 21, 695–703. [Google Scholar] [CrossRef]
- Wickramasinghe, C.S.; Marino, D.L.; Manic, M. ResNet Autoencoders for Unsupervised Feature Learning from High-Dimensional Data: Deep Models Resistant to Performance Degradation. IEEE Access 2021, 9, 40511–40520. [Google Scholar] [CrossRef]
- Shao, R.; Li, W.; Zhang, L.; Zhang, R.; Liu, Z.; Chen, R.; Nie, L. Large VLM-Based Vision-Language-Action Models for Robotic Manipulation: A Survey. arXiv 2025, arXiv:2508.13073. [Google Scholar]
- Buchholz, M.; Carlucho, I.; Grimaldi, M.; Petillot, Y.R. Distributed AI Agents for Cognitive Underwater Robot Autonomy. arXiv 2025, arXiv:2507.23735. [Google Scholar] [CrossRef]
- Skorobogatov, G.; Barrado, C.; Salamí, E. Multiple UAV Systems: A Survey. Unmanned Syst. 2020, 8, 149–169. [Google Scholar] [CrossRef]
- Zitkovich, B.; Yu, T.; Xu, S.; Xu, P.; Xiao, T.; Xia, F.; Wu, J.; Wohlhart, P.; Welker, S.; Wahid, A.; et al. RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control. In Proceedings of the 7th Conference on Robot Learning, Atlanta, GA, USA, 2 December 2023. [Google Scholar]
- Wang, Z.; Zhu, Y.; Yan, Y.; Tian, X.; Shao, X.; Li, M.; Li, W.; Su, G.; Cui, W.; Fan, D. UnderwaterVLA: Dual-Brain Vision-Language-Action Architecture for Autonomous Underwater Navigation. arXiv 2025, arXiv:2509.22441. [Google Scholar]
- Goyal, A.; Hadfield, H.; Yang, X.; Blukis, V.; Ramos, F. VLA-0: Building State-of-the-Art VLAs with Zero Modification. arXiv 2025, arXiv:2510.13054. [Google Scholar]
- Li, Q.; Chang, B.; Mei, W.; Chen, Z. Integrated Sensing, Computing, Communication, and Control for Time-Sequence-Based Semantic Communications. arXiv 2025, arXiv:2505.03127. [Google Scholar]
- Jaffe, J.S. Computer Modeling and the Design of Optimal Underwater Imaging Systems. IEEE J. Ocean. Eng. 1990, 15, 101–111. [Google Scholar] [CrossRef]
- Tian, Y.; Xu, Y.; Zhou, J. Underwater Image Enhancement Method Based on Feature Fusion Neural Network. IEEE Access 2022, 10, 107536–107548. [Google Scholar] [CrossRef]
- Luan, X.; Hou, G.; Sun, Z.; Wang, Y.; Song, D.; Wang, S. Underwater Color Image Enhancement Using Combining Schemes. Mar. Technol. Soc. J. 2014, 48, 57–62. [Google Scholar] [CrossRef]
- Zhang, S.; Wang, T.; Dong, J.; Yu, H. Underwater Image Enhancement via Extended Multi-Scale Retinex. Neurocomputing 2017, 245, 1–9. [Google Scholar] [CrossRef]
- Therrien, C.W.; Frack, K.L.; Ruiz Fontes, N. A Short-Time Wiener Filter for Noise Removal in Underwater Acoustic Data. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, 21–24 April 1997; Volume 1, pp. 543–546. [Google Scholar]
- Sree Vidhya, K.S.; Deepthi, P.S. A Comprehensive Analysis of Underwater Image Processing Based on Deep Learning Techniques. In Proceedings of the International Conference on Control, Communication and Computing (ICCC), Thiruvananthapuram, India, 19–21 May 2023; pp. 1–6. [Google Scholar]
- Wang, Y.; Zhang, J.; Cao, Y.; Wang, Z. A Deep CNN Method for Underwater Image Enhancement. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Beijing, China, 17 September 2017; pp. 1382–1386. [Google Scholar]
- Li, J.; Skinner, K.A.; Eustice, R.M.; Johnson-Roberson, M. WaterGAN: Unsupervised Generative Network to Enable Real-Time Color Correction of Monocular Underwater Images. IEEE Robot. Autom. Lett. 2017, 3, 387–394. [Google Scholar] [CrossRef]
- Zhu, J.-Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. arXiv 2020, arXiv:1703.10593. [Google Scholar]
- Du, R.; Li, W.; Chen, S.; Li, C.; Zhang, Y. Unpaired Underwater Image Enhancement Based on CycleGAN. Information 2022, 13, 1. [Google Scholar] [CrossRef]
- Steiniger, Y.; Kraus, D.; Meisen, T. Survey on Deep Learning Based Computer Vision for Sonar Imagery. Eng. Appl. Artif. Intell. 2022, 114, 105157. [Google Scholar] [CrossRef]
- Wang, Z.; Xue, T.; Wang, Y.; Li, J.; Zhang, H.; Xu, Z.; Xu, G. Enhancing Object Detection Accuracy in Underwater Sonar Images through Deep Learning-Based Denoising. arXiv 2025, arXiv:2503.01655. [Google Scholar] [CrossRef]
- Huang, Y.; Li, W.; Yuan, F. Speckle Noise Reduction in Sonar Image Based on Adaptive Redundant Dictionary. J. Mar. Sci. Eng. 2020, 8, 761. [Google Scholar] [CrossRef]
- Liu, P.; Wang, L.; He, G.; Zhao, L. A Survey on Active Deep Learning: From Model-Driven to Data-Driven. ACM Comput. Surv. 2022, 54, 1–34. [Google Scholar] [CrossRef]
- Drap, P.; Merad, D.; Boï, J.-M.; Mahiddine, A.; Peloso, D.; Chemisky, B.; Seguin, E.; Alcala, F.; Bianchimani, O. Underwater Multimodal Survey: Merging Optical and Acoustic Data. In Underwater Seascapes; Musard, O., Le Dû-Blayo, L., Francour, P., Beurier, J.-P., Feunteun, E., Talassinos, L., Eds.; Springer International Publishing: Cham, Switzerland, 2014; pp. 221–238. ISBN 978-3-319-03439-3. [Google Scholar]
- Bach, N.G. Underwater Image Enhancement with Physical-Based Denoising Diffusion Implicit Models. J. Image Graph. 2025, 13, 2013–2230. [Google Scholar] [CrossRef]
- Macatangay, X.; Gabriel, S.A.; Hoseinnezhad, R.; Fowler, A.; Bab-Hadiashar, A. Machine Learning for Modeling Underwater Vehicle Dynamics: Overview and Insights. IEEE Access 2024, 12, 139486–139504. [Google Scholar] [CrossRef]
- Qu, P.; Li, T.; Zhou, L.; Jin, S.; Liang, Z.; Zhao, W.; Zhang, W. DAMNet: Dual Attention Mechanism Deep Neural Network for Underwater Biological Image Classification. IEEE Access 2023, 11, 6000–6009. [Google Scholar] [CrossRef]
- Arora, A.; Kumar, A. HOG and SIFT Transformation Algorithms for the Underwater Image Fusion. In Proceedings of the IEEE International Conference on Technology, Research, and Innovation for Betterment of Society (TRIBES), Raipur, India, 17–19 December 2021; pp. 1–5. [Google Scholar]
- Pachaiyappan, P.; Chidambaram, G.; Jahid, A.; Alsharif, M.H. Enhancing Underwater Object Detection and Classification Using Advanced Imaging Techniques: A Novel Approach with Diffusion Models. Sustainability 2024, 16, 7488. [Google Scholar] [CrossRef]
- Prasetyo, E.; Suciati, N.; Fatichah, C. Multi-Level Residual Network VGGNet for Fish Species Classification. J. King Saud Univ.-Comput. Inf. Sci. 2022, 24, 5286–5295. [Google Scholar] [CrossRef]
- Reliable Object Recognition Using Deep Transfer Learning for Marine Transportation Systems with Underwater Surveillance Semantic Scholar. Available online: https://www.semanticscholar.org/paper/Reliable-Object-Recognition-Using-Deep-Transfer-for-Moghimi-Mohanna/46ad1003d1d3ce7edfa4be357373fa918b6d2244 (accessed on 25 October 2025).
- Li, G.; Wang, F.; Zhou, L.; Jin, S.; Xie, X.; Ding, C.; Pan, X.; Zhang, W. MCANet: Multi-Channel Attention Network with Multi-Color Space Encoder for Underwater Image Classification. Comput. Electr. Eng. 2023, 108, 108724. [Google Scholar] [CrossRef]
- Nawarathne, U.; Kumari, H.; Kumari, H. Underwater Waste Detection Using Deep Learning A Performance Comparison of YOLOv7 to 10 and Faster RCNN. arXiv 2025, arXiv:2507.18967. [Google Scholar] [CrossRef]
- Wang, L.; Chen, L.-Z.; Peng, B.; Lin, Y.-T. Improved YOLOv5 Algorithm for Real-Time Prediction of Fish Yield in All Cage Schools. J. Mar. Sci. Eng. 2024, 12, 195. [Google Scholar] [CrossRef]
- Lei, F.; Tang, F.; Li, S. Underwater Target Detection Algorithm Based on Improved YOLOv5. J. Mar. Sci. Eng. 2022, 10, 310. [Google Scholar] [CrossRef]
- Chen, J.; Er, M.J. Dynamic YOLO for Small Underwater Object Detection Artificial Intelligence Review. Artif. Intell. Rev. 2024, 57, 165. [Google Scholar] [CrossRef]
- Shi, Y.; Jia, Y.; Zhang, X. FocusDet: An Efficient Object Detector for Small Object. Sci. Rep. 2024, 14, 10697. [Google Scholar] [CrossRef] [PubMed]
- Ma, D.; Wei, J.; Li, Y.; Zhao, F.; Chen, X.; Hu, Y.; Yu, S.; He, T.; Jin, R.; Li, Z.; et al. MLDet: Towards Efficient and Accurate Deep Learning Method for Marine Litter Detection. Ocean Coast. Manag. 2023, 243, 106765. [Google Scholar] [CrossRef]
- Chen, J.; Tang, J.; Lin, S.; Liang, W.; Su, B.; Yan, J.; Zhou, D.; Wang, L.; Lai, Y.; Yang, B. RMP-Net: A Structural Reparameterization and Subpixel Super-Resolution-Based Marine Scene Segmentation Network. Sec. Ocean Obs. 2022, 9, 1032287. [Google Scholar] [CrossRef]
- Lin, B.; Dong, X. A Multi-Task Segmentation and Classification Network for Remote Ship Hull Inspection. Ocean Eng. 2024, 301, 117608. [Google Scholar] [CrossRef]
- Bidirectional Collaborative Mentoring Network for Marine Organism Detection and Beyond Semantic Scholar. Available online: https://www.semanticscholar.org/paper/Bidirectional-Collaborative-Mentoring-Network-for-Cheng-Wu/1aba94e0a51def387c99cf1350a3ebf221269612 (accessed on 25 October 2025).
- Kirillov, A.; Mintun, E.; Ravi, N.; Mao, H.; Rolland, C.; Gustafson, L.; Xiao, T.; Whitehead, S.; Berg, A.C.; Lo, W.-Y.; et al. Segment Anything. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2–6 October 2023; pp. 3992–4003. [Google Scholar] [CrossRef]
- Zhang, P.; Yan, T.; Liu, Y.; Lu, H. Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16 June 2024; IEEE: New York, NY, USA, 2024; pp. 2578–2587. [Google Scholar]
- Zhang, S.; Zhao, S.; An, D.; Liu, J.; Wang, H.; Feng, Y.; Li, D.; Zhao, R. Visual SLAM for Underwater Vehicles: A Survey. Comput. Sci. Rev. 2022, 46, 100510. [Google Scholar] [CrossRef]
- DeTone, D.; Malisiewicz, T.; Rabinovich, A. SuperPoint: Self-Supervised Interest Point Detection and Description. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA, 18–22 June 2018; pp. 337–33712. [Google Scholar]
- Sangari, M.S.; Thangaraj, K.; Vanitha, U.; Srikanth, N.; Sathyamoorthy, J.; Renu, K. Deep Learning-Based Object Detection in Underwater Communications System. In Proceedings of the 2nd International Conference on Electrical, Electronics, Information and Communication Technologies (ICEEICT), Trichirappalli, India, 5–7 April 2023; pp. 1–6. [Google Scholar] [CrossRef]
- Balemans, N.; Hellinckx, P.; Latré, S.; Reiter, P.; Steckel, J. S2L-SLAM: Sensor Fusion Driven SLAM Using Sonar, LiDAR and Deep Neural Networks. In Proceedings of the IEEE Sensors Conference, New York, NY, USA, 31 October–4 November 2021; pp. 1–4. [Google Scholar]
- Yu, X.; Sun, Y.; Wang, X.; Zhang, G. End-to-End AUV Motion Planning Method Based on Soft Actor-Critic. Sensors 2021, 21, 5893. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction. IEEE Trans. Neural Netw. 1998, 9, 1054. [Google Scholar] [CrossRef]
- Czum, J.M. Dive into Deep Learning. J. Am. Coll. Radiol. 2020, 17, 637–638. Available online: https://www.jacr.org/article/S1546-1440(20)30146-0/abstract (accessed on 29 October 2025). [CrossRef] [PubMed]
- Guerneve, T.; Subr, K.; Petillot, Y. Underwater 3D Structures as Semantic Landmarks in SONAR Mapping. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 614–619. [Google Scholar]
- Kiran, B.R.; Sobh, I.; Talpaert, V.; Mannion, P.; Sallab, A.A.A.; Yogamani, S.; Pérez, P. Deep Reinforcement Learning for Autonomous Driving: A Survey. IEEE Trans. Intell. Transp. Syst. 2022, 23, 4909–4926. [Google Scholar] [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Graves, A.; Antonoglou, I.; Wierstra, D.; Riedmiller, M.A. Playing Atari with Deep Reinforcement Learning. arXiv 2013, arXiv:1312.5602. [Google Scholar] [CrossRef]
- Tabas, S.S.; Samadi, V. Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control. arXiv 2024. [Google Scholar] [CrossRef]
- Shi, W.; Tang, Y.; Jin, M.; Jing, L. An AUV-Assisted Data Gathering Scheme Based on Deep Reinforcement Learning for IoUT. J. Mar. Sci. Eng. 2023, 11, 2279. [Google Scholar] [CrossRef]
- Zhang, Q.; Lin, J.; Sha, Q.; He, B.; Li, G. Deep Interactive Reinforcement Learning for Path Following of Autonomous Underwater Vehicle. IEEE Access 2020, 8, 24258–24268. [Google Scholar] [CrossRef]
- Carroll, K.P.; McClaran, S.R.; Nelson, E.L.; Barnett, D.M.; Friesen, D.K.; William, G.N. AUV Path Planning: An A* Approach to Path Planning with Consideration of Variable Vehicle Speeds and Multiple, Overlapping, Time-Dependent Exclusion Zones. In Proceedings of the Symposium on Autonomous Underwater Vehicle Technology, Washington, DC, USA, 2 June 1992; pp. 79–84. [Google Scholar]
- Li, J.; Zhang, Z. AUV Local Path Planning Based on Fusion of Improved DWA and RRT Algorithms. In Proceedings of the IEEE International Conference on Mechatronics and Automation (ICMA), Harbin, China, 6–9 August 2023; pp. 935–941. [Google Scholar]
- Cao, X.; Chen, L.; Guo, L.; Han, W. AUV Global Security Path Planning Based on a Potential Field Bio-Inspired Neural Network in Underwater Environment. Intell. Autom. Soft Comput. 2021, 27, 1002. [Google Scholar] [CrossRef]
- Liu, T.; Hu, Y.; Xu, H. Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control. Complexity 2021, 2021, 6649625. [Google Scholar] [CrossRef]
- Andriotis, C.P.; Papakonstantinou, K.G. Managing Engineering Systems with Large State and Action Spaces through Deep Reinforcement Learning. Reliab. Eng. Syst. Saf. 2019, 191, 106483. [Google Scholar] [CrossRef]
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous Control with Deep Reinforcement Learning. arXiv 2019, arXiv:1509.02971. [Google Scholar]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal Policy Optimization Algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar] [CrossRef]
- Yan, J.; Zhang, L.; Yang, X.; Chen, C.; Guan, X. Communication-Aware Motion Planning of AUV in Obstacle-Dense Environment: A Binocular Vision-Based Deep Learning Method. IEEE Trans. Intell. Transp. Syst. 2023, 24, 14927–14943. [Google Scholar] [CrossRef]
- Rahmati, M.; Nadeem, M.; Sadhu, V.; Pompili, D. UW-MARL: Multi-Agent Reinforcement Learning for Underwater Adaptive Sampling Using Autonomous Vehicles. In Proceedings of the International Conference on Underwater Networks & Systems, Atlanta, GA, USA, 23–25 October 2019; pp. 1–5. [Google Scholar] [CrossRef]
- Yang, R.; Zhang, F.; Hou, M. OceanPlan: Hierarchical Planning and Replanning for Natural Language AUV Piloting in Large-Scale Unexplored Ocean Environments. In Proceedings of the 18th International Conference on Underwater Networks & Systems, Sibenik, Croatia, 27 January 2025; Association for Computing Machinery: New York, NY, USA, 2025; pp. 1–5. [Google Scholar]
- Kim, T.-Y.; Choi, W.-S. Autonomous Vehicle Maneuvering Using Vision–LLM Models for Marine Surface Vehicles. J. Mar. Sci. Eng. 2025, 13, 1553. [Google Scholar] [CrossRef]
- Ding, Y.; Xu, J.; Xie, G.; Zhang, S.; Li, Y. Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework for Underwater Tasks. arXiv 2025, arXiv:2506.15082. [Google Scholar] [CrossRef]
- Goldberg, D. Huxley: A Flexible Robot Control Architecture for Autonomous Underwater Vehicles. In Proceedings of the IEEE OCEANS Conference, Santander, Spain, 6–9 June 2011; pp. 1–10. [Google Scholar]
- Wu, Z.; Modi, A.; Mavrogiannis, A.; Joshi, K.; Chopra, N.; Aloimonos, Y.; Karapetyan, N.; Rekleitis, I.; Lin, X. DREAM: Domain-Aware Reasoning for Efficient Autonomous Underwater Monitoring. arXiv 2025, arXiv:2509.13666. [Google Scholar] [CrossRef]
- Chen, R.; Blow, D.; Abdullah, A.; Islam, M.J. Word2Wave: Language Driven Mission Programming for Efficient Subsea Deployments of Marine Robots. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Atlanta, GA, USA, 19–23 May 2025; pp. 4107–4114. [Google Scholar]
- Li, Z.; Du, J.; Jiang, C.; Mi, W.; Ren, Y. HA-MARL: Heuristic and APF Assisted Multi-Agent Reinforcement Learning for Wireless Data Sharing in AUV Swarms. In Proceedings of the ICC 2024—IEEE International Conference on Communications, Denver, CO, USA, 9–13 June 2024; pp. 5401–5406. [Google Scholar] [CrossRef]
- Tong, R.; Feng, Y.; Wang, J.; Wu, Z.; Tan, M.; Yu, J. A Survey on Reinforcement Learning Methods in Bionic Underwater Robots. Biomimetics 2023, 8, 168. [Google Scholar] [CrossRef]
- Utkin, V. Sliding Mode Control. Control Syst. Robot. Autom. 1999, 12, 1–319. [Google Scholar]
- Lotfi, F.; Virji, K.; Dudek, N.; Dudek, G. A Comparison of RL-Based and PID Controllers for 6-DOF Swimming Robots: Hybrid Underwater Object Tracking. arXiv 2024, arXiv:2401.16618. [Google Scholar] [CrossRef]
- Alessandro, R.; Roberto, C.; Costanzi, R.; Francesco, F.; Enrico, M. A Dynamic Manipulation Strategy for an Intervention: Autonomous Underwater Vehicle. Adv. Robot. Autom. 2015, 4, 1–16. [Google Scholar]
- Muñoz, F.; Cervantes-Rojas, J.S.; Valdovinos, J.M.; Sandre-Hernández, O.; Salazar, S.; Romero, H. Dynamic Neural Network-Based Adaptive Tracking Control for an Autonomous Underwater Vehicle Subject to Modeling and Parametric Uncertainties. Appl. Sci. 2021, 11, 2797. [Google Scholar] [CrossRef]
- Cimurs, R.; Suh, I.H.; Lee, J.H. Goal-Driven Autonomous Exploration Through Deep Reinforcement Learning. IEEE Robot. Autom. Lett. 2022, 7, 730–737. [Google Scholar] [CrossRef]
- Cortez, W.S.; Vasisht, S.; Tuor, A.; Drgoňa, J.; Vrabie, D. Domain-Aware Control-Oriented Neural Models for Autonomous Underwater Vehicles. arXiv 2022, arXiv:2208.07333. [Google Scholar] [CrossRef]
- Carlucho, I.; De Paula, M.; Barbalata, C.; Acosta, G.G. A Reinforcement Learning Control Approach for Underwater Manipulation under Position and Torque Constraints. In Proceedings of the Global Oceans 2020: Singapore–U.S. Gulf Coast, Biloxi, MS, USA, 5–30 October 2020; pp. 1–7. [Google Scholar]
- Xie, G.; Xu, J.; Ding, Y.; Zhang, Z.; Zhang, S.; Li, Y. Never Too Prim to Swim: An LLM-Enhanced RL-Based Adaptive S-Surface Controller for AUV under Extreme Sea Conditions. arXiv 2025, arXiv:2503.00527. [Google Scholar]
- Battaglia, P.; Hamrick, J.B.; Bapst, V.; Sanchez-Gonzalez, A.; Zambaldi, V.; Malinowski, M.; Tacchetti, A.; Raposo, D.; Santoro, A.; Faulkner, R.; et al. Relational Inductive Biases, Deep Learning, and Graph Networks. arXiv 2018, arXiv:1806.01261. [Google Scholar] [CrossRef]
- Haarnoja, T.; Zhou, A.; Hartikainen, K.; Tucker, G.; Ha, S.; Tan, J.; Kumar, V.; Zhu, H.; Gupta, A.; Abbeel, P.; et al. Soft Actor-Critic Algorithms and Applications. arXiv 2019, arXiv:1812.05905. [Google Scholar] [CrossRef]
- Lyu, X.; Sun, Y.; Wang, L.; Tan, J.; Zhang, L. End-to-End AUV Local Motion Planning Method Based on Deep Reinforcement Learning. J. Mar. Sci. Eng. 2023, 11, 1796. [Google Scholar] [CrossRef]
- Carlucho, I.; De Paula, M.; Wang, S.; Menna, B.V.; Petillot, Y.R.; Acosta, G.G. AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning. In Proceedings of the OCEANS 2018 MTS/IEEE Charleston, Charleston, SC, USA, 22–25 October 2018; pp. 1–8. [Google Scholar]
- Musa, P.; Rafi, F.A.; Lamsani, M. A Review: Contrast-Limited Adaptive Histogram Equalization (CLAHE) Methods to Help the Application of Face Recognition. In Proceedings of the 3rd International Conference on Informatics and Computing (ICIC), Palembang, Indonesia, 17–18 October 2018; pp. 1–6. [Google Scholar]
- Tarling, P.; Cantor, M.; Clapés, A.; Escalera, S. Deep Learning with Self-Supervision and Uncertainty Regularization to Count Fish in Underwater Images. PLoS ONE 2022, 17, e0267759. [Google Scholar] [CrossRef]
- Li, X.; Li, X.; Han, B.; Wang, S.; Chen, K. Application of EfficientNet and YOLOv5 Model in Submarine Pipeline Inspection and a New Decision-Making System. Water 2023, 15, 3386. [Google Scholar] [CrossRef]
- Karimanzira, D.; Renkewitz, H.; Shea, D.; Albiez, J. Object Detection in Sonar Images. Electronics 2020, 9, 1180. [Google Scholar] [CrossRef]
- Ma, Z.; Li, S.; Ding, J.; Zou, B. MHGAN: A Multi-Headed Generative Adversarial Network for Underwater Sonar Image Super-Resolution. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–16. [Google Scholar] [CrossRef]
- Sudevan, V.; Zayer, F.; Kausar, R.; Javed, S.; Karki, H.; De Masi, G.; Dias, J. snnTrans-DHZ: A Lightweight Spiking Neural Network Architecture for Underwater Image Dehazing. arXiv 2025. [Google Scholar] [CrossRef]
- Yu, H.; Li, X.; Feng, Y.; Han, S. Underwater Vision Enhancement Based on GAN with Dehazing Evaluation. Appl. Intell. 2022, 53, 5664–5680. [Google Scholar] [CrossRef]
- Zhang, C.; Liu, L.; Huang, G.; Wen, H.; Zhou, X.; Wang, Y. WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark. arXiv 2024. [Google Scholar] [CrossRef]
- Wang, M.; Qiu, B.; Zhu, Z.; Ma, L.; Zhou, C. Passive Tracking of Underwater Acoustic Targets Based on Multi-Beam LOFAR and Deep Learning. PLoS ONE 2022, 17, e0273898. [Google Scholar] [CrossRef]
- Liu, F.; Fang, M. Semantic Segmentation of Underwater Images Based on Improved Deeplab. J. Mar. Sci. Eng. 2020, 8, 188. [Google Scholar] [CrossRef]
- Chen, B.; Zhao, W.; Zhang, Q.; Li, M.; Qi, M.; Tang, Y. Semantic Segmentation of Underwater Images Based on the Improved SegFormer. Front. Mar. Sci. 2025, 12, 1522160. [Google Scholar] [CrossRef]
- Gao, W.; Han, M.; Wang, Z.; Deng, L.; Wang, H.; Ren, J. Research on Method of Collision Avoidance Planning for UUV Based on Deep Reinforcement Learning. J. Mar. Sci. Eng. 2023, 11, 2245. [Google Scholar] [CrossRef]
- Vu, Q.V.; Dinh, T.A.; Nguyen, T.V.; Tran, H.V.; Le, H.X.; Pham, H.V.; Kim, T.D.; Nguyen, L. An Adaptive Hierarchical Sliding Mode Controller for Autonomous Underwater Vehicles. Electronics 2021, 10, 2316. [Google Scholar] [CrossRef]
- Peng, G.; Chen, C.L.P.; Yang, C. Neural Networks Enhanced Optimal Admittance Control of Robot–Environment Interaction Using Reinforcement Learning. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 4551–4561. [Google Scholar] [CrossRef] [PubMed]
- Kebaili, A.; Lapuyade-Lahorgue, J.; Vera, P.; Ruan, S. AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation. In Proceedings of the IEEE 22nd International Symposium on Biomedical Imaging (ISBI), Houston, TX, USA, 14–17 April 2025; pp. 1–4. [Google Scholar]
- Ochal, M.; Vazquez, J.; Petillot, Y.; Wang, S. A Comparison of Few-Shot Learning Methods for Underwater Optical and Sonar Image Classification. In Proceedings of the Global Oceans 2020: Singapore–U.S. Gulf Coast, Biloxi, MS, USA, 5–30 October 2020; pp. 1–10. [Google Scholar] [CrossRef]
- Wang, J.; Li, Q.; Fang, Z.; Zhou, X.; Tang, Z.; Han, Y.; Ma, Z. YOLOv6-ESG: A Lightweight Seafood Detection Method. J. Mar. Sci. Eng. 2023, 11, 1623. [Google Scholar] [CrossRef]
- Tobin, J.; Fong, R.; Ray, A.; Schneider, J.; Zaremba, W.; Abbeel, P. Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 23–30. [Google Scholar] [CrossRef]
- Li, B.; Qi, P.; Liu, B.; Di, S.; Liu, J.; Pei, J.; Yi, J.; Zhou, B. Trustworthy AI: From Principles to Practices. ACM Comput. Surv. 2023, 55, 1–46. [Google Scholar] [CrossRef]
- Chaffre, T.; Wheare, J.; Lammas, A.; Santos, P.; Le Chenadec, G.; Sammut, K.; Clement, B. Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilisation under Current Disturbance. Int. J. Robot. Res. 2025, 44, 407–430. [Google Scholar] [CrossRef]


| Traditional/DL | Name | Technology | Summary of the Technical Route |
|---|---|---|---|
| Solution based on physical models and traditional filtering | Jaffe–McGlamery Model [31] | Physics-based model for underwater vision recovery | Recovers images by inversely solving the physical process of light propagation in water, modeling forward scattering, backscattering, and absorption effects. |
| Contrast Stretching [33] | Signal processing enhancement technique | Enhances image contrast based on statistical properties, with low computational cost and no reliance on complex physical models. | |
| Retinex Theory [34] | Signal processing enhancement technique | Enhances image quality based on statistical properties, with low computational cost and no reliance on complex physical models. | |
| Wiener Filtering [35] | Signal processing enhancement technique (for acoustic signals) | Denoises acoustic signals based on statistical properties, with low computational cost and no reliance on complex physical models. | |
| Deep-learning solution | UIE-Net [37] | CNN for underwater image enhancement | Achieves synergistic optimization of color correction and defogging through dual-task joint training, marking an early application of CNNs in this field. |
| WaterGAN [38] | GAN for underwater image processing | Generates paired training data through unsupervised adversarial training, effectively addressing the scarcity of underwater datasets. | |
| CycleGAN [39] | Non-paired image model | Lowers data acquisition barriers by enabling model training without strictly corresponding paired clear-degraded image sets. | |
| UW-CycleGAN [40] | CycleGAN framework for underwater image enhancement | Enables high-quality underwater image enhancement without explicit paired data by learning image-to-image translation between unpaired degraded and clear image sets, using cyclic consistency loss. | |
| Multi-frame denoising technique with OPD [42] | Multi-frame denoising for underwater sonar images | Fuses multi-frame data, treated as images processed by different denoising algorithms, to achieve better denoising results in underwater sonar imaging. | |
| PINN (Physics-Informed Neural Networks) [43] | Neural network with embedded physical models | Embeds physical models (e.g., optical or acoustic propagation) as inductive biases into neural network structures or loss functions, enhancing model stability and interpretability by preventing physically unrealistic outcomes. |
| Traditional/DL | Name | Technology | Summary of the Technical Route |
|---|---|---|---|
| Traditional Solution | SIFT, HOG [10] | Hand-designed feature extractor | Extracts simple, predefined features for basic environmental understanding. |
| Traditional Visual SLAM | Geometric feature-based SLAM | Relies on geometric features like corners, prone to failure in weak-texture scenes. | |
| Deep-Learning Solution | MLR-VGGNet [50] | CNN architecture | Combining the VGGNet backbone with multi-layer residual, asymmetric and depthwise separable convolutions to optimize fish classification and reduce model parameters. |
| The method based on mResNet [51] | CNN architecture | Underwater target recognition method based on mResNet and optimized feature engineering. | |
| DAMNet [52] | CNN with attention mechanism | Utilizes advanced attention mechanisms for complex biological image classification. | |
| MCANet [53] | CNN with attention mechanism | Utilizes advanced attention mechanisms for complex biological image classification. | |
| Faster R-CNN [54] | Two-stage deep-learning detection framework | Widely applied for underwater object detection and segmentation. | |
| YOLO improved variants [10,55,56] | Deep-learning detection framework | Mainstream for underwater object monitoring due to simplicity, open-source nature, and ease of deployment. | |
| FocusDet [57] | Fine-grained architecture for small object detection | Specialized for monitoring small objects like underwater trash. | |
| MLDet [58] | Fine-grained architecture for small object detection | Specialized for monitoring underwater trash. | |
| MTHI-Net [60] | Encoder–Decoder architecture | By using multi-task learning to hierarchically segment images, performance is enhanced, demonstrating innovation and potential in the field of image segmentation. | |
| BCMNet [61] | Encoder–Decoder architecture | Through bidirectional contrastive representation learning, more effective motion representations can be extracted from multimodal data, | |
| Dual-SAM [63] | Specialized foundation model for segmentation | Underwater-specific variant for fine-grained segmentation based on SAM. | |
| SuperPoint [8] | Feature-learning network | Learns robust high-level features for improved visual odometry and pose estimation. | |
| RCNN [64] | Deep learning for loop closure detection | Breaks through in loop closure detection by using probabilistic appearance recognition to eliminate cumulative errors in SLAM. | |
| S2L-SLAM [65] | Deep-learning model for multimodal sensor fusion | Converts sonar data to LiDAR point clouds, enabling LiDAR SLAM in challenging environments and dynamic sensor selection. | |
| SONAR-CAD for Underwater Semantic 3D Mapping [66] | Deep learning for multimodal sensor fusion and semantic SLAM | Fuses visual and sonar data, adding high-level semantic information to maps through segmentation and object recognition. |
| Whether to Integrate the Two Modules | Traditional/DL | Name | Technology | Summary of the Technical Route |
|---|---|---|---|---|
| No | Traditional | Huxley [81] | Hierarchical expert system (state machines, behavior trees) | Organizes task flows using modular control layers with predefined state machines and behavior trees. |
| A * Path Planning Approach [77] | Graph search algorithm DWA APF | Used for local real-time obstacle avoidance in traditional approaches. | ||
| Deep Learning | DRL-Guided Autonomous Exploration with Waypoint Navigation [88] | DRL agent | Autonomously plans waypoints and performs exploration in unknown underwater cave environments without prior maps. | |
| Word2Wave [90] | VLM SLM | Real-time programming and parameter configuration for AUV tasks. | ||
| DREAM [91] | VLM | The VLM-driven underwater autonomous monitoring system integrates multimodal perception, cognitive planning based on thought chains, and low-level control | ||
| RL Adaptive Underwater Arm Control [89] | Actor–Critic structure with DNNs | Demonstrates that RL controllers can outperform MPC in fine physical interaction. | ||
| UW-MARL [92] | Q-learning MARL | MARL with distributed Q-learning for adaptive underwater sampling, coordinating autonomous vehicles via shared Q-values. | ||
| HA-MARL [93] | APF MAPPO | It enhances multi-AUV data sharing by integrating APF for path planning and a Tabu-Search task scheduler into MAPPO. | ||
| UnderwaterVLA [28] | VLM VLA | The dual-brain architecture and zero-data training enable robust autonomous navigation of underwater VLA. | ||
| Yes | OceanPlan [94] | LLM planner, HTN task planner, DQN motion planner, replanner | Addresses efficient and robust AUV navigation in unknown oceans via natural language instructions. | |
| Autonomous Vehicle Maneuvering [81] | LLM-guided path planning | Achieves real-time environmental adaptive LLM-guided path planning by integrating cognitive, decision-making, path planning, and control functions. |
| Whether to Integrate the Two Modules | Traditional/DL | Name | Technology | Summary of the Technical Route |
|---|---|---|---|---|
| No | Traditional | PID Controller [97] | PID | Provides simple and effective stable tracking for predefined paths. |
| SMC [98] | Sliding Mode Control | Offers robust control to suppress external disturbances like sea currents. | ||
| Inverse Kinematics + PID [79] | Inverse Kinematics, PID | Calculates and follows joint trajectories for manipulator arms. | ||
| Deep Learning | DNCS [99] | DNN | Online learning of unknown system dynamics to adaptively adjust control gains for tracking. | |
| Domain-aware Control-oriented Neural Models for Autonomous Underwater Vehicles [100] | Physics-Informed Neural Network | Embeds hydrodynamic priors into the network to improve generalization. | ||
| RL Adaptive Underwater Arm Control [89] | Actor–Critic structure with DNNs | Demonstrates that RL controllers can outperform traditional MPC for fine physical interaction. | ||
| Yes | OceanPlan [94] | LLM planner, HTN task planner, DQN motion planner, replanner | Addresses efficient and robust AUV navigation in unknown oceans via natural language instructions. | |
| Autonomous Vehicle Maneuvering [81] | LLM-guided path planning | Achieves real-time environmental adaptive LLM-guided path planning by integrating cognitive, decision-making, path planning, and control functions. |
| Simple Task | Complex Task | |
|---|---|---|
| Structured environment | Scene One: Routine operations for efficiency and cost optimization | Scene Three: Pre-determined operations for high-precision physical interaction |
| Unstructured environment | Scene Two: Goal-oriented behavior with strong environmental robustness | Scene Four: A fully autonomous system oriented towards unknown exploration and dynamic interaction |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ding, Q.; Ye, L.; Chen, H.; Liu, H.; Liang, A.; Cui, W. AUV Intelligent Decision-Making System Empowered by Deep Learning: Evolution, Challenges and Future Prospects. Technologies 2025, 13, 586. https://doi.org/10.3390/technologies13120586
Ding Q, Ye L, Chen H, Liu H, Liang A, Cui W. AUV Intelligent Decision-Making System Empowered by Deep Learning: Evolution, Challenges and Future Prospects. Technologies. 2025; 13(12):586. https://doi.org/10.3390/technologies13120586
Chicago/Turabian StyleDing, Qiulin, Lugang Ye, Hao Chen, Hongyuan Liu, Aoming Liang, and Weicheng Cui. 2025. "AUV Intelligent Decision-Making System Empowered by Deep Learning: Evolution, Challenges and Future Prospects" Technologies 13, no. 12: 586. https://doi.org/10.3390/technologies13120586
APA StyleDing, Q., Ye, L., Chen, H., Liu, H., Liang, A., & Cui, W. (2025). AUV Intelligent Decision-Making System Empowered by Deep Learning: Evolution, Challenges and Future Prospects. Technologies, 13(12), 586. https://doi.org/10.3390/technologies13120586

