Next Article in Journal
Effect of Scour on Hydrodynamic Pressure of Offshore Monopile and Site Response Under Seismic Loads
Previous Article in Journal
MSC-YOLO: An Accurate and Effective Maritime Ship Detection Model Based on Improved YOLOv11n
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

E2E-AUD: An End-to-End Adaptive Underwater Detection Framework Integrating Physical Priors and Frequency-Adaptive Learning

by
Wenhao Zhou
1,2,
Junbao Zeng
1,*,
Shuo Li
1 and
Yuexing Zhang
1
1
Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China
2
University of Chinese Academy of Sciences, Beijing 100049, China
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2026, 14(12), 1067; https://doi.org/10.3390/jmse14121067
Submission received: 7 May 2026 / Revised: 3 June 2026 / Accepted: 5 June 2026 / Published: 7 June 2026
(This article belongs to the Section Ocean Engineering)

Abstract

Underwater detection is crucial for the autonomous operation of Autonomous Underwater Vehicles (AUVs). However, underwater environments pose significant challenges, including severe image degradation, complex target deformation, and densely distributed small objects. Most existing methods treat image enhancement as an independent preprocessing module and rely on fixed-shape convolution kernels for feature extraction, which often leads to inconsistent optimization objectives and limited capability in handling irregular targets and fine-grained small-object details. To address these issues, we propose an End-to-End Adaptive Underwater Detection framework (E2E-AUD). Specifically, a lightweight image enhancement module, UnitModule, is embedded into the detection network so that enhancement can be jointly optimized with detection and directly serve downstream feature learning. In addition, linear deformable convolution (LDConv) is introduced into the backbone to adaptively model polymorphic targets, while Haar wavelet downsampling (HWD) is adopted to preserve boundary and texture information through frequency-domain analysis. Experiments on the DUO and URPC datasets demonstrate that E2E-AUD achieves superior performance over both general-purpose and underwater-specific detectors. Specifically, on the DUO dataset, our model reaches 86.2% mAP50 and 67.8% mAP50-95, outperforming the recent YOLOv12 by 3.0% and 2.7%, respectively. On the highly turbid URPC dataset, it achieves 84.3% mAP50 and 50.8% mAP50-95, surpassing the competitive underwater-specific detector LEFEN by notable margins in strict localization metrics. Furthermore, E2E-AUD maintains a real-time inference speed of 21.8 FPS with highly constrained computational complexity (9.4 GFLOPs), proving its exceptional balance between detection accuracy and deployment efficiency compared to previous methods.
Keywords: underwater object detection; end-to-end learning; physics-aware enhancement; wavelet-based downsampling; deformable convolution underwater object detection; end-to-end learning; physics-aware enhancement; wavelet-based downsampling; deformable convolution

Share and Cite

MDPI and ACS Style

Zhou, W.; Zeng, J.; Li, S.; Zhang, Y. E2E-AUD: An End-to-End Adaptive Underwater Detection Framework Integrating Physical Priors and Frequency-Adaptive Learning. J. Mar. Sci. Eng. 2026, 14, 1067. https://doi.org/10.3390/jmse14121067

AMA Style

Zhou W, Zeng J, Li S, Zhang Y. E2E-AUD: An End-to-End Adaptive Underwater Detection Framework Integrating Physical Priors and Frequency-Adaptive Learning. Journal of Marine Science and Engineering. 2026; 14(12):1067. https://doi.org/10.3390/jmse14121067

Chicago/Turabian Style

Zhou, Wenhao, Junbao Zeng, Shuo Li, and Yuexing Zhang. 2026. "E2E-AUD: An End-to-End Adaptive Underwater Detection Framework Integrating Physical Priors and Frequency-Adaptive Learning" Journal of Marine Science and Engineering 14, no. 12: 1067. https://doi.org/10.3390/jmse14121067

APA Style

Zhou, W., Zeng, J., Li, S., & Zhang, Y. (2026). E2E-AUD: An End-to-End Adaptive Underwater Detection Framework Integrating Physical Priors and Frequency-Adaptive Learning. Journal of Marine Science and Engineering, 14(12), 1067. https://doi.org/10.3390/jmse14121067

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop