This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer
by
Bingxun Zhao
Bingxun Zhao and
Yuan Chen
Yuan Chen *
School of Airspace Science and Engineering, Shandong University, Weihai 264209, China
*
Author to whom correspondence should be addressed.
Computers 2026, 15(1), 43; https://doi.org/10.3390/computers15010043 (registering DOI)
Submission received: 22 December 2025
/
Revised: 4 January 2026
/
Accepted: 8 January 2026
/
Published: 10 January 2026
Abstract
Underwater vision is fundamental to ocean exploration, yet it is frequently impaired by underwater degradation including low contrast, color distortion and blur, thereby presenting significant challenges for underwater object detection (UOD). Most existing methods employ underwater image enhancement as a preprocessing step to improve visual quality prior to detection. However, image enhancement and object detection are optimized for fundamentally different objectives, and directly cascading them leads to feature distribution mismatch. Moreover, prevailing dual-branch architectures process enhancement and detection independently, overlooking multi-scale interactions across domains and thus constraining the learning of cross-domain feature representation. To overcome these limitations, We propose an underwater cross-domain fusion Transformer detector (UCF-DETR). UCF-DETR jointly leverages image enhancement and object detection by exploiting the complementary information from the enhanced and original image domains. Specifically, an underwater image enhancement module is employed to improve visibility. We then design a cross-domain feature pyramid to integrate fine-grained structural details from the enhanced domain with semantic representations from the original domain. Cross-domain query interaction mechanism is introduced to model inter-domain query relationships, leading to accurate object localization and boundary delineation. Extensive experiments on the challenging DUO and UDD benchmarks demonstrate that UCF-DETR consistently outperforms state-of-the-art methods for UOD.
Share and Cite
MDPI and ACS Style
Zhao, B.; Chen, Y.
Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers 2026, 15, 43.
https://doi.org/10.3390/computers15010043
AMA Style
Zhao B, Chen Y.
Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers. 2026; 15(1):43.
https://doi.org/10.3390/computers15010043
Chicago/Turabian Style
Zhao, Bingxun, and Yuan Chen.
2026. "Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer" Computers 15, no. 1: 43.
https://doi.org/10.3390/computers15010043
APA Style
Zhao, B., & Chen, Y.
(2026). Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers, 15(1), 43.
https://doi.org/10.3390/computers15010043
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.