Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer

Zhao, Bingxun; Chen, Yuan

doi:10.3390/computers15010043

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer

by

Bingxun Zhao

and

Yuan Chen

^*

School of Airspace Science and Engineering, Shandong University, Weihai 264209, China

^*

Author to whom correspondence should be addressed.

Computers 2026, 15(1), 43; https://doi.org/10.3390/computers15010043 (registering DOI)

Submission received: 22 December 2025 / Revised: 4 January 2026 / Accepted: 8 January 2026 / Published: 10 January 2026

(This article belongs to the Special Issue Advanced Image Processing and Computer Vision (2nd Edition))

Download Versions Notes

Abstract

Underwater vision is fundamental to ocean exploration, yet it is frequently impaired by underwater degradation including low contrast, color distortion and blur, thereby presenting significant challenges for underwater object detection (UOD). Most existing methods employ underwater image enhancement as a preprocessing step to improve visual quality prior to detection. However, image enhancement and object detection are optimized for fundamentally different objectives, and directly cascading them leads to feature distribution mismatch. Moreover, prevailing dual-branch architectures process enhancement and detection independently, overlooking multi-scale interactions across domains and thus constraining the learning of cross-domain feature representation. To overcome these limitations, We propose an underwater cross-domain fusion Transformer detector (UCF-DETR). UCF-DETR jointly leverages image enhancement and object detection by exploiting the complementary information from the enhanced and original image domains. Specifically, an underwater image enhancement module is employed to improve visibility. We then design a cross-domain feature pyramid to integrate fine-grained structural details from the enhanced domain with semantic representations from the original domain. Cross-domain query interaction mechanism is introduced to model inter-domain query relationships, leading to accurate object localization and boundary delineation. Extensive experiments on the challenging DUO and UDD benchmarks demonstrate that UCF-DETR consistently outperforms state-of-the-art methods for UOD.

Keywords: underwater object detection; underwater image enhancement; joint inference; cross-domain

Share and Cite

MDPI and ACS Style

Zhao, B.; Chen, Y. Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers 2026, 15, 43. https://doi.org/10.3390/computers15010043

AMA Style

Zhao B, Chen Y. Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers. 2026; 15(1):43. https://doi.org/10.3390/computers15010043

Chicago/Turabian Style

Zhao, Bingxun, and Yuan Chen. 2026. "Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer" Computers 15, no. 1: 43. https://doi.org/10.3390/computers15010043

APA Style

Zhao, B., & Chen, Y. (2026). Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers, 15(1), 43. https://doi.org/10.3390/computers15010043

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI