Next Article in Journal
Privacy-Preserving Set Intersection Protocol Based on SM2 Oblivious Transfer
Previous Article in Journal
Preparation for Inclusive and Technology-Enhanced Pedagogy: A Cluster Analysis of Secondary Special Education Teachers
Previous Article in Special Issue
ConWave-LoRA: Concept Fusion in Customized Diffusion Models with Contrastive Learning and Wavelet Filtering
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer

School of Airspace Science and Engineering, Shandong University, Weihai 264209, China
*
Author to whom correspondence should be addressed.
Computers 2026, 15(1), 43; https://doi.org/10.3390/computers15010043 (registering DOI)
Submission received: 22 December 2025 / Revised: 4 January 2026 / Accepted: 8 January 2026 / Published: 10 January 2026
(This article belongs to the Special Issue Advanced Image Processing and Computer Vision (2nd Edition))

Abstract

Underwater vision is fundamental to ocean exploration, yet it is frequently impaired by underwater degradation including low contrast, color distortion and blur, thereby presenting significant challenges for underwater object detection (UOD). Most existing methods employ underwater image enhancement as a preprocessing step to improve visual quality prior to detection. However, image enhancement and object detection are optimized for fundamentally different objectives, and directly cascading them leads to feature distribution mismatch. Moreover, prevailing dual-branch architectures process enhancement and detection independently, overlooking multi-scale interactions across domains and thus constraining the learning of cross-domain feature representation. To overcome these limitations, We propose an underwater cross-domain fusion Transformer detector (UCF-DETR). UCF-DETR jointly leverages image enhancement and object detection by exploiting the complementary information from the enhanced and original image domains. Specifically, an underwater image enhancement module is employed to improve visibility. We then design a cross-domain feature pyramid to integrate fine-grained structural details from the enhanced domain with semantic representations from the original domain. Cross-domain query interaction mechanism is introduced to model inter-domain query relationships, leading to accurate object localization and boundary delineation. Extensive experiments on the challenging DUO and UDD benchmarks demonstrate that UCF-DETR consistently outperforms state-of-the-art methods for UOD.
Keywords: underwater object detection; underwater image enhancement; joint inference; cross-domain underwater object detection; underwater image enhancement; joint inference; cross-domain

Share and Cite

MDPI and ACS Style

Zhao, B.; Chen, Y. Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers 2026, 15, 43. https://doi.org/10.3390/computers15010043

AMA Style

Zhao B, Chen Y. Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers. 2026; 15(1):43. https://doi.org/10.3390/computers15010043

Chicago/Turabian Style

Zhao, Bingxun, and Yuan Chen. 2026. "Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer" Computers 15, no. 1: 43. https://doi.org/10.3390/computers15010043

APA Style

Zhao, B., & Chen, Y. (2026). Joint Inference of Image Enhancement and Object Detection via Cross-Domain Fusion Transformer. Computers, 15(1), 43. https://doi.org/10.3390/computers15010043

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop