Next Article in Journal
Design and Evaluation of a Flexible Substrate-Based Microstrip Sensor for Partial Discharge Detection in High-Voltage Equipment
Previous Article in Journal
3D Pose Estimation Using Virtual Projection Based on 3D Reconstructed Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

MDCL-DETR: Multi-Domain Enhancement and Cross-Layer Feature Fusion for Small Object Detection

School of Computer Science and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
*
Author to whom correspondence should be addressed.
Sensors 2026, 26(11), 3305; https://doi.org/10.3390/s26113305
Submission received: 30 March 2026 / Revised: 8 May 2026 / Accepted: 20 May 2026 / Published: 22 May 2026
(This article belongs to the Section Sensing and Imaging)

Abstract

Small object detection in uncrewed aerial vehicle (UAV) imagery is hindered by limited pixels, insufficient detailed information, and strong background interference, leading to weak feature representation and poor contextual modeling. To address these issues, we propose a multi-domain enhancement and cross-layer feature fusion detection Transformer (MDCL-DETR) with progressive feature processing. First, a multi-domain enhancement module (MDEM) based on CSP (cross stage partial) structure is proposed, which fuses spatial and frequency-domain features in a lightweight manner to enhance object detail and global structures while effectively distinguishing object features from background interference. Second, a cross-layer feature extraction module (CLEM) is introduced to aggregate multi-scale features across layers, alleviate information loss caused by downsampling, and preserve spatial details of small objects while integrating high-level contextual semantics. Meanwhile, a gated Mamba fusion module (GMFM) is proposed, which adopts the Mamba architecture for long-range dependency modeling of multi-scale features and integrates a gating mechanism to realize the dynamic weighted fusion of local details and global context, further improving feature discriminability and global modeling capability. Finally, a fine-grained enhancement module (FGEM) is designed, which leverages feature reorganization and adaptive feature extraction to reinforce and compensate fine-grained features. Extensive experimental results validate the effectiveness and generalization of the proposed method, achieving mAP50 scores of 54.1% and 56.2% on the VisDrone2019 and AI-TOD datasets.
Keywords: multi-domain enhancement; feature fusion; small object detection; uncrewed aerial vehicle (UAV) multi-domain enhancement; feature fusion; small object detection; uncrewed aerial vehicle (UAV)

Share and Cite

MDPI and ACS Style

Hao, T.; Zhang, X.; Zhou, B. MDCL-DETR: Multi-Domain Enhancement and Cross-Layer Feature Fusion for Small Object Detection. Sensors 2026, 26, 3305. https://doi.org/10.3390/s26113305

AMA Style

Hao T, Zhang X, Zhou B. MDCL-DETR: Multi-Domain Enhancement and Cross-Layer Feature Fusion for Small Object Detection. Sensors. 2026; 26(11):3305. https://doi.org/10.3390/s26113305

Chicago/Turabian Style

Hao, Tianran, Xiao Zhang, and Bing Zhou. 2026. "MDCL-DETR: Multi-Domain Enhancement and Cross-Layer Feature Fusion for Small Object Detection" Sensors 26, no. 11: 3305. https://doi.org/10.3390/s26113305

APA Style

Hao, T., Zhang, X., & Zhou, B. (2026). MDCL-DETR: Multi-Domain Enhancement and Cross-Layer Feature Fusion for Small Object Detection. Sensors, 26(11), 3305. https://doi.org/10.3390/s26113305

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop