Next Article in Journal
Three-Dimensional Deformation Field Inversion Based on Fused Monitoring Data of GNSS and InSAR: A Case Study of Jinchuan No. 2 Mining Area
Previous Article in Journal
Hidden Forest in Non-Forest Land: A Remote Sensing-Based Mapping Case in Lithuania
Previous Article in Special Issue
Per-SAM-MCPA: A Lightweight Framework for Individual Tree Crown Segmentation from UAV Imagery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

From Generic to Adaptive: Similarity-Adaptive Receptive-Field Cross DETR for Remote-Sensing Object Detection

1
HDU-ITMO Joint Institute, Hangzhou Dianzi University, 1158 No. 2 Ave., Baiyang Sub-district, Qiantang District, Hangzhou 310018, China
2
School of Computer Science, Hangzhou Dianzi University, 1158 No. 2 Ave., Baiyang Sub-District, Qiantang District, Hangzhou 310018, China
3
School of Communication Engineering, Zhejiang Key Laboratory of Low Altitude Ubiquitous Networking Technology, Hangzhou Dianzi University, 1158 No. 2 Ave., Baiyang Sub-district, Qiantang District, Hangzhou 310018, China
4
School of Communication Engineering, Hangzhou Dianzi University, 1158 No. 2 Ave., Baiyang Sub-District, Qiantang District, Hangzhou 310018, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(10), 1670; https://doi.org/10.3390/rs18101670
Submission received: 5 April 2026 / Revised: 17 May 2026 / Accepted: 19 May 2026 / Published: 21 May 2026

Abstract

Object detection in optical remote sensing imagery faces persistent challenges from severe instance overlap, extreme spatial density, and motion or atmospheric blur. These degradations cause conventional detectors to over-mix neighboring instance features and fail to separate closely packed objects. To address these limitations, we propose SARC-DETR, a detection framework that augments the RT-DETR architecture with two complementary plug-in modules: Similarity Adaptive Convolution (SAC) and Receptive Field Cross Convolution (RCC). SAC introduces a reproducing-kernel-Hilbert-space (RKHS) motivated similarity gate that selectively suppresses responses inconsistent with local feature prototypes, thereby reducing cross-instance interference in overlapped and blurred regions. RCC constructs a large directional receptive field through orthogonal strip-based aggregation and content-adaptive fusion, enabling efficient long-range context capture without quadratic complexity overhead. Both modules can be integrated into existing DETR-style detectors without modifying the detection head or training protocol. On VisDrone2019-DET, SARC-DETR improves APval from 29.7 to 34.8, AP50val from 49.5 to 56.2, and APSval from 19.2 to 24.8. On DIOR, AP rises from 57.9 to 68.4, and on NWPU VHR-10, from 44.4 to 66.5, demonstrating robust cross-dataset generalization. After structural reparameterization, the additional overhead is less than 0.75 M parameters and 0.36 G FLOPs, confirming deployment suitability for UAV and satellite-based remote sensing applications.
Keywords: remote sensing object detection; DETR; receptive field design; similarity-adaptive convolution; dense scene detection; UAV imagery remote sensing object detection; DETR; receptive field design; similarity-adaptive convolution; dense scene detection; UAV imagery

Share and Cite

MDPI and ACS Style

Lin, C.; Fu, Y.; Xu, H.; Teng, X.; Wang, T. From Generic to Adaptive: Similarity-Adaptive Receptive-Field Cross DETR for Remote-Sensing Object Detection. Remote Sens. 2026, 18, 1670. https://doi.org/10.3390/rs18101670

AMA Style

Lin C, Fu Y, Xu H, Teng X, Wang T. From Generic to Adaptive: Similarity-Adaptive Receptive-Field Cross DETR for Remote-Sensing Object Detection. Remote Sensing. 2026; 18(10):1670. https://doi.org/10.3390/rs18101670

Chicago/Turabian Style

Lin, Chenyu, Yunzhan Fu, Hang Xu, Xuyang Teng, and Tingyu Wang. 2026. "From Generic to Adaptive: Similarity-Adaptive Receptive-Field Cross DETR for Remote-Sensing Object Detection" Remote Sensing 18, no. 10: 1670. https://doi.org/10.3390/rs18101670

APA Style

Lin, C., Fu, Y., Xu, H., Teng, X., & Wang, T. (2026). From Generic to Adaptive: Similarity-Adaptive Receptive-Field Cross DETR for Remote-Sensing Object Detection. Remote Sensing, 18(10), 1670. https://doi.org/10.3390/rs18101670

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop