Next Article in Journal
A Novel Color Image Encryption Method Based on Hierarchical Surrogate-Assisted Optimization
Previous Article in Journal
Gait Generation and Motion Implementation of Humanoid Robots Based on Hierarchical Whole-Body Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

FD-RTDETR: Frequency Enhancement and Dynamic Sequence-Feature Optimization for Object Detection

1
School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei 230601, China
2
School of Big Data and Artificial Intelligence, Anhui Xinhua University, Hefei 230088, China
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(23), 4715; https://doi.org/10.3390/electronics14234715 (registering DOI)
Submission received: 28 October 2025 / Revised: 25 November 2025 / Accepted: 27 November 2025 / Published: 29 November 2025

Abstract

The high computational complexity of transformer-based detectors leads to slow inference speeds. RT-DETR demonstrates improved performance in these aspects, yet there remains room for enhancement. To achieve more comprehensive feature learning and better coverage of objects across scales, we introduce FD-RTDETR, a refined architecture for end-to-end object detection. We design a Frequency and Attention-based Intra-scale Feature Interaction module for the hybrid encoder, performing dual-path enhancement on high and low frequency features. Simultaneously, we introduce a Dynamic Fusion of Scale Sequence features module for cross-scale feature fusion, which significantly extends the model’s coverage capability across objects at different scales. Using ResNet-18 as the backbone network and evaluating on the COCO 2017 dataset, our FD-RTDETR achieves 45.1 AP, surpassing RT-DETR by 0.7 AP. On the VisDrone2019 dataset, it achieves 47.9 mAP50, outperforming RT-DETR by 1.3. Our method was also tested for generalization on urinary sediment and high-altitude infrared thermal imaging datasets, achieving 0.7 mAP50 and 0.8 mAP50:95 higher than RT-DETR, respectively, and performs better in certain categories.
Keywords: computer vision; object detection; DETR computer vision; object detection; DETR

Share and Cite

MDPI and ACS Style

Wang, W.; Wang, Q.; Zou, K.; Huang, Y.; Xu, Q. FD-RTDETR: Frequency Enhancement and Dynamic Sequence-Feature Optimization for Object Detection. Electronics 2025, 14, 4715. https://doi.org/10.3390/electronics14234715

AMA Style

Wang W, Wang Q, Zou K, Huang Y, Xu Q. FD-RTDETR: Frequency Enhancement and Dynamic Sequence-Feature Optimization for Object Detection. Electronics. 2025; 14(23):4715. https://doi.org/10.3390/electronics14234715

Chicago/Turabian Style

Wang, Wu, Qijin Wang, Kun Zou, Yichi Huang, and Qi Xu. 2025. "FD-RTDETR: Frequency Enhancement and Dynamic Sequence-Feature Optimization for Object Detection" Electronics 14, no. 23: 4715. https://doi.org/10.3390/electronics14234715

APA Style

Wang, W., Wang, Q., Zou, K., Huang, Y., & Xu, Q. (2025). FD-RTDETR: Frequency Enhancement and Dynamic Sequence-Feature Optimization for Object Detection. Electronics, 14(23), 4715. https://doi.org/10.3390/electronics14234715

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop