Next Article in Journal
Adaptive Kalman Filter-Based UWB Location Tracking with Optimized DS-TWR in Workshop Non-Line-of-Sight Environments
Previous Article in Journal
Cybersecure Intelligent Sensor Framework for Smart Buildings: AI-Based Intrusion Detection and Resilience Against IoT Attacks
Previous Article in Special Issue
Extended Dynamic Model for the UR16e 6-Degree-of-Freedom Robotic Manipulator
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

STAIR-DETR: A Synergistic Transformer Integrating Statistical Attention and Multi-Scale Dynamics for UAV Small Object Detection

by
Linna Hu
1,*,
Penghao Xue
2,
Bin Guo
3,
Yiwen Chen
1,
Weixian Zha
1 and
Jiya Tian
4
1
School of Network and Communication Engineering, Jinling Institute of Technology, Nanjing 211169, China
2
School of Software Engineering, Jinling Institute of Technology, Nanjing 211169, China
3
College of Computer and Information Engineering, Xinjiang Agricultural University, Urumqi 830052, China
4
School of Information Engineering, Xinjiang Institute of Technology, Aksu 843100, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(24), 7681; https://doi.org/10.3390/s25247681
Submission received: 30 October 2025 / Revised: 11 December 2025 / Accepted: 16 December 2025 / Published: 18 December 2025
(This article belongs to the Special Issue Dynamics and Control System Design for Robotics)

Abstract

Detecting small objects in unmanned aerial vehicle (UAV) imagery remains a challenging task due to the limited target scale, cluttered backgrounds, severe occlusion, and motion blur commonly observed in dynamic aerial environments. This study presents STAIR-DETR, a real-time synergistic detection framework derived from RT-DETR, featuring comprehensive enhancements in feature extraction, resolution transformation, and detection head design. A Statistical Feature Attention (SFA) module is incorporated into the neck to replace the original AIFI, enabling token-level statistical modeling that strengthens fine-grained feature representation while effectively suppressing background interference. The backbone is reinforced with a Diverse Semantic Enhancement Block (DSEB), which employs multi-branch pathways and dynamic convolution to enrich semantic expressiveness without sacrificing spatial precision. To mitigate information loss during scale transformation, an Adaptive Scale Transformation Operator (ASTO) is proposed by integrating Context-Guided Downsampling (CGD) and Dynamic Sampling (DySample), achieving context-aware compression and content-adaptive reconstruction across resolutions. In addition, a high-resolution P2 detection head is introduced to leverage shallow-layer features for accurate classification and localization of extremely small targets. Extensive experiments conducted on the VisDrone2019 dataset demonstrate that STAIR-DETR attains 41.7% mAP@50 and 23.4% mAP@50:95, outperforming contemporary state-of-the-art (SOTA) detectors while maintaining real-time inference efficiency. These results confirm the effectiveness and robustness of STAIR-DETR for precise small object detection in complex UAV-based imaging scenarios.
Keywords: multi-scale semantic integration; real-time object recognition; small object detection; UAV imagery multi-scale semantic integration; real-time object recognition; small object detection; UAV imagery

Share and Cite

MDPI and ACS Style

Hu, L.; Xue, P.; Guo, B.; Chen, Y.; Zha, W.; Tian, J. STAIR-DETR: A Synergistic Transformer Integrating Statistical Attention and Multi-Scale Dynamics for UAV Small Object Detection. Sensors 2025, 25, 7681. https://doi.org/10.3390/s25247681

AMA Style

Hu L, Xue P, Guo B, Chen Y, Zha W, Tian J. STAIR-DETR: A Synergistic Transformer Integrating Statistical Attention and Multi-Scale Dynamics for UAV Small Object Detection. Sensors. 2025; 25(24):7681. https://doi.org/10.3390/s25247681

Chicago/Turabian Style

Hu, Linna, Penghao Xue, Bin Guo, Yiwen Chen, Weixian Zha, and Jiya Tian. 2025. "STAIR-DETR: A Synergistic Transformer Integrating Statistical Attention and Multi-Scale Dynamics for UAV Small Object Detection" Sensors 25, no. 24: 7681. https://doi.org/10.3390/s25247681

APA Style

Hu, L., Xue, P., Guo, B., Chen, Y., Zha, W., & Tian, J. (2025). STAIR-DETR: A Synergistic Transformer Integrating Statistical Attention and Multi-Scale Dynamics for UAV Small Object Detection. Sensors, 25(24), 7681. https://doi.org/10.3390/s25247681

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop