This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection
by
Jiarui Liang
Jiarui Liang 1,†
,
Jiachen Yu
Jiachen Yu 1,†,
Mingyang Li
Mingyang Li 1,
Yikui Zhai
Yikui Zhai 2
and
Xiaolin Tian
Xiaolin Tian 1,*
1
Faculty of Innovation Engineering, Macau University of Science and Technology, Avenida Wai Long N°S 100–460, Taipa, Macau, China
2
Department of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China
*
Author to whom correspondence should be addressed.
†
These authors contributed equally to this work.
Appl. Sci. 2026, 16(12), 6062; https://doi.org/10.3390/app16126062 (registering DOI)
Submission received: 14 May 2026
/
Revised: 7 June 2026
/
Accepted: 11 June 2026
/
Published: 15 June 2026
Abstract
Unmanned aerial vehicle (UAV) object detection plays an essential role in modern visual perception, but it remains challenging because UAV imagery typically contains extremely small, densely distributed objects embedded in complex backgrounds. Conventional detectors, including the recent YOLOv12, are prone to losing critical spatial details during downsampling and often exhibit task misalignment between classification and localization, particularly under severe scale variations. To address these problems, this study proposes SPTD-YOLO, a small-object-aware pyramidal and task-aligned dynamic detector. Specifically, a Small Object Enhanced Pyramid (SOEP) is developed by incorporating SPDConv and CSPOmniKernel to preserve and refine shallow, fine-grained features. In addition, a high-resolution P2 detection layer is introduced to increase spatial grid density and strengthen the structural representation of tiny objects. Furthermore, a Task-Aligned Dynamic Detection Head (TADDH) is designed to decouple and coordinate classification and regression through dynamic convolution and a synergistic dual-gating mechanism. Experiments on VisDrone2019 show that SPTD-YOLO improves mAP@0.5 by 8.37% and mAP@0.5:0.95 by 5.11% over YOLOv12 while maintaining practical efficiency for UAV edge deployment.
Share and Cite
MDPI and ACS Style
Liang, J.; Yu, J.; Li, M.; Zhai, Y.; Tian, X.
SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection. Appl. Sci. 2026, 16, 6062.
https://doi.org/10.3390/app16126062
AMA Style
Liang J, Yu J, Li M, Zhai Y, Tian X.
SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection. Applied Sciences. 2026; 16(12):6062.
https://doi.org/10.3390/app16126062
Chicago/Turabian Style
Liang, Jiarui, Jiachen Yu, Mingyang Li, Yikui Zhai, and Xiaolin Tian.
2026. "SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection" Applied Sciences 16, no. 12: 6062.
https://doi.org/10.3390/app16126062
APA Style
Liang, J., Yu, J., Li, M., Zhai, Y., & Tian, X.
(2026). SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection. Applied Sciences, 16(12), 6062.
https://doi.org/10.3390/app16126062
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.