Next Article in Journal
Cutting Tool Wear Condition Monitoring in Milling Using Deep Learning and Data Fusion
Previous Article in Journal
Unified Texture Descriptor in the Form of Color and Machine Learning, Applied to Face Identification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection

1
Faculty of Innovation Engineering, Macau University of Science and Technology, Avenida Wai Long N°S 100–460, Taipa, Macau, China
2
Department of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Appl. Sci. 2026, 16(12), 6062; https://doi.org/10.3390/app16126062 (registering DOI)
Submission received: 14 May 2026 / Revised: 7 June 2026 / Accepted: 11 June 2026 / Published: 15 June 2026

Abstract

Unmanned aerial vehicle (UAV) object detection plays an essential role in modern visual perception, but it remains challenging because UAV imagery typically contains extremely small, densely distributed objects embedded in complex backgrounds. Conventional detectors, including the recent YOLOv12, are prone to losing critical spatial details during downsampling and often exhibit task misalignment between classification and localization, particularly under severe scale variations. To address these problems, this study proposes SPTD-YOLO, a small-object-aware pyramidal and task-aligned dynamic detector. Specifically, a Small Object Enhanced Pyramid (SOEP) is developed by incorporating SPDConv and CSPOmniKernel to preserve and refine shallow, fine-grained features. In addition, a high-resolution P2 detection layer is introduced to increase spatial grid density and strengthen the structural representation of tiny objects. Furthermore, a Task-Aligned Dynamic Detection Head (TADDH) is designed to decouple and coordinate classification and regression through dynamic convolution and a synergistic dual-gating mechanism. Experiments on VisDrone2019 show that SPTD-YOLO improves mAP@0.5 by 8.37% and mAP@0.5:0.95 by 5.11% over YOLOv12 while maintaining practical efficiency for UAV edge deployment.
Keywords: UAV object detection; YOLOv12; small object detection; feature enhancement; dynamic head; task alignment UAV object detection; YOLOv12; small object detection; feature enhancement; dynamic head; task alignment

Share and Cite

MDPI and ACS Style

Liang, J.; Yu, J.; Li, M.; Zhai, Y.; Tian, X. SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection. Appl. Sci. 2026, 16, 6062. https://doi.org/10.3390/app16126062

AMA Style

Liang J, Yu J, Li M, Zhai Y, Tian X. SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection. Applied Sciences. 2026; 16(12):6062. https://doi.org/10.3390/app16126062

Chicago/Turabian Style

Liang, Jiarui, Jiachen Yu, Mingyang Li, Yikui Zhai, and Xiaolin Tian. 2026. "SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection" Applied Sciences 16, no. 12: 6062. https://doi.org/10.3390/app16126062

APA Style

Liang, J., Yu, J., Li, M., Zhai, Y., & Tian, X. (2026). SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection. Applied Sciences, 16(12), 6062. https://doi.org/10.3390/app16126062

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop