A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes

Zhai, Yu; Zhang, Ziyi; Xie, Sen; Tong, Chunsheng; Luo, Xiuli; Li, Xuan; Wang, Liming; Zhao, Yingliang

doi:10.3390/electronics15010211

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes

by

Yu Zhai

^1,2,

Ziyi Zhang

^1,2,

Sen Xie

^1,2,

Chunsheng Tong

^1,2,

Xiuli Luo

^1,2,

Xuan Li

^1,2,

Liming Wang

^1,2 and

Yingliang Zhao

^1,2,*

¹

State key Laboratory of Extreme Environment Optoelectronic Dynamic Measurement Technology and Instrument, Taiyuan 030051, China

²

Shanxi Province Key Laboratory of Intelligent Detection Technology & Equipment, Taiyuan 030051, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(1), 211; https://doi.org/10.3390/electronics15010211 (registering DOI)

Submission received: 21 November 2025 / Revised: 29 December 2025 / Accepted: 29 December 2025 / Published: 1 January 2026

(This article belongs to the Special Issue Image Processing for Intelligent Electronics in Multimedia Systems)

Download Versions Notes

Abstract

Low-altitude Unmanned Aerial Vehicle (UAV) detection using LiDAR range images faces persistent challenges. These include sparse features for long-range targets, large scale variations caused by viewpoint changes, and severe interference from complex backgrounds. To address these issues, we propose an improved detection framework based on YOLOv10. First, we design a Swin-Conv hybrid module that combines sparse attention with deformable convolution. This module enables the network to focus on informative regions and adapt to target geometry. These capabilities jointly strengthen feature extraction for sparse, long-range targets. Second, we introduce Attentional Feature Fusion (AFF) in the neck to replace naïve feature concatenation. AFF employs multi-scale channel attention to softly select and adaptively weight features from different levels, improving robustness to multi-scale targets. In addition, we systematically study how the viewpoint distribution in the training set affects performance. The results show that moderately increasing the proportion of low-elevation-view samples significantly improves detection accuracy. Experiments on a self-built simulated LiDAR range-image dataset demonstrate that our method achieves 88.96% mAP at 54.2 FPS, which is 4.78 percentage points higher than the baseline. Deployment on the Jetson Orin Nano edge device further validates the model’s potential for real-time applications. The proposed method remains robust under noise and complex backgrounds. The proposed approach achieves an effective balance between detection accuracy and computational efficiency, providing a reliable solution for real-time target detection in complex low-altitude environments.

Keywords: object detection; YOLOv10; LiDAR; UAV detection; feature fusion

Share and Cite

MDPI and ACS Style

Zhai, Y.; Zhang, Z.; Xie, S.; Tong, C.; Luo, X.; Li, X.; Wang, L.; Zhao, Y. A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes. Electronics 2026, 15, 211. https://doi.org/10.3390/electronics15010211

AMA Style

Zhai Y, Zhang Z, Xie S, Tong C, Luo X, Li X, Wang L, Zhao Y. A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes. Electronics. 2026; 15(1):211. https://doi.org/10.3390/electronics15010211

Chicago/Turabian Style

Zhai, Yu, Ziyi Zhang, Sen Xie, Chunsheng Tong, Xiuli Luo, Xuan Li, Liming Wang, and Yingliang Zhao. 2026. "A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes" Electronics 15, no. 1: 211. https://doi.org/10.3390/electronics15010211

APA Style

Zhai, Y., Zhang, Z., Xie, S., Tong, C., Luo, X., Li, X., Wang, L., & Zhao, Y. (2026). A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes. Electronics, 15(1), 211. https://doi.org/10.3390/electronics15010211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Real-Time Improved YOLOv10 Model for Small and Multi-Scale Ground Target Detection in UAV LiDAR Range Images of Complex Scenes

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI