MDPI - Publisher of Open Access Journals

40 pages, 27259 KB

Open AccessArticle

Monocular 3D Position Estimation of a Moving Vehicle Based on a Kalman-Goldschmidt Adaptive Filter

by Diana Kalita, Pavel Lyakhov, Valery Andreev and Denis Butusov

J. Sens. Actuator Netw. 2026, 15(3), 48; https://doi.org/10.3390/jsan15030048 (registering DOI) - 18 Jun 2026

Viewed by 63

Determining the 3D position of a vehicle from a 2D image plays a key role in video surveillance, autonomous driving, and spatial localization. However, localization accuracy can significantly degrade in conditions of incomplete or synthetic measurement noise and keypoint jitter. In this paper, [...] Read more.

Determining the 3D position of a vehicle from a 2D image plays a key role in video surveillance, autonomous driving, and spatial localization. However, localization accuracy can significantly degrade in conditions of incomplete or synthetic measurement noise and keypoint jitter. In this paper, we propose a new iterative 3D position estimation algorithm (KGA). This algorithm includes geometric correction and calibration steps for converting from 2D to 3D coordinates; trajectory prediction and correction using a Kalman filter; and adaptive tuning of the filter parameters using the Goldschmidt algorithm. Experiments confirm that KGA outperforms the standard (FK) and modified (MFK) Kalman filters in accuracy and convergence speed, demonstrating robustness to various camera angles and noise levels. The novelty of this approach lies in the integration of the Goldschmidt algorithm into the Kalman filter to create an adaptation mechanism that dynamically adjusts the measurement noise covariance based on instantaneous innovation magnitude. Unlike end-to-end deep learning trackers or nonlinear filters (EKF/UKF), KGA is designed as a lightweight post-processing stage that can be seamlessly integrated into existing detection pipelines while maintaining the low computational footprint required for UAV-based edge deployment. The algorithm is of practical value for computer vision systems requiring accurate and robust tracking under varying observational conditions, with current implementation suitable for offline or buffered processing, and clear pathways to real-time deployment through code optimization. The algorithm is of practical value for computer vision systems requiring accurate and robust tracking under varying observational conditions. Full article

(This article belongs to the Section Big Data, Computing and Artificial Intelligence)

► Show Figures

Figure 1

26 pages, 2413 KB

Open AccessArticle

UAV-Assisted Preview-Augmented DSMC with Control Barrier Functions for Safe and Robust Trajectory Tracking of AGVs

by Umar Farid, Muhammad Usman Jamil and Zahid Ullah

Machines 2026, 14(6), 696; https://doi.org/10.3390/machines14060696 (registering DOI) - 17 Jun 2026

Viewed by 385

Abstract

Autonomous navigation of a vehicle in an environment where there are obstacles is difficult due to low onboard sensing technology, high measuring noise, and external interference, which collectively result in poor tracking performance of the vehicle’s trajectory and compromise safety. In this paper, [...] Read more.

Autonomous navigation of a vehicle in an environment where there are obstacles is difficult due to low onboard sensing technology, high measuring noise, and external interference, which collectively result in poor tracking performance of the vehicle’s trajectory and compromise safety. In this paper, a UAV-assisted Distributed Sliding Mode Control (DSMC) is proposed to robustly and safely implement path tracking for autonomous ground vehicles (AGVs). The proposed system utilizes an aero-sensor layer for enhanced perception, such as obstacle sensing, reference path preview, and look-ahead trajectory information, and it shares this information with the vehicle via wireless communication. The fundamental scheme, called DSMC, is based on a conventional Sliding Mode Control (SMC) technique and uses UAV preview-based feedback. This allows anticipation of control actions to enhance tracking performance and achieve more timely, smoother obstacle avoidance than baseline SMC. The proposed method is designed to overcome the limitations of traditional SMC strategies, such as chattering and poor responsiveness. The proposed method features continuous nonlinear approximation and damping mechanisms to reduce chattering and improve response characteristics, thereby enhancing stability and reducing oscillations. Strict safety enforcement through constraint is always achieved by keeping the vehicle and obstacles separated by a minimum distance only; that is, a minimum distance is always guaranteed: a Constraint Barrier Function (CBF)-based constraint is used. By combining UAV-assisted perception with DSMC and CBF the system can guarantee its formal safety in the presence of disturbances and sensing uncertainties while maintaining accurate trajectory tracking. Based on our simulation results, the proposed UAV-assisted DSMC method is shown to be significantly superior to conventional SMC and Model Predictive Controller (MPC) in terms of tracking accuracy, control smoothness, and adherence to the safety margin. Our simulation results demonstrate that the proposed method significantly outperforms conventional SMC and MPC control. Specifically, it achieves a 22.9% reduction in RMSE (0.135 m vs. 0.175 m) and 63% lower mean control effort, and it strictly maintains the minimum safety distance under both static and dynamic obstacles. The algorithm runs in real-time with an average execution time of 1.85 ms (>200 Hz), making it highly suitable for embedded deployment. These results highlight the effectiveness of combining UAV-assisted preview, adaptive robust control, and formal safety constraints for reliable autonomous navigation in complex environments. Full article

(This article belongs to the Special Issue Advances in Automotive Mechatronics)

► Show Figures

Figure 1

33 pages, 8778 KB

Open AccessArticle

SPTD-YOLO: Small-Object-Aware Pyramidal and Task-Aligned Dynamic YOLO for UAV Small Object Detection

by Jiarui Liang, Jiachen Yu, Mingyang Li, Yikui Zhai and Xiaolin Tian

Appl. Sci. 2026, 16(12), 6062; https://doi.org/10.3390/app16126062 - 15 Jun 2026

Viewed by 124

Abstract

Unmanned aerial vehicle (UAV) object detection plays an essential role in modern visual perception, but it remains challenging because UAV imagery typically contains extremely small, densely distributed objects embedded in complex backgrounds. Conventional detectors, including the recent YOLOv12, are prone to losing critical [...] Read more.

Unmanned aerial vehicle (UAV) object detection plays an essential role in modern visual perception, but it remains challenging because UAV imagery typically contains extremely small, densely distributed objects embedded in complex backgrounds. Conventional detectors, including the recent YOLOv12, are prone to losing critical spatial details during downsampling and often exhibit task misalignment between classification and localization, particularly under severe scale variations. To address these problems, this study proposes SPTD-YOLO, a small-object-aware pyramidal and task-aligned dynamic detector. Specifically, a Small Object Enhanced Pyramid (SOEP) is developed by incorporating SPDConv and CSPOmniKernel to preserve and refine shallow, fine-grained features. In addition, a high-resolution P₂ detection layer is introduced to increase spatial grid density and strengthen the structural representation of tiny objects. Furthermore, a Task-Aligned Dynamic Detection Head (TADDH) is designed to decouple and coordinate classification and regression through dynamic convolution and a synergistic dual-gating mechanism. Experiments on VisDrone2019 show that SPTD-YOLO improves mAP@0.5 by 8.37% and mAP@0.5:0.95 by 5.11% over YOLOv12 while maintaining practical efficiency for UAV edge deployment. Full article

► Show Figures

Figure 1

32 pages, 2159 KB

Open AccessArticle

Traffic-Predictive Drone Scheduling: Day-Ahead Synchronization of Mobile Depots and Parallel Aerial Sorties in Urban Airspace

by Shihab Hasan, Tarek Sheltami and Ashraf Mahmoud

Drones 2026, 10(6), 461; https://doi.org/10.3390/drones10060461 - 13 Jun 2026

Viewed by 161

Abstract

Urban Unmanned Aerial Vehicle (UAV) logistics operations are frequently constrained by the intersection of limited battery endurance and dynamic ground traffic. When mobile depots are delayed by congestion, onboard drone fleets experience extended idling periods, leading to constrained sortie generation and reduced asset [...] Read more.

Urban Unmanned Aerial Vehicle (UAV) logistics operations are frequently constrained by the intersection of limited battery endurance and dynamic ground traffic. When mobile depots are delayed by congestion, onboard drone fleets experience extended idling periods, leading to constrained sortie generation and reduced asset utilization. To address this bottleneck, this paper introduces a traffic-predictive multi-UAV dispatch framework for deterministic day-ahead planning under modeled urban operating conditions. By coupling a count-derived macroscopic speed surrogate learned using XGBoost with a Particle Swarm Optimization (PSO)–Mixed-Integer Linear Programming (MILP) optimization architecture, the framework synchronizes mobile depot trajectories with forecasted low-congestion windows and pre-allocates endurance-feasible parallel aerial sorties. Controlled computational experiments across 30 synthetic routing instances demonstrate the potential value of this approach within the stated modeling assumptions. Compared to baseline clustered deployments, the traffic-aware framework raises mean fleet utilization from 0.43 to 0.63—a 46.2% relative improvement driven by temporal compression of the mission window rather than an absolute increase in flight hours. Furthermore, the proposed framework reduces total mission completion time by 69.87% relative to the conventional truck-only baseline, while achieving a 29.58% incremental gain over static speed drone deployments. These findings suggest that incorporating predictive ground traffic information into day-ahead UAV scheduling can improve modeled fleet efficiency; however, field validation with measured route-level speeds, real delivery demand, and operational constraints remains necessary before deployment-level claims can be made. Full article

(This article belongs to the Section Innovative Urban Mobility)

► Show Figures

Figure 1

26 pages, 19446 KB

Open AccessArticle

Automated Synthesis of Hierarchical Deep Learning Cascades for Identifying Visually Similar Objects in UAV Imagery

by Dmytro Borovyk, Oleksander Barmak, Pavlo Radiuk and Iurii Krak

Technologies 2026, 14(6), 360; https://doi.org/10.3390/technologies14060360 - 13 Jun 2026

Viewed by 193

Abstract

Accurate identification of visually similar targets in Unmanned Aerial Vehicle (UAV) imagery is hindered by significant inter-class ambiguity and viewpoint variability. While hierarchical deep learning mitigates these challenges, existing architectures relieve manual design, introducing subjectivity and limiting cross-domain scalability. In this work, we [...] Read more.

Accurate identification of visually similar targets in Unmanned Aerial Vehicle (UAV) imagery is hindered by significant inter-class ambiguity and viewpoint variability. While hierarchical deep learning mitigates these challenges, existing architectures relieve manual design, introducing subjectivity and limiting cross-domain scalability. In this work, we propose an objective, data-driven method for the automated synthesis of hierarchical classification structures. Our approach uses a hybrid inter-class proximity metric that integrates geometric distances between latent-feature-space centroids with empirical misclassification probabilities. Using a hierarchical agglomerative clustering algorithm optimized via an inconsistency coefficient, we synthesize a coarse-to-fine cascade that deploys YOLOv11 for feature extraction and FT-Transformers for specialized identification. Experimental validation on the VisDrone2019 and UAV123 datasets demonstrates that the automatically generated hierarchy achieves a peak F1-score of 94.9%, outperforming the monolithic YOLOv11 model by 0.8% and matching human-designed cascades. Sensitivity analysis indicates an optimal hybrid weight range of 0.4–0.6. The findings confirm that our automated synthesis provides high adaptability and reliability for real-time edge AI deployments, ensuring robust performance in dynamic monitoring environments without requiring manual redesign. Full article

(This article belongs to the Special Issue Advanced Technologies in Computer Vision and Applications)

► Show Figures

Figure 1

25 pages, 18006 KB

Open AccessArticle

Multi-UAV Cooperative Localization in Pseudolite-Augmented GNSS-Denied Regions: An Anomaly-Resilient Adaptive Kalman Filter with Group Covariance Compensation

by Chengyan Ji, Xiye Guo, Yuqiu Tang, Xiaohe Han and Yuhang Song

Drones 2026, 10(6), 460; https://doi.org/10.3390/drones10060460 - 12 Jun 2026

Viewed by 261

Abstract

In complex low-altitude environments, unmanned aerial vehicles (UAVs) require reliable positioning, yet Global Navigation Satellite System (GNSS) signals are vulnerable to occlusion and interference. Pseudolite-augmented cooperative localization, which combines ground base-station signals with inter-UAV relative observations, can complement GNSS in such environments. However, [...] Read more.

In complex low-altitude environments, unmanned aerial vehicles (UAVs) require reliable positioning, yet Global Navigation Satellite System (GNSS) signals are vulnerable to occlusion and interference. Pseudolite-augmented cooperative localization, which combines ground base-station signals with inter-UAV relative observations, can complement GNSS in such environments. However, two practical issues remain in real-world deployment: UAV-to-base-station (U-B) and UAV-to-UAV (U-U) observations have markedly different error statistics that a unified noise adjustment cannot handle, and the conservative covariance estimates produced by Covariance Intersection (CI) fusion bias the innovation-based adaptive noise estimation in distributed architectures. To address these issues, this paper proposes a Distributed Group Covariance Compensation Adaptive Kalman Filter (DGCC-AKF) for collaborative enhancement of UAV regional localization. DGCC-AKF establishes a group adaptive mechanism that independently adjusts the noise covariance matrices of U-B and U-U observations, enabling observation-type-level adaptive weighting that suppresses anomalous U-B or U-U measurements at the group level. In addition, a bounded covariance compensation factor is incorporated to alleviate the CI-induced conservatism in the adaptive noise estimation. The proposed method is evaluated on a 2800 km² semi-physical testbed based on the Ground-based High-precision Local Positioning System (GH-LPS) pseudolite network using measured U-B observations and high-dynamic (>300 km/h) flight trajectories collected from a fixed-wing platform across three independent flight sessions. Results demonstrate that under observation fault periods, the proposed method improves 3D positioning accuracy by up to about 75% over single-UAV extended Kalman filter (EKF). Compared with two advanced algorithms in this field, variational Bayesian adaptive Kalman filter (VBAKF) and maximum correntropy criterion Kalman filter (MCC-EKF), it is the only scheme that remains accurate and stable across all UAVs and fault types. The framework provides a practical step toward field deployment for resilient multi-UAV cooperative navigation in pseudolite-augmented GNSS-denied regions. Full article

(This article belongs to the Topic Navigation and Positioning System: Opportunities and Obstacles)

► Show Figures

Figure 1

31 pages, 4488 KB

Open AccessArticle

Weather-Aware Asynchronous Vehicle–UAV Cooperative Scheduling for Distribution Network Inspection via Bi-Level MODDPG–NSGA-II Optimization

by Xiaoyi Liu, Yuhan Yin, Yetong Zhang, Kunxiao Wu, Jianyong Zheng and Fei Mei

Technologies 2026, 14(6), 355; https://doi.org/10.3390/technologies14060355 - 12 Jun 2026

Viewed by 147

Abstract

Extreme weather conditions impose significant challenges on distribution network inspection because UAV flight safety, energy consumption, vehicle mobility, and task coverage are strongly coupled under wind disturbances. To improve inspection efficiency and operational robustness, this paper proposes a weather-aware asynchronous vehicle–UAV cooperative scheduling [...] Read more.

Extreme weather conditions impose significant challenges on distribution network inspection because UAV flight safety, energy consumption, vehicle mobility, and task coverage are strongly coupled under wind disturbances. To improve inspection efficiency and operational robustness, this paper proposes a weather-aware asynchronous vehicle–UAV cooperative scheduling method based on bi-level MODDPG–NSGA-II optimization. First, a dynamic wind field model and a wind-sensitive UAV energy model are established to describe the effects of background wind, vertical wind shear, and local gust disturbances on UAV motion and state-of-charge evolution. Then, an asynchronous vehicle–UAV collaboration mechanism is developed, allowing the vehicle to move toward downstream parking sites after UAV deployment while UAVs perform inspection and cross-site recovery under rendezvous and energy safety constraints. On this basis, a bi-level optimization framework is constructed, in which NSGA-II searches global coordination parameters and MODDPG learns adaptive multi-UAV scheduling policies in continuous decision spaces. Controlled wind-factor experiments show that, with the task scale fixed at 52 inspection tasks, the proposed method maintains 100% task coverage under 0–10 m/s wind conditions. As the reference wind speed increases from 0 m/s to 10 m/s, the mission completion time increases from 40.97 min to 70.24 min, while the minimum residual SOC decreases from 50.32% to 13.82%, which remains above the predefined safety threshold. Repeated stochastic trials and statistical significance analysis further indicate that the proposed method achieves shorter mission time and more stable task coverage than representative baselines under the same experimental conditions. The scope of this study is simulation-level validation; real-world flight tests and hardware-in-the-loop verification will be further investigated in future work. Full article

(This article belongs to the Section Information and Communication Technologies)

22 pages, 706 KB

Open AccessArticle

Fault Recovery in Distribution Cyber–Physical Systems via UAV-Assisted Emergency Communication

by Wei Wang, Hongquan Xu, Chao Fang, Huibin Jia and Yipeng Wu

Energies 2026, 19(12), 2811; https://doi.org/10.3390/en19122811 - 12 Jun 2026

Viewed by 297

Abstract

The escalating frequency of extreme weather events poses severe threats to power system security, often resulting in catastrophic economic and societal consequences. As modern information and communication technologies (ICTs) integrate deeply with power grids, post-disaster communication failures and electrical faults become increasingly interdependent, [...] Read more.

The escalating frequency of extreme weather events poses severe threats to power system security, often resulting in catastrophic economic and societal consequences. As modern information and communication technologies (ICTs) integrate deeply with power grids, post-disaster communication failures and electrical faults become increasingly interdependent, complicating the restoration of distribution cyber–physical systems (CPSs). To bridge the gap where conventional Unmanned Aerial Vehicle (UAV)-enabled emergency communication ignores coordination with power system restoration, this paper proposes a coordinated recovery method featuring a two-stage UAV deployment strategy. First, a coupled cyber–physical model is established to characterize the cross-layer interaction mechanisms. On this basis, a bi-level optimization framework is developed: the upper level formulates a dynamic two-stage UAV deployment strategy to minimize the mobilization of resources, while the lower level executes network topology reconfiguration to maximize weighted load restoration, constrained by the recovered communication coverage. Simulation results on a modified IEEE 33-bus system demonstrate that the proposed method significantly enhances restoration efficiency. Compared with conventional schemes, the cumulative load loss rate is reduced by 15.75% and 2.42% across different scenarios; the two-stage UAV deployment method achieves a time reduction of 67.23%, 21.40% and 71.56%, validating the superior performance of the coordinated recovery strategy in disaster-stricken CPS. Full article

► Show Figures

Figure 1

34 pages, 4240 KB

Open AccessArticle

A Multimodal Data Fusion Algorithm for Urban Low-Altitude UAV Perception

by Bowen Xu, Peinan He, Xu Wang, Yixiao Zhang and Yuanjie Zhao

Drones 2026, 10(6), 457; https://doi.org/10.3390/drones10060457 - 11 Jun 2026

Viewed by 173

Abstract

Accurate Unmanned Aerial Vehicle (UAV) position estimation is the cornerstone of urban low-altitude safety management systems. Time Difference of Arrival (TDOA) and Remote Identification (Remote ID) are widely used surveillance technologies with complementary characteristics. TDOA provides high-rate updates but suffers from geometry-induced horizontal–vertical [...] Read more.

Accurate Unmanned Aerial Vehicle (UAV) position estimation is the cornerstone of urban low-altitude safety management systems. Time Difference of Arrival (TDOA) and Remote Identification (Remote ID) are widely used surveillance technologies with complementary characteristics. TDOA provides high-rate updates but suffers from geometry-induced horizontal–vertical anisotropy and multipath effects, while Remote ID supplies absolute state information yet struggles with intermittent sampling and packet loss. Existing fusion schemes typically address these issues in isolation: sequential filtering manages asynchrony but assumes Gaussian noise, robust estimators suppress outliers at the cost of discarding valid data, and coupled-filter architectures allow vertical anomalies to contaminate horizontal estimates through the Kalman gain cross-coupling. No prior framework jointly handles structural TDOA altitude jumps, stochastic Remote ID timing jitter, and the geometric anisotropy between estimation subspaces within a single coherent pipeline. To bridge this gap, we propose a Hybrid Conditional Kalman Filter (HCKF) framework comprising three integrated modules. First, a kinematics-based temporal alignment module maps asynchronous measurements onto a uniform timeline and predicts missing samples, resolving cross-modal time mismatches. Second, a measurement quality evaluation mechanism detects TDOA altitude steps via robust two-layer stratification and scores Remote ID timing irregularity through a confidence mapping, converting these anomalies into dynamic covariance adjustments and weight caps without discarding observations. Third, a Subspace-Decoupled Fusion strategy exploits the physical insight that TDOA horizontal precision derives from hyperbolic intersection geometry, whereas its vertical estimates suffer from weak observability due to near-coplanar ground-station deployment. By applying entropy-guided weighting in the horizontal plane and a conditional Remote ID-dominant rule in the vertical axis, this design prevents cross-dimensional error propagation. The framework was validated using three real-world flight missions at distinct altitudes (255 m, 345 m, and 440 m) totaling 13.51 km of flight distance, with RTK serving as ground truth. HCKF reduces the Root Mean Square Error by over 40% relative to single-source baselines (95% bootstrap confidence interval: [35.2%, 48.7%]), and paired Wilcoxon signed-rank tests confirm statistically significant improvement (

p < 0.01

) over standard EKF, Covariance Intersection, and Iterative CI across all three tracks. Full article

(This article belongs to the Special Issue Detection, Identification and Tracking of UAVs and Drones: 2nd Edition)

► Show Figures

Figure 1

33 pages, 5566 KB

Open AccessReview

A Review of Reinforcement Learning for Multirotor UAVs from a Hierarchical Control Perspective: Biomimetic Architecture and Sim-to-Real

by Wei Wei, Xubo Zhao, Yongjie Shu, Qingkai Meng, Mingkai Ding, Yunyi Wang and Qingdong Yan

Drones 2026, 10(6), 448; https://doi.org/10.3390/drones10060448 - 8 Jun 2026

Viewed by 226

Abstract

As unmanned aerial vehicle (UAV) systems evolve from automated execution toward autonomous decision-making, multirotor UAVs increasingly face complex dynamics, uncertain sensing conditions, and task-level autonomy demands. Reinforcement learning (RL) has emerged as a promising learning-based paradigm for addressing these challenges. Existing surveys on [...] Read more.

As unmanned aerial vehicle (UAV) systems evolve from automated execution toward autonomous decision-making, multirotor UAVs increasingly face complex dynamics, uncertain sensing conditions, and task-level autonomy demands. Reinforcement learning (RL) has emerged as a promising learning-based paradigm for addressing these challenges. Existing surveys on RL-based UAV control predominantly classify methods from an algorithmic or learning-paradigm perspective, while relatively little attention has been paid to the functional roles of RL policies within the control loop. This often leads to an unclear correspondence between algorithmic characteristics and the requirements of different control layers. To address this gap, this review proposes a biomimetic “spinal cord–cerebellum–cerebrum” framework, organizing existing RL studies into low-level dynamic stabilization, mid-level perception–action coordination, and high-level task planning and decision-making. The proposed hierarchy emphasizes the functional role and intervention depth of RL policies within the control architecture, further supporting a layer-wise analysis of sim-to-real challenges. This review aims to provide a structured understanding of the roles of reinforcement learning in hierarchical UAV control and to highlight future research directions toward robust real-world deployment. Full article

(This article belongs to the Special Issue Advances in Deep Learning for Drones and Its Applications: 2nd Edition)

► Show Figures

Figure 1

17 pages, 1905 KB

Open AccessArticle

DAS-Net: A Lightweight Dynamic Convolution Network with Attention Gates and Deep Supervision for UAV Semantic Segmentation

by Young Jae Kim and Sang-Chul Kim

Appl. Sci. 2026, 16(11), 5688; https://doi.org/10.3390/app16115688 - 5 Jun 2026

Viewed by 156

Abstract

Anti-UAV surveillance demands real-time pixel-level UAV localization on resource-constrained gimbal-mounted platforms, yet existing lightweight segmentation models suffer from low recall that propagates to downstream tracking failure. Building on our prior dataset of 605,045 paired visible-light and infrared images, we extend the lightweight ThinDyUNet [...] Read more.

Anti-UAV surveillance demands real-time pixel-level UAV localization on resource-constrained gimbal-mounted platforms, yet existing lightweight segmentation models suffer from low recall that propagates to downstream tracking failure. Building on our prior dataset of 605,045 paired visible-light and infrared images, we extend the lightweight ThinDyUNet baseline with three architectural improvements: (1) symmetric dynamic convolution applied to both the encoder and decoder, (2) attention gates filtering skip connections, and (3) deep supervision with auxiliary loss heads. The resulting DAS-Net is evaluated under a three-seed Monte Carlo cross-validation protocol on the full 174,008-image test set. DAS-Net achieves a mean test mIoU of 0.6780 and Dice coefficient of 0.7509 across three independent seeds, outperforming the ThinDyUNet baseline by +6.65 percentage points (pp) in mIoU with statistical significance (one-sided paired t-test, p = 0.045, Cohen’s d = 1.74; full variance and significance analysis in the experimental section). DAS-Net matches the best-performing external baseline (UNet) and exceeds the others (MobileUNet, PAN, PSPNet) while using approximately 14.7× fewer parameters than ResNet-34-based variants. DAS-Net runs at 8.83 ms per image on an NVIDIA A6000 GPU (113 FPS) and 38.44 ms on an NVIDIA Jetson AGX Orin (26 FPS at FP16), demonstrating real-time deployability across server-class and embedded edge platforms. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

► Show Figures

Figure 1

16 pages, 3985 KB

Open AccessArticle

Vehicle Detection in Drone Aerial Views Based on Lightweight YOLOv10-IAD

by Lei Zhang, Zhongmin Li and Yufeng Yao

Sensors 2026, 26(11), 3585; https://doi.org/10.3390/s26113585 - 4 Jun 2026

Viewed by 289

Abstract

UAV-based vehicle detection faces challenges of small targets, dense distribution, and occlusions. Built upon YOLOv10n, this paper proposes YOLOv10-IAD by integrating three modules: (1) Involution convolution in the backbone to enlarge the receptive field and enhance spatial perception for small targets; (2) ACmix [...] Read more.

UAV-based vehicle detection faces challenges of small targets, dense distribution, and occlusions. Built upon YOLOv10n, this paper proposes YOLOv10-IAD by integrating three modules: (1) Involution convolution in the backbone to enlarge the receptive field and enhance spatial perception for small targets; (2) ACmix (Attention and Convolution Mixed) in the neck to fuse local details with global context; (3) DyHead (Dynamic Head) that recalibrates features via scale-, space-, and task-aware attention, improving localization for occluded objects. On VisDrone2019 and UAVDT datasets, YOLOv10-IAD improves mAP50 by 3.7% (to 47.2%) and 3.5% (to 52.0%), and recall by 3.1% and 2.0%, respectively, with only a modest increase in parameters (2.9 M) and computational cost. Compared to other YOLO series, it achieves a favorable trade-off between detection accuracy and computational efficiency. These advancements make it suitable for deployment on hardware onboard UAVs for real-time road vehicle detection. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

28 pages, 549 KB

Open AccessArticle

Constrained Optimization and Dynamic Trade-Off Method for Formation Assignment of Heterogeneous UAV Swarms

by Zhenxing Zhang, Liping Hu, Dongwei Zhang, Rennong Yang, Ying Wang and Jialiang Zuo

Drones 2026, 10(6), 428; https://doi.org/10.3390/drones10060428 - 1 Jun 2026

Viewed by 284

Abstract

This paper addresses the formation assignment problem for heterogeneous UAV swarms in dynamic mission environments. A constrained optimization model is constructed in which UAV capabilities are divided into shareable and exclusive types; a neighborhood collaboration decay factor captures the locality of capability complementarity; [...] Read more.

This paper addresses the formation assignment problem for heterogeneous UAV swarms in dynamic mission environments. A constrained optimization model is constructed in which UAV capabilities are divided into shareable and exclusive types; a neighborhood collaboration decay factor captures the locality of capability complementarity; and a Cobb–Douglas production function evaluates position-specific effectiveness under bottleneck constraints. The objective dynamically trades off deployment costs and system risks through threat-adaptive weight adjustment. To solve the model, a Hybrid Adaptive Large Neighborhood Search (HALNS) algorithm is proposed, integrating an adaptive destroy-repair mechanism, a mathematical-programming-based local search, and an incremental re-optimization strategy for rapid dynamic response. Experiments verify that HALNS attains globally optimal solutions on small-scale instances and outperforms mainstream baselines on medium-to-large problems. The collaboration mechanism raises system effectiveness by an average of 34.75% across four mission scenarios. Compared with static re-optimization, the incremental strategy improves dynamic response performance by 58.25% while reducing runtime by up to 56.7%. Sensitivity analyses confirm the robustness of key parameters. This work provides a theoretical and algorithmic foundation for intelligent UAV swarm assignment and reconfiguration. Full article

(This article belongs to the Section Artificial Intelligence in Drones (AID))

► Show Figures

Figure 1

29 pages, 10417 KB

Open AccessArticle

RHG-DETR: Riemannian Hyper-Graph Transformer with Dynamic Receptive Fields for Detecting Special Targets in Degraded UAV Imagery

by Kaipeng Wang, Guanglin He, Wenhao Kong, Yuzhe Fu and Zongze Li

Remote Sens. 2026, 18(11), 1728; https://doi.org/10.3390/rs18111728 - 27 May 2026

Viewed by 419

Abstract

Special target detection in UAV remote sensing imagery is challenged by composite multi-type degradation, which collectively erodes target structure across every stage of a detection pipeline. Existing methods address individual degradation types in isolation and do not generalize to the composite conditions encountered [...] Read more.

Special target detection in UAV remote sensing imagery is challenged by composite multi-type degradation, which collectively erodes target structure across every stage of a detection pipeline. Existing methods address individual degradation types in isolation and do not generalize to the composite conditions encountered in real deployment. We propose the Riemannian Hyper-Graph Detection Transformer (RHG-DETR), a degradation-robust end-to-end framework composed of the Dynamic Receptive-field Hyper-graph Attention Network (DRHANet), the Bi-directional Weighted Adaptive Fusion Network (BWAFN), and the Adaptive Sparse Multi-scale Encoder with Dynamic Normalization (ASMED). DRHANet introduces anisotropic dynamic depthwise separable convolutions to align receptive fields with local structural orientations and Riemannian hyper-graph fusion to aggregate multi-scale features on a manifold, preserving inter-scale angular relations that Euclidean fusion destroys under degradation. BWAFN employs a bi-directional weighted pyramid in which each fusion node learns per-scale contribution weights, correcting cross-scale semantic misalignment that fixed-weight single-pass aggregation cannot recover. ASMED combines saliency-conditioned sparse window attention to suppress background dilution, a spatially gated feed-forward branch to retain pre-attention spatial geometry, and a bounded dynamic normalizer to stabilize activations under extreme illumination and electromagnetic interference. On a self-constructed UAV special-target dataset spanning seven physics-based degradation types, RHG-DETR achieves 78.5% mAP₅₀, a 3.7% absolute gain over RT-DETR at 34.4% lower GFLOPs and 28.8% fewer parameters at 84.2 FPS, outperforming restoration-then-detect pipelines in both accuracy and latency. Consistent improvements on VisDrone2019 and BDD100K confirm cross-domain generalization. Full article

► Show Figures

Figure 1

25 pages, 6763 KB

Open AccessArticle

PHSNet: A Small-Target Infrared Hotspot Detection Network for Photovoltaic Modules in UAV Remote-Sensing Images

by Bingpeng Gao, Yunbo Yang, Xingzhi Chen, Xin Cai and Xinyuan Nan

J. Imaging 2026, 12(6), 221; https://doi.org/10.3390/jimaging12060221 - 25 May 2026

Viewed by 269

Abstract

With the rapid expansion of global photovoltaic (PV) installed capacity, hot spot defects have become a major hidden danger that reduces power generation efficiency and threatens the safe and stable operation of PV stations. Unmanned aerial vehicle (UAV) infrared remote sensing is a [...] Read more.

With the rapid expansion of global photovoltaic (PV) installed capacity, hot spot defects have become a major hidden danger that reduces power generation efficiency and threatens the safe and stable operation of PV stations. Unmanned aerial vehicle (UAV) infrared remote sensing is a key technology for the efficient intelligent monitoring of large-scale PV stations. However, detecting tiny hotspots in such infrared images poses severe challenges. Most of these defects are ultra-small targets with extremely low pixel size and weak contrast, which are easily submerged by complex background noise, leading to prominent issues including low detection accuracy and high miss rates. To address these issues, we propose a lightweight detection network based on YOLO11n, named PHSNet, for PV hotspot detection in UAV infrared images. Its core designs include the dynamic convolution integrated C3k2 (Dy-C3k2) for small target feature enhancement, context-guided downsampling (CG-Down) to alleviate feature loss during downsampling, optimized detection layers, and a lightweight shared deconvolutional detection head (LSDECD) for small target adaptation in low-altitude aerial scenes, forming a full-link optimization architecture for tiny target feature perception. Experiments on a dedicated dataset (4025 images, 25,181 annotations, 92% targets < 20 pixels) show that PHSNet achieves 0.73 AP₅₀ and 0.315 AP, surpassing YOLO11n by 0.1 in AP₅₀ and 0.058 in AP, respectively. With only 1.8 M parameters and 98.8 FPS, it outperforms mainstream lightweight models, including YOLOv8n and RT-DETR-R18, strikes a superior accuracy–efficiency balance, and provides an efficient solution for real-time intelligent monitoring and edge deployment of PV stations. Full article

(This article belongs to the Section Computer Vision and Pattern Recognition)

► Show Figures

Figure 1

Search Results (227)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (227)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI