Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (220)

Search Parameters:
Keywords = altitude fusion

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 4958 KB  
Article
Adaptive Weighted Factor Graph Optimized Positioning Algorithm Based on Joint GNSS/INS/Vision Residual Detection
by Jin Wang, Jun Zou, Yan Xing, Jin Lu, Pengwu Wan and Jianbo Du
Sensors 2026, 26(12), 3783; https://doi.org/10.3390/s26123783 (registering DOI) - 14 Jun 2026
Abstract
Multi-sensor fusion of GNSS, IMU, and vision sensors has been extensively applied in urban Internet of Things systems and automated driving to improve positioning accuracy in complex environments. However, conventional FGO algorithms are based on fixed sensor weights, which limit their adaptability to [...] Read more.
Multi-sensor fusion of GNSS, IMU, and vision sensors has been extensively applied in urban Internet of Things systems and automated driving to improve positioning accuracy in complex environments. However, conventional FGO algorithms are based on fixed sensor weights, which limit their adaptability to fluctuations in sensor errors caused by environmental changes, thereby compromising positioning performance. To overcome this limitation, a novel multi-sensor adaptive weighted localization algorithm based on joint residuals detection was proposed in this study. The algorithm computes joint residuals by the sliding window accumulation of GNSS, IMU, and vision sensor measurements. By integrating a global weight decay factor into the M-estimation framework, the weights of each sensor were dynamically adjusted, thereby suppressing the effects of outliers on the state estimation. This approach enables high-precision and robust estimation of position, velocity, and attitude. Experimental results demonstrate that, based on validation with the GNSS–Visual–Inertial Navigation System (GVINS) public datasets sports field and complex environments, the proposed method exhibits superior performance in challenging low-altitude economic scenarios such as weak GNSS signals and significant IMU drift—specifically, it improves positioning accuracy by 32.3% and reduces velocity error by 32% compared to traditional FGO algorithms. In scenarios with GNSS signal interference, the system effectively mitigates error accumulation and maintains the stability of position and velocity estimation. The proposed algorithm demonstrates exceptional positioning accuracy and robustness in complex and dynamic environments, making it highly suitable for advanced urban IoT and automated driving applications. Full article
(This article belongs to the Special Issue Multi-Sensor Technology for Tracking, Positioning and Navigation)
Show Figures

Figure 1

33 pages, 3890 KB  
Article
Robust Spatial Georeferencing for UAV-UGV Mobile Mapping Platforms in Urban Canyons via Asymmetric GNSS/UWB Fusion
by Jiajia Chen, Xing’ao Wang, Zhibo Fang, Ming Gao, Ying Xu and Zhiyou Zhang
Remote Sens. 2026, 18(12), 1967; https://doi.org/10.3390/rs18121967 (registering DOI) - 13 Jun 2026
Abstract
Reliable spatial georeferencing of mobile mapping platforms is a fundamental prerequisite for high-fidelity urban remote sensing products such as 3D point clouds and digital twins. However, in deep urban canyons, severe signal occlusion and multipath effects reduce visible GNSS satellites, causing ambiguity resolution [...] Read more.
Reliable spatial georeferencing of mobile mapping platforms is a fundamental prerequisite for high-fidelity urban remote sensing products such as 3D point clouds and digital twins. However, in deep urban canyons, severe signal occlusion and multipath effects reduce visible GNSS satellites, causing ambiguity resolution (AR) failure and degraded observation geometry for UGV-borne systems. Conventional Vehicle-to-Vehicle (V2V) cooperation offers limited improvement due to symmetric ground-level occlusion. To overcome this, we propose an asymmetric GNSS/UWB fusion method that introduces Unmanned Aerial Vehicles (UAVs) as high-altitude dynamic spatial anchors to reconstruct the 3D observation geometry. Two contributions are presented: (i) an asymmetric heterogeneous stochastic model coupling carrier-to-noise ratio (C/N0) and elevation angle to handle the quality disparity between air and ground sensor links, preventing multipath contamination of high-fidelity UAV observations; and (ii) a dynamic baseline constrained least-squares algorithm integrating Ultra-Wideband (UWB) ranging to stabilize GNSS positioning under high-dynamic relative motion. Validated through high-fidelity simulations and field experiments, the method achieves a 98.2% AR success rate and sub-decimeter 3D accuracy under extreme occlusion (≤3 visible satellites), while urban-canyon tests demonstrate 100% positioning availability across all evaluated epochs and reduce the 95th-percentile 3D error from 7.25 m to 0.19 m under the tested single-UAV/single-UGV configuration. The framework supports smart city modeling, 3D reconstruction, and infrastructure monitoring. Full article
25 pages, 18006 KB  
Article
Multi-UAV Cooperative Localization in Pseudolite-Augmented GNSS-Denied Regions: An Anomaly-Resilient Adaptive Kalman Filter with Group Covariance Compensation
by Chengyan Ji, Xiye Guo, Yuqiu Tang, Xiaohe Han and Yuhang Song
Drones 2026, 10(6), 460; https://doi.org/10.3390/drones10060460 (registering DOI) - 12 Jun 2026
Abstract
In complex low-altitude environments, unmanned aerial vehicles (UAVs) require reliable positioning, yet Global Navigation Satellite System (GNSS) signals are vulnerable to occlusion and interference. Pseudolite-augmented cooperative localization, which combines ground base-station signals with inter-UAV relative observations, can complement GNSS in such environments. However, [...] Read more.
In complex low-altitude environments, unmanned aerial vehicles (UAVs) require reliable positioning, yet Global Navigation Satellite System (GNSS) signals are vulnerable to occlusion and interference. Pseudolite-augmented cooperative localization, which combines ground base-station signals with inter-UAV relative observations, can complement GNSS in such environments. However, two practical issues remain in real-world deployment: UAV-to-base-station (U-B) and UAV-to-UAV (U-U) observations have markedly different error statistics that a unified noise adjustment cannot handle, and the conservative covariance estimates produced by Covariance Intersection (CI) fusion bias the innovation-based adaptive noise estimation in distributed architectures. To address these issues, this paper proposes a Distributed Group Covariance Compensation Adaptive Kalman Filter (DGCC-AKF) for collaborative enhancement of UAV regional localization. DGCC-AKF establishes a group adaptive mechanism that independently adjusts the noise covariance matrices of U-B and U-U observations, enabling observation-type-level adaptive weighting that suppresses anomalous U-B or U-U measurements at the group level. In addition, a bounded covariance compensation factor is incorporated to alleviate the CI-induced conservatism in the adaptive noise estimation. The proposed method is evaluated on a 2800 km2 semi-physical testbed based on the Ground-based High-precision Local Positioning System (GH-LPS) pseudolite network using measured U-B observations and high-dynamic (>300 km/h) flight trajectories collected from a fixed-wing platform across three independent flight sessions. Results demonstrate that under observation fault periods, the proposed method improves 3D positioning accuracy by up to about 75% over single-UAV extended Kalman filter (EKF). Compared with two advanced algorithms in this field, variational Bayesian adaptive Kalman filter (VBAKF) and maximum correntropy criterion Kalman filter (MCC-EKF), it is the only scheme that remains accurate and stable across all UAVs and fault types. The framework provides a practical step toward field deployment for resilient multi-UAV cooperative navigation in pseudolite-augmented GNSS-denied regions. Full article
Show Figures

Figure 1

25 pages, 8001 KB  
Article
Landslide Deformation Remote Monitoring in Alpine Mountains Using UAV Photogrammetry and Infrared Thermography: A Case Study in Wumeng Mountain Region, China
by Cong Zhao, Meng Wang, Yueping Yin, Yongbo Tie, Sainan Zhu, Jingtao Liang, Su Zhang, Jianguo Feng, Ban Song and Xueqing Li
Remote Sens. 2026, 18(12), 1961; https://doi.org/10.3390/rs18121961 (registering DOI) - 12 Jun 2026
Abstract
Land surface temperature (LST) is crucial for understanding winter landslide evolution. This study combines Unmanned Aerial Vehicle (UAV) photogrammetry and infrared thermography (IRT) to monitor winter landslides in China’s Wumeng Mountain region. Using the Yangjiazhai landslide—induced by underground coal mining—as a case study, [...] Read more.
Land surface temperature (LST) is crucial for understanding winter landslide evolution. This study combines Unmanned Aerial Vehicle (UAV) photogrammetry and infrared thermography (IRT) to monitor winter landslides in China’s Wumeng Mountain region. Using the Yangjiazhai landslide—induced by underground coal mining—as a case study, we demonstrate significant correlations between IRT-detected LST anomalies and surface cracks: (1) cracks with elevated temperatures are likely connected to subsurface goaf zones; (2) excessively widened cracks show no thermal anomalies due to enhanced air convection. The research reveals that key landslide components have distinct LST signatures, governed by differential soil–rock moisture and crack networks. For accurate high-altitude winter LST acquisition, UAV thermal surveys should be conducted under overcast, fog-free conditions to reduce solar interference. This validates UAV visible–infrared fusion for extracting landslide boundaries, cracks, slumping zones, bedrock patterns, and moisture distribution. The methodology establishes a new pathway for investigating winter landslide deformation and instability, confirming IRT’s operational viability in high-altitude alpine regions. Full article
(This article belongs to the Special Issue Advances in GIS and Remote Sensing Applications in Natural Hazards)
14 pages, 1123 KB  
Article
ESKF-g2o-SLAM: A Stereo Visual–Inertial SLAM with ORB Features and ESKF-Based VIO
by Yiyi Cai, Wenyi Jing, Jingneng Ren, Haodong Bai, Simin Li, Yu Sun and Min Xie
Electronics 2026, 15(12), 2599; https://doi.org/10.3390/electronics15122599 (registering DOI) - 12 Jun 2026
Abstract
With the development of the low-altitude economy, low-altitude intelligent agents such as delivery robots, courier drones, and outdoor cleaning robots are gradually moving towards widespread application. One of the core challenges faced by such systems is localization and mapping in complex scenarios characterized [...] Read more.
With the development of the low-altitude economy, low-altitude intelligent agents such as delivery robots, courier drones, and outdoor cleaning robots are gradually moving towards widespread application. One of the core challenges faced by such systems is localization and mapping in complex scenarios characterized by satellite signal denial and unknown environmental prior information. To address this requirement, this paper proposes ESKF-g2o-SLAM, a stereo visual-inertial SLAM system that integrates an ESKF (Error-State Kalman Filter)-based visual-inertial odometry front-end with an ORB-feature-based g2o graph optimization back-end in a cascaded, loosely coupled manner. The proposed method was evaluated on 11 sequences of the EuRoC dataset and compared with state-of-the-art approaches including ORB-SLAM2 (stereo), MSCKF-VIO, OKVIS, and VINS-Fusion (stereo). Ablation studies show marginal improvements on selected sequences and suggest potential robustness advantages under more challenging visual conditions. Experimental results show that our method achieves competitive accuracy in terms of both Absolute Trajectory Error (ATE) and Relative Pose Error (RPE), exhibiting good robustness and stability. Full article
Show Figures

Figure 1

34 pages, 4235 KB  
Article
A Multimodal Data Fusion Algorithm for Urban Low-Altitude UAV Perception
by Bowen Xu, Peinan He, Xu Wang, Yixiao Zhang and Yuanjie Zhao
Drones 2026, 10(6), 457; https://doi.org/10.3390/drones10060457 - 11 Jun 2026
Viewed by 51
Abstract
Accurate Unmanned Aerial Vehicle (UAV) position estimation is the cornerstone of urban low-altitude safety management systems. Time Difference of Arrival (TDOA) and Remote Identification (Remote ID) are widely used surveillance technologies with complementary characteristics. TDOA provides high-rate updates but suffers from geometry-induced horizontal–vertical [...] Read more.
Accurate Unmanned Aerial Vehicle (UAV) position estimation is the cornerstone of urban low-altitude safety management systems. Time Difference of Arrival (TDOA) and Remote Identification (Remote ID) are widely used surveillance technologies with complementary characteristics. TDOA provides high-rate updates but suffers from geometry-induced horizontal–vertical anisotropy and multipath effects, while Remote ID supplies absolute state information yet struggles with intermittent sampling and packet loss. Existing fusion schemes typically address these issues in isolation: sequential filtering manages asynchrony but assumes Gaussian noise, robust estimators suppress outliers at the cost of discarding valid data, and coupled-filter architectures allow vertical anomalies to contaminate horizontal estimates through the Kalman gain cross-coupling. No prior framework jointly handles structural TDOA altitude jumps, stochastic Remote ID timing jitter, and the geometric anisotropy between estimation subspaces within a single coherent pipeline. To bridge this gap, we propose a Hybrid Conditional Kalman Filter (HCKF) framework comprising three integrated modules. First, a kinematics-based temporal alignment module maps asynchronous measurements onto a uniform timeline and predicts missing samples, resolving cross-modal time mismatches. Second, a measurement quality evaluation mechanism detects TDOA altitude steps via robust two-layer stratification and scores Remote ID timing irregularity through a confidence mapping, converting these anomalies into dynamic covariance adjustments and weight caps without discarding observations. Third, a Subspace-Decoupled Fusion strategy exploits the physical insight that TDOA horizontal precision derives from hyperbolic intersection geometry, whereas its vertical estimates suffer from weak observability due to near-coplanar ground-station deployment . By applying entropy-guided weighting in the horizontal plane and a conditional Remote ID-dominant rule in the vertical axis, this design prevents cross-dimensional error propagation. The framework was validated using three real-world flight missions at distinct altitudes (255 m, 345 m, and 440 m) totaling 13.51 km of flight distance, with RTK serving as ground truth. HCKF reduces the Root Mean Square Error by over 40% relative to single-source baselines (95% bootstrap confidence interval: [35.2%, 48.7%]), and paired Wilcoxon signed-rank tests confirm statistically significant improvement (p<0.01) over standard EKF, Covariance Intersection, and Iterative CI across all three tracks. Full article
23 pages, 20700 KB  
Article
Edge-Deployable RGB–Thermal UAV Monitoring for Wildfires in Power Transmission Corridors
by Biao Wang, Daochun Huang, Yifeng Lin, Xu He, Zhengxian Guo and Bo Hong
Remote Sens. 2026, 18(12), 1869; https://doi.org/10.3390/rs18121869 - 6 Jun 2026
Viewed by 292
Abstract
Early wildfire monitoring in power transmission corridors requires reliable detection of weak fire and smoke cues under complex field conditions and strict edge-computing constraints. To address these issues, this paper proposes an edge-deployable RGB–thermal framework based on visible and thermal infrared (TIR) imaging [...] Read more.
Early wildfire monitoring in power transmission corridors requires reliable detection of weak fire and smoke cues under complex field conditions and strict edge-computing constraints. To address these issues, this paper proposes an edge-deployable RGB–thermal framework based on visible and thermal infrared (TIR) imaging for unmanned aerial vehicle (UAV)-based corridor monitoring, including a spatial detector, YOLO-MMSC, and a temporal-enhanced version, YOLO-MMSC-T. The study also establishes a self-collected corridor-oriented RGB–thermal (RGB–T) dataset to complement public wildfire data. Unlike existing RGB–thermal wildfire datasets that mainly focus on forest or wildland fire scenes, the proposed dataset is specifically organized for complex-background power transmission-corridor monitoring, including continuous UAV sequences, nighttime conditions, smoke/vegetation occlusion, long-range small targets, and hard-negative interference. To the best of our knowledge, this is the first self-collected RGB–thermal wildfire dataset designed for this specific application scenario. The framework integrates a mobile inverted bottleneck convolution (MBConv) lightweight backbone, a Shallow Detail Fusion Module (SDFM) for shallow cross-modal alignment and denoising, a Content-Guided Attention (CGA) module for adaptive fusion, and normalized Wasserstein distance (NWD)-based box regression for long-range small-target localization. Experiments on public and self-collected datasets show that YOLO-MMSC achieves 94.6% mAP@0.5, 95.0% precision, and 93.9% recall while running at 60 FPS on Jetson Orin NX. With temporal fine-tuning, YOLO-MMSC-T reaches a continuous detection rate (CDR) of 95.6% with a jitter index of 2.8×103. Field experiments using a DJI Matrice 4T further indicate a practical operating altitude of 120–180 m. These results support lightweight RGB–thermal remote sensing for real-time wildfire monitoring in complex transmission-corridor environments. Full article
Show Figures

Figure 1

25 pages, 3141 KB  
Article
Ground and Low-Altitude Target Classification in Cluttered Radar Remote Sensing via Velocity-Aware Multi-Feature Fusion
by Peilong Hu, Liyu Tian, Mengze Zhang and Zhongshan Zhang
Remote Sens. 2026, 18(11), 1788; https://doi.org/10.3390/rs18111788 - 1 Jun 2026
Viewed by 131
Abstract
Classification of ground and low-altitude targets in radar remote sensing is challenging because environmental clutter and noise can significantly degrade the discriminability of target echoes, especially under complex outdoor observation conditions. To improve the classification performance for humans, vehicles, and unmanned aerial vehicles [...] Read more.
Classification of ground and low-altitude targets in radar remote sensing is challenging because environmental clutter and noise can significantly degrade the discriminability of target echoes, especially under complex outdoor observation conditions. To improve the classification performance for humans, vehicles, and unmanned aerial vehicles (UAVs), this paper proposes a velocity-aware multi-feature fusion method based on measured radar echo data. First, radar echoes are preprocessed using a wavelet-decomposition-based strategy to suppress clutter and noise while preserving useful target information. Then, multiple complementary features, including wavelet packet energy distribution, spectral entropy, spectral standard deviation, temporal standard deviation, amplitude dispersion coefficient, and relative radar cross-section (RCS), are extracted to characterize the target echoes from different perspectives. Considering the influence of target velocity on Doppler distribution and class separability, the measured data are further divided into different velocity intervals for stratified classification. Based on the fused feature vectors, a long short-term memory (LSTM) network is employed to model feature relationships and perform target classification. Experiments conducted on real measured radar echo data demonstrate that the proposed method achieves classification accuracies of 97.82% for UAVs, 96.00% for vehicles, and a mean interval-level accuracy of 96.94%, indicating its effectiveness for ground and low-altitude target classification in cluttered radar remote sensing environments. Full article
Show Figures

Figure 1

29 pages, 7347 KB  
Article
A Lightweight Multi-Level Feature Fusion Detector for UAV-Based Tiny Personnel Detection in Hilly Road Safety Monitoring and Rescue
by Yanghao Cao, Chaojun Sheng, Haishan Tian, Qi Xiao, Ziyi Zhou, Jiayuan Li, Weiwei Tang and Kui Wang
Remote Sens. 2026, 18(11), 1725; https://doi.org/10.3390/rs18111725 - 27 May 2026
Viewed by 298
Abstract
Target detection systems based on UAV platforms have the advantages of speed, flexibility, and agility in safety monitoring and rescue operations on hilly roads. However, due to the high altitude of aerial imaging, terrain occlusions, and interference from complex backgrounds, trapped individuals often [...] Read more.
Target detection systems based on UAV platforms have the advantages of speed, flexibility, and agility in safety monitoring and rescue operations on hilly roads. However, due to the high altitude of aerial imaging, terrain occlusions, and interference from complex backgrounds, trapped individuals often appear as visually negligible small targets, leading to high miss rates and delayed responses in traditional detection methods. To address this urgent need, this paper proposes a lightweight small-target detector called FCML-YOLO. First, a frequency-domain feature enhancement (FDFE) module is designed, which extracts frequency-domain features using the discrete cosine transform and enhances global context perception through adaptive global pooling and multi-branch fully connected layers. Then, a content-aware reassembly of features (CARAFE) module is incorporated to preserve fine-grained image details during upsampling. Additionally, a multi-scale feature reconstruction (MSFR) module is developed, which integrates features from multiple scales and reduces redundant information using an adaptive weighting mechanism. Building on this, we construct a lightweight multi-level feature fusion (LMFF) network by removing redundant structures and fully exploiting deep and shallow features. The experimental results on multiple datasets demonstrate that, compared to YOLO11s, FCML-YOLO achieves 4.4% improvement in mAP50 on the self-built OPVM-VIRD dataset. Additionally, the model demonstrates a significant advantage over mainstream detection models on public datasets such as VisDrone, USOD, DOTA, and TinyPerson. Furthermore, experiments are extended to the search-and-rescue-oriented SARD dataset to verify the applicability of FCML-YOLO in UAV-based rescue scenarios. The model is deployed on a self-developed UAV-mounted detection pod system, with the number of parameters reduced by 62.6% compared to the baseline model, achieving real-time performance at 64 frames per second (FPS). Full article
(This article belongs to the Special Issue Small Target Detection, Recognition, and Tracking in Remote Sensing)
Show Figures

Figure 1

30 pages, 10324 KB  
Article
Spatiotemporal Variations in Snow/Ice Cover, Climate Responses and Future Trends in the Headwaters of the Keriya River on the Northern Slope of the Kunlun Mountains
by Weixiang Sun, Jiayi Zheng, Peilin Lan, Haoran Lu and Kun Xing
Sustainability 2026, 18(11), 5385; https://doi.org/10.3390/su18115385 - 27 May 2026
Viewed by 228
Abstract
Against the backdrop of global warming and the ‘warming and wetting’ trend in north-western China, changes in seasonal snowpack and glacial ice in high-altitude cold regions directly impact water security in inland river basins. At present, there is a paucity of systematic research [...] Read more.
Against the backdrop of global warming and the ‘warming and wetting’ trend in north-western China, changes in seasonal snowpack and glacial ice in high-altitude cold regions directly impact water security in inland river basins. At present, there is a paucity of systematic research concerning the long-term evolution of snow and ice cover, multi-scale climate responses and future trends in the source region of the Keriya River on the northern slope of the Kunlun Mountains. To address this, this study utilised Landsat remote sensing imagery and meteorological station data from 2005 to 2024. Employing a multi-model fusion framework that integrates various machine learning and time-series models—including random forests, gradient boosting trees and ARIMA—the research incorporated trend factors, climate cycle identification and probabilistic modelling of extreme events to systematically analyse the spatiotemporal variability of snow/ice coverage and its multiscale coupling relationships with air temperature and precipitation. Given the inherent limitations of optical remote sensing methods in distinguishing between seasonal snow and glacial ice, this study defines the extracted coverage type as snow/ice coverage. Given the inherent limitations of optical remote sensing methods in distinguishing between seasonal snow and glacial ice, this study defines the extracted coverage type as snow/ice coverage. The results indicate that: (1) the annual average snow/ice cover percentage in the study area shows a non-significant decreasing trend (−0.69%/year, p > 0.1); within the year, it exhibits a pattern of accumulation in winter and melting in summer, with a peak in January (average 63.2%) and a trough in August (average 11.6%); (2) snow/ice cover percentage increases significantly with altitude; the annual average SICP in the <2000 m elevation zone is 5.2%; in the 2000–3000 m and 3000–4000 m altitude ranges, this rises to 5.7% and 8.3%, respectively, representing the primary seasonal snow/ice distribution zones; in areas above 6000 m, the annual average reaches 70.3%, constituting a zone of perennial stable snow/ice cover; (3) the relationship between snow/ice and temperature and precipitation exhibits significant time-scale dependence: correlations are weak on an annual scale (temperature R = −0.25, precipitation R = −0.14), but significantly strengthen on a monthly scale and exhibit seasonal differentiation; during the melting season, temperature exerts a dominant negative influence (August R = −0.35), whilst during the accumulation season, solid precipitation provides a positive supplement (February R = 0.34), with the strongest correlation with temperature occurring in September (R = −0.50); (4) it is projected that between 2025 and 2044, snow and ice cover will follow a fluctuating downward trend (averaging an annual decrease of roughly −0.12%), falling to approximately 29% by 2044; at the same time, temperatures are expected to continue rising (+0.035 °C per year), whilst precipitation will increase slightly (+0.4% per year). The results of this study provide a sound scientific basis for formulating sustainable water resource management strategies for the northern flank of the Kunlun Mountains and optimising measures to regulate snowmelt runoff. They are of great importance for safeguarding the stability of the oasis ecological systems in the Keriya River basin and ensuring the sustainable development and utilisation of water resources. Full article
Show Figures

Figure 1

24 pages, 6298 KB  
Article
Siamese-ViT: A Local–Global Feature Fusion Method for Real-Time Visual Navigation of UAVs in Real-World Environments
by Yu Cheng, Xixiang Liu, Shuai Chen and Chuan Xu
Remote Sens. 2026, 18(10), 1556; https://doi.org/10.3390/rs18101556 - 13 May 2026
Viewed by 226
Abstract
Visual scene matching navigation (VSMN) for unmanned aerial vehicles (UAVs) boasts advantages such as high precision, high reliability, and autonomy. The biggest challenge lies in the tension between local fine-grained information and global semantics, as well as limited generalization ability in real-world environments. [...] Read more.
Visual scene matching navigation (VSMN) for unmanned aerial vehicles (UAVs) boasts advantages such as high precision, high reliability, and autonomy. The biggest challenge lies in the tension between local fine-grained information and global semantics, as well as limited generalization ability in real-world environments. While existing Transformer-based cross-view geolocation methods enhance global context modeling capabilities, they still generally face issues such as high demands on training data and computational resources, insufficient fusion of local fine-grained information and global semantics, and real-time performance in real-world complex environment. To address these problems, we propose a scene matching and localization algorithm based on the Siamese-ViT. For feature extraction, we use the ViT model to extract global features and K-means clustering to aggregate local features. Combined with the global features extracted by the ViT, a robust local–global feature representation vector is generated. For feature matching, incremental principal component analysis (IPCA) is used to reduce the dimensionality of the high-dimensional feature space, and a KD-tree is constructed for fast feature retrieval to improve matching efficiency. We validated our algorithm on the University-1652 dataset and a dataset of real-world satellite-drone image pairs. The results show that our Siamese-ViT outperforms other models in both Recall and AP. We conduct flight experiments in real-world environments, capturing drone images of complex scenes, including farmland, urban buildings, and waterways. The results show that, at a flight altitude of 350 m, our algorithm achieves an average absolute value of 6.2063 m for latitude, 6.7552 m for longitude, and 10.1922 m for horizontal error. Therefore, our Siamese-ViT demonstrates ideal overall positioning accuracy. Full article
Show Figures

Figure 1

30 pages, 22665 KB  
Article
An Enhanced Algorithm Integrating YOLOv11 and ByteTrack for Small-Object Detection and Tracking in Low-Altitude Remote Sensing Imagery
by Jianfeng Han, Feijie Sun, Zihan Xu, Lili Song and Jiandong Fang
Remote Sens. 2026, 18(10), 1547; https://doi.org/10.3390/rs18101547 - 13 May 2026
Viewed by 410
Abstract
In vision-based low-altitude unmanned aerial vehicle (UAV) remote sensing, detecting small targets accurately and maintaining stable tracking under fast-motion conditions remain significant challenges. Specifically, small-object detection suffers from low feature representation, while camera motion often induces tracking drift and identity switches. To address [...] Read more.
In vision-based low-altitude unmanned aerial vehicle (UAV) remote sensing, detecting small targets accurately and maintaining stable tracking under fast-motion conditions remain significant challenges. Specifically, small-object detection suffers from low feature representation, while camera motion often induces tracking drift and identity switches. To address these issues, this paper proposes a novel small target detection and tracking algorithm named TCYOLO-SofByteTrack, which integrates an improved YOLOv11 with ByteTrack. The algorithm comprises two core innovative modules: First, the TCYOLO detector is designed by integrating the C3k2-TA feature enhancement module with triplet attention mechanism to achieve cross-dimensional interaction modeling, significantly improving small target feature representation capability and network contextual awareness. A Cross-Scale Feature Fusion Module for UAVs (CCFM-UAV) is constructed to provide precise detection support for small targets at different scales. Second, building upon the ByteTrack framework, the SofByteTrack tracker is designed, which introduces a sparse optical flow-based motion compensation strategy. This strategy estimates and compensates for image displacement caused by UAV motion in real time, ensuring the stability of target bounding boxes under fast-motion conditions, thereby effectively mitigating tracking drift and identity switches. Experimental results demonstrate that the TCYOLO detector achieves a 7.4% improvement in mAP for small target detection compared to the baseline YOLOv11 model. The complete TCYOLO-SofByteTrack tracking algorithm achieves a HOTA score of 45.3%, MOTA of 42.7%, and IDF1 of 57.8%, representing improvements of 4.5%, 5.9%, and 8.0%, respectively, over the baseline methods. Furthermore, the number of successfully tracked targets increased by 37.3%, while identity switches decreased by 23.4%. These results demonstrate the notable advantages of the proposed method in small target detection accuracy, tracking precision, and identity consistency. Its generalization capability is further validated on a custom highway inspection dataset. Moreover, deployment tests on an NVIDIA Jetson Orin NX platform show that, compared to YOLOv11n, the proposed algorithm achieves higher detection accuracy while still meeting real-time processing requirements, highlighting its practical applicability in resource-constrained scenarios. Full article
Show Figures

Figure 1

22 pages, 2017 KB  
Article
Fault-Aware Kalman-Based Method for UAV Altitude Estimation Under Radar Altimeter Anomalies
by Van Dung Vu, Xuan Sinh Mai, Kieu Trang Le, Minh Vu Tran and Thanh Dong Nguyen
Drones 2026, 10(5), 369; https://doi.org/10.3390/drones10050369 - 11 May 2026
Viewed by 339
Abstract
Reliable altitude and vertical speed estimation are fundamental for unmanned aerial vehicle (UAV) autonomous flight, especially during low-altitude operations such as takeoff and landing. Barometric altimeters are widely used due to their low cost, high availability, and good long-term stability, providing smooth altitude [...] Read more.
Reliable altitude and vertical speed estimation are fundamental for unmanned aerial vehicle (UAV) autonomous flight, especially during low-altitude operations such as takeoff and landing. Barometric altimeters are widely used due to their low cost, high availability, and good long-term stability, providing smooth altitude trends over a wide operating range. However, barometric measurements are indirectly inferred from static pressure and are therefore sensitive to local airflow disturbances. In particular, rotor downwash and ground effect-induced pressure perturbations near the surface can introduce significant biases and short-term fluctuations in barometric altitude, which propagate into erroneous vertical speed estimates during critical flight phases. Time-of-flight (TOF) altimeters, such as radar or laser sensors, provide direct above-ground-level (AGL) measurements and are largely insensitive to ground effect-related pressure disturbances. Within their limited operational range, TOF altimeters typically offer higher accuracy and lower short-term noise compared with barometric altitude. Nevertheless, TOF sensors are characterized by a restricted valid measurement range and frequently exhibit non-ideal behaviors in real-world UAV operations, including out-of-range outputs, frozen measurements, and in-range biased readings. These anomalies violate the nominal sensor assumptions used in conventional Kalman filter-based fusion and can significantly degrade estimation performance if not properly handled. This paper proposes a hybrid Kalman–rule-based altitude estimation framework that fuses barometric and TOF altitude measurements to exploit their complementary characteristics while mitigating their respective limitations. A vertical dynamic state-space model is formulated to jointly estimate altitude, vertical velocity, accelerometer bias, and ground height offset. A rule-based anomaly detection and classification module is developed to identify multiple TOF altimeter failure modes observed in operational UAV flights. The detected anomaly states are incorporated into the Kalman filter to adaptively weight, accept, or reject TOF measurements, thereby improving robustness against sensor non-idealities. The proposed approach is validated using 39 real UAV flight logs covering diverse flight regimes, including low-altitude maneuvers, cruise, and autonomous landing. Experimental results show that the proposed framework provides more stable and robust altitude and vertical speed estimation under practical sensor anomaly conditions compared with conventional barometer-only and standard Kalman fusion configurations. These results demonstrate the practical effectiveness of the proposed method for fault-aware altitude estimation in UAV autonomous flight. Full article
Show Figures

Figure 1

22 pages, 6871 KB  
Article
GSC-YOLO: A Pedestrian Detection Method for Low-Light Security Surveillance Scenarios
by Wei Qing, Fan Li, Shuang Li and Pengfei Yin
Sensors 2026, 26(10), 2987; https://doi.org/10.3390/s26102987 - 9 May 2026
Viewed by 664
Abstract
Pedestrian detection in nighttime security surveillance and other low-light visual sensing tasks is an important foundation for intelligent perception in complex environments. Under low-light conditions, visible-light images often suffer from missing texture details, intensified noise, and reduced contrast, which can easily lead to [...] Read more.
Pedestrian detection in nighttime security surveillance and other low-light visual sensing tasks is an important foundation for intelligent perception in complex environments. Under low-light conditions, visible-light images often suffer from missing texture details, intensified noise, and reduced contrast, which can easily lead to insufficient target representation, unstable cross-scale feature fusion, and an increased risk of missed detections. Although multimodal schemes, such as RGB–infrared approaches, can improve detection performance by exploiting modal complementarity, they involve relatively high hardware costs, cross-modal calibration complexity, and system integration overhead, which impose deployment limitations in lightweight or cost-sensitive scenarios. Therefore, developing an efficient pedestrian detection method for low-light monocular RGB scenarios is of clear practical value. This study focuses on low-light monocular RGB pedestrian detection and proposes an application-oriented structurally optimized model, termed GSC-YOLO, built upon YOLOv13. First, GhostNetV3 is introduced as the backbone to enhance multi-scale feature representation under weak-texture conditions. Second, a Semantic–Spatial Alignment (SSA) module is designed to improve information compensation and suppress noise during the feature fusion stage. Finally, C2f_Faster is incorporated into the high-level semantic branch to optimize information flow and reduce redundant computation. On the RGB subsets of the two public datasets, LLVIP and KAIST, GSC-YOLO achieves mAP@0.5:0.95 values of 57.70% and 66.61%, respectively, and Recall values of 89.93% and 90.49%, respectively, consistently outperforming the YOLOv13 baseline. The results demonstrate that, under the experimental settings adopted in this study, the proposed method effectively improves pedestrian perception performance in low-light RGB scenes while maintaining favorable real-time inference capability, and may provide a useful reference for front-end vision sensing research in low-altitude intelligent networks. Full article
Show Figures

Figure 1

27 pages, 20342 KB  
Article
SBG-DDN: Semantic Boundary-Guided Dynamic Detection Network for Small Transmission Tower Detection in Remote Sensing Images
by Kun Cheng, Yunhe Cui, Yifan Liu, Pengyu Yin, Yu Jiang, Ningzhe Liu, Zhe Yin, Jiaping Wu, Tonggang Zhao and Junxiang Tan
Remote Sens. 2026, 18(10), 1452; https://doi.org/10.3390/rs18101452 - 7 May 2026
Viewed by 365
Abstract
Accurate detection of small transmission towers in remote sensing images is critical for low-altitude aviation safety and power-grid monitoring, yet it remains challenging due to complex backgrounds, weak target responses, and severe feature submersion. To address these issues, we propose a Semantic Boundary-Guided [...] Read more.
Accurate detection of small transmission towers in remote sensing images is critical for low-altitude aviation safety and power-grid monitoring, yet it remains challenging due to complex backgrounds, weak target responses, and severe feature submersion. To address these issues, we propose a Semantic Boundary-Guided Dynamic Detection Network (SBG-DDN) that integrates a semantic boundary-guided representation framework with a dynamic detection and localization optimization scheme. Specifically, the proposed method combines a frozen DINOv3 backbone to provide global semantic priors and a CSPDarknet backbone to capture local boundary-sensitive details while further enhancing their interaction through the Semantic-Boundary Fusion Module (SBFM). In addition, a Dynamic Semantic-Boundary Optimization Head (DSBOH) and an Adaptive Structure-Aware Transmission Tower IoU (ASTIoU) loss are introduced to improve multi-scale feature adaptation and geometry-aware localization for sparse and elongated transmission towers. To support broader evaluation in this area, we constructed the Power Transmission Tower Object Detection (PTTOD) dataset, which covers multiple countries and diverse geographic environments. Experimental results on the public SRSPTD dataset and the proposed PTTOD dataset demonstrate the effectiveness of the proposed method. On SRSPTD, SBG-DDN achieves 74.3% mAP@0.5 and 35.1% mAP@0.5:0.95, outperforming existing state-of-the-art detectors. Full article
(This article belongs to the Special Issue Deep Learning-Based Small-Target Detection in Remote Sensing)
Show Figures

Figure 1

Back to TopTop