GLP-VO: A Hybrid Visual Odometry Framework for Low-Altitude UAV Imaging in Complex Urban Environments
Highlights
- A hybrid visual odometry framework (GLP-VO) is proposed, integrating line features with point features to construct a robust geometric constraint model.
- An adaptive weighting strategy is designed to dynamically adjust the contributions of point and line features based on real-time scene texture and structural complexity.
- The proposed method effectively addresses UAV positioning drift in low-texture or dynamic urban environments, significantly outperforming traditional point-based approaches.
- The method provides a low-drift, highly robust autonomous navigation solution for low-altitude urban flight that does not rely on pre-training or GPU-intensive learning modules.
Abstract
1. Introduction
2. Related Work
2.1. Point-Based Visual Odometry Algorithm
2.2. Multi-Feature Fusion VO/vSLAM Algorithm
3. Materials and Methods
3.1. Feature Extraction and Description
3.1.1. Representation of Geometric Structural Features
3.1.2. Feature Descriptor Generation
3.2. Geometric Structure Feature Matching
3.2.1. Descriptor Preprocessing
3.2.2. Similarity-Based Matching
3.2.3. Geometric Validation
3.3. Feature Fusion
3.3.1. Adaptive Weighting Mechanism for Feature Fusion
| Algorithm 1: Feature fusion with adaptive weighting. |
![]() |
3.3.2. Complexity-Based Dynamic Weighting for Geometric Structure Features
3.3.3. Joint Descriptor Construction for Feature Fusion
3.3.4. Online Learning for Dynamic Weight Adjustment
3.4. Joint Pose Estimation and Optimization
3.4.1. Pose Estimation Framework
3.4.2. Joint Pose Optimization
3.4.3. Geometric Constraints in Pose Optimization
3.4.4. Optimization Algorithm
4. Results
4.1. Quantitative Evaluation on the TUM RGB-D Dataset
4.2. Real-World Experiments
4.2.1. Construction of the Experimental Dataset
4.2.2. Initialization Speed Comparison in Real Scenarios
4.2.3. Comparison of Stability and Accuracy in Real Scenarios
4.2.4. Linear Assessment of GLP-VO Function
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Dai, M.; Zheng, E.; Feng, Z.; Qi, L.; Zhuang, J.; Yang, W. Vision-based UAV self-positioning in low-altitude urban environments. IEEE Trans. Image Process. 2023, 33, 493–508. [Google Scholar] [CrossRef] [PubMed]
- Shafafi, K.; Ricardo, M.; Campos, R. Traffic and Obstacle-Aware UAV Positioning in Urban Environments Using Reinforcement Learning. IEEE Access 2024, 12, 188652–188663. [Google Scholar] [CrossRef]
- Zhang, Z.; Cao, Y.; Ding, M.; Zhang, L.; Tao, J. Monocular Vision Based Obstacle Avoidance Trajectory Planning for Unmanned Aerial Vehicle. Aerosp. Sci. Technol. 2020, 106, 106199. [Google Scholar] [CrossRef]
- Song, Y.; Hsu, L. Tightly Coupled Integrated Navigation System Via Factor Graph for UAV Indoor Localization. Aerosp. Sci. Technol. 2021, 108, 106370. [Google Scholar] [CrossRef]
- Vetrella, A.R.; Fasano, G.; Accardo, D. Attitude Estimation for Cooperating UAVs Based on Tight Integration of GNSS and Vision Measurements. Aerosp. Sci. Technol. 2019, 84, 966–974. [Google Scholar] [CrossRef]
- Wei, J.; Karakay, D.; Yilmaz, A. A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning. In Proceedings of the 2022 IEEE Sensors, Dallas, TX, USA, 30 October–2 November 2022; pp. 1–4. [Google Scholar] [CrossRef]
- Russell, J.S.; Ye, M.; Anderson, B.D.O.; Hmam, H.; Sarunic, P. Cooperative Localisation of a GPS-Denied UAV in 3-Dimensional Space Using Direction of Arrival Measurements. arXiv 2017, arXiv:1703.06261. [Google Scholar] [CrossRef]
- Mur-Artal, R.; Montiel, J.M.M.; Tardós, J.D. ORB-SLAM: A Versatile and Accurate Monocular SLAM System. IEEE Trans. Robot. 2015, 31, 1147–1163. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhu, P.; Ren, W. PL-CVIO: Point-Line Cooperative Visual-Inertial Odometry. arXiv 2023, arXiv:2311.05717. [Google Scholar] [CrossRef]
- Zheng, F.; Zhou, L.; Lin, W.; Liu, J.; Sun, L. LRPL-VIO: A Lightweight and Robust Visual–Inertial Odometry with Point and Line Features. Sensors 2024, 24, 1322. [Google Scholar] [CrossRef]
- Liu, X.; Wang, H.; Yang, S. A Fast Point-Line Visual-Inertial Odometry with Structural Regularity. In Proceedings of the 2023 IEEE 2nd Industrial Electronics Society Annual On-Line Conference (ONCON), Online, 8–10 December 2023; pp. 1–6. [Google Scholar] [CrossRef]
- Gee, A.P.; Mayol-Cuevas, W. Real-Time Model-Based SLAM Using Line Segments. In Advances in Visual Computing; Springer: Berlin/Heidelberg, Germany, 2006; Volume 4292, pp. 354–363. [Google Scholar] [CrossRef]
- Pumarola, A.; Vakhitov, A.; Agudo, A.; Sanfeliu, A.; Moreno-Noguer, F. PL-SLAM: Real-Time Monocular Visual SLAM with Points and Lines. In Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore, 29 May–3 June 2017; pp. 4503–4508. [Google Scholar] [CrossRef]
- Yang, G.; Wang, Q.; Liu, P.; Zhang, H. An Improved Monocular PL-SLAM Method with Point-Line Feature Fusion Under Low-Texture Environment. In Proceedings of the 2021 ACM 4th International Conference on Control and Computer Vision, Macau, China, 13–15 August 2021; pp. 119–125. [Google Scholar] [CrossRef]
- Ebrahimi, H.; Yazdaninejadi, A. Comparing the Superimposed Voltage-Current-Based and Superimposed Energy-Based Fault Detection Methods in Series-Compensated Lines: Exploring Challenges and Issues. In Proceedings of the 2022 IEEE International Conference on Protection and Automation of Power Systems (IPAPS), Zahedan, Iran, 19–20 January 2022; pp. 1–10. [Google Scholar] [CrossRef]
- Wu, D.; Wang, M.; Li, Q.; Xu, W.; Zhang, T.; Ma, Z. Visual Odometry with Point and Line Features Based on Underground Tunnel Environment. IEEE Access 2023, 11, 24003–24015. [Google Scholar] [CrossRef]
- Li, C.; Yan, L.; Xia, Y. A Real-Time Visual-Inertial Monocular Odometry by Fusing Point and Line Features. In Proceedings of the 2021 40th Chinese Control Conference (CCC), Shanghai, China, 26–28 July 2021; pp. 4085–4090. [Google Scholar] [CrossRef]
- Tan, W.; Liu, H.; Dong, Z.; Zhang, G.; Bao, H. Robust Monocular SLAM in Dynamic Environments. In Proceedings of the 2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Adelaide, SA, Australia, 1–4 October 2013; pp. 209–218. [Google Scholar]
- Li, J.; Hu, Q.; Ai, M.; Zhong, R. Robust Feature Matching Via Support-Line Voting and Affine-Invariant Ratios. ISPRS J. Photogramm. Remote Sens. 2017, 132, 61–76. [Google Scholar] [CrossRef]
- Chai, Z.; Shi, X.; Zhou, Y.; Xiong, Z. A real-time global re-localization framework for 3D LiDAR SLAM. arXiv 2021, arXiv:2109.00200. [Google Scholar]
- Yang, S.; Scherer, S. Direct Monocular Odometry Using Points and Lines. arXiv 2017, arXiv:1703.06380. [Google Scholar] [CrossRef]
- Forster, C.; Pizzoli, M.; Scaramuzza, D. SVO: Fast Semi-Direct Monocular Visual Odometry. In Proceedings of the 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China, 31 May–7 June 2014; pp. 15–22. [Google Scholar] [CrossRef]
- Engel, J.; Schöps, T.; Cremers, D. LSD-SLAM: Large-Scale Direct Monocular SLAM. In Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland, 6–12 September 2014; Springer: Berlin/Heidelberg, Germany, 2014; pp. 834–849. [Google Scholar]
- Wang, J.; Liu, S.; Zhang, P. A New Line Matching Approach for High-Resolution Line Array Remote Sensing Images. Remote Sens. 2022, 14, 3287. [Google Scholar] [CrossRef]
- Bartoli, A.; Sturm, P. Structure-from-Motion Using Lines: Representation, Triangulation, and Bundle Adjustment. Comput. Vis. Image Underst. 2005, 100, 416–441. [Google Scholar] [CrossRef]
- Lee, S.J.; Hwang, S.S. Elaborate Monocular Point and Line SLAM with Robust Initialization. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 1121–1129. [Google Scholar]
- Zuo, X.; Xie, X.; Liu, Y.; Huang, G. Robust Visual SLAM with Point and Line Features. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 1775–1782. [Google Scholar]
- Zhu, Y.; Jin, R.; Lou, T.; Zhao, L. PLD-VINS: RGBD Visual-Inertial SLAM with Point and Line Features. Aerosp. Sci. Technol. 2021, 119, 107185. [Google Scholar] [CrossRef]
- Weiss, S.; Achtelik, M.W.; Lynen, S.; Chli, M.; Siegwart, R. Realtime Onboard Visual-Inertial State Estimation and Self-Calibration of MAVs in Unknown Environments. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation (ICRA), St. Paul, MN, USA, 14–18 May 2012; pp. 957–964. [Google Scholar]
- Lynen, S.; Achtelik, M.W.; Weiss, S.; Chli, M.; Siegwart, R. A Robust and Modular Multi-Sensor Fusion Approach Applied to MAV Navigation. In Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Tokyo, Japan, 3–7 November 2013; pp. 3923–3929. [Google Scholar]
- Yang, Z.; Shen, S. Monocular Visual–Inertial State Estimation with Online Initialization and Camera–IMU Extrinsic Calibration. IEEE Trans. Autom. Sci. Eng. 2017, 14, 39–51. [Google Scholar] [CrossRef]
- Mur-Artal, R.; Tardos, J.D. Visual-Inertial Monocular SLAM with Map Reuse. IEEE Robot. Autom. Lett. 2017, 2, 796–803. [Google Scholar] [CrossRef]
- Bloesch, M.; Omari, S.; Hutter, M.; Siegwart, R. Robust Visual Inertial Odometry Using a Direct EKF-Based Approach. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany, 28 September–2 October 2015; pp. 298–304. [Google Scholar]
- Liu, Z.; Shi, D.; Li, R.; Qin, W.; Zhang, Y.; Ren, X. PLC-VIO: Visual–Inertial Odometry Based on Point-Line Constraints. IEEE Trans. Autom. Sci. Eng. 2022, 19, 1880–1897. [Google Scholar] [CrossRef]
- Mishra, S.; Palanisamy, P. Autonomous Advanced Aerial Mobility - An End-to-End Autonomy Framework for UAVs and Beyond. IEEE Access 2023, 11, 136318–136349. [Google Scholar] [CrossRef]
- Bakirci, M. A Novel Swarm Unmanned Aerial Vehicle System: Incorporating Autonomous Flight, Real-Time Object Detection, and Coordinated Intelligence for Enhanced Performance. Trait. Signal 2023, 40, 2063–2078. [Google Scholar] [CrossRef]
- Jeong, W.Y.; Lee, K.M. Visual SLAM with Line and Corner Features. In Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, Beijing, China, 9–15 October 2006; pp. 2570–2575. [Google Scholar]













| Sequences | GLP-VO | PL-SLAM | ORB-SLAM | LSD-SLAM | PTAM |
|---|---|---|---|---|---|
| f1_xyz | 0.91 | 1.32 | 1.16 | 8.34 | 1.03 |
| f2_xyz | 0.32 | 1.16 | 0.48 | 1.83 | 0.18 |
| f1_floor | 6.58 | 8.27 | 7.12 | 36.29 | / |
| f2_360_kidnap | 2.26 | 56.43 | 3.31 | / | 2.23 |
| f3_long_office | 1.68 | 4.99 | 3.16 | 34.28 | / |
| f3_nstr_tex_far | / | 35.64 | / | 16.42 | 31.57 |
| f3_nstr_tex_near | 1.84 | 1.31 | 2.37 | 6.39 | 2.53 |
| f3_str_tex_far | 0.62 | 1.07 | 0.83 | 6.57 | 0.73 |
| f3_str_tex_near | 1.64 | 5.74 | 1.34 | / | 0.92 |
| f2_desk_person | 1.88 | 6.22 | 4.96 | 28.58 | / |
| Algorithm Module | Desktop (ms) | Onboard (ms) |
|---|---|---|
| Dual-Feature Extraction | ∼10.2 | ∼31.5 |
| Descriptor Generation | ∼4.5 | ∼12.8 |
| Geometric Matching | ∼5.8 | ∼18.2 |
| Adaptive Feature Fusion | ∼1.4 | ∼4.6 |
| Joint Pose Optimization | ∼6.5 | ∼22.4 |
| Total Execution Time | ∼28.4 ms | ∼89.5 ms |
| Real-time Performance | ∼35 FPS | ∼11 FPS |
| Sequences | Scene Description | GLP-VO | ORB-SLAM2 | PL-SLAM |
|---|---|---|---|---|
| Lab 1 | Dense urban environment (buildings) | 1.23 | 2.34 | 1.45 |
| Lab 2 | Complex city streets (varied structures) | 0.11 | 1.02 | 0.34 |
| Lab 3 | Semi-rural outskirts (farmland) | 4.46 | 4.78 | 5.21 |
| Configuration | Feature Extraction & Matching | Adaptive Feature Fusion | Optimization | f1_xyz (cm) | f2_360_kidnap (cm) | Lab 1 (m) | |||
|---|---|---|---|---|---|---|---|---|---|
| ATE | RPE | ATE | RPE | ATE | RPE | ||||
| Base | – | – | – | 1.16 | 0.62 | 3.31 | 1.78 | 9.25 | 3.60 |
| Variant 1 | ✓ | – | – | 1.05 | 0.56 | 2.85 | 1.52 | 7.80 | 3.02 |
| Variant 2 | ✓ | ✓ | – | 0.96 | 0.50 | 2.48 | 1.34 | 6.20 | 2.42 |
| Full GLP-VO | ✓ | ✓ | ✓ | 0.91 | 0.46 | 2.26 | 1.21 | 5.10 | 2.05 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Xu, Y.; Jiang, B.; Huang, L.; Qu, R.; Wang, Z. GLP-VO: A Hybrid Visual Odometry Framework for Low-Altitude UAV Imaging in Complex Urban Environments. Drones 2026, 10, 329. https://doi.org/10.3390/drones10050329
Xu Y, Jiang B, Huang L, Qu R, Wang Z. GLP-VO: A Hybrid Visual Odometry Framework for Low-Altitude UAV Imaging in Complex Urban Environments. Drones. 2026; 10(5):329. https://doi.org/10.3390/drones10050329
Chicago/Turabian StyleXu, Yuxuan, Bo Jiang, Longyang Huang, Ruokun Qu, and Zhiyuan Wang. 2026. "GLP-VO: A Hybrid Visual Odometry Framework for Low-Altitude UAV Imaging in Complex Urban Environments" Drones 10, no. 5: 329. https://doi.org/10.3390/drones10050329
APA StyleXu, Y., Jiang, B., Huang, L., Qu, R., & Wang, Z. (2026). GLP-VO: A Hybrid Visual Odometry Framework for Low-Altitude UAV Imaging in Complex Urban Environments. Drones, 10(5), 329. https://doi.org/10.3390/drones10050329

