Next Article in Journal
Techno-Economic Analysis of Flare Gas to Hydrogen: A Lean and Green Sustainability Approach
Previous Article in Journal
Influence of Rare Earth Elements on the Radiation-Shielding Behavior of Serpentinite-Based Materials
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking, and Loop Closing

1
College of Intelligence Science and Technology, National University of Defense Technology, Changsha 410073, China
2
College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(14), 7838; https://doi.org/10.3390/app15147838 (registering DOI)
Submission received: 24 June 2025 / Revised: 3 July 2025 / Accepted: 10 July 2025 / Published: 13 July 2025
(This article belongs to the Section Robotics and Automation)

Abstract

The performance of visual SLAM in complex, real-world scenarios is often compromised by unreliable feature extraction and matching when using handcrafted features. Although deep learning-based local features excel at capturing high-level information and perform well on matching benchmarks, they struggle with generalization in continuous motion scenes, adversely affecting loop detection accuracy. Our system employs a Model-Agnostic Meta-Learning (MAML) strategy to optimize the training of keypoint extraction networks, enhancing their adaptability to diverse environments. Additionally, we introduce a coarse-to-fine feature tracking mechanism for learned keypoints. It begins with a direct method to approximate the relative pose between consecutive frames, followed by a feature matching method for refined pose estimation. To mitigate cumulative positioning errors, DK-SLAM incorporates a novel online learning module that utilizes binary features for loop closure detection. This module dynamically identifies loop nodes within a sequence, ensuring accurate and efficient localization. Experimental evaluations on publicly available datasets demonstrate that DK-SLAM outperforms leading traditional and learning-based SLAM systems, such as ORB-SLAM3 and LIFT-SLAM. DK-SLAM achieves 17.7% better translation accuracy and 24.2% better rotation accuracy than ORB-SLAM3 on KITTI and 34.2% better translation accuracy on EuRoC. These results underscore the efficacy and robustness of our DK-SLAM in varied and challenging real-world environments.
Keywords: monocular SLAM; deep learning; feature extraction and matching; loop closing monocular SLAM; deep learning; feature extraction and matching; loop closing

Share and Cite

MDPI and ACS Style

Qu, H.; Zhang, L.; Mao, J.; Tie, J.; He, X.; Hu, X.; Shi, Y.; Chen, C. DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking, and Loop Closing. Appl. Sci. 2025, 15, 7838. https://doi.org/10.3390/app15147838

AMA Style

Qu H, Zhang L, Mao J, Tie J, He X, Hu X, Shi Y, Chen C. DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking, and Loop Closing. Applied Sciences. 2025; 15(14):7838. https://doi.org/10.3390/app15147838

Chicago/Turabian Style

Qu, Hao, Lilian Zhang, Jun Mao, Junbo Tie, Xiaofeng He, Xiaoping Hu, Yifei Shi, and Changhao Chen. 2025. "DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking, and Loop Closing" Applied Sciences 15, no. 14: 7838. https://doi.org/10.3390/app15147838

APA Style

Qu, H., Zhang, L., Mao, J., Tie, J., He, X., Hu, X., Shi, Y., & Chen, C. (2025). DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking, and Loop Closing. Applied Sciences, 15(14), 7838. https://doi.org/10.3390/app15147838

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop