A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing
Abstract
:1. Introduction
2. Overall Structure
2.1. Bidirectional Scoring Strategy for the Filtering of Dynamic Feature Points
2.2. Transformation Matrix Estimation
- All values of are calculated.
- The mean w of all is found.
- is judged: If , and are correct similarities, and they are retained; otherwise, they are deleted.
- The filtered point pair with the correct similarity is taken as the initial iterative feature point pair for the RANSAC algorithm.
- The point pair with the correct similarity is used as a candidate matching feature set. Four groups are randomly selected to establish equations and calculate the unknowns in the transformation matrix M for the estimation of the transformation matrix.
- The distances between other feature points and the candidate matching points are calculated by using the transformation model, and the threshold r is set. When the distance is less than this threshold, the feature point is determined to be an inlier; otherwise, it is an outlier.
- The inliers are used to re-estimate the transformation matrix for N iterations.
3. Experiments and Analysis
3.1. Experimental Environment and Datasets
3.2. Feature Point Matching Based on the Bidirectional Scoring Strategy
3.3. Feature Point Filtering Based on the Estimation of a Transformation Matrix
3.4. Three-Dimensional Pose Tracking Accuracy
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
SLAM | Simultaneous localization and mapping |
TUM | Technical University of Munich |
RGB-D | Red–green–blue-depth |
ORB | Oriented FAST and rotated BRIEF |
ORB-SLAM2 | Oriented FAST and rotated BRIEF SLAM II |
ORB-SLAM3 | Oriented FAST and rotated BRIEF SLAM III |
RANSAC | Random sample consensus |
SegNet | Semantic segmentation network |
Mask R-CNN | Mask region-based convolutional neural network |
YOLO | You only look once |
DS-SLAM | Dynamic semantic SLAM |
DynaSLAM | Dynamic SLAM |
CNN | Convolutional neural network |
APE | Absolute pose error |
RMSE | Root-mean-squared error |
SSEs | Sum of squared errors |
References
- Montemerlo, M.; Thrun, S.; Koller, D.; Wegbreit, B. FastSLAM: A factored solution to the simultaneous localization and mapping problem. In Proceedings of the AAAI-02: Eighteenth National Conference on Artificial Intelligence, Edmonton, AL, Canada, 28 July–1 August 2002; Volume 593598. [Google Scholar]
- Chen, H.; Guo, P.; Li, P.; Lee, G.H.; Chirikjian, G. Multi-person 3D pose estimation in crowded scenes based on multi-view geometry. In Proceedings of the Computer Vision–ECCV 2020: 16th European Conference, Proceedings, Part III, Glasgow, UK, 23–28 August 2020; Springer: Berlin/Heidelberg, Germany, 2020; pp. 541–557. [Google Scholar]
- Henein, M.; Zhang, J.; Mahony, R.; Ila, V. Dynamic SLAM: The need for speed. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 2123–2129. [Google Scholar]
- Cheng, Z.Q.; Chen, Y.; Martin, R.R.; Lai, Y.K.; Wang, A. SuperMatching: Feature Matching Using Supersymmetric Geometric Constraints. IEEE Trans. Vis. Comput. Graph. 2013, 19, 1885–1894. [Google Scholar] [CrossRef]
- Lu, X.; Manduchi, R. Wide baseline feature matching using the cross-epipolar ordering constraint. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, Washington, DC, USA, 27 June–2 July 2004; Volume 1, p. I. [Google Scholar] [CrossRef]
- Yu, C.; Liu, Z.; Liu, X.-J.; Xie, F.; Yang, Y.; Wei, Q.; Fei, Q. DS-SLAM: A semantic visual SLAM towards dynamic environments. In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 1–5 October 2018; pp. 1168–1174. [Google Scholar]
- Mur-Artal, R.; Tardós, J.D. ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Trans. Robot. 2017, 33, 1255–1262. [Google Scholar] [CrossRef]
- Lopez-Molina, C.; Baets, B.D.; Bustince, H. Quantitative error measures for edge detection. Pattern Recognit. 2013, 46, 1125–1139. [Google Scholar] [CrossRef]
- Derpanis, K.G. Overview of the RANSAC Algorithm. Image Rochester NY 2010, 4, 2–3. [Google Scholar]
- Cashbaugh, J.; Kitts, C. Automatic Calculation of a Transformation Matrix Between Two Frames. IEEE Access 2018, 6, 9614–9622. [Google Scholar] [CrossRef]
- Badrinarayanan, V.; Kendall, A.; Cipolla, R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Tian, Y.; Yang, G.; Wang, Z.; Wang, H.; Li, E.; Liang, Z. Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Comput. Electron. Agric. 2019, 157, 417–426. [Google Scholar] [CrossRef]
- Bescos, B.; Fácil, J.M.; Civera, J.; Neira, J. DynaSLAM: Tracking, mapping, and inpainting in dynamic scenes. IEEE Robot. Autom. Lett. 2018, 3, 4076–4083. [Google Scholar] [CrossRef]
- Li, S.; Lee, D. RGB-D SLAM in dynamic environments using static point weighting. IEEE Robot. Autom. Lett. 2017, 2, 2263–2270. [Google Scholar] [CrossRef]
- Zhang, Y.; Jin, R.; Zhou, Z. Understanding bag-of-words model: A statistical framework. Int. J. Mach. Learn. Cybern. 2010, 1, 43–52. [Google Scholar] [CrossRef]
- Wang, L.; Zhang, Y.; Feng, J. On the Euclidean distance of images. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1334–1339. [Google Scholar] [CrossRef]
- Roy, P.; Dutta, S.; Dey, N.; Dey, G.; Ray, S. Adaptive thresholding: A comparative study. In Proceedings of the 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT), Kanyakumari, India, 10–11 July 2014. [Google Scholar]
- Campos, C.; Elvira, R.; Rodríguez, J.J.G.; Montiel, J.M.; Tardós, J.D. ORB-SLAM3: An accurate open-source library for visual, visual-inertial, and multimap SLAM. IEEE Trans. Robot. 2021, 37, 1874–1890. [Google Scholar] [CrossRef]
Mean | RMSE | |
---|---|---|
60% | 0.0652 | 0.0693 |
70% | 0.0447 | 0.0481 |
80% | 0.0281 | 0.0235 |
90% | 0.0637 | 0.0693 |
Comparative Experiments | ORB-SLAM3 | DynaSLAM | Proposed | Improvement on DynaSLAM | |||
---|---|---|---|---|---|---|---|
Feature Matching and APE | Matches | RMSE | Matches | RMSE | Matches | RMSE | RMSE |
freiburg3_sitting_halfsphere | 462 | 0.0430 | 198 | 0.0245 | 146 | 0.0240 | 2.91% |
freiburg3_sitting_rpy | 396 | 0.9839 | 176 | 0.9460 | 122 | 0.2591 | 72.60% |
freiburg3_sitting_static | 411 | 0.5759 | 171 | 0.2147 | 133 | 0.1270 | 40.87% |
freiburg3_sitting_xyz | 389 | 0.0270 | 185 | 0.0210 | 145 | 0.0175 | 16.77% |
freiburg3_walking_halfsphere | 367 | 0.7125 | 163 | 0.0230 | 159 | 0.0225 | 2.19% |
freiburg3_walking_rpy | 335 | 0.7551 | 153 | 0.1319 | 110 | 0.1125 | 14.69% |
freiburg3_walking_static | 459 | 2.7401 | 203 | 0.1710 | 172 | 0.1331 | 22.63% |
freiburg3_walking_xyz | 312 | 1.7032 | 189 | 0.0255 | 132 | 0.0235 | 7.68% |
Comparative Experiments | Improvement on ORB-SLAM3 | Improvement on DynaSLAM | ||||||
---|---|---|---|---|---|---|---|---|
APE | Mean | Median | RMSE | See | Mean | Median | RMSE | See |
freiburg3_sitting_halfsphere | 47.43% | 44.94% | 43.07% | 64.64% | 3.59% | 2.28% | 2.91% | 0.34% |
freiburg3_sitting_rpy | 73.69% | 72.76% | 73.66% | 92.97% | 72.63% | 72.92% | 72.60% | 64.42% |
freiburg3_sitting_static | 77.97% | 78.33% | 77.95% | 97.22% | 40.91% | 42.02% | 40.87% | 69.23% |
freiburg3_sitting_xyz | 37.00% | 39.83% | 35.26% | 50.46% | 16.89% | 21.26% | 16.77% | 32.46% |
freiburg3_walking_halfsphere | 97.14% | 97.52% | 96.84% | 99.97% | 2.87% | 1.82% | 2.19% | 2.03% |
freiburg3_walking_rpy | 84.64% | 85.21% | 85.10% | 99.36% | 15.31% | 16.52% | 14.69% | 27.90% |
freiburg3_walking_static | 95.14% | 95.14% | 95.14% | 99.86% | 22.19% | 21.85% | 22.63% | 58.92% |
freiburg3_walking_xyz | 98.71% | 98.78% | 98.62% | 99.99% | 10.63% | 12.03% | 7.68% | 22.65% |
Comparative Experiments | Improvement on ORB-SLAM3 | Improvement on DynaSLAM | ||||||
---|---|---|---|---|---|---|---|---|
RPE | Mean | Median | RMSE | See | Mean | Median | RMSE | See |
rgbd_bonn_balloon | 68.37% | 82.37% | 54.98% | 24.01% | 62.28% | 61.03% | 49.86% | 27.07% |
rgbd_bonn_crowd3 | 63.57% | 68.81% | 52.98% | 3.95% | 39.89% | 43.82% | 31.10% | 3.52% |
rgbd_bonn_person_tracking2 | 37.23% | 31.52% | 30.26% | 22.58% | 37.61% | 50.10% | 23.73% | 4.05% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, B.; Cheng, X.; Wang, J.; Jiao, L. A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing. Remote Sens. 2024, 16, 723. https://doi.org/10.3390/rs16040723
Wang B, Cheng X, Wang J, Jiao L. A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing. Remote Sensing. 2024; 16(4):723. https://doi.org/10.3390/rs16040723
Chicago/Turabian StyleWang, Bo, Xina Cheng, Jialiang Wang, and Licheng Jiao. 2024. "A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing" Remote Sensing 16, no. 4: 723. https://doi.org/10.3390/rs16040723
APA StyleWang, B., Cheng, X., Wang, J., & Jiao, L. (2024). A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing. Remote Sensing, 16(4), 723. https://doi.org/10.3390/rs16040723