Research on Inter-Frame Feature Mismatch Removal Method of VSLAM in Dynamic Scenes
Abstract
:1. Introduction
- Propose an adaptive RANSAC method (ATRANSAC) capable of dynamically adjusting the image feature matching error threshold. Through a certain number of iterations, it identifies the minimum error threshold that satisfies the given inlier matching rate. This method dynamically adjusts the error threshold with relatively fewer iterations compared to traditional RANSAC methods, providing a precise and fast image feature mismatch removal approach.
- The GMS method [12] has a fast detection speed and offers relatively reliable filtering of image feature-matching results. The combination of GMS and ATRANSAC methods effectively removes image feature mismatches. The GMS coarse screening set is downsampled to reduce the computational burden of ATRANSAC, facilitating the identification of the optimal set of image feature matches with very few iterations.
- Conduct detailed testing on the performance and feasibility of GMS-ATRANSAC. First, this proposed method was visually compared to the classical method RANSAC and the latest methods GMS and GMS-RANSAC for image feature matching effects in various scenes within the TUM [13] and KITTI [14] datasets. Subsequently, a comparative analysis of the accuracy and processing speed in the mismatch removal process for the mentioned methods was conducted in dynamic environments. Finally, the proposed method was applied to the initialization thread of ORB-SLAM2 and the ORB-SLAM3 tracking thread for feasibility verification.
2. Related Works
3. Methodology
3.1. GMS-ATRANSAC Framework
- Capture continuous 2D images PA and PB through the camera.
- Extract FAST keypoints in images PA and PB.
- Calculate the BRIEF descriptors corresponding to the FAST keypoints in images PA and PB, completing the ORB feature extraction.
- Obtain a matching set by using brute-force matching [31] based on Hamming distance for ORB features.
- Apply the GMS method to perform coarse screening on the brute-force matching set , obtaining the coarse matching set .
- Downsample the coarse screening matching set to obtain the downsampled matching set .
- Apply the ATRANSAC algorithm proposed in this paper for fine screening in the downsampled matching set
3.2. Adaptive Threshold RANSAC
- Set the initial maximum internal matching rate Pmax, minimum internal matching rate Pmin, error threshold et, and matching set
- Randomly extract four matches from the matching sample set and calculate the homography matrix H.
- Calculate the reprojection point set by projecting the matching point set from image PA to image PB based on the homography matrix H.
- Calculate the error for each matching point using Formula (2) and determine whether it is an internal matching using Equation (3); if it is an internal matching, add it to the internal matching set Mn and record the internal matching count Q.
- Check if the current iteration’s internal matching count Q is less than 4. If it is less than or equal to 4, output the previous internal matching set Mn−1. If Q is greater than 4, proceed to the next step.
- Calculate the internal matching rate P using Equation (4). If update the error threshold et to α times its original value (where α < 1) and repeat from step (2). If , reduce by β.
- If , output the current internal matching set Mn.
Algorithm 1: Adaptive error threshold RANSAC |
Input: Sample set: Initial parameters: Maximum internal matching rate Pmax Minimum internal matching rate Pmin Error Threshold et Error Threshold Coefficient α Maximum inlier ratio adjustment factor β Output: Internal matching set Mn |
|
3.3. GMS-ATRANSAC for Image Feature Mismatch Removal
4. Experimental
4.1. Dataset and Metrics
4.2. Removal of Image Feature Mismatches in Different Scenarios
4.3. Inter-Frame Feature Mismatch Removal Based on GMS-ATRANSAC in ORB-SLAM
4.3.1. Initializer
4.3.2. Tracking
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Kazerouni, I.A.; Fitzgerald, L.; Dooly, G.; Toal, D. A survey of state-of-the-art on visual SLAM. Expert Syst. Appl. 2022, 205, 117734. [Google Scholar] [CrossRef]
- Fuentes-Pacheco, J.; Ruiz-Ascencio, J.; Rendón-Mancha, J.M. Visual simultaneous localization and mapming: A survey. Artif. Intell. Rev. 2015, 43, 55–81. [Google Scholar] [CrossRef]
- Davison, A.J.; Reid, I.D.; Molton, N.D.; Stasse, O. MonoSLAM: Real-time single camera SLAM. IEEE Trans. Pattern Anal. Mach. Intell. 2007, 29, 1052–1067. [Google Scholar] [CrossRef] [PubMed]
- Klein, G.; Murray, D. Parallel tracking and mapping for small AR workspaces. In Proceedings of the 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, Nara, Japan, 13–16 November 2007. [Google Scholar]
- Forster, C.; Pizzoli, M.; Scaramuzz, D. SVO: Fast semi-direct monocular visual odometry. In Proceedings of the 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China, 31 May–7 June 2014. [Google Scholar]
- Rublee, E.; Rabaud, V.; Konolige, K.; Bradski, G. ORB: An efficient alternative to SIFT or SURF. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011. [Google Scholar]
- Mur-Artal, R.; Montiel, J.M.M.; Tardos, J.D. ORB-SLAM: A Versatile and Accurate Monocular SLAM System. IEEE Trans. Robot. 2015, 31, 1147–1163. [Google Scholar] [CrossRef]
- Mur-Artal, R.; Tardós, J.D. Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras. IEEE Trans. Robot. 2017, 33, 1255–1262. [Google Scholar] [CrossRef]
- Campos, C.; Elvira, R.; Rodríguez, J.J.; Montiel, J.M.; Tardós, J.D. Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE Trans. Robot. 2021, 37, 1874–1890. [Google Scholar] [CrossRef]
- Kim, T.W. Feature Detection Based on Significancy of Local Features for Image Matching. IEICE Trans. Inf. Syst. 2021, 104, 1510–1513. [Google Scholar] [CrossRef]
- Gao, X.; Zhang, T. Robust RGB-D simultaneous localization and mapping using planar point features. Robot. Auton. Syst. 2015, 72, 1–14. [Google Scholar] [CrossRef]
- Bian, J.W.; Lin, W.Y.; Matsushita, Y. Gms: Grid-based motion statistics for fast, ultra-robust feature correspondence. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
- Sturm, J.; Engelhard, N.; Endres, F.; Burgard, W.; Cremers, D. A benchmark for the evaluation of RGB-D SLAM systems. In Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, Vilamoura-Algarve, Portugal, 7–12 October 2012. [Google Scholar]
- Geiger, A.; Lenz, P.; Stiller, C.; Urtasun, R. Vision meets robotics: The kitti dataset. Int. J. Robot. Res. 2013, 32, 1231–1237. [Google Scholar] [CrossRef]
- Dubrofsky, E. Homography Estimation. Master’s Thesis, University of British Columbia, Vancouver, BC, Canada, 2009. [Google Scholar]
- Fischler, M.A.; Bolles, R.C. Random Sample Consensus: A paradigm for model fitting with application to image analysis and automated cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
- Hartley, R.; Zisserman, A. Multiple View Geometry in Computer Vision; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
- Torr, P.H.S.; Zisserman, A. MLESAC: A New Robust Estimator with Application to Estimating Image Geometry. Comput. Vis. Image Underst. 2000, 78, 138–156. [Google Scholar] [CrossRef]
- Chum, O.; Matas, J.; Kittler, J. Locally optimized RANSAC. In Pattern Recognition: 25th DAGM Symposium, Magdeburg, Germany, September 10–12, 2003. Proceedings 25; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
- Zhang, Q.; Wu, S.; Wang, W.; Fang, Z. Improving 2D Camera Calibration by LO-RANSAC. Int. J. Inf. Electron. Eng. 2017, 7, 93–98. [Google Scholar] [CrossRef]
- Barath, D.; Matas, J. Graph-cut RANSAC. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018. [Google Scholar]
- Chum, O.; Matas, J. Matching with PROSAC-progressive sample consensus. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA, 20–26 June 2005; Volume 1. [Google Scholar]
- Xin, G.; Zhang, X.; Wang, X.; Song, J. A RGBD SLAM algorithm combining ORB with PROSAC for indoor mobile robot. In Proceedings of the 2015 4th International Conference on Computer Science and Network Technology (ICCSNT), Harbin, China, 19–20 December 2015. [Google Scholar]
- Myatt, D.R.; Torr, P.H.S.; Nasuto, S.J.; Bishop, J.M.; Craddock, R. NAPSAC: High noise, high dimensional robust estimation—it’s in the bag. In Proceedings of the 13th British Machine Vision Conference (BMVC), Cardiff, UK, 2–5 September 2002; pp. 458–467. [Google Scholar]
- Rodehorst, V.; Hellwich, O. Genetic Algorithm SAmple Consensus (GASAC)—A parallel strategy for robust parameter estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshop(CVPRW), New York, NY, USA, 17–22 June 2006. [Google Scholar]
- Chum, O.; Matas, J. Randomized RANSAC with Td,d test. In Proceedings of the 13th British Machine Vision Conference (BMVC), Cardiff, UK, 2–5 September 2002; pp. 448–457. [Google Scholar]
- Torr, P. Bayesian Model Estimation and Selection for Epipolar Geometry and Generic Manifold Fitting. Int. J. Comput. Vis. 2002, 50, 35–61. [Google Scholar] [CrossRef]
- Xu, W.; Mulligan, J. Feature Matching under Region-Based Constraints for Robust Epipolar Geometry Estimation. In International Symposium on Visual Computing; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
- Lan, X.; Guo, B.; Huang, Z.; Zhang, S. An Improved UAV Aerial Image Mosaic Algorithm Based on GMS-RANSAC. In Proceedings of the 2020 IEEE 5th International Conference on Signal and Image Processing (ICSIP), Nanjing, China, 3–5 July 2020. [Google Scholar]
- Zhang, D.; Zhu, J.; Wang, F.; Hu, X.; Ye, X. GMS-RANSAC: A Fast Algorithm for Removing Mismatches Based on ORB-SLAM2. Symmetry 2022, 14, 849. [Google Scholar] [CrossRef]
- Jakubović, A.; Velagić, J. Image feature matching and object detection using brute-force matchers. In Proceedings of the 2018 International Symposium ELMAR, Zadar, Croatia, 16–19 September 2018. [Google Scholar]
Sequences | Metrics | RANSAC | G-R | G-ATR |
---|---|---|---|---|
TUM (Indoor static) | 1.489 | 1.655 | 1.236 | |
0.843 | 0.642 | 0.353 | ||
Processing time (ms) | 94 | 25 | 12 | |
TUM (Indoor dynamic) | 1.275 | 1.212 | 0.802 | |
0.752 | 0.774601 | 0.278 | ||
Processing time (ms) | 95 | 34 | 22 | |
KITTI (Outdoor dynamic) | 1.543 | 1.678 | 1.014 | |
0.778 | 0.654 | 0.229 | ||
Processing time (ms) | 100 | 42 | 29 |
Sequences | for RANSAC (%) | for G-R (%) | for RANSAC (%) | for G-R (%) | for RANSAC (%) | for G-R (%) |
---|---|---|---|---|---|---|
TUM (Indoor static) | 16.9% | 25.3% | 58.1% | 45.0% | 87.2% | 52.0% |
TUM (Indoor dynamic) | 37.1% | 33.8% | 63.0% | 64.1% | 76.8% | 31.0% |
KITTI (Outdoor dynamic) | 34.2% | 39.6% | 70.5% | 64.9% | 71.0% | 31.0% |
AVG. | 29.4% | 32.9% | 63.9% | 58.0% | 78.3% | 38.0% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, Z.; He, Y.; Zhao, K.; Lang, Q.; Duan, H.; Xiong, Y.; Zhang, D. Research on Inter-Frame Feature Mismatch Removal Method of VSLAM in Dynamic Scenes. Sensors 2024, 24, 1007. https://doi.org/10.3390/s24031007
Yang Z, He Y, Zhao K, Lang Q, Duan H, Xiong Y, Zhang D. Research on Inter-Frame Feature Mismatch Removal Method of VSLAM in Dynamic Scenes. Sensors. 2024; 24(3):1007. https://doi.org/10.3390/s24031007
Chicago/Turabian StyleYang, Zhiyong, Yang He, Kun Zhao, Qing Lang, Hua Duan, Yuhong Xiong, and Daode Zhang. 2024. "Research on Inter-Frame Feature Mismatch Removal Method of VSLAM in Dynamic Scenes" Sensors 24, no. 3: 1007. https://doi.org/10.3390/s24031007
APA StyleYang, Z., He, Y., Zhao, K., Lang, Q., Duan, H., Xiong, Y., & Zhang, D. (2024). Research on Inter-Frame Feature Mismatch Removal Method of VSLAM in Dynamic Scenes. Sensors, 24(3), 1007. https://doi.org/10.3390/s24031007