Affiliation:
1. School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China
2. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Abstract
Robust and precise visual localization over extended periods of time poses a formidable challenge in the current domain of spatial vision. The primary difficulty lies in effectively addressing significant variations in appearance caused by seasonal changes (summer, winter, spring, autumn) and diverse lighting conditions (dawn, day, sunset, night). With the rapid development of related technologies, more and more relevant datasets have emerged, which has also promoted the progress of 6-DOF visual localization in both directions of autonomous vehicles and handheld devices.This manuscript endeavors to rectify the existing limitations of the current public benchmark for long-term visual localization, especially in the part on the autonomous vehicle challenge. Taking into account that autonomous vehicle datasets are primarily captured by multi-camera rigs with fixed extrinsic camera calibration and consist of serialized image sequences, we present several proposed modifications designed to enhance the rationality and comprehensiveness of the evaluation algorithm. We advocate for standardized preprocessing procedures to minimize the possibility of human intervention influencing evaluation results. These procedures involve aligning the positions of multiple cameras on the vehicle with a predetermined canonical reference system, replacing the individual camera positions with uniform vehicle poses, and incorporating sequence information to compensate for any failed localized poses. These steps are crucial in ensuring a just and accurate evaluation of algorithmic performance. Lastly, we introduce a novel indicator to resolve potential ties in the Schulze ranking among submitted methods. The inadequacies highlighted in this study are substantiated through simulations and actual experiments, which unequivocally demonstrate the necessity and effectiveness of our proposed amendments.
Funder
the National Key R&D Program of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference39 articles.
1. Castle, R.O., Klein, G., and Murray, D.W. (October, January 28). Video-Rate Localization in Multiple Maps for Wearable Augmented Reality. Proceedings of the 12th IEEE International Symposium on Wearable Computers (ISWC 2008), Pittsburgh, PA, USA.
2. ORB-SLAM: A Versatile and Accurate Monocular SLAM System;Montiel;IEEE Trans. Robot.,2015
3. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition;Arandjelovic;IEEE Trans. Pattern Anal. Mach. Intell.,2018
4. 24/7 Place Recognition by View Synthesis;Torii;IEEE Trans. Pattern Anal. Mach. Intell.,2018
5. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance;Cummins;Int. J. Robot. Res.,2008