Abstract
AbstractVisual place recognition (VPR) is the ability to recognize locations in a physical environment based only on visual inputs. It is a challenging task due to perceptual aliasing, viewpoint and appearance variations and complexity of dynamic scenes. Despite promising demonstrations, many state-of-the-art VPR approaches based on artificial neural networks (ANNs) suffer from computational inefficiency. Spiking neural networks (SNNs), on the other hand, implemented on neuromorphic hardware, are reported to have remarkable potential towards more efficient solutions computationally, compared to ANNs. However, the training of the state-of-the-art (SOTA) SNNs for the VPR task is often intractable on large and diverse datasets. To address this, we develop an end-to-end convolutional SNN model for VPR, that leverages back-propagation for tractable training. Rate-based approximations of leaky integrate-and-fire (LIF) neurons are employed during training to enable back-propagation, and the approximation units are replaced with spiking LIF neurons during inference. The proposed method outperforms the SOTA ANNs and SNNs by achieving 78.2% precision at 100% recall on the challenging Nordland dataset, compared with 53% SOTA performance, and exhibits competitive performance on the Oxford RobotCar dataset while being easier to train and faster in both training and inference when compared to other ANN and SNN-based methods.
Publisher
Cold Spring Harbor Laboratory
Reference78 articles.
1. M. Lanham . Learn ARCore-Fundamentals of Google ARCore: Learn to build augmented reality apps for Android, Unity, and the web with Google ARCore 1.0. Packt Publishing Ltd, 2018.
2. T. Reinhardt . Using Global Localization to Improve Navigation, Feb. 2019. URL: https://ai.googleblog.com/2019/02/using-global-localization-to-improve.html. xLast visited on 05/15/2023.
3. T. Weyand , I. Kostrikov , and J. Philbin . Planet-photo geolocation with convolutional neural networks. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14, pages 37–55. Springer, 2016.
4. P. H. Seo , T. Weyand , J. Sim , and B. Han . Cplanet: Enhancing image geolocalization by combinatorial partitioning of maps. In Proceedings of the European Conference on Computer Vision (ECCV), pages 536–551, 2018.
5. Vision-based mobile indoor assistive navigation aid for blind people;IEEE transactions on mobile computing,2018