Cross-View Outdoor Localization in Augmented Reality by Fusing Map and Satellite Data
-
Published:2023-10-12
Issue:20
Volume:13
Page:11215
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Emmaneel René12, Oswald Martin R.1, de Haan Sjoerd3, Datcu Dragos2
Affiliation:
1. Computer Vision Group, Informatics Institute, Faculty of Science, University of Amsterdam, Science Park 904, 1098 XH Amsterdam, The Netherlands 2. Huawei Technologies Netherlands, 1101 CM Amsterdam, The Netherlands 3. Go Grow AI, 1076 VC Amsterdam, The Netherlands
Abstract
Visual positioning is the task of finding the location of a given image and is necessary for augmented reality applications. Traditional algorithms solve this problem by matching against premade 3D point clouds or panoramic images. Recently, more attention has been given to models that match the ground-level image with overhead imagery. In this paper, we introduce AlignNet, which builds upon previous work to bridge the gap between ground-level and top-level images. By making multiple key insights, we push the model results to achieve up to 4 times higher recall rates on a visual position dataset. We use a fusion of both satellite and map data from OpenStreetMap for this matching by extending the previously available satellite database with corresponding map data. The model pushes the input images through a two-branch U-Net and is able to make matches using a geometric projection module to map the top-level image to the ground-level domain at a given position. By calculating the difference between the projection and ground-level image in a differentiable fashion, we can use a Levenberg–Marquardt (LM) module to iteratively align the estimated position towards the ground-truth position. This sample-wise optimization strategy allows the model to align the position better than if the model has to obtain the location in a single step. We provide key insights into the model’s behavior, which allows us to increase the model’s ability to obtain competitive results on the KITTI cross-view dataset. We compare our obtained results with the state of the art and obtain new best results on 3 of the 9 categories we look at, which include a 57% likelihood of lateral localization within 1 m in a 40 m × 40 m area and a 93% azimuth localization within 3∘ when using a 20∘ rotation noise prior.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference34 articles.
1. Sattler, T., Leibe, B., and Kobbelt, L. (2011, January 6–13). Fast image-based localization using direct 2D-to-3D matching. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain. 2. Li, Y., Snavely, N., Huttenlocher, D., and Fua, P. (2012). Proceedings of the European Conference on Computer Vision, Springer. 3. Zeisl, B., Sattler, T., and Pollefeys, M. (2015, January 7–13). Camera Pose Voting for Large-Scale Image-Based Localization. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile. 4. Efficient & Effective Prioritized Matching for Large-Scale Image-Based Localization;Sattler;IEEE Trans. Pattern Anal. Mach. Intell.,2017 5. Daniilidis, K., Maragos, P., and Paragios, N. (2010). Proceedings of the Computer Vision—ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece, September 5–11, 2010, Proceedings, Part IV 11, Springer.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|