Leveraging cross-view geo-localization with ensemble learning and temporal awareness

Author:

Ghanem Abdulrahman,Abdelhay Ahmed,Salah Noor Eldeen,Nour Eldeen Ahmed,Elhenawy MohammedORCID,Masoud Mahmoud,Hassan Ammar M.ORCID,Hassan Abdallah A.

Abstract

The Global Navigation Satellite System (GNSS) is unreliable in some situations. To mend the poor GNSS signal, an autonomous vehicle can self-localize by matching a ground image against a database of geotagged aerial images. However, this approach has challenges because of the dramatic differences in the viewpoint between aerial and ground views, harsh weather and lighting conditions, and the lack of orientation information in training and deployment environments. In this paper, it is shown that previous models in this area are complementary, not competitive, and that each model solves a different aspect of the problem. There was a need for a holistic approach. An ensemble model is proposed to aggregate the predictions of multiple independently trained state-of-the-art models. Previous state-of-the-art (SOTA) temporal-aware models used heavy-weight network to fuse the temporal information into the query process. The effect of making the query process temporal-aware is explored and exploited by an efficient meta block: naive history. But none of the existing benchmark datasets was suitable for extensive temporal awareness experiments, a new derivative dataset based on the BDD100K dataset is generated. The proposed ensemble model achieves a recall accuracy R@1 (Recall@1: the top most prediction) of 97.74% on the CVUSA dataset and 91.43% on the CVACT dataset (surpassing the current SOTA). The temporal awareness algorithm converges to R@1 of 100% by looking at a few steps back in the trip history.

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference54 articles.

1. Ben-Moshe B, Elkin E, Levi H, Weissman A. Improving Accuracy of GNSS Devices in Urban Canyons. In: CCCG; 2011. p. 511–515.

2. Zhai M, Bessinger Z, Workman S, Jacobs N. Predicting ground-level scene layout from aerial imagery. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; 2017. p. 867–875.

3. Each part matters: Local patterns facilitate cross-view geo-localization;T Wang;IEEE Transactions on Circuits and Systems for Video Technology,2021

4. Comprehensive review of autonomous taxi dispatching systems;W Zeng;Comput Sci,2020

5. Vo NN, Hays J. Localizing and orienting street views using overhead imagery. In: European conference on computer vision. Springer; 2016. p. 494–509.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3