UAV’s Status Is Worth Considering: A Fusion Representations Matching Method for Geo-Localization
Author:
Zhu RunzheORCID, Yang MingzeORCID, Yin Ling, Wu Fei, Yang Yuncheng
Abstract
Visual geo-localization plays a crucial role in positioning and navigation for unmanned aerial vehicles, whose goal is to match the same geographic target from different views. This is a challenging task due to the drastic variations in different viewpoints and appearances. Previous methods have been focused on mining features inside the images. However, they underestimated the influence of external elements and the interaction of various representations. Inspired by multimodal and bilinear pooling, we proposed a pioneering feature fusion network (MBF) to address these inherent differences between drone and satellite views. We observe that UAV’s status, such as flight height, leads to changes in the size of image field of view. In addition, local parts of the target scene act a role of importance in extracting discriminative features. Therefore, we present two approaches to exploit those priors. The first module is to add status information to network by transforming them into word embeddings. Note that they concatenate with image embeddings in Transformer block to learn status-aware features. Then, global and local part feature maps from the same viewpoint are correlated and reinforced by hierarchical bilinear pooling (HBP) to improve the robustness of feature representation. By the above approaches, we achieve more discriminative deep representations facilitating the geo-localization more effectively. Our experiments on existing benchmark datasets show significant performance boosting, reaching the new state-of-the-art result. Remarkably, the recall@1 accuracy achieves 89.05% in drone localization task and 93.15% in drone navigation task in University-1652, and shows strong robustness at different flight heights in the SUES-200 dataset.
Funder
The Science and Technology Development Center of the Ministry of Education of China Science and Technology Commission of Shanghai Municipality
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference50 articles.
1. Wang, Y., Li, S., Lin, Y., and Wang, M. (2021). Lightweight Deep Neural Network Method for Water Body Extraction from High-Resolution Remote Sensing Images with Multisensors. Sensors, 21. 2. Suo, C., Zhao, J., Zhang, W., Li, P., Huang, R., Zhu, J., and Tan, X. (2021). Research on UAV Three-Phase Transmission Line Tracking and Localization Method Based on Electric Field Sensor Array. Sensors, 21. 3. Zhu, C., Zhu, J., Bu, T., and Gao, X. (2022). Monitoring and Identification of Road Construction Safety Factors via UAV. Sensors, 22. 4. Chen, C.L., He, R., and Peng, C.C. (2022). Development of an Online Adaptive Parameter Tuning vSLAM Algorithm for UAVs in GPS-Denied Environments. Sensors, 22. 5. Hassan, S.I., Alam, M.M., Zia, M.Y.I., Rashid, M., Illahi, U., and Su’ud, M.M. (2022). Rice Crop Counting Using Aerial Imagery and GIS for the Assessment of Soil Health to Increase Crop Yield. Sensors, 22.
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|