AGCosPlace: A UAV Visual Positioning Algorithm Based on Transformer-Reference-Cited by-同舟云学术

AGCosPlace: A UAV Visual Positioning Algorithm Based on Transformer

Published:2023-07-28 Issue:8 Volume:7 Page:498
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Guo Ya¹,Zhou Yatong¹,Yang Fan¹

Affiliation:

1. School of Electronic and Information Engineering, Hebei University of Technology, 5340 Xiping Road, Beichen District, Tianjin 300401, China

Abstract

To address the limitation and obtain the position of the drone even when the relative poses and intrinsics of the drone camera are unknown, a visual positioning algorithm based on image retrieval called AGCosPlace, which leverages the Transformer architecture to achieve improved performance, is proposed. Our approach involves subjecting the feature map of the backbone to an encoding operation that incorporates attention mechanisms, multi-layer perceptron coding, and a graph network module. This encoding operation allows for better aggregation of the context information present in the image. Subsequently, the aggregation module with dynamic adaptive pooling produces a descriptor with an appropriate dimensionality, which is then passed into the classifier to recognize the position. Considering the complexity associated with labeling visual positioning labels for UAV images, the visual positioning network is trained using the publicly available Google Street View SF-XL dataset. The performance of the trained network model on a custom UAV perspective test set is evaluated. The experimental results demonstrate that our proposed algorithm, which improves upon the ResNet backbone networks on the SF-XL test set, exhibits excellent performance on the UAV test set. The algorithm achieves notable improvements in the four evaluation metrics: R@1, R@5, R@10, and R@20. These results confirm that the trained visual positioning network can effectively be employed in UAV visual positioning tasks.

Funder

Special Foundation for Beijing Tianjin Hebei Basic Research Cooperation

Inner Mongolia Discipline Inspection and Supervision Big Data Laboratory

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering

Link

https://www.mdpi.com/2504-446X/7/8/498/pdf

Reference41 articles.

1. DSF-NOMA: UAV-assisted emergency communication technology in a heterogeneous Internet of Things;Liu;IEEE Internet Things J.,2019

2. A compilation of UAV applications for precision agriculture;Sarigiannidis;Comput. Netw.,2020

3. High-level multiple-UAV cinematography tools for covering outdoor events;Mademlis;IEEE Trans. Broadcast.,2019

4. Feroz, S., and Abu Dabous, S. (2021). Uav-based remote sensing applications for bridge condition assessment. Remote Sens., 13.

5. Applications of unmanned aerial vehicle (UAV) in road safety, traffic and highway infrastructure management: Recent advances and challenges;Outay;Transp. Res. Part A Policy Pract.,2020

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PnP-UGCSuperGlue: deep learning drone image matching algorithm for visual localization;The Journal of Supercomputing;2024-05-01

2. Ethical Considerations in Drone Cybersecurity;Advances in Information Security, Privacy, and Ethics;2024-01-26

3. VL-MFL: UAV Visual Localization Based on Multisource Image Feature Learning;IEEE Transactions on Geoscience and Remote Sensing;2024