Infrared and Visible Image Homography Estimation Based on Feature Correlation Transformers for Enhanced 6G Space–Air–Ground Integrated Network Perception
-
Published:2023-07-13
Issue:14
Volume:15
Page:3535
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Wang Xingyi1, Luo Yinhui1, Fu Qiang1, Rui Yun2, Shu Chang1, Wu Yuezhou1, He Zhige1, He Yuanqing1
Affiliation:
1. School of Computer Science, Civil Aviation Flight University of China, Guanghan 618307, China 2. School of Communication and Electronic Engineering, East China Normal University, Shanghai 200241, China
Abstract
The homography estimation of infrared and visible images, a key technique for assisting perception, is an integral element within the 6G Space–Air–Ground Integrated Network (6G SAGIN) framework. It is widely applied in the registration of these two image types, leading to enhanced environmental perception and improved efficiency in perception computation. However, the traditional estimation methods are frequently challenged by insufficient feature points and the low similarity in features when dealing with these images, which results in poor performance. Deep-learning-based methods have attempted to address these issues by leveraging strong deep feature extraction capabilities but often overlook the importance of precisely guided feature matching in regression networks. Consequently, exactly acquiring feature correlations between multi-modal images remains a complex task. In this study, we propose a feature correlation transformer method, devised to offer explicit guidance for feature matching for the task of homography estimation between infrared and visible images. First, we propose a feature patch, which is used as a basic unit for correlation computation, thus effectively coping with modal differences in infrared and visible images. Additionally, we propose a novel cross-image attention mechanism to identify correlations between varied modal images, thus transforming the multi-source images homography estimation problem into a single-source images problem by achieving source-to-target image mapping in the feature dimension. Lastly, we propose a feature correlation loss (FCL) to induce the network into learning a distinctive target feature map, further enhancing source-to-target image mapping. To validate the effectiveness of the newly proposed components, we conducted extensive experiments to demonstrate the superiority of our method compared with existing methods in both quantitative and qualitative aspects.
Funder
National Key R&D Program of China Science and Technology Plan Project of Sichuan Province Fundamental Research Funds for the Central Universities
Subject
General Earth and Planetary Sciences
Reference53 articles.
1. Liao, Z., Chen, C., Ju, Y., He, C., Jiang, J., and Pei, Q. (2022). Multi-Controller Deployment in SDN-Enabled 6G Space–Air–Ground Integrated Network. Remote Sens., 14. 2. Chen, C., Wang, C., Liu, B., He, C., Cong, L., and Wan, S. (2023). Edge Intelligence Empowered Vehicle Detection and Image Segmentation for Autonomous Vehicles. IEEE Trans. Intell. Transp. Syst., 1–12. 3. Joint Secure Offloading and Resource Allocation for Vehicular Edge Computing Network: A Multi-Agent Deep Reinforcement Learning Approach;Ju;IEEE Trans. Intell. Transp. Syst.,2023 4. A Cooperative Vehicle-Infrastructure System for Road Hazards Detection With Edge Intelligence;Chen;IEEE Trans. Intell. Transp. Syst.,2023 5. Xu, H., Ma, J., Yuan, J., Le, Z., and Liu, W. (2022, January 19–24). Rfnet: Unsupervised network for mutually reinforcing multi-modal image registration and fusion. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|